Monodnaviria | |
---|---|
Parvoviruses in blood | |
Virus classification | |
(unranked): | Virus |
Realm: | Monodnaviria |
Subtaxa | |
Monodnaviria is a realm of viruses that includes all single-stranded DNA viruses that encode an endonuclease of the HUH superfamily that initiates rolling circle replication of the circular viral genome. Viruses descended from such viruses are also included in the realm, including certain linear single-stranded DNA (ssDNA) viruses and circular double-stranded DNA (dsDNA) viruses. These atypical members typically replicate through means other than rolling circle replication.
Monodnaviria was established in 2019 and contains four kingdoms: Loebvirae , Sangervirae , Trapavirae , and Shotokuvirae . Viruses in the first three kingdoms infect prokaryotes, and viruses in Shotokuvirae infect eukaryotes and include the atypical members of the realm. Viruses in Monodnaviria appear to have come into existence independently multiple times from circular bacterial and archaeal plasmids that encode the HUH endonuclease. Eukaryotic viruses in the realm appear to have come into existence multiple times via genetic recombination events that merged deoxyribonucleic acid (DNA) from the aforementioned plasmids with capsid proteins of certain RNA viruses. Most identified ssDNA viruses belong to Monodnaviria.
The prototypic members of the realm are often called CRESS-DNA viruses. CRESS-DNA viruses are associated with a wide range of diseases, including diseases in economically important crops and a variety of diseases in animals. The atypical members of the realm include papillomaviruses and polyomaviruses, which are known to cause various cancers. Members of Monodnaviria are also known to frequently become integrated into the DNA of their hosts as well as experience a relatively high rate of genetic mutations and recombinations.
Monodnaviria is a portmanteau of mono, from Greek μόνος [mónos], meaning single, DNA from deoxyribonucleic acid (DNA), referencing single-stranded DNA, and the suffix -viria, which is the suffix used for virus realms. The prototypic members of Monodnaviria are often called CRESS-DNA, or CRESS DNA, viruses, which stands for "circular Rep-encoding ssDNA" viruses. [1]
All prototypical viruses in Monodnaviria encode an endonuclease of the HUH superfamily. [note 1] Endonucleases are enzymes that can cleave phosphodiester bonds within a polynucleotide chain. HUH, or HuH, endonucleases are endonucleases that contain a HUH motif made of two histidine residues separated by a bulky hydrophobic residue and a Y motif that contains one or two tyrosine residues. The HUH endonuclease of ssDNA viruses is often called the replication initiation protein, or simply Rep, because its cleavage of a specific site in the viral genome initiates replication. [1] [2]
Once the viral ssDNA is inside of the host cell, it is replicated by the host cell's DNA polymerase to produce a double-stranded form of the viral genome. Rep then recognizes a short sequence on the 3'-end ("three prime end") at the origin of replication. Upstream {description needed} from the recognition site, Rep binds to the DNA and nicks the positive-sense strand, creating a nick {description needed} site. In doing so, Rep binds to the 5'-end (five prime end) via a tyrosine residue that covalently bonds {source for covalent bonds needed} to the phosphate backbone of DNA, creating a phosphotyrosine molecule that connects Rep to the viral DNA. [2] [3] [4]
The 3'-end of the nicked strand remains as a free hydroxyl (OH) end that acts as a signal for the host DNA polymerase to replicate the genome. Replication commences at the 3'-OH end and is performed by extending the 3'-end of the positive strand using the negative strand as a template for replication. {refer to DNA replication} Synthesis of the new positive strand uses the negative strand as a template, and so synthesizes a newly connected strand of DNA which displaces the nicked positive strand to reform double-stranded DNA. The 3'-OH end of the displaced positive strand disrupts the original phosphotyrosine bond which releases and circularizes the displaced positive strand as its own circular copy of viral DNA. After one cycle of replicating the genome, Rep is able to recognize the newly replicated recognition site {recognize the recognition site?} on the reformed double-stranded viral DNA and nick it, which starts the whole process again. [2] [3] [4]
Rep may nick the positive strand a second time, doing so with a second tyrosine residue, or a new Rep may nick the DNA. Multiple copies of the genome may be produced in a single strand. After the positive strand is completely detached from the negative strand and nicked, the 3'-OH end bonds to the phosphotyrosine of the 5'-end, creating a free circular ssDNA genome that usually is either converted into dsDNA for transcription or further replication or is packaged into newly constructed viral capsids. The replication process can be repeated numerous times on the same circular genome to produce many copies of the original viral genome. [2] [3] [4]
While the prototypical viruses in Monodnaviria have circular ssDNA genomes and replicate via RCR, some have linear ssDNA genomes with different replication methods, including the families Parvoviridae and Bidnaviridae , assigned to the phylum Cossaviricota of the kingdom Shotokuvirae. [1] Parvoviruses use rolling hairpin replication, in which the ends of the genome have hairpin loops that repeatedly unfold and refold during replication to change the direction of DNA synthesis to move back and forth along the genome, producing numerous copies of the genome in a continuous process. Individual genomes are then excised from this molecule by the HUH endonuclease. [2] [5] In place of the HUH endonuclease, bidnaviruses encode their own protein-primed DNA polymerase that replicates the genome, which is bipartite and packaged into two separate virions, instead of using the host cell's DNA polymerase for replication. [1] [3] [6]
Additionally, some viruses in the realm are dsDNA viruses with circular genomes, including Polyomaviridae and Papillomaviridae, also assigned to the phylum Cossaviricota. Instead of replicating via RCR, these viruses use theta bidirectional DNA replication. This begins by unwinding the dsDNA at a site called the origin to separate the two DNA strands from each other. Two replication forks are established that move in opposite directions around the circular genome until they meet at the side opposite of the origin and replication is terminated. [7]
Apart from the aforementioned replication methods, ssDNA viruses in Monodnaviria share a number of other common characteristics. The capsids of ssDNA viruses, which store the viral DNA, are usually icosahedral in shape and composed of either one type of protein or, in the case of parvoviruses, multiple types of proteins. All ssDNA viruses that have had the structure of their capsid proteins analyzed in high resolution have shown to contain a single jelly roll fold in their folded structure. [1] [3]
Nearly all families of ssDNA viruses have a positive-sense genome, the sole exception being viruses in the family Anelloviridae , unassigned to a realm, which have a negative-sense genome. In any case, ssDNA viruses have their genomes converted to a dsDNA form prior to transcription, which creates the messenger RNA (mRNA) needed to produce viral proteins from ribosomal translation. CRESS-DNA viruses also have similar genome structures, genome lengths, and gene compositions. [1] [3] [4] [8]
Lastly, ssDNA viruses have a relatively high rate of genetic recombinations and substitution mutations. Genetic recombination, or mixture, of ssDNA genomes can occur between closely related viruses when a gene is replicated and transcribed at the same time, which may cause the host cell's DNA polymerases to switch DNA templates (negative strands) during the process, causing recombination. These recombinations usually occur in the negative strand and either outside of or at the peripheries of genes rather than toward the middle of genes. [3]
The high substitution rate seen in ssDNA viruses is unusual since replication is performed primarily by the host cell's DNA polymerase, which contains proofreading mechanisms to prevent mutations. Substitutions in ssDNA viral genomes may occur because the viral DNA may become oxidatively damaged while the genome is inside the capsid. The prevalence of recombinations and substitutions among ssDNA viruses means that eukaryotic ssDNA viruses can emerge as threatening pathogens. [3]
Comparison of genomes and phylogenetic analyses of the HUH endonucleases, superfamily 3 helicases (S3H), and capsid proteins of viruses in Monodnaviria have shown that they have multiple, chimeric origins. HUH endonucleases of CRESS-DNA viruses are most similar to those found in small, RCR bacterial and archael plasmids, extra-chromosomal DNA molecules inside bacteria and archaea, and appear to have evolved from them at least three times. HUH endonucleases of prokaryotic CRESS-DNA viruses seem to have originated from plasmid endonucleases that lacked the S3H domain, whereas eukaryotic CRESS-DNA viruses evolved from ones that had S3H domains. [1] [6]
The capsid proteins of eukaryotic CRESS-DNA viruses are most closely related those of various animal and plant positive-sense RNA viruses, which belong to the realm Riboviria. Because of this, eukaryotic CRESS-DNA viruses appear to have emerged multiple times from recombination events that merged DNA from bacterial and archaeal plasmids with complementary DNA (cDNA) copies of positive-sense RNA viruses. CRESS-DNA viruses therefore represent a notable instance of convergent evolution, whereby organisms that are not directly related evolve the same or similar traits. [1]
Linear ssDNA viruses, specifically parvoviruses, in Monodnaviria are likely to have evolved from CRESS-DNA viruses via loss of the joining activity used by CRSS-DNA viruses to create circular genomes. [6] In turn, the circular dsDNA viruses in Monodnaviria appear to have evolved from parvoviruses through inactivation of the endonuclease's HUH domain. The HUH domain then became a DNA-binding domain, changing these viruses' manner of replication to theta bidirectional replication. The capsid proteins of these circular dsDNA viruses are highly divergent, so it is unclear if they evolved from parvovirus capsid proteins or through other means. [1] Bidnaviruses, which are linear ssDNA viruses, appear to have been created as a result of a parvovirus genome becoming integrated into the genome of a polinton, a type of self-replicating genomic DNA molecule, which replaced the HUH endonuclease with a polinton's DNA polymerase. [6] [9]
Monodnaviria has four kingdoms: Loebvirae, Sangervirae, Shotokuvirae, and Trapavirae. Loebvirae is monotypic down to the rank of order, and Sangervirae and Trapavirae are monotypic down to the rank of family. This taxonomy is described further as follows: [1] [10]
Monodnaviria includes the vast majority of identified ssDNA viruses, which are Group II: ssDNA viruses in the Baltimore classification system, which groups viruses together based on how they produce mRNA, often used alongside standard virus taxonomy, which is based on evolutionary history.. Of the 16 ssDNA virus families, three are not assigned to Monodnaviria, all three being unassigned to a realm: Anelloviridae, Finnlakeviridae , a proposed member of the realm Varidnaviria, and Spiraviridae . The dsDNA viruses in Monodnaviria are assigned to Baltimore Group I: dsDNA viruses. Realms are the highest level of taxonomy used for viruses and Monodnaviria is one of four, the other three being Duplodnaviria , Riboviria , and Varidnaviria . [1] [3] [10]
Although Anelloviridae is currently unassigned to a realm, it is a potential member of Monodnaviria since it appears to be morphologically similar to circoviruses. It has been suggested that anelloviruses are essentially CRESS-DNA viruses with negative sense genomes, unlike the typical positive sense genomes. [11]
The eukaryotic CRESS-DNA viruses are associated with a variety of diseases. Plant viruses in the families Geminiviridae and Nanoviridae infect economically important crops, causing significant damage to agricultural productivity. Animal viruses in Circoviridae are associated with many diseases, including respiratory illness, intestinal illness, and reproductive problems. Bacilladnaviruses, which primarily infect diatoms, are thought to have a significant role in controlling algal blooms. [3]
The atypical members of the realm are also associated with many widely known diseases. Parvoviruses are most widely known for causing a lethal infection in canids as well as causing fifth disease in humans. [12] Papillomaviruses and polyomaviruses are known to cause different types of cancers and other diseases. A polyomavirus is responsible for Merkel-cell carcinoma, and papillomaviruses cause various genital and other cancers as well as warts. [13] [14]
The Rep protein lacks homologues in cellular organisms, so it can be searched for within an organism's genome to identify if viral DNA has become endogenized as part of the organism's genome. Among eukaryotes, endogenization is most often observed in plants, but it is also observed in animals, fungi, and various protozoans. Endogenization can occur through several means such as the integrase or transpose enzymes or by exploiting the host cell's recombination machinery. [3]
Most endogenized ssDNA viruses are in non-coding regions of the organism's genome, but sometimes the viral genes are expressed, and the Rep protein may be used by the organism. Because viral DNA can become a part of an organism's genome, this represents an example of horizontal gene transfer between unrelated organisms that can be used to study evolutionary history. By comparing related organisms, it is possible to estimate the approximate age of ssDNA viruses. For example, comparison of animal genomes has shown that circoviruses and parvoviruses first integrated into their hosts' genomes at least 40–50 million years ago. [3]
The earliest reference to a virus in Monodnaviria was made in a poem written in 752 by Japanese Empress Shotoku, describing a yellowing or vein clearing disease of Eupatorium plants that was likely caused by a geminivirus. Centuries later, a circovirus infection that caused balding in birds was observed in Australia in 1888, marking the first reference to ssDNA viruses in modern times. The first animal CRESS-DNA virus to be characterized was the porcine circovirus in 1974, and in 1977, the first genome of an ssDNA virus, the Bean golden mosaic virus , was detailed. Beginning in the 1970s, the families of related members in Monodnaviria began to be organized, Parvoviridae becoming the first ssDNA family recognized with additional families being continually discovered. [1] [3] [10]
In recent years, analyses of viral DNA in various contexts such as fecal matter and marine sediments have shown that ssDNA viruses are widespread throughout nature, and the increased knowledge of their diversity has helped to greater understand their evolutionary history. The relation between CRESS-DNA viruses was resolved from 2015 to 2017, [3] leading to the establishment of Monodnaviria in 2019 based on their shared relation, including viruses descended from them. Despite appearing to have polyphyletic origins, the similar genome structure, genome length, and gene compositions of CRESS-DNA viruses provided the justification to unite them under a realm. [1]
A DNA virus is a virus that has a genome made of deoxyribonucleic acid (DNA) that is replicated by a DNA polymerase. They can be divided between those that have two strands of DNA in their genome, called double-stranded DNA (dsDNA) viruses, and those that have one strand of DNA in their genome, called single-stranded DNA (ssDNA) viruses. dsDNA viruses primarily belong to two realms: Duplodnaviria and Varidnaviria, and ssDNA viruses are almost exclusively assigned to the realm Monodnaviria, which also includes some dsDNA viruses. Additionally, many DNA viruses are unassigned to higher taxa. Reverse transcribing viruses, which have a DNA genome that is replicated through an RNA intermediate by a reverse transcriptase, are classified into the kingdom Pararnavirae in the realm Riboviria.
An RNA virus is a virus—other than a retrovirus—that has ribonucleic acid (RNA) as its genetic material. The nucleic acid is usually single-stranded RNA (ssRNA) but it may be double-stranded (dsRNA). Notable human diseases caused by RNA viruses include the common cold, influenza, SARS, MERS, COVID-19, Dengue Virus, hepatitis C, hepatitis E, West Nile fever, Ebola virus disease, rabies, polio, mumps, and measles.
Virus classification is the process of naming viruses and placing them into a taxonomic system similar to the classification systems used for cellular organisms.
Parvoviruses are a family of animal viruses that constitute the family Parvoviridae. They have linear, single-stranded DNA (ssDNA) genomes that typically contain two genes encoding for a replication initiator protein, called NS1, and the protein the viral capsid is made of. The coding portion of the genome is flanked by telomeres at each end that form into hairpin loops that are important during replication. Parvovirus virions are small compared to most viruses, at 23–28 nanometers in diameter, and contain the genome enclosed in an icosahedral capsid that has a rugged surface.
Geminiviridae is a family of plant viruses that encode their genetic information on a circular genome of single-stranded (ss) DNA. There are 520 species in this family, assigned to 14 genera. Diseases associated with this family include: bright yellow mosaic, yellow mosaic, yellow mottle, leaf curling, stunting, streaks, reduced yields. They have single-stranded circular DNA genomes encoding genes that diverge in both directions from a virion strand origin of replication. According to the Baltimore classification they are considered class II viruses. It is the largest known family of single stranded DNA viruses.
Baltimore classification is a system used to classify viruses based on their manner of messenger RNA (mRNA) synthesis. By organizing viruses based on their manner of mRNA production, it is possible to study viruses that behave similarly as a distinct group. Seven Baltimore groups are described that take into consideration whether the viral genome is made of deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), whether the genome is single- or double-stranded, and whether the sense of a single-stranded RNA genome is positive or negative.
RNA-dependent RNA polymerase (RdRp) or RNA replicase is an enzyme that catalyzes the replication of RNA from an RNA template. Specifically, it catalyzes synthesis of the RNA strand complementary to a given RNA template. This is in contrast to typical DNA-dependent RNA polymerases, which all organisms use to catalyze the transcription of RNA from a DNA template.
Plectrovirus is a genus of viruses, in the family Plectroviridae. Bacteria in the phylum Mycoplasmatota serve as natural hosts, making these viruses bacteriophages. Acholeplasma virus L51 is the only species in the genus.
Bidensovirus is a genus of single stranded DNA viruses that infect invertebrates. The species in this genus were originally classified in the family Parvoviridae but were moved to a new genus because of significant differences in the genomes.
Spiraviridae is a family of incertae sedis viruses that replicate in hyperthermophilic archaea of the genus Aeropyrum, specifically Aeropyrum pernix. The family contains one genus, Alphaspiravirus, which contains one species, Aeropyrum coil-shaped virus. The virions of ACV are non-enveloped and in the shape of hollow cylinders that are formed by a coiling fiber that consists of two intertwining halves of the circular DNA strand inside a capsid. An appendage protrudes from each end of the cylindrical virion. The viral genome is ssDNA(+) and encodes for significantly more genes than other known ssDNA viruses. ACV is also unique in that it appears to lack its own enzymes to aid replication, instead likely using the host cell's replisomes. ACV has no known relation to any other archaea-infecting viruses, but it does share its coil-like morphology with some other archaeal viruses, suggesting that such viruses may be an ancient lineage that only infect archaea.
Positive-strand RNA viruses are a group of related viruses that have positive-sense, single-stranded genomes made of ribonucleic acid. The positive-sense genome can act as messenger RNA (mRNA) and can be directly translated into viral proteins by the host cell's ribosomes. Positive-strand RNA viruses encode an RNA-dependent RNA polymerase (RdRp) which is used during replication of the genome to synthesize a negative-sense antigenome that is then used as a template to create a new positive-sense viral genome.
Riboviria is a realm of viruses that includes all viruses that use a homologous RNA-dependent polymerase for replication. It includes RNA viruses that encode an RNA-dependent RNA polymerase, as well as reverse-transcribing viruses that encode an RNA-dependent DNA polymerase. RNA-dependent RNA polymerase (RdRp), also called RNA replicase, produces RNA from RNA. RNA-dependent DNA polymerase (RdDp), also called reverse transcriptase (RT), produces DNA from RNA. These enzymes are essential for replicating the viral genome and transcribing viral genes into messenger RNA (mRNA) for translation of viral proteins.
In virology, realm is the highest taxonomic rank established for viruses by the International Committee on Taxonomy of Viruses (ICTV), which oversees virus taxonomy. Six virus realms are recognized and united by specific highly conserved traits:
Smacoviridae is a family of single-stranded DNA viruses. The genomes of this family are small. The name Smacoviridae stands for 'small circular genome virus'. The genomes are circular single-stranded DNA and encode rolling-circle replication initiation proteins (Rep) and unique capsid proteins. As of 2021, 12 genera and 84 species are recognized in this family. The viruses in this taxon were isolated from faecal samples from insects and vertebrates by metagenomic methods. Little is known about their biology.
HUH endonucleases (HUH-tags) are sequence-specific single-stranded DNA (ssDNA) binding proteins originating from numerous species of bacteria and viruses. Viral HUH endonucleases are involved in initiating rolling circle replication while ones of bacterial origin initiate bacterial conjugation. In biotechnology, they can be used to create protein-DNA linkages, akin to other methods such as SNAP-tag. In doing so, they create a 5' covalent bond between the ssDNA and the protein. HUH endonucleases can be fused with other proteins or used as protein tags.
Cressdnaviricota is a phylum of viruses with small, circular single-stranded DNA genomes and encoding rolling circle replication-initiation proteins with the N-terminal HUH endonuclease and C-terminal superfamily 3 helicase domains. While the replication-associated proteins are homologous among viruses within the phylum, the capsid proteins are very diverse and have presumably been acquired from RNA viruses on multiple independent occasions. Nevertheless, all cressdnaviruses for which structural information is available appear to contain the jelly-roll fold.
Varidnaviria is a realm of viruses that includes all DNA viruses that encode major capsid proteins that contain a vertical jelly roll fold. The major capsid proteins (MCP) form into pseudohexameric subunits of the viral capsid, which stores the viral deoxyribonucleic acid (DNA), and are perpendicular, or vertical, to the surface of the capsid. Apart from this, viruses in the realm also share many other characteristics, such as minor capsid proteins (mCP) with the vertical jelly roll fold, an ATPase that packages viral DNA into the capsid, and a DNA polymerase that replicates the viral genome.
Bacilladnaviridae is a family of single-stranded DNA viruses that primarily infect diatoms.
Orthornavirae is a kingdom of viruses that have genomes made of ribonucleic acid (RNA), including genes which encode an RNA-dependent RNA polymerase (RdRp). The RdRp is used to transcribe the viral RNA genome into messenger RNA (mRNA) and to replicate the genome. Viruses in this kingdom share a number of characteristics which promote rapid evolution, including high rates of genetic mutation, recombination, and reassortment.
Rolling hairpin replication (RHR) is a unidirectional, strand displacement form of DNA replication used by parvoviruses, a group of viruses that constitute the family Parvoviridae. Parvoviruses have linear, single-stranded DNA (ssDNA) genomes in which the coding portion of the genome is flanked by telomeres at each end that form hairpin loops. During RHR, these hairpin loops repeatedly unfold and refold to change the direction of DNA replication so that replication progresses in a continuous manner back and forth across the genome. RHR is initiated and terminated by an endonuclease encoded by parvoviruses that is variously called NS1 or Rep, and RHR is similar to rolling circle replication, which is used by ssDNA viruses that have circular genomes.