A DNA virus is a virus that has a genome made of deoxyribonucleic acid (DNA) that is replicated by a DNA polymerase. They can be divided between those that have two strands of DNA in their genome, called double-stranded DNA (dsDNA) viruses, and those that have one strand of DNA in their genome, called single-stranded DNA (ssDNA) viruses. dsDNA viruses primarily belong to two realms: Duplodnaviria and Varidnaviria , and ssDNA viruses are almost exclusively assigned to the realm Monodnaviria , which also includes some dsDNA viruses. Additionally, many DNA viruses are unassigned to higher taxa. Reverse transcribing viruses, which have a DNA genome that is replicated through an RNA intermediate by a reverse transcriptase, are classified into the kingdom Pararnavirae in the realm Riboviria .
DNA viruses are ubiquitous worldwide, especially in marine environments where they form an important part of marine ecosystems, and infect both prokaryotes and eukaryotes. They appear to have multiple origins, as viruses in Monodnaviria appear to have emerged from archaeal and bacterial plasmids on multiple occasions, though the origins of Duplodnaviria and Varidnaviria are less clear.
Prominent disease-causing DNA viruses include herpesviruses, papillomaviruses, and poxviruses.
The Baltimore classification system is used to group viruses together based on their manner of messenger RNA (mRNA) synthesis and is often used alongside standard virus taxonomy, which is based on evolutionary history. DNA viruses constitute two Baltimore groups: Group I: double-stranded DNA viruses, and Group II: single-stranded DNA viruses. While Baltimore classification is chiefly based on transcription of mRNA, viruses in each Baltimore group also typically share their manner of replication. Viruses in a Baltimore group do not necessarily share genetic relation or morphology. [1]
The first Baltimore group of DNA viruses are those that have a double-stranded DNA genome. All dsDNA viruses have their mRNA synthesized in a three-step process. First, a transcription preinitiation complex binds to the DNA upstream of the site where transcription begins, allowing for the recruitment of a host RNA polymerase. Second, once the RNA polymerase is recruited, it uses the negative strand as a template for synthesizing mRNA strands. Third, the RNA polymerase terminates transcription upon reaching a specific signal, such as a polyadenylation site. [2] [3] [4]
dsDNA viruses make use of several mechanisms to replicate their genome. Bidirectional replication, in which two replication forks are established at a replication origin site and move in opposite directions of each other, is widely used. [5] A rolling circle mechanism that produces linear strands while progressing in a loop around the circular genome is also common. [6] [7] Some dsDNA viruses use a strand displacement method whereby one strand is synthesized from a template strand, and a complementary strand is then synthesized from the prior synthesized strand, forming a dsDNA genome. [8] Lastly, some dsDNA viruses are replicated as part of a process called replicative transposition whereby a viral genome in a host cell's DNA is replicated to another part of a host genome. [9]
dsDNA viruses can be subdivided between those that replicate in the cell nucleus, and as such are relatively dependent on host cell machinery for transcription and replication, and those that replicate in the cytoplasm, in which case they have evolved or acquired their own means of executing transcription and replication. [10] dsDNA viruses are also commonly divided between tailed dsDNA viruses, referring to members of the realm Duplodnaviria, usually the tailed bacteriophages of the order Caudovirales, and tailless or non-tailed dsDNA viruses of the realm Varidnaviria. [11] [12]
The second Baltimore group of DNA viruses are those that have a single-stranded DNA genome. ssDNA viruses have the same manner of transcription as dsDNA viruses. However, because the genome is single-stranded, it is first made into a double-stranded form by a DNA polymerase upon entering a host cell. mRNA is then synthesized from the double-stranded form. The double-stranded form of ssDNA viruses may be produced either directly after entry into a cell or as a consequence of replication of the viral genome. [13] [14] Eukaryotic ssDNA viruses are replicated in the nucleus. [10] [15]
Most ssDNA viruses contain circular genomes that are replicated via rolling circle replication (RCR). ssDNA RCR is initiated by an endonuclease that bonds to and cleaves the positive strand, allowing a DNA polymerase to use the negative strand as a template for replication. Replication progresses in a loop around the genome by means of extending the 3'-end of the positive strand, displacing the prior positive strand, and the endonuclease cleaves the positive strand again to create a standalone genome that is ligated into a circular loop. The new ssDNA may be packaged into virions or replicated by a DNA polymerase to form a double-stranded form for transcription or continuation of the replication cycle. [13] [16]
Parvoviruses contain linear ssDNA genomes that are replicated via rolling hairpin replication (RHR), which is similar to RCR. Parvovirus genomes have hairpin loops at each end of the genome that repeatedly unfold and refold during replication to change the direction of DNA synthesis to move back and forth along the genome, producing numerous copies of the genome in a continuous process. Individual genomes are then excised from this molecule by the viral endonuclease. For parvoviruses, either the positive or negative sense strand may be packaged into capsids, varying from virus to virus. [16] [17]
Nearly all ssDNA viruses have positive sense genomes, but a few exceptions and peculiarities exist. The family Anelloviridae is the only ssDNA family whose members have negative sense genomes, which are circular. [15] Parvoviruses, as previously mentioned, may package either the positive or negative sense strand into virions. [14] Lastly, bidnaviruses package both the positive and negative linear strands. [15] [18]
The International Committee on Taxonomy of Viruses (ICTV) oversees virus taxonomy and organizes viruses at the basal level at the rank of realm. Virus realms correspond to the rank of domain used for cellular life but differ in that viruses within a realm do not necessarily share common ancestry, nor do the realms share common ancestry with each other. As such, each virus realm represents at least one instance of viruses coming into existence. Within each realm, viruses are grouped together based on shared characteristics that are highly conserved over time. [19] Three DNA virus realms are recognized: Duplodnaviria, Monodnaviria, and Varidnaviria.
Duplodnaviria contains dsDNA viruses that encode a major capsid protein (MCP) that has the HK97 fold. Viruses in the realm also share a number of other characteristics involving the capsid and capsid assembly, including an icosahedral capsid shape and a terminase enzyme that packages viral DNA into the capsid during assembly. Two groups of viruses are included in the realm: tailed bacteriophages, which infect prokaryotes and are assigned to the order Caudovirales , and herpesviruses, which infect animals and are assigned to the order Herpesvirales . [11]
Duplodnaviria is a very ancient realm, perhaps predating the last universal common ancestor (LUCA) of cellular life. Its origins not known, nor whether it is monophyletic or polyphyletic. A characteristic feature is the HK97-fold found in the MCP of all members, which is found outside the realm only in encapsulins, a type of nanocompartment found in bacteria: this relation is not fully understood. [11] [20] [21]
The relation between caudoviruses and herpesviruses is also uncertain: they may share a common ancestor or herpesviruses may be a divergent clade from the realm Caudovirales. A common trait among duplodnaviruses is that they cause latent infections without replication while still being able to replicate in the future. [22] [23] Tailed bacteriophages are ubiquitous worldwide, [24] important in marine ecology, [25] and the subject of much research. [26] Herpesviruses are known to cause a variety of epithelial diseases, including herpes simplex, chickenpox and shingles, and Kaposi's sarcoma. [27] [28] [29]
Monodnaviria contains ssDNA viruses that encode an endonuclease of the HUH superfamily that initiates rolling circle replication and all other viruses descended from such viruses. The prototypical members of the realm are called CRESS-DNA viruses and have circular ssDNA genomes. ssDNA viruses with linear genomes are descended from them, and in turn some dsDNA viruses with circular genomes are descended from linear ssDNA viruses. [30]
Viruses in Monodnaviria appear to have emerged on multiple occasions from archaeal and bacterial plasmids, a type of extra-chromosomal DNA molecule that self-replicates inside its host. The kingdom Shotokuvirae in the realm likely emerged from recombination events that merged the DNA of these plasmids and complementary DNA encoding the capsid proteins of RNA viruses. [30] [31]
CRESS-DNA viruses include three kingdoms that infect prokaryotes: Loebvirae , Sangervirae , and Trapavirae . The kingdom Shotokuvirae contains eukaryotic CRESS-DNA viruses and the atypical members of Monodnaviria. [30] Eukaryotic monodnaviruses are associated with many diseases, and they include papillomaviruses and polyomaviruses, which cause many cancers, [32] [33] and geminiviruses, which infect many economically important crops. [34]
Varidnaviria contains DNA viruses that encode MCPs that have a jelly roll fold folded structure in which the jelly roll (JR) fold is perpendicular to the surface of the viral capsid. Many members also share a variety of other characteristics, including a minor capsid protein that has a single JR fold, an ATPase that packages the genome during capsid assembly, and a common DNA polymerase. Two kingdoms are recognized: Helvetiavirae , whose members have MCPs with a single vertical JR fold, and Bamfordvirae , whose members have MCPs with two vertical JR folds. [12]
Varidnaviria is either monophyletic or polyphyletic and may predate the LUCA. The kingdom Bamfordvirae is likely derived from the other kingdom Helvetiavirae via fusion of two MCPs to have an MCP with two jelly roll folds instead of one. The single jelly roll (SJR) fold MCPs of Helvetiavirae show a relation to a group of proteins that contain SJR folds, including the Cupin superfamily and nucleoplasmins. [12] [20] [21]
Marine viruses in Varidnaviria are ubiquitous worldwide and, like tailed bacteriophages, play an important role in marine ecology. [35] Most identified eukaryotic DNA viruses belong to the realm. [36] Notable disease-causing viruses in Varidnaviria include adenoviruses, poxviruses, and the African swine fever virus. [37] Poxviruses have been highly prominent in the history of modern medicine, especially Variola virus, which caused smallpox. [38] Many varidnaviruses can become endogenized in their host's genome; a peculiar example are virophages, which after infecting a host, can protect the host against giant viruses. [36]
dsDNA viruses are classified into three realms and include many taxa that are unassigned to a realm:
ssDNA viruses are classified into one realm and include several families that are unassigned to a realm:
An RNA virus is a virus—other than a retrovirus—that has ribonucleic acid (RNA) as its genetic material. The nucleic acid is usually single-stranded RNA (ssRNA) but it may be double-stranded (dsRNA). Notable human diseases caused by RNA viruses include the common cold, influenza, SARS, MERS, Covid-19, Dengue Virus, hepatitis C, hepatitis E, West Nile fever, Ebola virus disease, rabies, polio, mumps, and measles.
Virus classification is the process of naming viruses and placing them into a taxonomic system similar to the classification systems used for cellular organisms.
Parvoviruses are a family of animal viruses that constitute the family Parvoviridae. They have linear, single-stranded DNA (ssDNA) genomes that typically contain two genes encoding for a replication initiator protein, called NS1, and the protein the viral capsid is made of. The coding portion of the genome is flanked by telomeres at each end that form into hairpin loops that are important during replication. Parvovirus virions are small compared to most viruses, at 23–28 nanometers in diameter, and contain the genome enclosed in an icosahedral capsid that has a rugged surface.
Geminiviridae is a family of plant viruses that encode their genetic information on a circular genome of single-stranded (ss) DNA. There are 520 species in this family, assigned to 14 genera. Diseases associated with this family include: bright yellow mosaic, yellow mosaic, yellow mottle, leaf curling, stunting, streaks, reduced yields. They have single-stranded circular DNA genomes encoding genes that diverge in both directions from a virion strand origin of replication. According to the Baltimore classification they are considered class II viruses. It is the largest known family of single stranded DNA viruses.
Baltimore classification is a system used to classify viruses based on their manner of messenger RNA (mRNA) synthesis. By organizing viruses based on their manner of mRNA production, it is possible to study viruses that behave similarly as a distinct group. Seven Baltimore groups are described that take into consideration whether the viral genome is made of deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), whether the genome is single- or double-stranded, and whether the sense of a single-stranded RNA genome is positive or negative.
RNA-dependent RNA polymerase (RdRp) or RNA replicase is an enzyme that catalyzes the replication of RNA from an RNA template. Specifically, it catalyzes synthesis of the RNA strand complementary to a given RNA template. This is in contrast to typical DNA-dependent RNA polymerases, which all organisms use to catalyze the transcription of RNA from a DNA template.
Double-stranded RNA viruses are a polyphyletic group of viruses that have double-stranded genomes made of ribonucleic acid. The double-stranded genome is used to transcribe a positive-strand RNA by the viral RNA-dependent RNA polymerase (RdRp). The positive-strand RNA may be used as messenger RNA (mRNA) which can be translated into viral proteins by the host cell's ribosomes. The positive-strand RNA can also be replicated by the RdRp to create a new double-stranded viral genome.
Corticovirus is a genus of viruses in the family Corticoviridae. Corticoviruses are bacteriophages; that is, their natural hosts are bacteria. The genus contains two species. The name is derived from Latin cortex, corticis. However, prophages closely related to PM2 are abundant in the genomes of aquatic bacteria, suggesting that the ecological importance of corticoviruses might be underestimated. Bacteriophage PM2 was first described in 1968 after isolation from seawater sampled from the coast of Chile.
Bidensovirus is a genus of single stranded DNA viruses that infect invertebrates. The species in this genus were originally classified in the family Parvoviridae but were moved to a new genus because of significant differences in the genomes.
Positive-strand RNA viruses are a group of related viruses that have positive-sense, single-stranded genomes made of ribonucleic acid. The positive-sense genome can act as messenger RNA (mRNA) and can be directly translated into viral proteins by the host cell's ribosomes. Positive-strand RNA viruses encode an RNA-dependent RNA polymerase (RdRp) which is used during replication of the genome to synthesize a negative-sense antigenome that is then used as a template to create a new positive-sense viral genome.
Negative-strand RNA viruses are a group of related viruses that have negative-sense, single-stranded genomes made of ribonucleic acid. They have genomes that act as complementary strands from which messenger RNA (mRNA) is synthesized by the viral enzyme RNA-dependent RNA polymerase (RdRp). During replication of the viral genome, RdRp synthesizes a positive-sense antigenome that it uses as a template to create genomic negative-sense RNA. Negative-strand RNA viruses also share a number of other characteristics: most contain a viral envelope that surrounds the capsid, which encases the viral genome, −ssRNA virus genomes are usually linear, and it is common for their genome to be segmented.
Riboviria is a realm of viruses that includes all viruses that use a homologous RNA-dependent polymerase for replication. It includes RNA viruses that encode an RNA-dependent RNA polymerase, as well as reverse-transcribing viruses that encode an RNA-dependent DNA polymerase. RNA-dependent RNA polymerase (RdRp), also called RNA replicase, produces RNA from RNA. RNA-dependent DNA polymerase (RdDp), also called reverse transcriptase (RT), produces DNA from RNA. These enzymes are essential for replicating the viral genome and transcribing viral genes into messenger RNA (mRNA) for translation of viral proteins.
In virology, realm is the highest taxonomic rank established for viruses by the International Committee on Taxonomy of Viruses (ICTV), which oversees virus taxonomy. Six virus realms are recognized and united by specific highly conserved traits:
Duplodnaviria is a realm of viruses that includes all double-stranded DNA viruses that encode the HK97 fold major capsid protein. The HK97 fold major capsid protein is the primary component of the viral capsid, which stores the viral deoxyribonucleic acid (DNA). Viruses in the realm also share a number of other characteristics, such as an icosahedral capsid, an opening in the viral capsid called a portal, a protease enzyme that empties the inside of the capsid prior to DNA packaging, and a terminase enzyme that packages viral DNA into the capsid.
Monodnaviria is a realm of viruses that includes all single-stranded DNA viruses that encode an endonuclease of the HUH superfamily that initiates rolling circle replication of the circular viral genome. Viruses descended from such viruses are also included in the realm, including certain linear single-stranded DNA (ssDNA) viruses and circular double-stranded DNA (dsDNA) viruses. These atypical members typically replicate through means other than rolling circle replication.
Varidnaviria is a realm of viruses that includes all DNA viruses that encode major capsid proteins that contain a vertical jelly roll fold. The major capsid proteins (MCP) form into pseudohexameric subunits of the viral capsid, which stores the viral deoxyribonucleic acid (DNA), and are perpendicular, or vertical, to the surface of the capsid. Apart from this, viruses in the realm also share many other characteristics, such as minor capsid proteins (mCP) with the vertical jelly roll fold, an ATPase that packages viral DNA into the capsid, and a DNA polymerase that replicates the viral genome.
Orthornavirae is a kingdom of viruses that have genomes made of ribonucleic acid (RNA), those genomes encoding an RNA-dependent RNA polymerase (RdRp). The RdRp is used to transcribe the viral RNA genome into messenger RNA (mRNA) and to replicate the genome. Viruses in this kingdom also share a number of characteristics involving evolution, including high rates of genetic mutations, recombinations, and reassortments.
An archaeal virus is a virus that infects and replicates in archaea, a domain of unicellular, prokaryotic organisms. Archaeal viruses, like their hosts, are found worldwide, including in extreme environments inhospitable to most life such as acidic hot springs, highly saline bodies of water, and at the bottom of the ocean. They have been also found in the human body. The first known archaeal virus was described in 1974 and since then, a large diversity of archaeal viruses have been discovered, many possessing unique characteristics not found in other viruses. Little is known about their biological processes, such as how they replicate, but they are believed to have many independent origins, some of which likely predate the last archaeal common ancestor (LACA).
Portogloboviridae is a family of DNA viruses that infect archaea. It is a proposed family of the realm Varidnaviria. Viruses in the family are related to Halopanivirales. The capsid proteins of these viruses and their characteristics are of evolutionary importance for the origin of the other Varidnaviria viruses since they seem to retain primordial characters.
Adnaviria is a realm of viruses that includes archaeal viruses that have a filamentous virion and a linear, double-stranded DNA genome. The genome exists in A-form (A-DNA) and encodes a dimeric major capsid protein (MCP) that contains the SIRV2 fold, a type of alpha-helix bundle containing four helices. The virion consists of the genome encased in capsid proteins to form a helical nucleoprotein complex. For some viruses, this helix is surrounded by a lipid membrane called an envelope. Some contain an additional protein layer between the nucleoprotein helix and the envelope. Complete virions are long and thin and may be flexible or a stiff like a rod.