Adenovirus genome

Last updated

Adenovirus genomes are linear, non-segmented double-stranded (ds) DNA molecules that are typically 26-46 Kbp long, containing 23-46 protein-coding genes. [1] The example used for the following description is Human adenovirus E, a mastadenovirus with a 36 Kbp genome containing 38 protein-coding genes. [2] While the precise number and identity of genes varies among adenoviruses, the basic principles of genome organization and the functions of most of the genes described in this article are shared among all adenoviruses.

Contents

Transcription units

Transcription units (in green) and genes (protein-coding genes in red, other genes in blue) of human adenovirus E genome Human adenovirus E genes NCBI.jpg
Transcription units (in green) and genes (protein-coding genes in red, other genes in blue) of human adenovirus E genome

The 38 genes in the Human adenovirus E genome are organized in 17 transcription units, each containing 1-8 coding sequences. [3] Alternative splicing during processing of the pre-mRNAs produced by each transcription unit enable multiple different mRNAs to be produced from one transcription unit.[ citation needed ]

The E1A, E1B, E2A, E2B, E3, and E4 transcription units are successively transcribed early in the viral reproductive cycle. The proteins coded for by genes within these transcription units are mostly involved in regulation of viral transcription, in replication of viral DNA, and in suppression of the host response to infection. [4]

The L1-L5 transcription units are transcribed later in the viral reproductive cycle, and code mostly for proteins that make up components of the viral capsid or are involved in assembly of the capsid. The L1-L5 transcription units are all regulated by the same promoter region and share the same transcription start site. As a result, transcription of all five late transcription units begins at the same point in the viral reproductive cycle. [5]

Transcription of pre-mRNAs beginning at the late promoter is randomly terminated at one of five termination sites, producing a population of transcripts of five different lengths. The pre-mRNAs of any given length are then alternatively spliced to produce 1-4 different mRNAs coding for a corresponding number of proteins.[ citation needed ]

Protein-coding genes

The names, locations, and properties of the 38 protein-coding genes in the Human Adenovirus E genome are given in the following table. [6] [7]

Protein nameProtein identifierTranscription unitStart baseStop baseStrandLength (amino acids)
control protein E1AYP_068018.1E1A5761441+257
control protein E1B 19KYP_068019.1E1B16002115+171
control protein E1B 55KYP_068020.1E1B19053356+483
capsid protein IXYP_068021.1IX34413869+142
encapsidation protein IVa2YP_068022.1IVa239305554-448
DNA polymeraseYP_068023.1E2B503313773-1193
protein 13.6KYP_001661328.1L178149476+139
terminal protein precursor pTPYP_068024.1E2B840413773-642
encapsidation protein 52KYP_068025.1L11076511937+390
capsid protein precursor pIIIaYP_068026.1L11196113736+591
penton base (capsid protein III)YP_068027.1L21381515422+535
core protein precursor pVIIYP_068028.1L21542616007+193
core protein VYP_068029.1L21605517080+341
core protein precursor pXYP_068030.1L21710317336+77
capsid protein precursor pVIYP_068031.1L31741318141+242
hexon (capsid protein II)YP_068032.1L31824821058+936
proteaseYP_068033.1L32108221702+206
single-stranded DNA-binding proteinYP_068034.1E2A-L2177423312-512
hexon assembly protein 100KYP_068035.1L42334125716+791
protein 33KYP_068036.1L42543926252+214
encapsidation protein 22KYP_068037.1L42543925978+179
capsid protein precursor pVIIIYP_068038.1L42632127004+227
control protein E3 12.5KYP_068039.1E32700527325+106
membrane glycoprotein E3 CR1-alphaYP_068040.1E32727927911+210
membrane glycoprotein E3 gp19KYP_068041.1E32789328417+174
membrane glycoprotein E3 CR1-betaYP_068042.1E32844929111+220
membrane glycoprotein E3 CR1-deltaYP_068043.1E32944030264+274
membrane protein E3 RID-alphaYP_068044.1E33027330548+91
membrane protein E3 RID-betaYP_068045.1E33055430994+146
control protein E3 14.7KYP_068046.1E33098731388+133
protein UYP_068047.1U3148131632-50
fiber (capsid protein IV)YP_068048.1L53164932926+425
control protein E4orf6/7YP_068049.1E43302234169-141
control protein E4 34KYP_068050.1E43327034169-299
control protein E4orf4YP_068051.1E43407234440-122
control protein E4orf3YP_068052.1E43444934802-117
control protein E4orf2YP_068053.1E43479935188-129
control protein E4orf1YP_068054.1E43523635610-124

The functions of many adenovirus proteins are known: [5]

Related Research Articles

<i>Adenoviridae</i> Family of viruses

Adenoviruses are medium-sized, nonenveloped viruses with an icosahedral nucleocapsid containing a double-stranded DNA genome. Their name derives from their initial isolation from human adenoids in 1953.

<span class="mw-page-title-main">Viral protein</span>

The term viral protein refers to both the products of the genome of a virus and any host proteins incorporated into the viral particle. Viral proteins are grouped according to their functions, and groups of viral proteins include structural proteins, nonstructural proteins, regulatory proteins, and accessory proteins. Viruses are non-living and do not have the means to reproduce on their own, instead depending on their host cell's machinery to do this. Thus, viruses do not code for most of the proteins required for their replication and the translation of their mRNA into viral proteins, but use proteins encoded by the host cell for this purpose.

<span class="mw-page-title-main">Viral replication</span> Formation of biological viruses during the infection process

Viral replication is the formation of biological viruses during the infection process in the target host cells. Viruses must first get into the cell before viral replication can occur. Through the generation of abundant copies of its genome and packaging these copies, the virus continues infecting new hosts. Replication between viruses is greatly varied and depends on the type of genes involved in them. Most DNA viruses assemble in the nucleus while most RNA viruses develop solely in cytoplasm.

<i>Gammaretrovirus</i> Genus of viruses

Gammaretrovirus is a genus in the Retroviridae family. Example species are the murine leukemia virus and the feline leukemia virus. They cause various sarcomas, leukemias and immune deficiencies in mammals, reptiles and birds.

<i>Alphavirus</i> Genus of viruses

Alphavirus is a genus of RNA viruses, the sole genus in the Togaviridae family. Alphaviruses belong to group IV of the Baltimore classification of viruses, with a positive-sense, single-stranded RNA genome. There are 32 alphaviruses, which infect various vertebrates such as humans, rodents, fish, birds, and larger mammals such as horses, as well as invertebrates. Alphaviruses that could infect both vertebrates and arthropods are referred dual-host alphaviruses, while insect-specific alphaviruses such as Eilat virus and Yada yada virus are restricted to their competent arthropod vector. Transmission between species and individuals occurs mainly via mosquitoes, making the alphaviruses a member of the collection of arboviruses – or arthropod-borne viruses. Alphavirus particles are enveloped, have a 70 nm diameter, tend to be spherical, and have a 40 nm isometric nucleocapsid.

<span class="mw-page-title-main">Viral envelope</span> Outermost layer of many types of the infectious agent

A viral envelope is the outermost layer of many types of viruses. It protects the genetic material in their life cycle when traveling between host cells. Not all viruses have envelopes. A viral envelope protein or E protein is a protein in the envelope, which may be acquired by the capsid from an infected host cell.

<span class="mw-page-title-main">Porcine circovirus</span> Species of virus

Porcine circovirus (PCV) is a group of four single-stranded DNA viruses that are non-enveloped with an unsegmented circular genome. They are members of the genus Circovirus that can infect pigs. The viral capsid is icosahedral and approximately 17 nm in diameter.

<span class="mw-page-title-main">Herpes simplex virus</span> Species of virus

Herpes simplex virus1 and 2, also known by their taxonomic names Human alphaherpesvirus 1 and Human alphaherpesvirus 2, are two members of the human Herpesviridae family, a set of viruses that produce viral infections in the majority of humans. Both HSV-1 and HSV-2 are very common and contagious. They can be spread when an infected person begins shedding the virus.

Adenovirus E1B protein usually refers to one of two proteins transcribed from the E1B gene of the adenovirus: a 55kDa protein and a 19kDa protein. These two proteins are needed to block apoptosis in adenovirus-infected cells. E1B proteins work to prevent apoptosis that is induced by the small adenovirus E1A protein, which stabilizes p53, a tumor suppressor.

<span class="mw-page-title-main">Vectors in gene therapy</span>

Gene therapy utilizes the delivery of DNA into cells, which can be accomplished by several methods, summarized below. The two major classes of methods are those that use recombinant viruses and those that use naked DNA or DNA complexes.

Mason-Pfizer monkey virus (M-PMV), formerly Simian retrovirus (SRV), is a species of retroviruses that usually infect and cause a fatal immune deficiency in Asian macaques. The ssRNA virus appears sporadically in mammary carcinoma of captive macaques at breeding facilities which expected as the natural host, but the prevalence of this virus in feral macaques remains unknown. M-PMV was transmitted naturally by virus-containing body fluids, via biting, scratching, grooming, and fighting. Cross contaminated instruments or equipment (fomite) can also spread this virus among animals.

Adenovirus early region 1A (E1A) is a gene expressed during adenovirus replication to produce a variety of E1A proteins. It is expressed during the early phase of the viral life span.

Adenovirus varieties have been explored extensively as a viral vector for gene therapy and also as an oncolytic virus.

<span class="mw-page-title-main">Lentiviral vector in gene therapy</span>

Lentiviral vectors in gene therapy is a method by which genes can be inserted, modified, or deleted in organisms using lentiviruses.

Mastadenovirus is a genus of viruses in the family Adenoviridae. Humans and other mammals serve as natural hosts. There are 51 species in this genus. The genus as a whole includes many very common causes of human infection, estimated to be responsible for 2 to 5% of all respiratory infections, as well as gastrointestinal and eye infections. Symptoms are usually mild.

A kinetic class, also known as a temporal class, is a grouping of genes in a viral genome that are expressed at the same time during the viral replication cycle. Five of the human DNA viral families have multiple kinetic classes: Poxviridae, Herpesviridae, Adenoviridae, Papillomaviridae, and Polyomaviridae. All of the genes in a particular kinetic class are activated by the same mechanism: either by the process of the virus entering the cell and uncoating, or by the products of an earlier kinetic class in what is known as a transcriptional cascade. Generally speaking, earlier kinetic classes code for enzymes that direct the viral replication process, and later kinetic classes code for structural proteins to be packaged into virions

<i>Sepik virus</i> Mosquito transmitted virus endemic to Papua New Guinea

Sepik virus (SEPV) is an arthropod-borne virus (arbovirus) of the genus Flavivirus and family Flaviviridae. Flaviviridae is one of the most well characterized viral families, as it contains many well-known viruses that cause diseases that have become very prevalent in the world, like Dengue virus. The genus Flavivirus is one of the largest viral genera and encompasses over 50 viral species, including tick and mosquito borne viruses like Yellow fever virus and West Nile virus. Sepik virus is much less well known and has not been as well-classified as other viruses because it has not been known of for very long. Sepik virus was first isolated in 1966 from the mosquito Mansoniaseptempunctata, and it derives its name from the Sepik River area in Papua New Guinea, where it was first found. The geographic range of Sepik virus is limited to Papua New Guinea, due to its isolation.

<i>Woolly monkey hepatitis B virus</i> Species of virus

The woolly monkey hepatitis B virus (WMHBV) is a viral species of the Orthohepadnavirus genus of the Hepadnaviridae family. Its natural host is the woolly monkey (Lagothrix), an inhabitant of South America categorized as a New World primate. WMHBV, like other hepatitis viruses, infects the hepatocytes, or liver cells, of its host organism. It can cause hepatitis, liver necrosis, cirrhosis, and hepatocellular carcinoma. Because nearly all species of Lagothrix are threatened or endangered, researching and developing a vaccine and/or treatment for WMHBV is important for the protection of the whole woolly monkey genus.

<i>Varidnaviria</i> Realm of viruses

Varidnaviria is a realm of viruses that includes all DNA viruses that encode major capsid proteins that contain a vertical jelly roll fold. The major capsid proteins (MCP) form into pseudohexameric subunits of the viral capsid, which stores the viral deoxyribonucleic acid (DNA), and are perpendicular, or vertical, to the surface of the capsid. Apart from this, viruses in the realm also share many other characteristics, such as minor capsid proteins (mCP) with the vertical jelly roll fold, an ATPase that packages viral DNA into the capsid, and a DNA polymerase that replicates the viral genome.

Rio Negro virus is an alphavirus that was first isolated in Argentina in 1980. The virus was first called Ag80-663 but was renamed to Rio Negro virus in 2005. It is a former member of the Venezuelan equine encephalitis complex (VEEC), which are a group of alphaviruses in the Americas that have the potential to emerge and cause disease. Río Negro virus was recently reclassified as a distinct species. Closely related viruses include Mucambo virus and Everglades virus.

References

  1. "Viruses – Complete Genomes". NCBI. Retrieved 2013-01-17.
  2. "Human Adenovirus E Genome". NCBI. Retrieved 2013-01-17.
  3. "Human adenovirus E overview". NCBI. Retrieved 2013-01-17.
  4. "Adenoviruses". MicrobiologyBytes. Archived from the original on 2013-04-20. Retrieved 2013-01-17.
  5. 1 2 Branton, Philip; Marcellus, Richard C. (2011). "23". In Nicholas H. Acheson (ed.). Adenoviruses.{{cite book}}: |work= ignored (help)
  6. "Protein Details for Human adenovirus E". NCBI. Retrieved 2013-01-17.
  7. Russell, WC (Jan 2009). "Adenoviruses: update on structure and function". Journal of General Virology. 90 (Pt 1): 1–20. doi: 10.1099/vir.0.003087-0 . PMID   19088268.
  8. "PHA3612: receptor internalization and degradation protein alpha". NCBI. Retrieved 2013-01-17.
  9. "PHA3620: CR1 beta membrane glycoprotein". NCBI. Retrieved 2013-01-17.
  10. "PHA3615: E3 gp19K protein". NCBI. Retrieved 2013-01-17.
  11. "PHA3613: E3 14.7K protein". NCBI. Retrieved 2013-01-17.
  12. "PHA3605: control protein E4orf4". NCBI. Retrieved 2013-01-17.