Retroposon

Last updated
Phylogenetic tree of marsupials derived from retroposon data Phylogenetic tree of marsupials derived from retroposon data - journal.pbio.1000436.g002.png
Phylogenetic tree of marsupials derived from retroposon data

Retroposons are repetitive DNA fragments which are inserted into chromosomes after they had been reverse transcribed from any RNA molecule.

Contents

Difference between retroposons and retrotransposons

In contrast to retrotransposons, retroposons never encode reverse transcriptase (RT) (but see below). Therefore, they are non-autonomous elements with regard to transposition activity (as opposed to transposons). Non-long terminal repeat (LTR) retrotransposons such as the human LINE1 elements are sometimes falsely referred to as retroposons. However, this depends on the author. For example, Howard Temin published the following definition: Retroposons encode RT but are devoid of long terminal repeats (LTRs), for example long interspersed elements (LINEs). Retrotransposons also feature LTRs and retroviruses, in addition, are packaged as viral particles (virions). Retrosequences are non-autonomous elements devoid of RT. They are retroposed with the aid of the machinery of autonomous elements, such as LINEs; examples are short interspersed nuclear elements (SINEs) or mRNA-derived retro(pseudo)genes. [2] [3] [4]

Gene duplications

Retroposition accounts for approximately 10,000 gene-duplication events in the human genome, of which approximately 2-10% are likely to be functional. [5] Such genes are called retrogenes and represent a certain type of retroposon.

Horizontal gene transfer

A classical event is the retroposition of a spliced pre-mRNA molecule of the c-Src gene into the proviral ancestor of the Rous sarcoma virus (RSV). The retroposed c-src pre-mRNA still contained a single intron and within RSV is now referred to as v-Src gene. [6]

Related Research Articles

<span class="mw-page-title-main">Genome</span> All genetic material of an organism

In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA. The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as regulatory sequences, and often a substantial fraction of junk DNA with no evident function. Almost all eukaryotes have mitochondria and a small mitochondrial genome. Algae and plants also contain chloroplasts with a chloroplast genome.

<span class="mw-page-title-main">Transposable element</span> Semiparasitic DNA sequence

A transposable element is a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size. Transposition often results in duplication of the same genetic material. In the human genome, L1 and Alu elements are two examples. Barbara McClintock's discovery of them earned her a Nobel Prize in 1983. Its importance in personalized medicine is becoming increasingly relevant, as well as gaining more attention in data analytics given the difficulty of analysis in very high dimensional spaces.

An Alu element is a short stretch of DNA originally characterized by the action of the Arthrobacter luteus (Alu) restriction endonuclease. Alu elements are the most abundant transposable elements in the human genome, present in excess of one million copies. Alu elements were thought to be selfish or parasitic DNA, because their sole known function is self reproduction. However, they are likely to play a role in evolution and have been used as genetic markers. They are derived from the small cytoplasmic 7SL RNA, a component of the signal recognition particle. Alu elements are highly conserved within primate genomes and originated in the genome of an ancestor of Supraprimates.

Repeated sequences are short or long patterns of nucleic acids that occur in multiple copies throughout the genome. In many organisms, a significant fraction of the genomic DNA is repetitive, with over two-thirds of the sequence consisting of repetitive elements in humans. Some of these repeated sequences are necessary for maintaining important genome structures such as telomeres or centromeres.

<span class="mw-page-title-main">Retrotransposon</span> Type of genetic component

Retrotransposons are a type of genetic component that copy and paste themselves into different genomic locations (transposon) by converting RNA back into DNA through the reverse transcription process using an RNA transposition intermediate.

<span class="mw-page-title-main">Howard Martin Temin</span> 20th-century American geneticist

Howard Martin Temin was an American geneticist and virologist. He discovered reverse transcriptase in the 1970s at the University of Wisconsin–Madison, for which he shared the 1975 Nobel Prize in Physiology or Medicine with Renato Dulbecco and David Baltimore.

<span class="mw-page-title-main">Endogenous retrovirus</span> Inherited retrovirus encoded in an organisms genome

Endogenous retroviruses (ERVs) are endogenous viral elements in the genome that closely resemble and can be derived from retroviruses. They are abundant in the genomes of jawed vertebrates, and they comprise up to 5–8% of the human genome.

Rous sarcoma virus (RSV) is a retrovirus and is the first oncovirus to have been described. It causes sarcoma in chickens.

Retrotransposon markers are components of DNA which are used as cladistic markers. They assist in determining the common ancestry, or not, of related taxa. The "presence" of a given retrotransposon in related taxa suggests their orthologous integration, a derived condition acquired via a common ancestry, while the "absence" of particular elements indicates the plesiomorphic condition prior to integration in more distant taxa. The use of presence/absence analyses to reconstruct the systematic biology of mammals depends on the availability of retrotransposons that were actively integrating before the divergence of a particular species.

Exon shuffling is a molecular mechanism for the formation of new genes. It is a process through which two or more exons from different genes can be brought together ectopically, or the same exon can be duplicated, to create a new exon-intron structure. There are different mechanisms through which exon shuffling occurs: transposon mediated exon shuffling, crossover during sexual recombination of parental genomes and illegitimate recombination.

<span class="mw-page-title-main">Mobile genetic elements</span> DNA sequence whose position in the genome is variable

Mobile genetic elements (MGEs), sometimes called selfish genetic elements, are a type of genetic material that can move around within a genome, or that can be transferred from one species or replicon to another. MGEs are found in all organisms. In humans, approximately 50% of the genome is thought to be MGEs. MGEs play a distinct role in evolution. Gene duplication events can also happen through the mechanism of MGEs. MGEs can also cause mutations in protein coding regions, which alters the protein functions. These mechanisms can also rearrange genes in the host genome generating variation. These mechanism can increase fitness by gaining new or additional functions. An example of MGEs in evolutionary context are that virulence factors and antibiotic resistance genes of MGEs can be transported to share genetic code with neighboring bacteria. However, MGEs can also decrease fitness by introducing disease-causing alleles or mutations. The set of MGEs in an organism is called a mobilome, which is composed of a large number of plasmids, transposons and viruses.

<span class="mw-page-title-main">Long terminal repeat</span> DNA sequence

A long terminal repeat (LTR) is a pair of identical sequences of DNA, several hundred base pairs long, which occur in eukaryotic genomes on either end of a series of genes or pseudogenes that form a retrotransposon or an endogenous retrovirus or a retroviral provirus. All retroviral genomes are flanked by LTRs, while there are some retrotransposons without LTRs. Typically, an element flanked by a pair of LTRs will encode a reverse transcriptase and an integrase, allowing the element to be copied and inserted at a different location of the genome. Copies of such an LTR-flanked element can often be found hundreds or thousands of times in a genome. LTR retrotransposons comprise about 8% of the human genome.

The gag-onc fusion protein is a general term for a fusion protein formed from a group-specific antigen ('gag') gene and that of an oncogene ('onc'), a gene that plays a role in the development of a cancer. The name is also written as Gag-v-Onc, with "v" indicating that the Onc sequence resides in a viral genome. Onc is a generic placeholder for a given specific oncogene, such as C-jun..

v-Src is a gene found in Rous sarcoma virus (RSV) that encodes a tyrosine kinase that causes a type of cancer in chickens.

Avian sarcoma leukosis virus (ASLV) is an endogenous retrovirus that infects and can lead to cancer in chickens; experimentally it can infect other species of birds and mammals. ASLV replicates in chicken embryo fibroblasts, the cells that contribute to the formation of connective tissues. Different forms of the disease exist, including lymphoblastic, erythroblastic, and osteopetrotic.

<span class="mw-page-title-main">LTR retrotransposon</span> Class I transposable element

LTR retrotransposons are class I transposable element characterized by the presence of long terminal repeats (LTRs) directly flanking an internal coding region. As retrotransposons, they mobilize through reverse transcription of their mRNA and integration of the newly created cDNA into another location. Their mechanism of retrotransposition is shared with retroviruses, with the difference that most LTR-retrotransposons do not form infectious particles that leave the cells and therefore only replicate inside their genome of origin. Those that do (occasionally) form virus-like particles are classified under Ortervirales.

A conserved non-coding sequence (CNS) is a DNA sequence of noncoding DNA that is evolutionarily conserved. These sequences are of interest for their potential to regulate gene production.

<span class="mw-page-title-main">Long interspersed nuclear element</span>

Long interspersed nuclear elements (LINEs) are a group of non-LTR retrotransposons that are widespread in the genome of many eukaryotes. LINEs contain an internal Pol II promoter to initiate transcription into mRNA, and encode one or two proteins, ORF1 and ORF2. The functional domains present within ORF1 vary greatly among LINEs, but often exhibit RNA/DNA binding activity. ORF2 is essential to successful retrotransposition, and encodes a protein with both reverse transcriptase and endonuclease activity.

<span class="mw-page-title-main">Short interspersed nuclear element</span>

Short interspersed nuclear elements (SINEs) are non-autonomous, non-coding transposable elements (TEs) that are about 100 to 700 base pairs in length. They are a class of retrotransposons, DNA elements that amplify themselves throughout eukaryotic genomes, often through RNA intermediates. SINEs compose about 13% of the mammalian genome.

Cer6 is a LTR retrotransposon that is described from sequencing data in the chromosome III of C. elegans.

References

  1. Nilsson, M. A.; Churakov, G.; Sommer, M.; et al. (2010). "Tracking Marsupial Evolution Using Archaic Genomic Retroposon Insertions". PLOS Biology. 8 (7): e1000436. doi: 10.1371/journal.pbio.1000436 . PMC   2910653 . PMID   20668664.
  2. Temin HM (1989). "Retrons in bacteria". Nature. 339 (6222): 254–5. Bibcode:1989Natur.339..254T. doi: 10.1038/339254a0 . PMID   2471077. S2CID   29368479.
  3. Makałowski W, Toda Y (2007). "Modulation of Host Genes by Mammalian Transposable Elements". In Volff JN, Schmid M (eds.). Gene and Protein Evolution. Genome Dynamics. Vol. 3 (3rd ed.). Basel: S. Karger. p. 166. doi:10.1159/000107610. ISBN   978-3-8055-8341-1. OCLC   729848415. PMID   18753791.
  4. Makałowski W, Gotea V, Pande A, Makałowska I (2019). "Transposable Elements: Classification, Identification, and Their Use as a Tool for Comparative Genomics". In Anisimova M (ed.). Evolutionary Genomics. Methods in Molecular Biology. Vol. 1910. Clifton, NJ. pp. 185–86. doi: 10.1007/978-1-4939-9074-0_6 . ISBN   978-3-8055-8341-1. OCLC   145014779. PMID   31278665.{{cite book}}: CS1 maint: location missing publisher (link)
  5. Emerson JJ, Kaessmann H, Betrán E, Long M (January 2004). "Extensive gene traffic on the mammalian X chromosome". Science. 303 (5657): 537–40. Bibcode:2004Sci...303..537E. doi:10.1126/science.1090042. PMID   14739461. S2CID   41158232.
  6. Swanstrom, R; Parker, R C; Varmus, H E; Bishop, J M (May 1983). "Transduction of a cellular oncogene: the genesis of Rous sarcoma virus". Proceedings of the National Academy of Sciences. 80 (9): 2519–2523. Bibcode:1983PNAS...80.2519S. doi: 10.1073/pnas.80.9.2519 . PMC   393857 . PMID   6302692.