Retrotransposon

Last updated

Simplified representation of the life cycle of a retrotransposon Retrotransposons.png
Simplified representation of the life cycle of a retrotransposon

Retrotransposons (also called Class I transposable elements) are mobile elements which move in the host genome by converting their transcribed RNA into DNA through the reverse transcription. [1] Thus, they differ from Class II transposable elements, or DNA transposons, in utilizing an RNA intermediate for the transposition and leaving the transposition donor site unchanged. [2]

Contents

Through reverse transcription, retrotransposons amplify themselves quickly to become abundant in eukaryotic genomes such as maize (49–78%) [3] and humans (42%). [4] They are only present in eukaryotes but share features with retroviruses such as HIV, for example, discontinuous reverse transcriptase-mediated extrachromosomal recombination. [5] [6]

There are two main types of retrotransposons, long terminal repeats (LTRs) and non-long terminal repeats (non-LTRs). Retrotransposons are classified based on sequence and method of transposition. [7] Most retrotransposons in the maize genome are LTR, whereas in humans they are mostly non-LTR.

LTR retrotransposons

LTR retrotransposons are characterized by their long terminal repeats (LTRs), which are present at both the 5' and 3' ends of their sequences. These LTRs contain the promoters for these transposable elements (TEs), are essential for TE integration, and can vary in length from just over 100 base pairs (bp) to more than 1,000 bp. On average, LTR retrotransposons span several thousand base pairs, with the largest known examples reaching up to 30 kilobases (kb).

LTRs are highly functional sequences, and for that reason LTR and non-LTR retrotransposons differ greatly in their reverse transcription and integration mechanisms. Non-LTR retrotransposons use a target-primed reverse transcription (TPRT) process, which requires the RNA of the TE to be brought to the cleavage site of the retrotransposon’s integrase, where it is reverse transcribed. In contrast, LTR retrotransposons undergo reverse transcription in the cytoplasm, utilizing two rounds of template switching, and a formation of a pre-integration complex (PIC) composed of double-stranded DNA and an integrase dimer bound to LTRs. This complex then moves into the nucleus for integration into a new genomic location.

LTR retrotransposons typically encode the proteins gag and pol, which may be combined into a single open reading frame (ORF) or separated into distinct ORFs. Similar to retroviruses, the gag protein is essential for capsid assembly and the packaging of the TE's RNA and associated proteins. The pol protein is necessary for reverse transcription and includes these crucial domains: PR (protease), RT (reverse transcriptase), RH (RNase H), and INT (integrase). Additionally, some LTR retrotransposons have an ORF for an envelope (env) protein that is incorporated into the assembled capsid, facilitating attachment to cellular surfaces.

Endogenous retrovirus

An endogenous retrovirus is a retrovirus without virus pathogenic effects that has been integrated into the host genome by inserting their inheritable genetic information into cells that can be passed onto the next generation like a retrotransposon. [8] Because of this, they share features with retroviruses and retrotransposons. When the retroviral DNA is integrated into the host genome they evolve into endogenous retroviruses that influence eukaryotic genomes. So many endogenous retroviruses have inserted themselves into eukaryotic genomes that they allow insight into biology between viral-host interactions and the role of retrotransposons in evolution and disease. Many retrotransposons share features with endogenous retroviruses, the property of recognising and fusing with the host genome. However, there is a key difference between retroviruses and retrotransposons, which is indicated by the env gene. Although similar to the gene carrying out the same function in retroviruses, the env gene is used to determine whether the gene is retroviral or retrotransposon. If the gene is retroviral it can evolve from a retrotransposon into a retrovirus. They differ by the order of sequences in pol genes. Env genes are found in LTR retrotransposon types Ty1-copia (Pseudoviridae), Ty3-gypsy (Metaviridae) and BEL/Pao. [9] [8] They encode glycoproteins on the retrovirus envelope needed for entry into the host cell. Retroviruses can move between cells whereas LTR retrotransposons can only move themselves into the genome of the same cell. [10] Many vertebrate genes were formed from retroviruses and LTR retrotransposons. One endogenous retrovirus or LTR retrotransposon has the same function and genomic locations in different species, suggesting their role in evolution. [11]

Non-LTR retrotransposons

Like LTR retrotransposons, non-LTR retrotransposons contain genes for reverse transcriptase, RNA-binding protein, nuclease, and sometimes ribonuclease H domain [12] but they lack the long terminal repeats. RNA-binding proteins bind the RNA-transposition intermediate and nucleases are enzymes that break phosphodiester bonds between nucleotides in nucleic acids. Instead of LTRs, non-LTR retrotransposons have short repeats that can have an inverted order of bases next to each other aside from direct repeats found in LTR retrotransposons that is just one sequence of bases repeating itself.

Although they are retrotransposons, they cannot carry out reverse transcription using an RNA transposition intermediate in the same way as LTR retrotransposons. Those two key components of the retrotransposon are still necessary but the way they are incorporated into the chemical reactions is different. This is because unlike LTR retrotransposons, non-LTR retrotransposons do not contain sequences that bind tRNA.

They mostly fall into two types – LINEs (Long interspersed nuclear elements) and SINEs (Short interspersed nuclear elements). SVA elements are the exception between the two as they share similarities with both LINEs and SINEs, containing Alu elements and different numbers of the same repeat. SVAs are shorter than LINEs but longer than SINEs.

While historically viewed as "junk DNA", research suggests in some cases, both LINEs and SINEs were incorporated into novel genes to form new functions. [13]

LINEs

When a LINE is transcribed, the transcript contains an RNA polymerase II promoter that ensures LINEs can be copied into whichever location it inserts itself into. RNA polymerase II is the enzyme that transcribes genes into mRNA transcripts. The ends of LINE transcripts are rich in multiple adenines, [14] the bases that are added at the end of transcription so that LINE transcripts would not be degraded. This transcript is the RNA transposition intermediate.

The RNA transposition intermediate moves from the nucleus into the cytoplasm for translation. This gives the two coding regions of a LINE that in turn binds back to the RNA it is transcribed from. The LINE RNA then moves back into the nucleus to insert into the eukaryotic genome.

LINEs insert themselves into regions of the eukaryotic genome that are rich in bases AT. At AT regions LINE uses its nuclease to cut one strand of the eukaryotic double-stranded DNA. The adenine-rich sequence in LINE transcript base pairs with the cut strand to flag where the LINE will be inserted with hydroxyl groups. Reverse transcriptase recognises these hydroxyl groups to synthesise LINE retrotransposon where the DNA is cut. Like with LTR retrotransposons, this new inserted LINE contains eukaryotic genome information so it can be copied and pasted into other genomic regions easily. The information sequences are longer and more variable than those in LTR retrotransposons.

Most LINE copies have variable length at the start because reverse transcription usually stops before DNA synthesis is complete. In some cases this causes RNA polymerase II promoter to be lost so LINEs cannot transpose further. [15]

Genetic structure of murine LINE1 and SINEs. Bottom: proposed structure of L1 RNA-protein (RNP) complexes. ORF1 proteins form trimers, exhibiting RNA binding and nucleic acid chaperone activity. LINE1s and SINEs.png
Genetic structure of murine LINE1 and SINEs. Bottom: proposed structure of L1 RNA-protein (RNP) complexes. ORF1 proteins form trimers, exhibiting RNA binding and nucleic acid chaperone activity.

Human L1

LINE-1 (L1) retrotransposons make up a significant portion of the human genome, with an estimated 500,000 copies per genome. Genes encoding for human LINE1 usually have their transcription inhibited by methyl groups binding to its DNA carried out by PIWI proteins and enzymes DNA methyltransferases. L1 retrotransposition can disrupt the nature of genes transcribed by pasting themselves inside or near genes which could in turn lead to human disease. LINE1s can only retrotranspose in some cases to form different chromosome structures contributing to differences in genetics between individuals. [17] There is an estimate of 80–100 active L1s in the reference genome of the Human Genome Project, and an even smaller number of L1s within those active L1s retrotranspose often. L1 insertions have been associated with tumorigenesis by activating cancer-related genes oncogenes and diminishing tumor suppressor genes.

Each human LINE1 contains two regions from which gene products can be encoded. The first coding region contains a leucine zipper protein involved in protein-protein interactions and a protein that binds to the terminus of nucleic acids. The second coding region has a purine/pyrimidine nuclease, reverse transcriptase and protein rich in amino acids cysteines and histidines. The end of the human LINE1, as with other retrotransposons is adenine-rich. [18] [19] [20]

Human L1 actively retrotransposes in the human genome. A recent study identified 1,708 somatic L1 retrotransposition events, especially in colorectal epithelial cells. These events occur from early embryogenesis and retrotransposition rate is substantially increased during colorectal tumourigenesis. [21]

SINEs

SINEs are much shorter (300bp) than LINEs. [22] They share similarity with genes transcribed by RNA polymerase II, the enzyme that transcribes genes into mRNA transcripts, and the initiation sequence of RNA polymerase III, the enzyme that transcribes genes into ribosomal RNA, tRNA and other small RNA molecules. [23] SINEs such as mammalian MIR elements have tRNA gene at the start and adenine-rich at the end like in LINEs.

SINEs do not encode a functional reverse transcriptase protein and rely on other mobile transposons, especially LINEs. [24] SINEs exploit LINE transposition components despite LINE-binding proteins prefer binding to LINE RNA. SINEs cannot transpose by themselves because they cannot encode SINE transcripts. They usually consist of parts derived from tRNA and LINEs. The tRNA portion contains an RNA polymerase III promoter which the same kind of enzyme as RNA polymerase II. This makes sure the LINE copies would be transcribed into RNA for further transposition. The LINE component remains so LINE-binding proteins can recognise the LINE part of the SINE.

Alu elements

Alus are the most common SINE in primates. They are approximately 350 base pairs long, do not encode proteins and can be recognized by the restriction enzyme AluI (hence the name). Their distribution may be important in some genetic diseases and cancers. Copy and pasting Alu RNA requires the Alu's adenine-rich end and the rest of the sequence bound to a signal. The signal-bound Alu can then associate with ribosomes. LINE RNA associates on the same ribosomes as the Alu. Binding to the same ribosome allows Alus of SINEs to interact with LINE. This simultaneous translation of Alu element and LINE allows SINE copy and pasting.

SVA elements

SVA elements are present at lower levels than SINES and LINEs in humans. The starts of SVA and Alu elements are similar, followed by repeats and an end similar to endogenous retrovirus. LINEs bind to sites flanking SVA elements to transpose them. SVA are one of the youngest transposons in great apes genome and among the most active and polymorphic in the human population. SVA was created by a fusion between an Alu element, a VNTR (variable number tandem repeat), and an LTR fragment. [25]

Role in human disease

Retrotransposons ensure they are not lost by chance by occurring only in cell genetics that can be passed on from one generation to the next from parent gametes. However, LINEs can transpose into the human embryo cells that eventually develop into the nervous system, raising the question whether this LINE retrotransposition affects brain function. LINE retrotransposition is also a feature of several cancers, but it is unclear whether retrotransposition itself causes cancer instead of just a symptom. Uncontrolled retrotransposition is bad for both the host organism and retrotransposons themselves so they have to be regulated. Retrotransposons are regulated by RNA interference. RNA interference is carried out by a bunch of short non-coding RNAs. The short non-coding RNA interacts with protein Argonaute to degrade retrotransposon transcripts and change their DNA histone structure to reduce their transcription.

Role in evolution

LTR retrotransposons came about later than non-LTR retrotransposons, possibly from an ancestral non-LTR retrotransposon acquiring an integrase from a DNA transposon. Retroviruses gained additional properties to their virus envelopes by taking the relevant genes from other viruses using the power of LTR retrotransposon.

Due to their retrotransposition mechanism, retrotransposons amplify in number quickly, composing 40% of the human genome. The insertion rates for LINE1, Alu and SVA elements are 1/200 – 1/20, 1/20 and 1/900 respectively. The LINE1 insertion rates have varied a lot over the past 35 million years, so they indicate points in genome evolution.

Notably a large number of 100 kilobases in the maize genome show variety due to the presence or absence of retrotransposons. However since maize is unusual genetically as compared to other plants it cannot be used to predict retrotransposition in other plants.

Mutations caused by retrotransposons include:

Role in biotechnology

See also

Related Research Articles

<span class="mw-page-title-main">Genome</span> All genetic material of an organism

In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA. The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as regulatory sequences, and often a substantial fraction of junk DNA with no evident function. Almost all eukaryotes have mitochondria and a small mitochondrial genome. Algae and plants also contain chloroplasts with a chloroplast genome.

<span class="mw-page-title-main">Retroposon</span>

Retroposons are repetitive DNA fragments which are inserted into chromosomes after they had been reverse transcribed from any RNA molecule.

<span class="mw-page-title-main">Transposable element</span> Semiparasitic DNA sequence

A transposable element is a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size. Transposition often results in duplication of the same genetic material. In the human genome, L1 and Alu elements are two examples. Barbara McClintock's discovery of them earned her a Nobel Prize in 1983. Its importance in personalized medicine is becoming increasingly relevant, as well as gaining more attention in data analytics given the difficulty of analysis in very high dimensional spaces.

Non-coding DNA (ncDNA) sequences are components of an organism's DNA that do not encode protein sequences. Some non-coding DNA is transcribed into functional non-coding RNA molecules. Other functional regions of the non-coding DNA fraction include regulatory sequences that control gene expression; scaffold attachment regions; origins of DNA replication; centromeres; and telomeres. Some non-coding regions appear to be mostly nonfunctional, such as introns, pseudogenes, intergenic DNA, and fragments of transposons and viruses. Regions that are completely nonfunctional are called junk DNA.

An Alu element is a short stretch of DNA originally characterized by the action of the Arthrobacter luteus (Alu) restriction endonuclease. Alu elements are the most abundant transposable elements in the human genome, present in excess of one million copies. Alu elements were thought to be selfish or parasitic DNA, because their sole known function is self reproduction. However, they are likely to play a role in evolution and have been used as genetic markers. They are derived from the small cytoplasmic 7SL RNA, a component of the signal recognition particle. Alu elements are highly conserved within primate genomes and originated in the genome of an ancestor of Supraprimates.

Repeated sequences are short or long patterns of nucleic acids that occur in multiple copies throughout the genome. In many organisms, a significant fraction of the genomic DNA is repetitive, with over two-thirds of the sequence consisting of repetitive elements in humans. Some of these repeated sequences are necessary for maintaining important genome structures such as telomeres or centromeres.

Metaviridae is a family of viruses which exist as Ty3-gypsy LTR retrotransposons in a eukaryotic host's genome. They are closely related to retroviruses: members of the family Metaviridae share many genomic elements with retroviruses, including length, organization, and genes themselves. This includes genes that encode reverse transcriptase, integrase, and capsid proteins. The reverse transcriptase and integrase proteins are needed for the retrotransposon activity of the virus. In some cases, virus-like particles can be formed from capsid proteins.

<span class="mw-page-title-main">Endogenous retrovirus</span> Inherited retrovirus encoded in an organisms genome

Endogenous retroviruses (ERVs) are endogenous viral elements in the genome that closely resemble and can be derived from retroviruses. They are abundant in the genomes of jawed vertebrates, and they comprise up to 5–8% of the human genome.

Eukaryotic chromosome fine structure refers to the structure of sequences for eukaryotic chromosomes. Some fine sequences are included in more than one class, so the classification listed is not intended to be completely separate.

Exon shuffling is a molecular mechanism for the formation of new genes. It is a process through which two or more exons from different genes can be brought together ectopically, or the same exon can be duplicated, to create a new exon-intron structure. There are different mechanisms through which exon shuffling occurs: transposon mediated exon shuffling, crossover during sexual recombination of parental genomes and illegitimate recombination.

<span class="mw-page-title-main">Mobile genetic elements</span> DNA sequence whose position in the genome is variable

Mobile genetic elements (MGEs), sometimes called selfish genetic elements, are a type of genetic material that can move around within a genome, or that can be transferred from one species or replicon to another. MGEs are found in all organisms. In humans, approximately 50% of the genome is thought to be MGEs. MGEs play a distinct role in evolution. Gene duplication events can also happen through the mechanism of MGEs. MGEs can also cause mutations in protein coding regions, which alters the protein functions. These mechanisms can also rearrange genes in the host genome generating variation. These mechanism can increase fitness by gaining new or additional functions. An example of MGEs in evolutionary context are that virulence factors and antibiotic resistance genes of MGEs can be transported to share genetic code with neighboring bacteria. However, MGEs can also decrease fitness by introducing disease-causing alleles or mutations. The set of MGEs in an organism is called a mobilome, which is composed of a large number of plasmids, transposons and viruses.

<span class="mw-page-title-main">Long terminal repeat</span> DNA sequence

A long terminal repeat (LTR) is a pair of identical sequences of DNA, several hundred base pairs long, which occur in eukaryotic genomes on either end of a series of genes or pseudogenes that form a retrotransposon or an endogenous retrovirus or a retroviral provirus. All retroviral genomes are flanked by LTRs, while there are some retrotransposons without LTRs. Typically, an element flanked by a pair of LTRs will encode a reverse transcriptase and an integrase, allowing the element to be copied and inserted at a different location of the genome. Copies of such an LTR-flanked element can often be found hundreds or thousands of times in a genome. LTR retrotransposons comprise about 8% of the human genome.

<span class="mw-page-title-main">Knockout rat</span> Type of genetically engineered rat

A knockout rat is a genetically engineered rat with a single gene turned off through a targeted mutation used for academic and pharmaceutical research. Knockout rats can mimic human diseases and are important tools for studying gene function and for drug discovery and development. The production of knockout rats was not economically or technically feasible until 2008.

<span class="mw-page-title-main">LTR retrotransposon</span> Class I transposable element

LTR retrotransposons are class I transposable elements (TEs) characterized by the presence of long terminal repeats (LTRs) directly flanking an internal coding region. As retrotransposons, they mobilize through reverse transcription of their mRNA and integration of the newly created cDNA into another genomic location. Their mechanism of retrotransposition is shared with retroviruses, with the difference that the rate of horizontal transfer in LTR-retrotransposons is much lower than the vertical transfer by passing active TE insertions to the progeny. LTR retrotransposons that form virus-like particles are classified under Ortervirales.

A conserved non-coding sequence (CNS) is a DNA sequence of noncoding DNA that is evolutionarily conserved. These sequences are of interest for their potential to regulate gene production.

<span class="mw-page-title-main">Long interspersed nuclear element</span>

Long interspersed nuclear elements (LINEs) are a group of non-LTR retrotransposons that are widespread in the genome of many eukaryotes. LINEs contain an internal Pol II promoter to initiate transcription into mRNA, and encode one or two proteins, ORF1 and ORF2. The functional domains present within ORF1 vary greatly among LINEs, but often exhibit RNA/DNA binding activity. ORF2 is essential to successful retrotransposition, and encodes a protein with both reverse transcriptase and endonuclease activity.

Transposable elements are short strands of repetitive DNA that can self-replicate and translocate within the eukaryotic genome, and are generally perceived as parasitic in nature. Their transcription can lead to the production of dsRNAs, which resemble retroviruses transcripts. While most host cellular RNA has a singular, unpaired sense strand, dsRNA possesses sense and anti-sense transcripts paired together, and this difference in structure allows an host organism to detect dsRNA production, and thereby the presence of transposons. Plants lack distinct divisions between somatic cells and reproductive cells, and also have, generally, larger genomes than animals, making them an intriguing case-study kingdom to be used in attempting to better understand the epigenetics function of transposable elements.

<span class="mw-page-title-main">LINE1</span> Group of transposable elements

LINE1 is a family of related class I transposable elements in the DNA of some organisms, classified with the long interspersed elements (LINEs). L1 transposons comprise approximately 17% of the human genome. These active L1s can interrupt the genome through insertions, deletions, rearrangements, and copy number variations. L1 activity has contributed to the instability and evolution of genomes and is tightly regulated in the germline by DNA methylation, histone modifications, and piRNA. L1s can further impact genome variation through mispairing and unequal crossing over during meiosis due to its repetitive DNA sequences.

<span class="mw-page-title-main">Short interspersed nuclear element</span>

Short interspersed nuclear elements (SINEs) are non-autonomous, non-coding transposable elements (TEs) that are about 100 to 700 base pairs in length. They are a class of retrotransposons, DNA elements that amplify themselves throughout eukaryotic genomes, often through RNA intermediates. SINEs compose about 13% of the mammalian genome.

Cer6 is a LTR retrotransposon that is described from sequencing data in the chromosome III of C. elegans.

References

  1. Dombroski BA, Feng Q, Mathias SL, Sassaman DM, Scott AF, Kazazian HH, Boeke JD (July 1994). "An in vivo assay for the reverse transcriptase of human retrotransposon L1 in Saccharomyces cerevisiae". Molecular and Cellular Biology. 14 (7): 4485–92. doi:10.1128/mcb.14.7.4485. PMC   358820 . PMID   7516468.
  2. Craig, Nancy Lynn (2015). Mobile DNA III. Washington (D.C.): ASM press. ISBN   9781555819200.
  3. SanMiguel P, Bennetzen JL (1998). "Evidence that a recent increase in maize genome size was caused by the massive amplification of intergene retrotranposons". Annals of Botany. 82 (Suppl A): 37–44. doi: 10.1006/anbo.1998.0746 .
  4. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. (February 2001). "Initial sequencing and analysis of the human genome". Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi: 10.1038/35057062 . hdl: 2027.42/62798 . PMID   11237011.
  5. Sanchez DH, Gaubert H, Drost HG, Zabet NR, Paszkowski J (November 2017). "High-frequency recombination between members of an LTR retrotransposon family during transposition bursts". Nature Communications. 8 (1): 1283. Bibcode:2017NatCo...8.1283S. doi:10.1038/s41467-017-01374-x. PMC   5668417 . PMID   29097664.
  6. Drost HG, Sanchez DH (December 2019). "Becoming a Selfish Clan: Recombination Associated to Reverse-Transcription in LTR Retrotransposons". Genome Biology and Evolution. 11 (12): 3382–3392. doi:10.1093/gbe/evz255. PMC   6894440 . PMID   31755923.
  7. Xiong Y, Eickbush TH (October 1990). "Origin and evolution of retroelements based upon their reverse transcriptase sequences". The EMBO Journal. 9 (10): 3353–62. doi:10.1002/j.1460-2075.1990.tb07536.x. PMC   552073 . PMID   1698615.
  8. 1 2 Wicker T, Sabot F, Hua-Van A, Bennetzen JL, Capy P, Chalhoub B, Flavell A, Leroy P, Morgante M, Panaud O, Paux E, SanMiguel P, Schulman AH (December 2007). "A unified classification system for eukaryotic transposable elements". Nature Reviews. Genetics. 8 (12): 973–82. doi:10.1038/nrg2165. PMID   17984973. S2CID   32132898.
  9. Copeland CS, Mann VH, Morales ME, Kalinna BH, Brindley PJ (February 2005). "The Sinbad retrotransposon from the genome of the human blood fluke, Schistosoma mansoni, and the distribution of related Pao-like elements". BMC Evolutionary Biology. 5 (1): 20. doi: 10.1186/1471-2148-5-20 . PMC   554778 . PMID   15725362.
  10. Havecker ER, Gao X, Voytas DF (18 May 2004). "The diversity of LTR retrotransposons". Genome Biology. 5 (225): 225. doi: 10.1186/gb-2004-5-6-225 . PMC   463057 . PMID   15186483.
  11. Naville M, Warren IA, Haftek-Terreau Z, Chalopin D, Brunet F, Levin P, Galiana D, Volff JN (17 February 2016). "Not so bad after all: retroviruses and long terminal repeat retrotransposons as a source of new genes in vertebrates". Clinical Microbiology and Infection. 22 (4): 312–323. doi: 10.1016/j.cmi.2016.02.001 . PMID   26899828.
  12. Yadav VP, Mandal PK, Rao DN, Bhattacharya S (December 2009). "Characterization of the restriction enzyme-like endonuclease encoded by the Entamoeba histolytica non-long terminal repeat retrotransposon EhLINE1". The FEBS Journal. 276 (23): 7070–82. doi:10.1111/j.1742-4658.2009.07419.x. PMID   19878305. S2CID   30791213.
  13. Santangelo AM, de Souza FS, Franchini LF, Bumaschny VF, Low MJ, Rubinstein M (October 2007). "Ancient exaptation of a CORE-SINE retroposon into a highly conserved mammalian neuronal enhancer of the proopiomelanocortin gene". PLOS Genetics. 3 (10): 1813–26. doi: 10.1371/journal.pgen.0030166 . PMC   2000970 . PMID   17922573.
  14. Liang KH, Yeh CT (May 2013). "A gene expression restriction network mediated by sense and antisense Alu sequences located on protein-coding messenger RNAs". BMC Genomics. 14: 325. doi: 10.1186/1471-2164-14-325 . PMC   3655826 . PMID   23663499.
  15. Singer MF (March 1982). "SINEs and LINEs: highly repeated short and long interspersed sequences in mammalian genomes". Cell. 28 (3): 433–4. doi:10.1016/0092-8674(82)90194-5. PMID   6280868. S2CID   22129236.
  16. Walter M (2015). "Transposon regulation upon dynamic loss of DNA methylation (PDF Download Available)". ResearchGate. doi:10.13140/rg.2.2.18747.21286.
  17. Chueh AC, Northrop EL, Brettingham-Moore KH, Choo KH, Wong LH (January 2009). Bickmore WA (ed.). "LINE retrotransposon RNA is an essential structural and functional epigenetic component of a core neocentromeric chromatin". PLOS Genetics. 5 (1): e1000354. doi: 10.1371/journal.pgen.1000354 . PMC   2625447 . PMID   19180186.
  18. Doucet AJ, Hulme AE, Sahinovic E, Kulpa DA, Moldovan JB, Kopera HC, Athanikar JN, Hasnaoui M, Bucheton A, Moran JV, Gilbert N (October 2010). "Characterization of LINE-1 ribonucleoprotein particles". PLOS Genetics. 6 (10): e1001150. doi: 10.1371/journal.pgen.1001150 . PMC   2951350 . PMID   20949108.
  19. Denli AM, Narvaiza I, Kerman BE, Pena M, Benner C, Marchetto MC, Diedrich JK, Aslanian A, Ma J, Moresco JJ, Moore L, Hunter T, Saghatelian A, Gage FH (October 2015). "Primate-specific ORF0 contributes to retrotransposon-mediated diversity". Cell. 163 (3): 583–93. doi: 10.1016/j.cell.2015.09.025 . PMID   26496605. S2CID   10525450.
  20. Ohshima K, Okada N (2005). "SINEs and LINEs: symbionts of eukaryotic genomes with a common tail". Cytogenetic and Genome Research. 110 (1–4): 475–90. doi:10.1159/000084981. PMID   16093701. S2CID   42841487.
  21. Nam, Chang Hyun; Youk, Jeonghwan; Kim, Jeong Yeon; Lim, Joonoh; Park, Jung Woo; Oh, Soo A; Lee, Hyun Jung; Park, Ji Won; Won, Hyein; Lee, Yunah; Jeong, Seung-Yong; Lee, Dong-Sung; Oh, Ji Won; Han, Jinju; Lee, Junehawk (2023-05-18). "Widespread somatic L1 retrotransposition in normal colorectal epithelium". Nature. 617 (7961): 540–547. Bibcode:2023Natur.617..540N. doi: 10.1038/s41586-023-06046-z . ISSN   0028-0836. PMC   10191854 . PMID   37165195.
  22. Stansfield WD, King RC (1997). A dictionary of genetics (5th ed.). Oxford [Oxfordshire]: Oxford University Press. ISBN   978-0-19-509441-1.
  23. Kramerov DA, Vassetzky NS (2005). "Short retroposons in eukaryotic genomes". International Review of Cytology. 247: 165–221. doi:10.1016/s0074-7696(05)47004-7. PMID   16344113.
  24. Dewannieux M, Esnault C, Heidmann T (September 2003). "LINE-mediated retrotransposition of marked Alu sequences". Nature Genetics. 35 (1): 41–8. doi:10.1038/ng1223. PMID   12897783. S2CID   32151696.
  25. Wells, JN; Feschotte, C (23 November 2020). "A Field Guide to Eukaryotic Transposable Elements". Annual Review of Genetics. 54: 539–561. doi:10.1146/annurev-genet-040620-022145. PMC   8293684 . PMID   32955944.