Inverted repeat

Last updated

An inverted repeat (or IR) is a single stranded sequence of nucleotides followed downstream by its reverse complement. [1] The intervening sequence of nucleotides between the initial sequence and the reverse complement can be any length including zero. For example, 5'---TTACGnnnnnnCGTAA---3' is an inverted repeat sequence. When the intervening length is zero, the composite sequence is a palindromic sequence. [2]

Contents

Both inverted repeats and direct repeats constitute types of nucleotide sequences that occur repetitively. These repeated DNA sequences often range from a pair of nucleotides to a whole gene, while the proximity of the repeat sequences varies between widely dispersed and simple tandem arrays. [3] The short tandem repeat sequences may exist as just a few copies in a small region to thousands of copies dispersed all over the genome of most eukaryotes. [4] Repeat sequences with about 10–100 base pairs are known as minisatellites, while shorter repeat sequences having mostly 2–4 base pairs are known as microsatellites. [5] The most common repeats include the dinucleotide repeats, which have the bases AC on one DNA strand, and GT on the complementary strand. [3] Some elements of the genome with unique sequences function as exons, introns and regulatory DNA. [6] Though the most familiar loci of the repetitive sequences are the centromere and the telomere, [6] a large portion of the repeated sequences in the genome are found among the noncoding DNA. [5]

Inverted repeats have a number of important biological functions. They define the boundaries in transposons and indicate regions capable of self-complementary base pairing (regions within a single sequence which can base pair with each other). These properties play an important role in genome instability [7] and contribute not only to cellular evolution and genetic diversity [8] but also to mutation and disease. [9] In order to study these effects in detail, a number of programs and databases have been developed to assist in discovery and annotation of inverted repeats in various genomes.

Understanding inverted repeats

Example of an inverted repeat

The 5 base-pair sequence on the left is "repeated" and "inverted" to form sequence on the right. Inverted-Repeat.gif
The 5 base-pair sequence on the left is "repeated" and "inverted" to form sequence on the right.

Beginning with this initial sequence:
      5'-TTACG-3'

The complement created by base pairing is:
      3'-AATGC-5'

The reverse complement is:
      5'-CGTAA-3'

And, the inverted repeat sequence is:
      5'---TTACGnnnnnnCGTAA---3'

"nnnnnn" represents any number of intervening nucleotides.

Vs. direct repeat

A direct repeat occurs when a sequence is repeated with the same pattern downstream. [1] There is no inversion and no reverse complement associated with a direct repeat. The nucleotide sequence written in bold characters signifies the repeated sequence. It may or may not have intervening nucleotides.

TTACGnnnnnnTTACG 3´
AATGCnnnnnnAATGC 5´

Linguistically, a typical direct repeat is comparable to rhyming, as in "time on a dime".

Vs. tandem repeat

A direct repeat with no intervening nucleotides between the initial sequence and its downstream copy is a Tandem repeat. The nucleotide sequence written in bold characters signifies the repeated sequence.

TTACGTTACG 3´
AATGCAATGC 5´

Linguistically, a typical tandem repeat is comparable to stuttering, or deliberately repeated words, as in "bye-bye".

Vs. palindrome

An inverted repeat sequence with no intervening nucleotides between the initial sequence and its downstream reverse complement is a palindrome. [1]
   EXAMPLE:
     Step 1: start with an inverted repeat: 5' TTACGnnnnnnCGTAA 3'
     Step 2: remove intervening nucleotides: 5' TTACGCGTAA 3'
     This resulting sequence is palindromic because it is the reverse complement of itself. [1]

5' TTACGCGTAA 3'  test sequence (from Step 2 with intervening nucleotides removed)
3' AATGCGCATT 5'  complement of test sequence
5' TTACGCGTAA 3'  reverse complement    This is the same as the test sequence above, and thus, it is a palindrome.

Biological features and functionality

Conditions that favor synthesis

The diverse genome-wide repeats are derived from transposable elements, which are now understood to "jump" about different genomic locations, without transferring their original copies. [10] Subsequent shuttling of the same sequences over numerous generations ensures their multiplicity throughout the genome. [10] The limited recombination of the sequences between two distinct sequence elements known as conservative site-specific recombination (CSSR) results in inversions of the DNA segment, based on the arrangement of the recombination recognition sequences on the donor DNA and recipient DNA. [10] Again, the orientation of two of the recombining sites within the donor DNA molecule relative to the asymmetry of the intervening DNA cleavage sequences, known as the crossover region, is pivotal to the formation of either inverted repeats or direct repeats. [10] Thus, recombination occurring at a pair of inverted sites will invert the DNA sequence between the two sites. [10] Very stable chromosomes have been observed with comparatively fewer numbers of inverted repeats than direct repeats, suggesting a relationship between chromosome stability and the number of repeats. [11]

Regions where presence is obligatory

Terminal inverted repeats have been observed in the DNA of various eukaryotic transposons, even though their source remains unknown. [12] Inverted repeats are principally found at the origins of replication of cell organism and organelles that range from phage plasmids, mitochondria, and eukaryotic viruses to mammalian cells. [13] The replication origins of the phage G4 and other related phages comprise a segment of nearly 139 nucleotide bases that include three inverted repeats that are essential for replication priming. [13]

In the genome

To a large extent, portions of nucleotide repeats are quite often observed as part of rare DNA combinations. [14] The three main repeats which are largely found in particular DNA constructs include the closely precise homopurine-homopyrimidine inverted repeats, which is otherwise referred to as H palindromes, a common occurrence in triple helical H conformations that may comprise either the TAT or CGC nucleotide triads. The others could be described as long inverted repeats having the tendency to produce hairpins and cruciform, and finally direct tandem repeats, which commonly exist in structures described as slipped-loop, cruciform and left-handed Z-DNA. [14]

Common in different organisms

Past studies suggest that repeats are a common feature of eukaryotes unlike the prokaryotes and archaea. [14] Other reports suggest that irrespective of the comparative shortage of repeat elements in prokaryotic genomes, they nevertheless contain hundreds or even thousands of large repeats. [15] Current genomic analysis seem to suggest the existence of a large excess of perfect inverted repeats in many prokaryotic genomes as compared to eukaryotic genomes. [16]

Pseudoknot with four sets of inverted repeats. Inverted repeats 1 and 2 create the stem for stem-loop A and are part of the loop for stem-loop B. Similarly, inverted repeats 3 and 4 form the stem for stem-loop B and are part of the loop for stem-loop A. Pseudoknot-Inverted-Repeats.gif
Pseudoknot with four sets of inverted repeats. Inverted repeats 1 and 2 create the stem for stem-loop A and are part of the loop for stem-loop B. Similarly, inverted repeats 3 and 4 form the stem for stem-loop B and are part of the loop for stem-loop A.

For quantification and comparison of inverted repeats between several species, namely on archaea, see [17]

Inverted repeats in pseudoknots

Pseudoknots are common structural motifs found in RNA. They are formed by two nested stem-loops such that the stem of one structure is formed from the loop of the other. There are multiple folding topologies among pseudoknots and great variation in loop lengths, making them a structurally diverse group. [18]

Inverted repeats are a key component of pseudoknots as can be seen in the illustration of a naturally occurring pseudoknot found in the human telomerase RNA component. [19] Four different sets of inverted repeats are involved in this structure. Sets 1 and 2 are the stem of stem-loop A and are part of the loop for stem-loop B. Similarly, sets 3 and 4 are the stem for stem-loop B and are part of the loop for stem-loop A.

Pseudoknots play a number of different roles in biology. The telomerase pseudoknot in the illustration is critical to that enzyme's activity. [19] The ribozyme for the hepatitis delta virus (HDV) folds into a double-pseudoknot structure and self-cleaves its circular genome to produce a single-genome-length RNA. Pseudoknots also play a role in programmed ribosomal frameshifting found in some viruses and required in the replication of retroviruses. [18]

In riboswitches

Inverted repeats play an important role in riboswitches, which are RNA regulatory elements that control the expression of genes that produce the mRNA, of which they are part. [10] A simplified example of the flavin mononucleotide (FMN) riboswitch is shown in the illustration. This riboswitch exists in the mRNA transcript and has several stem-loop structures upstream from the coding region. However, only the key stem-loops are shown in the illustration, which has been greatly simplified to help show the role of the inverted repeats. There are multiple inverted repeats in this riboswitch as indicated in green (yellow background) and blue (orange background).

Ribo100r.gif

In the absence of FMN, the Anti-termination structure is the preferred conformation for the mRNA transcript. It is created by base-pairing of the inverted repeat region circled in red. When FMN is present, it may bind to the loop and prevent formation of the Anti-termination structure. This allows two different sets of inverted repeats to base-pair and form the Termination structure. [20] The stem-loop on the 3' end is a transcriptional terminator because the sequence immediately following it is a string of uracils (U). If this stem-loop forms (due to the presence of FMN) as the growing RNA strand emerges from the RNA polymerase complex, it will create enough structural tension to cause the RNA strand to dissociate and thus terminate transcription. The dissociation occurs easily because the base-pairing between the U's in the RNA and the A's in the template strand are the weakest of all base-pairings. [10] Thus, at higher concentration levels, FMN down-regulates its own transcription by increasing the formation of the termination structure.

Mutations and disease

Inverted repeats are often described as "hotspots" of eukaryotic and prokaryotic genomic instability. [7] Long inverted repeats are deemed to greatly influence the stability of the genome of various organisms. [21] This is exemplified in E. coli, where genomic sequences with long inverted repeats are seldom replicated, but rather deleted with rapidity. [21] Again, the long inverted repeats observed in yeast greatly favor recombination within the same and adjacent chromosomes, resulting in an equally very high rate of deletion. [21] Finally, a very high rate of deletion and recombination were also observed in mammalian chromosomes regions with inverted repeats. [21] Reported differences in the stability of genomes of interrelated organisms are always an indication of a disparity in inverted repeats. [11] The instability results from the tendency of inverted repeats to fold into hairpin- or cruciform-like DNA structures. These special structures can hinder or confuse DNA replication and other genomic activities. [7] Thus, inverted repeats lead to special configurations in both RNA and DNA that can ultimately cause mutations and disease. [9]

Inverted repeat changing to/from an extruded cruciform.   A: Inverted Repeat Sequences;   B: Loop;   C: Stem with base pairing of the inverted repeat sequences DNA palindrome.svg
Inverted repeat changing to/from an extruded cruciform.   A: Inverted Repeat Sequences;   B: Loop;   C: Stem with base pairing of the inverted repeat sequences

The illustration shows an inverted repeat undergoing cruciform extrusion. DNA in the region of the inverted repeat unwinds and then recombines, forming a four-way junction with two stem-loop structures. The cruciform structure occurs because the inverted repeat sequences self-pair to each other on their own strand. [22]

Extruded cruciforms can lead to frameshift mutations when a DNA sequence has inverted repeats in the form of a palindrome combined with regions of direct repeats on either side. During transcription, slippage and partial dissociation of the polymerase from the template strand can lead to both deletion and insertion mutations. [9] Deletion occurs when a portion of the unwound template strand forms a stem-loop that gets "skipped" by the transcription machinery. Insertion occurs when a stem-loop forms in a dissociated portion of the nascent (newly synthesized) strand causing a portion of the template strand to be transcribed twice. [9]

Antithrombin-gene-strand-switch.gif

Antithrombin deficiency from a point mutation

Imperfect inverted repeats can lead to mutations through intrastrand and interstrand switching. [9] The antithrombin III gene's coding region is an example of an imperfect inverted repeat as shown in the figure on the right. The stem-loop structure forms with a bump at the bottom because the G and T do not pair up. A strand switch event could result in the G (in the bump) being replaced by an A which removes the "imperfection" in the inverted repeat and provides a stronger stem-loop structure. However, the replacement also creates a point mutation converting the GCA codon to ACA. If the strand switch event is followed by a second round of DNA replication, the mutation may become fixed in the genome and lead to disease. Specifically, the missense mutation would lead to a defective gene and a deficiency in antithrombin which could result in the development of venous thromboembolism (blood clots within a vein). [9]

Collagen-gene-strand-switch.gif

Osteogenesis imperfecta from a frameshift mutation

Mutations in the collagen gene can lead to the disease Osteogenesis Imperfecta, which is characterized by brittle bones. [9] In the illustration, a stem-loop formed from an imperfect inverted repeat is mutated with a thymine (T) nucleotide insertion as a result of an inter- or intrastrand switch. The addition of the T creates a base-pairing "match up" with the adenine (A) that was previously a "bump" on the left side of the stem. While this addition makes the stem stronger and perfects the inverted repeat, it also creates a frameshift mutation in the nucleotide sequence which alters the reading frame and will result in an incorrect expression of the gene. [9]

Programs and databases

The following list provides information and external links to various programs and databases for inverted repeats:

Related Research Articles

<i>Parvoviridae</i> Family of viruses

Parvoviruses are a family of animal viruses that constitute the family Parvoviridae. They have linear, single-stranded DNA (ssDNA) genomes that typically contain two genes encoding for a replication initiator protein, called NS1, and the protein the viral capsid is made of. The coding portion of the genome is flanked by telomeres at each end that form into hairpin loops that are important during replication. Parvovirus virions are small compared to most viruses, at 23–28 nanometers in diameter, and contain the genome enclosed in an icosahedral capsid that has a rugged surface.

In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a common three-dimensional structure that appears in a variety of different, evolutionarily unrelated molecules. A structural motif does not have to be associated with a sequence motif; it can be represented by different and completely unrelated sequences in different proteins or RNA.

Satellite DNA consists of very large arrays of tandemly repeating, non-coding DNA. Satellite DNA is the main component of functional centromeres, and form the main structural constituent of heterochromatin.

Repeated sequences are short or long patterns of nucleic acids that occur in multiple copies throughout the genome. In many organisms, a significant fraction of the genomic DNA is repetitive, with over two-thirds of the sequence consisting of repetitive elements in humans. Some of these repeated sequences are necessary for maintaining important genome structures such as telomeres or centromeres.

This is a list of topics in molecular biology. See also index of biochemistry articles.

Nuclear DNA (nDNA), or nuclear deoxyribonucleic acid, is the DNA contained within each cell nucleus of a eukaryotic organism. It encodes for the majority of the genome in eukaryotes, with mitochondrial DNA and plastid DNA coding for the rest. It adheres to Mendelian inheritance, with information coming from two parents, one male and one female—rather than matrilineally as in mitochondrial DNA.

<span class="mw-page-title-main">Transfer-messenger RNA</span>

Transfer-messenger RNA is a bacterial RNA molecule with dual tRNA-like and messenger RNA-like properties. The tmRNA forms a ribonucleoprotein complex (tmRNP) together with Small Protein B (SmpB), Elongation Factor Tu (EF-Tu), and ribosomal protein S1. In trans-translation, tmRNA and its associated proteins bind to bacterial ribosomes which have stalled in the middle of protein biosynthesis, for example when reaching the end of a messenger RNA which has lost its stop codon. The tmRNA is remarkably versatile: it recycles the stalled ribosome, adds a proteolysis-inducing tag to the unfinished polypeptide, and facilitates the degradation of the aberrant messenger RNA. In the majority of bacteria these functions are carried out by standard one-piece tmRNAs. In other bacterial species, a permuted ssrA gene produces a two-piece tmRNA in which two separate RNA chains are joined by base-pairing.

<span class="mw-page-title-main">Stem-loop</span> Intramolecular base-pairing pattern in RNA and DNA

Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded RNA. The structure is also known as a hairpin or hairpin loop. It occurs when two regions of the same strand, usually complementary in nucleotide sequence when read in opposite directions, base-pair to form a double helix that ends in an unpaired loop. The resulting structure is a key building block of many RNA secondary structures. As an important secondary structure of RNA, it can direct RNA folding, protect structural stability for messenger RNA (mRNA), provide recognition sites for RNA binding proteins, and serve as a substrate for enzymatic reactions.

<span class="mw-page-title-main">Palindromic sequence</span> DNA or RNA sequence that matches its complement when read backwards

A palindromic sequence is a nucleic acid sequence in a double-stranded DNA or RNA molecule whereby reading in a certain direction on one strand is identical to the sequence in the same direction on the complementary strand. This definition of palindrome thus depends on complementary strands being palindromic of each other.

Ribosomal frameshifting, also known as translational frameshifting or translational recoding, is a biological phenomenon that occurs during translation that results in the production of multiple, unique proteins from a single mRNA. The process can be programmed by the nucleotide sequence of the mRNA and is sometimes affected by the secondary, 3-dimensional mRNA structure. It has been described mainly in viruses, retrotransposons and bacterial insertion elements, and also in some cellular genes.

<span class="mw-page-title-main">Nucleic acid structure</span> Biomolecular structure of nucleic acids such as DNA and RNA

Nucleic acid structure refers to the structure of nucleic acids such as DNA and RNA. Chemically speaking, DNA and RNA are very similar. Nucleic acid structure is often divided into four different levels: primary, secondary, tertiary, and quaternary.

<span class="mw-page-title-main">Nucleic acid secondary structure</span>

Nucleic acid secondary structure is the basepairing interactions within a single nucleic acid polymer or between two polymers. It can be represented as a list of bases which are paired in a nucleic acid molecule. The secondary structures of biological DNAs and RNAs tend to be different: biological DNA mostly exists as fully base paired double helices, while biological RNA is single stranded and often forms complex and intricate base-pairing interactions due to its increased ability to form hydrogen bonds stemming from the extra hydroxyl group in the ribose sugar.

mtDNA control region Non-coding region of the mitochondrial DNA

The mtDNA control region is an area of the mitochondrial genome which is non-coding DNA. This region controls RNA and DNA synthesis. It is the most polymorphic region of the human mtDNA genome, with polymorphism concentrated in hypervariable regions. The average nucleotide diversity in these regions is 1.7%. Despite this variability, an RNA transcript from this region has a conserved secondary structure (pictured) which has been found to be under selective pressure.

<span class="mw-page-title-main">Complementarity (molecular biology)</span> Lock-and-key pairing between two structures

In molecular biology, complementarity describes a relationship between two structures each following the lock-and-key principle. In nature complementarity is the base principle of DNA replication and transcription as it is a property shared between two DNA or RNA sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position in the sequences will be complementary, much like looking in the mirror and seeing the reverse of things. This complementary base pairing allows cells to copy information from one generation to another and even find and repair damage to the information stored in the sequences.

<span class="mw-page-title-main">Kissing stem-loop</span>

In genetics, a kissing stem-loop, or kissing stem loop interaction, is formed in ribonucleic acid (RNA) when two bases between two hairpin loops pair. These intra- and intermolecular kissing interactions are important in forming the tertiary or quaternary structure of many RNAs.

<span class="mw-page-title-main">Cruciform DNA</span>

Cruciform DNA is a form of non-B DNA, or an alternative DNA structure. The formation of cruciform DNA requires the presence of palindromes called inverted repeat sequences. These inverted repeats contain a sequence of DNA in one strand that is repeated in the opposite direction on the other strand. As a result, inverted repeats are self-complementary and can give rise to structures such as hairpins and cruciforms. Cruciform DNA structures require at least a six nucleotide sequence of inverted repeats to form a structure consisting of a stem, branch point and loop in the shape of a cruciform, stabilized by negative DNA supercoiling.

Coronavirus genomes are positive-sense single-stranded RNA molecules with an untranslated region (UTR) at the 5′ end which is called the 5′ UTR. The 5′ UTR is responsible for important biological functions, such as viral replication, transcription and packaging. The 5′ UTR has a conserved RNA secondary structure but different Coronavirus genera have different structural features described below.

Coronavirus genomes are positive-sense single-stranded RNA molecules with an untranslated region (UTR) at the 3′ end which is called the 3′ UTR. The 3′ UTR is responsible for important biological functions, such as viral replication. The 3′ UTR has a conserved RNA secondary structure but different Coronavirus genera have different structural features described below.

Rolling hairpin replication (RHR) is a unidirectional, strand displacement form of DNA replication used by parvoviruses, a group of viruses that constitute the family Parvoviridae. Parvoviruses have linear, single-stranded DNA (ssDNA) genomes in which the coding portion of the genome is flanked by telomeres at each end that form hairpin loops. During RHR, these hairpin loops repeatedly unfold and refold to change the direction of DNA replication so that replication progresses in a continuous manner back and forth across the genome. RHR is initiated and terminated by an endonuclease encoded by parvoviruses that is variously called NS1 or Rep, and RHR is similar to rolling circle replication, which is used by ssDNA viruses that have circular genomes.

This glossary of genetics is a list of definitions of terms and concepts commonly used in the study of genetics and related disciplines in biology, including molecular biology, cell biology, and evolutionary biology. It is intended as introductory material for novices; for more specific and technical detail, see the article corresponding to each term. For related terms, see Glossary of evolutionary biology.

References

  1. 1 2 3 4 Ussery, David W.; Wassenaar, Trudy; Borini, Stefano (2008-12-22). "Word Frequencies, Repeats, and Repeat-related Structures in Bacterial Genomes". Computing for Comparative Microbial Genomics: Bioinformatics for Microbiologists. Computational Biology. Vol. 8 (1 ed.). Springer. pp. 133–144. ISBN   978-1-84800-254-8.
  2. Ye, Congting; Ji, Guoli; Liang, Chun (2014). "detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation". PLOS ONE. 9 (11): e113349. Bibcode:2014PLoSO...9k3349Y. doi: 10.1371/journal.pone.0113349 . PMC   4237412 . PMID   25409465.
  3. 1 2 Richards, GR; Richards, RI (Apr 25, 1995). "Simple tandem DNA repeats and human genetic disease". Proceedings of the National Academy of Sciences of the United States of America. 92 (9): 3636–41. Bibcode:1995PNAS...92.3636S. doi: 10.1073/pnas.92.9.3636 . PMC   42017 . PMID   7731957.
  4. van Belkum, A; Scherer, S; van Alphen, L; Verbrugh, H (June 1998). "Short-sequence DNA repeats in prokaryotic genomes". Microbiology and Molecular Biology Reviews. 62 (2): 275–93. doi:10.1128/MMBR.62.2.275-293.1998. PMC   98915 . PMID   9618442.
  5. 1 2 Ramel, C (June 1997). "Mini- and microsatellites". Environmental Health Perspectives. 105 Suppl 4 (Suppl 4): 781–9. doi:10.2307/3433284. JSTOR   3433284. PMC   1470042 . PMID   9255562.
  6. 1 2 Eichler, EE (August 1998). "Masquerading repeats: paralogous pitfalls of the human genome". Genome Research. 8 (8): 758–62. doi: 10.1101/gr.8.8.758 . PMID   9724321.
  7. 1 2 3 Mirkin, I; Narayanan, V; Lobachev, KS; Mirkin, SM (Jul 22, 2008). "Replication stalling at unstable inverted repeats: interplay between DNA hairpins and fork stabilizing proteins". Proceedings of the National Academy of Sciences of the United States of America. 105 (29): 9936–41. Bibcode:2008PNAS..105.9936V. doi: 10.1073/pnas.0804510105 . PMC   2481305 . PMID   18632578.
  8. Lin, CT; Lin, WH; Lyu, YL; Whang-Peng, J (Sep 1, 2001). "Inverted repeats as genetic elements for promoting DNA inverted duplication: implications in gene amplification". Nucleic Acids Research. 29 (17): 3529–38. doi:10.1093/nar/29.17.3529. PMC   55881 . PMID   11522822.
  9. 1 2 3 4 5 6 7 8 Bissler, JJ (Mar 27, 1998). "DNA inverted repeats and human disease" (PDF). Frontiers in Bioscience. 3 (4): d408–18. doi:10.2741/a284. PMID   9516381. S2CID   12982. Archived from the original (PDF) on March 3, 2019.
  10. 1 2 3 4 5 6 7 School, James D. Watson, Cold Spring Harbor Laboratory, Tania A. Baker, Massachusetts Institute of Technology, Stephen P. Bell, Massachusetts Institute of Technology, Alexander Gann, Cold Spring Harbor Laboratory, Michael Levine, University of California, Berkeley, Richard Losik, Harvard University; with Stephen C. Harrison, Harvard Medical (2014). Molecular biology of the gene (Seventh ed.). Boston: Benjamin-Cummings Publishing Company. ISBN   9780321762436.
  11. 1 2 Achaz, G; Coissac, E; Netter, P; Rocha, EP (August 2003). "Associations between inverted repeats and the structural evolution of bacterial genomes". Genetics. 164 (4): 1279–89. doi:10.1093/genetics/164.4.1279. PMC   1462642 . PMID   12930739.
  12. Zhang, HH; Xu, HE; Shen, YH; Han, MJ; Zhang, Z (January 2013). "The Origin and Evolution of Six Miniature Inverted-Repeat Transposable Elements in Bombyx mori and Rhodnius prolixus". Genome Biology and Evolution. 5 (11): 2020–31. doi:10.1093/gbe/evt153. PMC   3845634 . PMID   24115603.
  13. 1 2 Pearson, CE; Zorbas, H; Price, GB; Zannis-Hadjopoulos, M (October 1996). "Inverted repeats, stem-loops, and cruciforms: significance for initiation of DNA replication". Journal of Cellular Biochemistry. 63 (1): 1–22. doi:10.1002/(SICI)1097-4644(199610)63:1<1::AID-JCB1>3.0.CO;2-3. PMID   8891900. S2CID   22204780.
  14. 1 2 3 Heringa, J (June 1998). "Detection of internal repeats: how common are they?". Current Opinion in Structural Biology. 8 (3): 338–45. doi:10.1016/S0959-440X(98)80068-7. PMID   9666330.
  15. Treangen, TJ; Abraham, AL; Touchon, M; Rocha, EP (May 2009). "Genesis, effects and fates of repeats in prokaryotic genomes" (PDF). FEMS Microbiology Reviews. 33 (3): 539–71. doi: 10.1111/j.1574-6976.2009.00169.x . PMID   19396957.
  16. Ladoukakis, ED; Eyre-Walker, A (September 2008). "The excess of small inverted repeats in prokaryotes" (PDF). Journal of Molecular Evolution. 67 (3): 291–300. Bibcode:2008JMolE..67..291L. CiteSeerX   10.1.1.578.7466 . doi:10.1007/s00239-008-9151-z. PMID   18696026. S2CID   29953202.
  17. Hosseini, M; Pratas, D; Pinho, AJ (2017). "On the Role of Inverted Repeats in DNA Sequence Similarity". 11th International Conference on Practical Applications of Computational Biology & Bioinformatics. Advances in Intelligent Systems and Computing. Vol. 616. Springer. pp. 228–236. doi:10.1007/978-3-319-60816-7_28. ISBN   978-3-319-60815-0.
  18. 1 2 Staple, DW; Butcher, SE (June 2005). "Pseudoknots: RNA structures with diverse functions". PLOS Biology. 3 (6): e213. doi:10.1371/journal.pbio.0030213. PMC   1149493 . PMID   15941360. Open Access logo PLoS transparent.svg
  19. 1 2 Chen, JL; Greider, CW (Jun 7, 2005). "Functional analysis of the pseudoknot structure in human telomerase RNA". Proceedings of the National Academy of Sciences of the United States of America. 102 (23): 8080–5, discussion 8077–9. Bibcode:2005PNAS..102.8080C. doi: 10.1073/pnas.0502259102 . PMC   1149427 . PMID   15849264.
  20. Winkler, WC; Cohen-Chalamish, S; Breaker, RR (Dec 10, 2002). "An mRNA structure that controls gene expression by binding FMN". Proceedings of the National Academy of Sciences of the United States of America. 99 (25): 15908–13. Bibcode:2002PNAS...9915908W. doi: 10.1073/pnas.212628899 . PMC   138538 . PMID   12456892.
  21. 1 2 3 4 Stormo, G; Chang, KY; Varley, K; Stormo, GD (Feb 28, 2007). Hall, Neil (ed.). "Evidence for active maintenance of inverted repeat structures identified by a comparative genomic approach". PLOS ONE. 2 (2): e262. Bibcode:2007PLoSO...2..262Z. doi: 10.1371/journal.pone.0000262 . PMC   1803023 . PMID   17327921. Open Access logo PLoS transparent.svg
  22. Ramreddy, T; Sachidanandam, R; Strick, TR (May 2011). "Real-time detection of cruciform extrusion by single-molecule DNA nanomanipulation". Nucleic Acids Research. 39 (10): 4275–83. doi:10.1093/nar/gkr008. PMC   3105387 . PMID   21266478.
  23. 1 2 Cer, RZ; Donohue, DE; Mudunuri, US; Temiz, NA; Loss, MA; Starner, NJ; Halusa, GN; Volfovsky, N; Yi, M; Luke, BT; Bacolla, A; Collins, JR; Stephens, RM (January 2013). "Non-B DB v2.0: a database of predicted non-B DNA-forming motifs and its associated tools". Nucleic Acids Research. 41 (Database issue): D94–D100. doi:10.1093/nar/gks955. PMC   3531222 . PMID   23125372.
  24. Gelfand, Y; Rodriguez, A; Benson, G (January 2007). "TRDB--the Tandem Repeats Database". Nucleic Acids Research. 35 (Database issue): D80–7. doi:10.1093/nar/gkl1013. PMC   1781109 . PMID   17175540.
  25. Chen, J; Hu, Q; Zhang, Y; Lu, C; Kuang, H (Oct 29, 2013). "P-MITE: a database for plant miniature inverted-repeat transposable elements". Nucleic Acids Research. 42 (1): D1176–81. doi:10.1093/nar/gkt1000. PMC   3964958 . PMID   24174541.
  26. 1 2 3 Rice, P; Longden, I; Bleasby, A (June 2000). "EMBOSS: the European Molecular Biology Open Software Suite". Trends in Genetics. 16 (6): 276–7. doi:10.1016/S0168-9525(00)02024-2. PMID   10827456.