Slipped strand mispairing

Last updated
Schematic representation of the slippage process at a replication fork.png

Slipped strand mispairing (SSM, also known as replication slippage) is a mutation process which occurs during DNA replication. It involves denaturation and displacement of the DNA strands, resulting in mispairing of the complementary bases. Slipped strand mispairing is one explanation for the origin and evolution of repetitive DNA sequences. [1]

Contents

It is a form of mutation that leads to either a trinucleotide or dinucleotide expansion, or sometimes contraction, during DNA replication. [2] A slippage event normally occurs when a sequence of repetitive nucleotides (tandem repeats) are found at the site of replication. Tandem repeats are unstable regions of the genome where frequent insertions and deletions of nucleotides can take place, resulting in genome rearrangements. [3] DNA polymerase, the main enzyme to catalyze the polymerization of free deoxyribonucleotides into a newly forming DNA strand, plays a significant role in the occurrence of this mutation. When DNA polymerase encounters a direct repeat, it can undergo a replication slippage. [4]

Strand slippage may also occur during the DNA synthesis step of DNA repair processes. Within DNA trinucleotide repeat sequences, the repair of DNA damage by the processes of homologous recombination, non-homologous end joining, DNA mismatch repair or base excision repair may involve strand slippage mispairing leading to trinucleotide repeat expansion when the repair is completed. [5]

Slipped strand mispairing has also been shown to function as a phase variation mechanism in certain bacteria. [6]

Stages

Slippage occurs through five main stages:

  1. In the first step, DNA polymerase encounters the direct repeat during the replication process.
  2. The polymerase complex suspends replication and is temporarily released from the template strand.
  3. The newly synthesized strand then detaches from the template strand and pairs with another direct repeat upstream.
  4. DNA polymerase reassembles its position on the template strand and resumes normal replication, but during the course of reassembling, the polymerase complex backtracks and repeats the insertion of deoxyribonucleotides that were previously added. This results in some repeats found in the template strand being replicated twice into the daughter strand. This expands the replication region with newly inserted nucleotides. The template and the daughter strand can no longer pair correctly. [4]
  5. Nucleotide excision repair proteins are mobilized to this area where one likely outcome is the expansion of nucleotides in the template strand while the other is the absence of nucleotides. Although trinucleotide contraction is possible, trinucleotide expansion occurs more frequently. [2]

Effects

Tandem repeats (the main influence for slippage replication) can be found in coding and non-coding regions. If these repeats are found in coding regions then the variations to the polynucleotide sequence can result in the formation of abnormal proteins in eukaryotes. Many human diseases have been reported to be associated with trinucleotide repeat expansions including Huntington's disease. [7] The HD gene [8] is found in all human genomes. In the event that a slippage event occurs there can be a large expansion in the tandem repeats of the HD gene. [8] An individual who is not affected by Huntington’s disease will have 6-35 tandem repeats at the HD locus. However, an affected individual will have 36- 121 repeats present. [7] The expansion of the HD locus results in a dysfunctional protein leading to Huntington’s disease.

Disease associations

Huntington disease is normally progressive and results in movement, cognitive and psychiatric disorders. These disorders can lead to a severe impact on an individual’s daily activities, making it hard for proper communication and independent actions to take place. [9] Replication slippage can also lead to other neurodegenerative diseases in humans. These include spinal and bulbar muscular atrophy ( trinucleotide expansion in the AR gene), dentatorubral–pallidoluysian atrophy ( trinucleotide expansion in the DRPLA gene), spinocerebellar ataxia type 1 ( trinucleotide expansion in the SCA1gene), Machado-Joseph disease ( trinucleotide expansion in the SCA3 gene), myotonic dystrophy ( trinucleotide expansion in the DMPK gene), and Friedreich's ataxia ( a trinuncleotide expansion in the X25 gene). [7] Therefore, replication slippage leads to a form of trinucleotide expansion which results in serious changes to protein structure.

Self-acceleration

SSM events can result in either insertions or deletions. Insertions are thought to be self-accelerating: as repeats grow longer, the probability of subsequent mispairing events increases. Insertions can expand simple tandem repeats by one or more units. In long repeats, expansions may involve two or more units. For example, insertion of a single repeat unit in GAGAGA expands the sequence to GAGAGAGA, while insertion of two repeat units in [GA]6 would produce [GA]8. Genomic regions with a high proportion of repeated DNA sequences (tandem repeats, microsatellites) are prone to strand slippage during DNA replication and DNA repair.

Trinucleotide repeat expansion is a cause of a number of human diseases including fragile X syndrome, Huntington’s disease, several spinocerebellar ataxias, myotonic dystrophy and Friedrich ataxia. [5]

Evolution of diverse adjacent repeats

The combination of SSM events with point mutation is thought to account for the evolution of more complex repeat units. Mutations followed by expansion would result in the formation of new types of adjacent short tandem repeat units. For example, a transversion could change the simple two- base repeat [GA]10 to [GA]4GATA[GA]2. This could then be expanded to[GA]4[GATA]3[GA]2 by two subsequent SSM events. Simple repetitive DNA sequences containing a variety of adjacent short tandem repeats are commonly observed in non-protein coding regions of eukaryotic genomes.

Related Research Articles

A microsatellite is a tract of repetitive DNA in which certain DNA motifs are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organism's genome. They have a higher mutation rate than other areas of DNA leading to high genetic diversity. Microsatellites are often referred to as short tandem repeats (STRs) by forensic geneticists and in genetic genealogy, or as simple sequence repeats (SSRs) by plant geneticists.

An inverted repeat is a single stranded sequence of nucleotides followed downstream by its reverse complement. The intervening sequence of nucleotides between the initial sequence and the reverse complement can be any length including zero. For example, 5'---TTACGnnnnnnCGTAA---3' is an inverted repeat sequence. When the intervening length is zero, the composite sequence is a palindromic sequence.

Tandem repeats occur in DNA when a pattern of one or more nucleotides is repeated and the repetitions are directly adjacent to each other. Several protein domains also form tandem repeats within their amino acid primary structure, such as armadillo repeats. However, in proteins, perfect tandem repeats are unlikely in most in vivo proteins, and most known repeats are in proteins which have been designed.

<span class="mw-page-title-main">Ribosomal DNA</span>

Ribosomal DNA (rDNA) is a DNA sequence that codes for ribosomal RNA. These sequences regulate transcription initiation and amplification, and contain both transcribed and non-transcribed spacer segments.

Repeated sequences are short or long patterns of nucleic acids that occur in multiple copies throughout the genome. In many organisms, a significant fraction of the genomic DNA is repetitive, with over two-thirds of the sequence consisting of repetitive elements in humans. Some of these repeated sequences are necessary for maintaining important genome structures such as telomeres or centromeres.

<span class="mw-page-title-main">Frameshift mutation</span> Mutation that shifts codon alignment

A frameshift mutation is a genetic mutation caused by indels of a number of nucleotides in a DNA sequence that is not divisible by three. Due to the triplet nature of gene expression by codons, the insertion or deletion can change the reading frame, resulting in a completely different translation from the original. The earlier in the sequence the deletion or insertion occurs, the more altered the protein. A frameshift mutation is not the same as a single-nucleotide polymorphism in which a nucleotide is replaced, rather than inserted or deleted. A frameshift mutation will in general cause the reading of the codons after the mutation to code for different amino acids. The frameshift mutation will also alter the first stop codon encountered in the sequence. The polypeptide being created could be abnormally short or abnormally long, and will most likely not be functional.

<span class="mw-page-title-main">DNA repair</span> Cellular mechanism

DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA damage, resulting in tens of thousands of individual molecular lesions per cell per day. Many of these lesions cause structural damage to the DNA molecule and can alter or eliminate the cell's ability to transcribe the gene that the affected DNA encodes. Other lesions induce potentially harmful mutations in the cell's genome, which affect the survival of its daughter cells after it undergoes mitosis. As a consequence, the DNA repair process is constantly active as it responds to damage in the DNA structure. When normal repair processes fail, and when cellular apoptosis does not occur, irreparable DNA damage may occur, including double-strand breaks and DNA crosslinkages. This can eventually lead to malignant tumors, or cancer as per the two hit hypothesis.

In genetics, anticipation is a phenomenon whereby as a genetic disorder is passed on to the next generation, the symptoms of the genetic disorder become apparent at an earlier age with each generation. In most cases, an increase in the severity of symptoms is also noted. Anticipation is common in trinucleotide repeat disorders, such as Huntington's disease and myotonic dystrophy, where a dynamic mutation in DNA occurs. All of these diseases have neurological symptoms. Prior to the understanding of the genetic mechanism for anticipation, it was debated whether anticipation was a true biological phenomenon or whether the earlier age of diagnosis was related to heightened awareness of disease symptoms within a family.

<span class="mw-page-title-main">Germline mutation</span> Inherited genetic variation

A germline mutation, or germinal mutation, is any detectable variation within germ cells. Mutations in these cells are the only mutations that can be passed on to offspring, when either a mutated sperm or oocyte come together to form a zygote. After this fertilization event occurs, germ cells divide rapidly to produce all of the cells in the body, causing this mutation to be present in every somatic and germline cell in the offspring; this is also known as a constitutional mutation. Germline mutation is distinct from somatic mutation.

This is a list of topics in molecular biology. See also index of biochemistry articles.

Nuclear DNA (nDNA), or nuclear deoxyribonucleic acid, is the DNA contained within each cell nucleus of a eukaryotic organism. It encodes for the majority of the genome in eukaryotes, with mitochondrial DNA and plastid DNA coding for the rest. It adheres to Mendelian inheritance, with information coming from two parents, one male and one female—rather than matrilineally as in mitochondrial DNA.

<span class="mw-page-title-main">Insertion (genetics)</span> Type of mutation

In genetics, an insertion is the addition of one or more nucleotide base pairs into a DNA sequence. This can often happen in microsatellite regions due to the DNA polymerase slipping. Insertions can be anywhere in size from one base pair incorrectly inserted into a DNA sequence to a section of one chromosome inserted into another. The mechanism of the smallest single base insertion mutations is believed to be through base-pair separation between the template and primer strands followed by non-neighbor base stacking, which can occur locally within the DNA polymerase active site. On a chromosome level, an insertion refers to the insertion of a larger sequence into a chromosome. This can happen due to unequal crossover during meiosis.

Trinucleotide repeat disorders, also known as microsatellite expansion diseases, are a set of over 50 genetic disorders caused by trinucleotide repeat expansion, a kind of mutation in which repeats of three nucleotides increase in copy numbers until they cross a threshold above which they become unstable. Depending on its location, the unstable trinucleotide repeat may cause defects in a protein encoded by a gene; change the regulation of gene expression; produce a toxic RNA, or lead to chromosome instability. In general, the larger the expansion the faster the onset of disease, and the more severe the disease becomes.

<span class="mw-page-title-main">Copy number variation</span> Repeated DNA variation between individuals

Copy number variation (CNV) is a phenomenon in which sections of the genome are repeated and the number of repeats in the genome varies between individuals. Copy number variation is a type of structural variation: specifically, it is a type of duplication or deletion event that affects a considerable number of base pairs. Approximately two-thirds of the entire human genome may be composed of repeats and 4.8–9.5% of the human genome can be classified as copy number variations. In mammals, copy number variations play an important role in generating necessary variation in the population as well as disease phenotype.

A trinucleotide repeat expansion, also known as a triplet repeat expansion, is the DNA mutation responsible for causing any type of disorder categorized as a trinucleotide repeat disorder. These are labelled in dynamical genetics as dynamic mutations. Triplet expansion is caused by slippage during DNA replication, also known as "copy choice" DNA replication. Due to the repetitive nature of the DNA sequence in these regions, 'loop out' structures may form during DNA replication while maintaining complementary base pairing between the parent strand and daughter strand being synthesized. If the loop out structure is formed from the sequence on the daughter strand this will result in an increase in the number of repeats. However, if the loop out structure is formed on the parent strand, a decrease in the number of repeats occurs. It appears that expansion of these repeats is more common than reduction. Generally, the larger the expansion the more likely they are to cause disease or increase the severity of disease. Other proposed mechanisms for expansion and reduction involve the interaction of RNA and DNA molecules.

<span class="mw-page-title-main">Microsatellite instability</span> Condition of genetic hypermutability

Microsatellite instability (MSI) is the condition of genetic hypermutability that results from impaired DNA mismatch repair (MMR). The presence of MSI represents phenotypic evidence that MMR is not functioning normally.

In genetics, a dynamic mutation is an unstable heritable element where the probability of expression of a mutant phenotype is a function of the number of copies of the mutation. That is, the replication product (progeny) of a dynamic mutation has a different likelihood of mutation than its predecessor. These mutations, typically short sequences repeated many times, give rise to numerous known diseases, including the trinucleotide repeat disorders.

A polyglutamine tract or polyQ tract is a portion of a protein consisting of a sequence of several glutamine units. A tract typically consists of about 10 to a few hundred such units.

Genome instability refers to a high frequency of mutations within the genome of a cellular lineage. These mutations can include changes in nucleic acid sequences, chromosomal rearrangements or aneuploidy. Genome instability does occur in bacteria. In multicellular organisms genome instability is central to carcinogenesis, and in humans it is also a factor in some neurodegenerative diseases such as amyotrophic lateral sclerosis or the neuromuscular disease myotonic dystrophy.

This glossary of genetics is a list of definitions of terms and concepts commonly used in the study of genetics and related disciplines in biology, including molecular biology, cell biology, and evolutionary biology. It is intended as introductory material for novices; for more specific and technical detail, see the article corresponding to each term. For related terms, see Glossary of evolutionary biology.

References

  1. Levinson G, Gutman GA (May 1987). "Slipped-strand mispairing: a major mechanism for DNA sequence evolution". Mol. Biol. Evol. 4 (3): 203–21. doi: 10.1093/oxfordjournals.molbev.a040442 . PMID   3328815.
  2. 1 2 Hartl L.D and Ruvolo M, 2012, Genetic Analysis of Genes and Genomes, Jones & Bartlett Learning, Burlington, pg. 529
  3. Lovett, S.T.; Drapkin, P.T.; Sutera, V.A. Jr.; Gluckman-Peskind, T.J. (1993). "A sister-strand exchange mechanism for recA-independent deletion of repeated DNA sequences in Escherichia coli". Genetics. 135 (3): 631–642. doi:10.1093/genetics/135.3.631. PMC   1205708 . PMID   8293969.
  4. 1 2 Viguera, E; Canceill, D; Ehrlich, SD. (2001). "Replication slippage involves DNA polymerase pausing and dissociation". The EMBO Journal. 20 (10): 2587–2595. doi:10.1093/emboj/20.10.2587. PMC   125466 . PMID   11350948.
  5. 1 2 Usdin K, House NC, Freudenreich CH (2015). "Repeat instability during DNA repair: Insights from model systems". Crit. Rev. Biochem. Mol. Biol. 50 (2): 142–67. doi:10.3109/10409238.2014.999192. PMC   4454471 . PMID   25608779.
  6. Torres-Cruz J, van der Woude MW (December 2003). "Slipped-strand mispairing can function as a phase variation mechanism in Escherichia coli". J. Bacteriol. 185 (23): 6990–4. doi:10.1128/jb.185.23.6990-6994.2003. PMC   262711 . PMID   14617664.
  7. 1 2 3 Brown TA. Genomes. 2nd edition. Oxford: Wiley-Liss; 2002. Chapter 14, Mutation, Repair and Recombination. Available from: https://www.ncbi.nlm.nih.gov/books/NBK21114/ Accessed November 3, 2012
  8. 1 2 Petruska J, Hartenstine MJ, Goodman MF (February 1998). "Analysis of strand slippage in DNA polymerase expansions of CAG/CTG triplet repeats associated with neurodegenerative disease". J. Biol. Chem. 273 (9): 5204–10. doi: 10.1074/jbc.273.9.5204 . PMID   9478975.
  9. "Stages of HD". Archived from the original on 2013-11-01. Retrieved 2013-10-30. Huntington's Disease

Further reading