Non-homologous end joining (NHEJ) is a pathway that repairs double-strand breaks in DNA. It is called "non-homologous" because the break ends are directly ligated without the need for a homologous template, in contrast to homology directed repair (HDR), which requires a homologous sequence to guide repair. NHEJ is active in both non-dividing and proliferating cells, while HDR is not readily accessible in non-dividing cells. [1] The term "non-homologous end joining" was coined in 1996 by Moore and Haber. [2]
NHEJ is typically guided by short homologous DNA sequences called microhomologies. These microhomologies are often present in single-stranded overhangs on the ends of double-strand breaks. When the overhangs are perfectly compatible, NHEJ usually repairs the break accurately. [2] [3] [4] [5] Imprecise repair leading to loss of nucleotides can also occur, but is much more common when the overhangs are not compatible. Inappropriate NHEJ can lead to translocations and telomere fusion, hallmarks of tumor cells. [6]
NHEJ implementations are understood to have been existent throughout nearly all biological systems and it is the predominant double-strand break repair pathway in mammalian cells. [7] In budding yeast ( Saccharomyces cerevisiae ), however, homologous recombination dominates when the organism is grown under common laboratory conditions.
When the NHEJ pathway is inactivated, double-strand breaks can be repaired by a more error-prone pathway called microhomology-mediated end joining (MMEJ). In this pathway, end resection reveals short microhomologies on either side of the break, which are then aligned to guide repair. [8] This contrasts with classical NHEJ, which typically uses microhomologies already exposed in single-stranded overhangs on the DSB ends. Repair by MMEJ therefore leads to deletion of the DNA sequence between the microhomologies.
Many species of bacteria, including Escherichia coli , lack an end joining pathway and thus rely completely on homologous recombination to repair double-strand breaks. NHEJ proteins have been identified in a number of bacteria, including Bacillus subtilis , Mycobacterium tuberculosis , and Mycobacterium smegmatis . [9] [10] Bacteria utilize a remarkably compact version of NHEJ in which all of the required activities are contained in only two proteins: a Ku homodimer and the multifunctional ligase/polymerase/nuclease LigD. [11] In mycobacteria, NHEJ is much more error prone than in yeast, with bases often added to and deleted from the ends of double-strand breaks during repair. [10] Many of the bacteria that possess NHEJ proteins spend a significant portion of their life cycle in a stationary haploid phase, in which a template for recombination is not available. [9] NHEJ may have evolved to help these organisms survive DSBs induced during desiccation. It preferentially use rNTPs (RNA nucleotides), possibly advantageous in dormant cells. [12]
The archaeal NHEJ system in Methanocella paludicola have a homodimeric Ku, but the three functions of LigD are broken up into three single-domain proteins sharing an operon. All three genes retain substantial homology with their LigD counterparts and the polymerase retains the preference for rNTP. [13] NHEJ has been lost and acquired multiple times in bacteria and archaea, with a significant amount of horizontal gene transfer shuffling the system around taxa. [14]
Corndog and Omega, two related mycobacteriophages of Mycobacterium smegmatis, also encode Ku homologs and exploit the NHEJ pathway to recircularize their genomes during infection. [15] Unlike homologous recombination, which has been studied extensively in bacteria, NHEJ was originally discovered in eukaryotes and was only identified in prokaryotes in the past decade.
In contrast to bacteria, NHEJ in eukaryotes utilizes a number of proteins, which participate in the following steps:
In yeast, the Mre11-Rad50-Xrs2 (MRX) complex is recruited to DSBs early and is thought to promote bridging of the DNA ends. [16] The corresponding mammalian complex of Mre11-Rad50-Nbs1 (MRN) is also involved in NHEJ, but it may function at multiple steps in the pathway beyond simply holding the ends in proximity. [17] DNA-PKcs is also thought to participate in end bridging during mammalian NHEJ. [18]
Eukaryotic Ku is a heterodimer consisting of Ku70 and Ku80, and forms a complex with DNA-PKcs, which is present in mammals but absent in yeast. Ku is a basket-shaped molecule that slides onto the DNA end and translocates inward. Ku may function as a docking site for other NHEJ proteins, and is known to interact with the DNA ligase IV complex and XLF. [19] [20]
End processing involves removal of damaged or mismatched nucleotides by nucleases and resynthesis by DNA polymerases. This step is not necessary if the ends are already compatible and have 3' hydroxyl and 5' phosphate termini.
Little is known about the function of nucleases in NHEJ. Artemis is required for opening the hairpins that are formed on DNA ends during V(D)J recombination, a specific type of NHEJ, and may also participate in end trimming during general NHEJ. [21] Mre11 has nuclease activity, but it seems to be involved in homologous recombination, not NHEJ.
The X family DNA polymerases Pol λ and Pol μ (Pol4 in yeast) fill gaps during NHEJ. [4] [22] [23] Yeast lacking Pol4 are unable to join 3' overhangs that require gap filling, but remain proficient for gap filling at 5' overhangs. [24] This is because the primer terminus used to initiate DNA synthesis is less stable at 3' overhangs, necessitating a specialized NHEJ polymerase.
The DNA ligase IV complex, consisting of the catalytic subunit DNA ligase IV and its cofactor XRCC4 (Dnl4 and Lif1 in yeast), performs the ligation step of repair. [25] XLF, also known as Cernunnos, is homologous to yeast Nej1 and is also required for NHEJ. [26] [27] While the precise role of XLF is unknown, it interacts with the XRCC4/DNA ligase IV complex and likely participates in the ligation step. [28] Recent evidence suggests that XLF promotes re-adenylation of DNA ligase IV after ligation, recharging the ligase and allowing it to catalyze a second ligation. [29]
In yeast, Sir2 was originally identified as an NHEJ protein, but is now known to be required for NHEJ only because it is required for the transcription of Nej1. [30]
NHEJ and heat-labile sites
Induction of heat-labile sites (HLS) is a signature of ionizing radiation. The DNA clustered damage sites consist of different types of DNA lesions. Some of these lesions are not prompt DSBs but they convert to DSB after heating. HLS are not evolved to DSB under physiological temperature (37 C). Also, the interaction of HLS with other lesions and their role in living cells is yet elusive. The repair mechanisms of these sites are not fully revealed. The NHEJ is the dominant DNA repair pathway throughout the cell cycle. The DNA-PKcs protein is the critical element in the center of NHEJ. Using DNA-PKcs KO cell lines or inhibition of DNA-PKcs does not affect the repair capacity of HLS. Also blocking both HR and NHEJ repair pathways by dactolisib (NVP-BEZ235) inhibitor showed that repair of HLS is not dependent on HR and NHEJ. These results showed that the repair mechanism of HLS is independent of NHEJ and HR pathways [31]
The choice between NHEJ and homologous recombination for repair of a double-strand break is regulated at the initial step in recombination, 5' end resection. In this step, the 5' strand of the break is degraded by nucleases to create long 3' single-stranded tails. DSBs that have not been resected can be rejoined by NHEJ, but resection of even a few nucleotides strongly inhibits NHEJ and effectively commits the break to repair by recombination. [23] NHEJ is active throughout the cell cycle, but is most important during G1 when no homologous template for recombination is available. This regulation is accomplished by the cyclin-dependent kinase Cdk1 (Cdc28 in yeast), which is turned off in G1 and expressed in S and G2. Cdk1 phosphorylates the nuclease Sae2, allowing resection to initiate. [32]
NHEJ plays a critical role in V(D)J recombination, the process by which B-cell and T-cell receptor diversity is generated in the vertebrate immune system. [33] In V(D)J recombination, hairpin-capped double-strand breaks are created by the RAG1/RAG2 nuclease, which cleaves the DNA at recombination signal sequences. [34] These hairpins are then opened by the Artemis nuclease and joined by NHEJ. [21] A specialized DNA polymerase called terminal deoxynucleotidyl transferase (TdT), which is only expressed in lymph tissue, adds nontemplated nucleotides to the ends before the break is joined. [35] [36] This process couples "variable" (V), "diversity" (D), and "joining" (J) regions, which when assembled together create the variable region of a B-cell or T-cell receptor gene. Unlike typical cellular NHEJ, in which accurate repair is the most favorable outcome, error-prone repair in V(D)J recombination is beneficial in that it maximizes diversity in the coding sequence of these genes. Patients with mutations in NHEJ genes are unable to produce functional B cells and T cells and suffer from severe combined immunodeficiency (SCID). [37]
Telomeres are normally protected by a "cap" that prevents them from being recognized as double-strand breaks. Loss of capping proteins causes telomere shortening and inappropriate joining by NHEJ, producing dicentric chromosomes which are then pulled apart during mitosis. Paradoxically, some NHEJ proteins are involved in telomere capping. For example, Ku localizes to telomeres and its deletion leads to shortened telomeres. [38] Ku is also required for subtelomeric silencing, the process by which genes located near telomeres are turned off.[ citation needed ]
Several human syndromes are associated with dysfunctional NHEJ. [39] Hypomorphic mutations in LIG4 and XLF cause LIG4 syndrome and XLF-SCID, respectively. These syndromes share many features including cellular radiosensitivity, microcephaly and severe combined immunodeficiency (SCID) due to defective V(D)J recombination. Loss-of-function mutations in Artemis also cause SCID, but these patients do not show the neurological defects associated with LIG4 or XLF mutations. The difference in severity may be explained by the roles of the mutated proteins. Artemis is a nuclease and is thought to be required only for repair of DSBs with damaged ends, whereas DNA Ligase IV and XLF are required for all NHEJ events. Mutations in genes that participate in non-homologous end joining lead to ataxia-telangiectasia (ATM gene), Fanconi anemia (multiple genes), as well as hereditary breast and ovarian cancers (BRCA1 gene).[ citation needed ]
Many NHEJ genes have been knocked out in mice. Deletion of XRCC4 or LIG4 causes embryonic lethality in mice, indicating that NHEJ is essential for viability in mammals. In contrast, mice lacking Ku or DNA-PKcs are viable, probably because low levels of end joining can still occur in the absence of these components. [40] All NHEJ mutant mice show a SCID phenotype, sensitivity to ionizing radiation, and neuronal apoptosis.[ citation needed ]
A system was developed for measuring NHEJ efficiency in the mouse. [41] NHEJ efficiency could be compared across tissues of the same mouse and in mice of different age. Efficiency was higher in the skin, lung and kidney fibroblasts, and lower in heart fibroblasts and brain astrocytes. Furthermore, NHEJ efficiency declined with age. The decline was 1.8 to 3.8-fold, depending on the tissue, in the 5-month-old compared to the 24-month-old mice. Reduced capability for NHEJ can lead to an increase in the number of unrepaired or faultily repaired DNA double-strand breaks that may then contribute to aging. [42] (Also see DNA damage theory of aging.) An analysis of the level of NHEJ protein Ku80 in human, cow, and mouse indicated that Ku80 levels vary dramatically between species, and that these levels are strongly correlated with species longevity. [43]
DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA damage, resulting in tens of thousands of individual molecular lesions per cell per day. Many of these lesions cause structural damage to the DNA molecule and can alter or eliminate the cell's ability to transcribe the gene that the affected DNA encodes. Other lesions induce potentially harmful mutations in the cell's genome, which affect the survival of its daughter cells after it undergoes mitosis. As a consequence, the DNA repair process is constantly active as it responds to damage in the DNA structure. When normal repair processes fail, and when cellular apoptosis does not occur, irreparable DNA damage may occur. This can eventually lead to malignant tumors, or cancer as per the two-hit hypothesis.
RecQ helicase is a family of helicase enzymes initially found in Escherichia coli that has been shown to be important in genome maintenance. They function through catalyzing the reaction ATP + H2O → ADP + P and thus driving the unwinding of paired DNA and translocating in the 3' to 5' direction. These enzymes can also drive the reaction NTP + H2O → NDP + P to drive the unwinding of either DNA or RNA.
Mycobacterium smegmatis is an acid-fast bacterial species in the phylum Actinomycetota and the genus Mycobacterium. It is 3.0 to 5.0 μm long with a bacillus shape and can be stained by Ziehl–Neelsen method and the auramine-rhodamine fluorescent method. It was first reported in November 1884, who found a bacillus with the staining appearance of tubercle bacilli in syphilitic chancres. Subsequent to this, Alvarez and Tavel found organisms similar to that described by Lustgarten also in normal genital secretions (smegma). This organism was later named M. smegmatis.
Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids.
V(D)J recombination is the mechanism of somatic recombination that occurs only in developing lymphocytes during the early stages of T and B cell maturation. It results in the highly diverse repertoire of antibodies/immunoglobulins and T cell receptors (TCRs) found in B cells and T cells, respectively. The process is a defining feature of the adaptive immune system.
Ku is a dimeric protein complex that binds to DNA double-strand break ends and is required for the non-homologous end joining (NHEJ) pathway of DNA repair. Ku is evolutionarily conserved from bacteria to humans. The ancestral bacterial Ku is a homodimer. Eukaryotic Ku is a heterodimer of two polypeptides, Ku70 (XRCC6) and Ku80 (XRCC5), so named because the molecular weight of the human Ku proteins is around 70 kDa and 80 kDa. The two Ku subunits form a basket-shaped structure that threads onto the DNA end. Once bound, Ku can slide down the DNA strand, allowing more Ku molecules to thread onto the end. In higher eukaryotes, Ku forms a complex with the DNA-dependent protein kinase catalytic subunit (DNA-PKcs) to form the full DNA-dependent protein kinase, DNA-PK. Ku is thought to function as a molecular scaffold to which other proteins involved in NHEJ can bind, orienting the double-strand break for ligation.
DNA repair protein XRCC4 (hXRCC4) also known as X-ray repair cross-complementing protein 4 is a protein that in humans is encoded by the XRCC4 gene. XRCC4 is also expressed in many other animals, fungi and plants. hXRCC4 is one of several core proteins involved in the non-homologous end joining (NHEJ) pathway to repair DNA double strand breaks (DSBs).
Double-strand break repair protein MRE11 is an enzyme that in humans is encoded by the MRE11 gene. The gene has been designated MRE11A to distinguish it from the pseudogene MRE11B that is nowadays named MRE11P1.
DNA ligase 4 also DNA ligase IV, is an enzyme that in humans is encoded by the LIG4 gene.
DNA polymerase lambda, also known as Pol λ, is an enzyme found in all eukaryotes. In humans, it is encoded by the POLL gene.
DNA polymerase mu is a polymerase enzyme found in eukaryotes. In humans, this protein is encoded by the POLM gene.
Non-homologous end-joining factor 1 (NHEJ1), also known as Cernunnos or XRCC4-like factor (XLF), is a protein that in humans is encoded by the NHEJ1 gene. XLF was originally discovered as the protein mutated in five patients with growth retardation, microcephaly, and immunodeficiency. The protein is required for the non-homologous end joining (NHEJ) pathway of DNA repair. Patients with XLF mutations also have immunodeficiency due to a defect in V(D)J recombination, which uses NHEJ to generate diversity in the antibody repertoire of the immune system. XLF interacts with DNA ligase IV and XRCC4 and is thought to be involved in the end-bridging or ligation steps of NHEJ. The yeast homolog of XLF is Nej1.
Homology-directed repair (HDR) is a mechanism in cells to repair double-strand DNA lesions. The most common form of HDR is homologous recombination. The HDR mechanism can only be used by the cell when there is a homologous piece of DNA present in the nucleus, mostly in G2 and S phase of the cell cycle. Other examples of homology-directed repair include single-strand annealing and breakage-induced replication. When the homologous DNA is absent, another process called non-homologous end joining (NHEJ) takes place instead.
Microhomology-mediated end joining (MMEJ), also known as alternative nonhomologous end-joining (Alt-NHEJ) is one of the pathways for repairing double-strand breaks in DNA. As reviewed by McVey and Lee, the foremost distinguishing property of MMEJ is the use of microhomologous sequences during the alignment of broken ends before joining, thereby resulting in deletions flanking the original break. MMEJ is frequently associated with chromosome abnormalities such as deletions, translocations, inversions and other complex rearrangements.
DNA ligase 3 also DNA ligase III, is an enzyme that, in humans, is encoded by the LIG3 gene. LIG3 encodes ATP-dependent DNA ligases that seal interruptions in the phosphodiester backbone of duplex DNA.
Synthesis-dependent strand annealing (SDSA) is a major mechanism of homology-directed repair of DNA double-strand breaks (DSBs). Although many of the features of SDSA were first suggested in 1976, the double-Holliday junction model proposed in 1983 was favored by many researchers. In 1994, studies of double-strand gap repair in Drosophila were found to be incompatible with the double-Holliday junction model, leading researchers to propose a model they called synthesis-dependent strand annealing. Subsequent studies of meiotic recombination in S. cerevisiae found that non-crossover products appear earlier than double-Holliday junctions or crossover products, challenging the previous notion that both crossover and non-crossover products are produced by double-Holliday junctions and leading the authors to propose that non-crossover products are generated through SDSA.
Telomeres, the caps on the ends of eukaryotic chromosomes, play critical roles in cellular aging and cancer. An important facet to how telomeres function in these roles is their involvement in cell cycle regulation.
DNA end resection, also called 5′–3′ degradation, is a biochemical process where the blunt end of a section of double-stranded DNA (dsDNA) is modified by cutting away some nucleotides from the 5' end to produce a 3' single-stranded sequence. The presence of a section of single-stranded DNA (ssDNA) allows the broken end of the DNA to line up accurately with a matching sequence, so that it can be accurately repaired.
A double-strand break repair model refers to the various models of pathways that cells undertake to repair double strand-breaks (DSB). DSB repair is an important cellular process, as the accumulation of unrepaired DSB could lead to chromosomal rearrangements, tumorigenesis or even cell death. In human cells, there are two main DSB repair mechanisms: Homologous recombination (HR) and non-homologous end joining (NHEJ). HR relies on undamaged template DNA as reference to repair the DSB, resulting in the restoration of the original sequence. NHEJ modifies and ligates the damaged ends regardless of homology. In terms of DSB repair pathway choice, most mammalian cells appear to favor NHEJ rather than HR. This is because the employment of HR may lead to gene deletion or amplification in cells which contains repetitive sequences. In terms of repair models in the cell cycle, HR is only possible during the S and G2 phases, while NHEJ can occur throughout whole process. These repair pathways are all regulated by the overarching DNA damage response mechanism. Besides HR and NHEJ, there are also other repair models which exists in cells. Some are categorized under HR, such as synthesis-dependent strain annealing, break-induced replication, and single-strand annealing; while others are an entirely alternate repair model, namely, the pathway microhomology-mediated end joining (MMEJ).
LigD is a multifunctional ligase/polymerase/nuclease (3'-phosphoesterase) found in bacterial non-homologous end joining (NHEJ) DNA repair systems. It is much more error-prone than the more complex eukaryotic system of NHEJ, which uses multiple enzymes to fill its role. The polymerase preferentially use rNTPs, possibly advantageous in dormant cells.