Human somatic variations are somatic mutations (mutations that occur in somatic cells) both at early stages of development and in adult cells. These variations can lead either to pathogenic phenotypes or not, even if their function in healthy conditions is not completely clear yet. [1]
The term mosaic (from medieval Latin musaicum, meaning "work of the Muses") [2] has been used since antiquity to refer to an artistic patchwork of ornamental stones, glass, gems, or other precious material. At a distance, the collective image appears as it would in a painting; only on close inspection do the individual components become recognizable. [2] In biological systems, mosaicism implies the presence of more than one genetically distinct cell line in a single organism. Occurrence of this phenomenon not only can result in major phenotypic changes but also reveal the expression of otherwise lethal genetic mutations. [3]
Genetic mutations involved in mosaicism may be due to endogenous factors, such as transposons and ploidy changes, or exogenous factors, such as UV radiation and nicotine. [4]
Somatic mosaicism arises a result of somatic mutations: genomic (or even mitochondrial) alterations of different sizes ranging from a single nucleotide to chromosome gains or loss within somatic cells. These alterations within somatic cells begin at an early stage (pre-implantation or conception) and continue during aging, giving rise to phenotypic heterogeneity within cells, which may lead to the development of diseases such as cancer. [4] Novel array based techniques for screening genome-wide copy number variants and loss of heterozygosity in single cells showed that chromosome aneuploidies, uniparental disomies, segmental deletions, duplications, and amplifications frequently occur during embryogenesis. [5] Yet not all somatic mutations are propagated to the adult individual, due to the phenomenon of cell competition. [6]
Genetic alterations involving gains or loss of entire chromosomes predominantly occur during anaphase stage of cell division. But these are uncommon in somatic cells because they are usually selected against due to their deleterious consequences. [7] Somatic variations during embryonic development can be represented by monozygous twins since they carry different copy number profiles and epigenetic marks that keep on increasing with age. [8]
Early research on somatic mutations in aging showed that deletions, inversion, and translocations of genetic material are common in aging mice and aging genomes tend to contain visible chromosomal changes, mitotic recombination, whole gene deletions, intragenic deletions, and point mutations. Other factors include the loss of methylation, increasing gene expression heterogeneity correlating to genomic abnormalities, [4] and telomere shortening. [9] It is uncertain if transcription-based DNA repair takes part in the maintaining of somatic mutations in aging tissues. [4]
In some cells, the somatically acquired alterations can be reversed back to wild type alleles by reversion mosaicism. This can be due to endogenous mechanism such as homologous recombination, codon substitution, second-site suppressor mutations, DNA slippage, and mobile elements. [10]
The advent of Next-Generation Sequencing technologies has increased the resolution of mutation detection and has led to the revelation that older individuals not only accumulate chromosomal alterations but also abundant mutations in cancer driver genes. [11]
Age-associated accumulation of chromosomal alterations has been documented with a variety of cytogenetic approaches, from chromosome painting to single nucleotide polymorphism (SNP) arrays. [12]
Numerous studies demonstrated that the clonal populations might lead to loss of organismal health through the functional decline of tissue and/or the promotion of disease processes, such as cancer. This is the reason why the aberrant clonal expansion (ACE) resulting from cancer-associated mutations are common in noncancerous tissue and accumulate with age. This is universal in most organisms and affects multiple tissues. [11]
In the hematopoietic compartment mutations include both large structural chromosomal alterations and point mutations affecting cancer-associated genes. Some translocations appear to occur very early in life. The frequency of these events is low in people younger than 50 years (<0.5%), but this frequency rapidly increases to 2% to 3% of individuals in their 70s and 80s. This phenomenon was termed clonal hematopoiesis. A number of environmental factors, such as smoking, viral infections, and pesticide exposure, may contribute not only through mutation induction but also by modulation of clonal expansion. [13]
Otherwise, the detection of somatic variants in normal solid tissues has historically proved difficult. The main reasons are the generally slower replicative index, clonally restrictive tissue architecture, difficulty of tissue access, and low frequency of mutation occurrence. Recently, the analysis of somatic mutations in benign tissues adjacent to tumors revealed that 80% of samples harbors clonal mutations, with increased frequency associated with older age, smoking, and concurrent mutations in DNA repair genes. With the advent of NGS, it has become increasingly clear that somatic mutations accumulate with aging in normal tissue, even in individuals who are cancer-free. [11]
This suggested that clonal expansions driven by cancer genes are a near-universal feature of aging. NGS technologies revealed that the clonal expansions of cancer-associated mutations are very common condition in somatic tissues. [11]
Through several recent studies a prevalence of somatic variations, both in pathological and healthy nervous systems, has been highlighted. [14] [15]
Somatic aneuploidy such as SNVs (single-nucleotide variations) and CNVs (copy number variations) have been particularly observed and linked to brain disfunctions when arising in prenatal brain development; anyway those somatic aneuploidy have been observed in rates of 1,3-40%, potentially increasing with age and for this reason they have been proposed as a mechanism to generate normal genetic diversity among neurons. [16]
The confirmation of that hypothesis has been obtained through studies of single-cell sequencing, which allow a direct assessment of single neuronal genomes, so that a systematic characterization of somatic aneuploidies and subchromosomal CNVs of these cells is possible. Using postmortem brains of both healthy and diseased humans it has been possible to study how CNVs change among these two groups. It emerged that somatic aneuploidies in healthy brains are quite rare, but somatic CNVs instead aren't. [17]
These studies also showed that clonal CNVs exist in both pathological and healthy brains. This means that some CNVs can arise in early development without causing diseases, even though, when compared to the CNVs arising in other cell types such as lymphoblast, the brain's ones are more often private. This evidence could be given by the fact that, while lymphoblasts can generate clonal CNVs for a long period as they continue to proliferate, adult neurons do not replicate anymore, so the clonal CNVs they are carrying must have been generated in an early development stage. [17]
Data highlighted a tendency in neurons for the loss, rather than for the gain of copies when compared to lymphoblasts. These differences could suggest that the molecular mechanisms of CNVs arising in that two cell types are completely different. [17]
The retrotransposon LINE-1 (long interspersed element 1, L1) is a transposable element that has colonized the mammalian germline. L1 retrotransposition can happen also in somatic cells causing mosaicism (SLAVs – L1-associated variations) and in cancer. Retrotransposition is a copy and paste process in which the RNA template is retrotranscribed in DNA and integrated randomly in the genome. In humans there are around 500.000 copies of L1 and occupy 17% of genome. Its mRNA encodes for two proteins; one of them in particular has a reverse transcriptase and endonuclease activity that allows the retrotransposition in cis. Anyway most part of these copies are rendered immobile by mutations or 5’ truncation, leaving just about 80–100 mobile L1 per human genome and just about 10 are considered hot L1s so able to mobilize efficiently. [18]
L1 transpose using a mechanism called TPRT (target primed reverse transcription) it's able to insert a L1 endonuclease motif, target site duplications (TSD) and a poly-A tail with a cis preference. [19]
It has been seen in the past that there's L1 mobilization in neural progenitors during foetal and adult neurogenesis suggesting that the brain may be a L1 mosaicism hotspot. Moreover, some studies suggested that also non-dividing neurons can support L1 mobilization. This has been confirmed by single-cell genomic studies. [18]
Single-cell paired-end sequencing experiments found out that SLAVs are present both in neurons and glia of hippocampus and frontal cortex. Any neural cell has a similar probability to contain a SLAV, suggesting that somatic variations are a random phaenomenon, not focused on a specific group of cells. SLAVs occurrence in the brain is estimated to be of 0.58–1 SLAVs per cell and to involve 44–63% of the brain cells.[ citation needed ]
Since experiments showed that a half of the analyzed SLAVs lack target site duplication (TSD), another kind of L1-associated variant might occur. In fact those sequences don't have an endonuclease activity, but still have endonuclease motifs so that they can be retrotransposed in trans.[ citation needed ]
An application of the study of somatic mosaicism in the brain could be the tracing of specific brain cells. Indeed, if the somatic L1 insertions occurs in a progenitor cell, the unique variant could be used to trace the progenitor cell's development, localization, and spreading through the brain. On the contrary, if the somatic L1 insertion occurs late in development, it will be present just in a single cell or in a small group of cells. Therefore, tracing somatic variations could be useful to understand at which point of development they have occurred. Further experiments are necessary to understand the role of somatic mosaicism in brain function, since small groups of cells or even single cells can affect network activity. [20]
Human somatic mutations (HSMs) are intensively exploited by the immune system for the production of antibodies. HSMs, recombination in particular, are indeed the reason why antibodies can identify an epitope with such high specificity and sensitivity. [21]
Antibodies are encoded by B cells. Each antibody is composed of two heavy chains (IgH, encoded by IGH gene) and two light chains (IgL, encoded by either IGL or IGK gene). Each chain is then composed of a constant region (C) and a variable region (V). The constant region (C) on the heavy chain is important in the BCR signaling and determines the type of immunoglobuline (IgA, IgD, IgE, IgG, or IgM). The variable region (V) is responsible for the recognition of the target epitope and is the product of recombination processes in the related loci. [22]
After exposure of an antigen, B cells start developing. B cells genome undergoes repeated recombination processing on the Ig genes until the recognition of the epitope is perfectioned. The recombination involves the IGH locus first and then the IGL and IGK loci. All IGL, IGK, and IGH genes are the product of the V(D)J recombination process. This recombination involves the variable (V), diversity (D) and joining (J) segments. All three segments (V, D, J) are involved in the formation of the heavy chain, while only V and J recombination products encode for the light chain. [23]
The recombination between these regions allows the formation of 1012–1018 potential different sequences. However, this number is an overestimation, since many factors contribute to limit the diversity of the B cell repertoire, first of all the actual number of B cell in the organism. [23]
Somatic mosaicism has been noted in the heart. Sequencing suggested mosaic variation in the gap junction protein connexin in three patients out of 15 might contribute to atrial fibrillation [24] although subsequent reports in larger numbers of patients found no examples among a large panel of genes. [25] At Stanford, a team led by Euan Ashley demonstrated somatic mosaicism in the heart of a newborn presenting with life threatening arrhythmia. Family-based genome sequencing as well as tissue RNA sequencing and single cell genomics techniques were used to verify the finding. A model combining partial and ordinary differential equations with inputs from heterologous single channel electrophysiology experiments of the genetic variant recapitulated certain aspects of the clinical presentation. [26]
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mitosis, or meiosis or other types of damage to DNA, which then may undergo error-prone repair, cause an error during other forms of repair, or cause an error during replication. Mutations may also result from substitution,insertion or deletion of segments of DNA due to mobile genetic elements.
Genetic recombination is the exchange of genetic material between different organisms which leads to production of offspring with combinations of traits that differ from those found in either parent. In eukaryotes, genetic recombination during meiosis can lead to a novel set of genetic information that can be further passed on from parents to offspring. Most recombination occurs naturally and can be classified into two types: (1) interchromosomal recombination, occurring through independent assortment of alleles whose loci are on different but homologous chromosomes ; & (2) intrachromosomal recombination, occurring through crossing over.
Aneuploidy is the presence of an abnormal number of chromosomes in a cell, for example a human cell having 45 or 47 chromosomes instead of the usual 46. It does not include a difference of one or more complete sets of chromosomes. A cell with any number of complete chromosome sets is called a euploid cell.
Gene duplication is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene. Gene duplications can arise as products of several types of errors in DNA replication and repair machinery as well as through fortuitous capture by selfish genetic elements. Common sources of gene duplications include ectopic recombination, retrotransposition event, aneuploidy, polyploidy, and replication slippage.
Nondisjunction is the failure of homologous chromosomes or sister chromatids to separate properly during cell division (mitosis/meiosis). There are three forms of nondisjunction: failure of a pair of homologous chromosomes to separate in meiosis I, failure of sister chromatids to separate during meiosis II, and failure of sister chromatids to separate during mitosis. Nondisjunction results in daughter cells with abnormal chromosome numbers (aneuploidy).
Mosaicism or genetic mosaicism is a condition in which a multicellular organism possesses more than one genetic line as the result of genetic mutation. This means that various genetic lines resulted from a single fertilized egg. Mosaicism is one of several possible causes of chimerism, wherein a single organism is composed of cells with more than one distinct genotype.
A germline mutation, or germinal mutation, is any detectable variation within germ cells. Mutations in these cells are the only mutations that can be passed on to offspring, when either a mutated sperm or oocyte come together to form a zygote. After this fertilization event occurs, germ cells divide rapidly to produce all of the cells in the body, causing this mutation to be present in every somatic and germline cell in the offspring; this is also known as a constitutional mutation. Germline mutation is distinct from somatic mutation.
Heteroplasmy is the presence of more than one type of organellar genome within a cell or individual. It is an important factor in considering the severity of mitochondrial diseases. Because most eukaryotic cells contain many hundreds of mitochondria with hundreds of copies of mitochondrial DNA, it is common for mutations to affect only some mitochondria, leaving most unaffected.
Genetics, a discipline of biology, is the science of heredity and variation in living organisms.
L1, also known as L1CAM, is a transmembrane protein member of the L1 protein family, encoded by the L1CAM gene. This protein, of 200 to 220 kDa, is a neuronal cell adhesion molecule with a strong implication in cell migration, adhesion, neurite outgrowth, myelination and neuronal differentiation. It also plays a key role in treatment-resistant cancers due to its function. It was first identified in 1984 by M. Schachner who found the protein in post-mitotic mice neurons.
Mitotic recombination is a type of genetic recombination that may occur in somatic cells during their preparation for mitosis in both sexual and asexual organisms. In asexual organisms, the study of mitotic recombination is one way to understand genetic linkage because it is the only source of recombination within an individual. Additionally, mitotic recombination can result in the expression of recessive alleles in an otherwise heterozygous individual. This expression has important implications for the study of tumorigenesis and lethal recessive alleles. Mitotic homologous recombination occurs mainly between sister chromatids subsequent to replication. Inter-sister homologous recombination is ordinarily genetically silent. During mitosis the incidence of recombination between non-sister homologous chromatids is only about 1% of that between sister chromatids.
Somatic evolution is the accumulation of mutations and epimutations in somatic cells during a lifetime, and the effects of those mutations and epimutations on the fitness of those cells. This evolutionary process has first been shown by the studies of Bert Vogelstein in colon cancer. Somatic evolution is important in the process of aging as well as the development of some diseases, including cancer.
Cancer genome sequencing is the whole genome sequencing of a single, homogeneous or heterogeneous group of cancer cells. It is a biochemical laboratory method for the characterization and identification of the DNA or RNA sequences of cancer cell(s).
Genomic structural variation is the variation in structure of an organism's chromosome, such as deletions, duplications, copy-number variants, insertions, inversions and translocations. Originally, a structure variation affects a sequence length about 1kb to 3Mb, which is larger than SNPs and smaller than chromosome abnormality. However, the operational range of structural variants has widened to include events > 50bp. Some structural variants are associated with genetic diseases, however most are not. Approximately 13% of the human genome is defined as structurally variant in the normal population, and there are at least 240 genes that exist as homozygous deletion polymorphisms in human populations, suggesting these genes are dispensable in humans. While humans carry a median of 3.6 Mbp in SNPs, a median of 8.9 Mbp is affected by structural variation which thus causes most genetic differences between humans in terms of raw sequence data.
Chromothripsis is a mutational process by which up to thousands of clustered chromosomal rearrangements occur in a single event in localised and confined genomic regions in one or a few chromosomes, and is known to be involved in both cancer and congenital diseases. It occurs through one massive genomic rearrangement during a single catastrophic event in the cell's history. It is believed that for the cell to be able to withstand such a destructive event, the occurrence of such an event must be the upper limit of what a cell can tolerate and survive. The chromothripsis phenomenon opposes the conventional theory that cancer is the gradual acquisition of genomic rearrangements and somatic mutations over time.
Genome instability refers to a high frequency of mutations within the genome of a cellular lineage. These mutations can include changes in nucleic acid sequences, chromosomal rearrangements or aneuploidy. Genome instability does occur in bacteria. In multicellular organisms genome instability is central to carcinogenesis, and in humans it is also a factor in some neurodegenerative diseases such as amyotrophic lateral sclerosis or the neuromuscular disease myotonic dystrophy.
Chromosomal instability (CIN) is a type of genomic instability in which chromosomes are unstable, such that either whole chromosomes or parts of chromosomes are duplicated or deleted. More specifically, CIN refers to the increase in rate of addition or loss of entire chromosomes or sections of them. The unequal distribution of DNA to daughter cells upon mitosis results in a failure to maintain euploidy leading to aneuploidy. In other words, the daughter cells do not have the same number of chromosomes as the cell they originated from. Chromosomal instability is the most common form of genetic instability and cause of aneuploidy.
The Network of Cancer Genes (NCG) is a freely accessible web resource of genes that, when altered in their sequence, drive clonal expansion of normal tissues or cancer. The project was launched in 2010 and has reached its 7th release in 2022. In 2023 the additional annotation of cancer drivers that interact with the tumour immune microenvironment was added. NCG7.1 reports information on 3,347 cancer drivers and 95 healthy drivers. Of these, 596 are also TIME drivers. NCG7.1 also reports the system-level properties and the associated publications of each driver, as well as a list possible false positives. NCG7.1 enables advanced searches on the primary anatomical site, cancer type, type of sequencing screens and literature supports.
LINE1 is a family of related class I transposable elements in the DNA of many groups of eukaryotes, including animals and plants, classified with the long interspersed nuclear elements (LINEs). L1 transposons are most ubiquitous in mammals, where they make up a significant fraction of the total genome length, for example they comprise approximately 17% of the human genome. These active L1s can interrupt the genome through insertions, deletions, rearrangements, and copy number variations. L1 activity has contributed to the instability and evolution of genomes and is tightly regulated in the germline by DNA methylation, histone modifications, and piRNA. L1s can further impact genome variation through mispairing and unequal crossing over during meiosis due to its repetitive DNA sequences.
A somatic mutation is a change in the DNA sequence of a somatic cell of a multicellular organism with dedicated reproductive cells; that is, any mutation that occurs in a cell other than a gamete, germ cell, or gametocyte. Unlike germline mutations, which can be passed on to the descendants of an organism, somatic mutations are not usually transmitted to descendants. This distinction is blurred in plants, which lack a dedicated germline, and in those animals that can reproduce asexually through mechanisms such as budding, as in members of the cnidarian genus Hydra.
{{cite book}}
: |journal=
ignored (help)