Transcriptional bursting, also known as transcriptional pulsing, is a fundamental property of genes in which transcription from DNA to RNA can occur in "bursts" or "pulses", which has been observed in diverse organisms, from bacteria to mammals. [1] [2] [3] [4] [5]
This phenomenon came to light with the advent of technologies, such as MS2 tagging and single molecule RNA fluorescence in situ hybridisation, to detect RNA production in single cells, through precise measurements of RNA number or RNA appearance at the gene. Other, more widespread techniques, such as Northern blotting, microarrays, RT-PCR and RNA-Seq, measure bulk RNA levels from homogenous population extracts. These techniques lose dynamic information from individual cells and give the impression that transcription is a continuous smooth process. Observed at an individual cell level, transcription is irregular, with strong periods of activity interspersed by long periods of inactivity.
Bursting may result from the stochastic nature of biochemical events superimposed upon a two step fluctuation. In its simplest form, the gene is proposed to exist in two states, one where activity is negligible and one where there is a certain probability of activation. [6] Only in the second state does transcription readily occur. It seems likely that some rudimentary eukaryotes have genes which do not show bursting. The genes are always in the permissive state, with a simple probability describing the numbers of RNAs generated. [7]
More recent data indicate the two state model can be an oversimplification. Transcription of the c-Fos gene in response to serum stimulation can, for the most part, be summarised by two states, although at certain times after stimulation, a third state better explains the variance in the data. [8] Another model suggests a two state model can apply, but with each cell having a different transcription rate in the active state. [9] Other analyses indicate a spectrum or continuum of activity states. [10] [11] The nuclear and signaling landscapes of complex eukaryotic nuclei may favour more than two simple states- for example, there are over several dozen post-translational modifications of nucleosomes and perhaps a hundred different proteins involved in the average eukaryotic transcription reaction.
What do the repressive and permissive states represent? An attractive idea is that the repressed state is a closed chromatin conformation whilst the permissive states are more open. Another hypothesis is that the fluctuations between states reflect reversible transitions in the binding and dissociation of pre-initiation complexes. [12] Bursts may also result from bursty signalling, cell cycle effects or movement of chromatin to and from transcription factories. Bursting dynamics have been demonstrated to be influenced by cell size [13] and the frequency of extracellular signalling. [14] Recent data suggest different degrees of supercoiling distinguish the permissive and inactive states. [15]
The bursting phenomenon, as opposed to simple probabilistic models of transcription, can account for the high variability (see transcriptional noise) in gene expression occurring between cells in isogenic populations. This variability in turn can have tremendous consequences on cell behaviour, and must be mitigated or integrated. Suggested mechanisms by which noise can be dampened include strong extracellular signalling, [16] diffusion of RNA and protein in cell syncitia, [17] promoter proximal pausing, [18] and nuclear retention of transcripts. [19] In certain contexts, such as the survival of microbes in rapidly changing stressful environments, the expression variability may be essential. [20] Variability also impacts upon the effectiveness of clinical treatment, with resistance of bacteria to antibiotics demonstrably caused by non-genetic differences. [21] [22] Similar phenomena may contribute to the resistance of sub-populations of cancer cells to chemotherapy. [23] Spontaneous variability in gene expression is also proposed to act as a source of cell fate diversity in self-organizing differentiation processes, [24] and may act as a barrier to effective cellular reprogramming strategies. [25]
The cell nucleus is a membrane-bound organelle found in eukaryotic cells. Eukaryotic cells usually have a single nucleus, but a few cell types, such as mammalian red blood cells, have no nuclei, and a few others including osteoclasts have many. The main structures making up the nucleus are the nuclear envelope, a double membrane that encloses the entire organelle and isolates its contents from the cellular cytoplasm; and the nuclear matrix, a network within the nucleus that adds mechanical support.
Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in reinforcing the DNA during cell division, preventing DNA damage, and regulating gene expression and DNA replication. During mitosis and meiosis, chromatin facilitates proper segregation of the chromosomes in anaphase; the characteristic shapes of chromosomes visible during this stage are the result of DNA being coiled into highly condensed chromatin.
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn are wrapped into 30-nanometer fibers that form tightly packed chromatin. Histones prevent DNA from becoming tangled and protect it from DNA damage. In addition, histones play important roles in gene regulation and DNA replication. Without histones, unwound DNA in chromosomes would be very long. For example, each human cell has about 1.8 meters of DNA if completely stretched out; however, when wound about histones, this length is reduced to about 90 micrometers (0.09 mm) of 30 nm diameter chromatin fibers.
Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, and ultimately affect a phenotype. These products are often proteins, but in non-protein-coding genes such as transfer RNA (tRNA) and small nuclear RNA (snRNA), the product is a functional non-coding RNA. Gene expression is summarized in the central dogma of molecular biology first formulated by Francis Crick in 1958, further developed in his 1970 article, and expanded by the subsequent discoveries of reverse transcription and RNA replication.
Transcription is the process of copying a segment of DNA into RNA. The segments of DNA transcribed into RNA molecules that can encode proteins are said to produce messenger RNA (mRNA). Other segments of DNA are copied into RNA molecules called non-coding RNAs (ncRNAs). mRNA comprises only 1–3% of total RNA samples. Less than 2% of the human genome can be transcribed into mRNA, while at least 80% of mammalian genomic DNA can be actively transcribed, with the majority of this 80% considered to be ncRNA.
A generegulatory network (GRN) is a collection of molecular regulators that interact with each other and with other substances in the cell to govern the gene expression levels of mRNA and proteins which, in turn, determine the function of the cell. GRN also play a central role in morphogenesis, the creation of body structures, which in turn is central to evolutionary developmental biology (evo-devo).
In molecular biology and genetics, transcriptional regulation is the means by which a cell regulates the conversion of DNA to RNA (transcription), thereby orchestrating gene activity. A single gene can be regulated in a range of ways, from altering the number of copies of RNA that are transcribed, to the temporal control of when the gene is transcribed. This control allows the cell or organism to respond to a variety of intra- and extracellular signals and thus mount a response. Some examples of this include producing the mRNA that encode enzymes to adapt to a change in a food source, producing the gene products involved in cell cycle specific activities, and producing the gene products responsible for cellular differentiation in multicellular eukaryotes, as studied in evolutionary developmental biology.
Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products. Sophisticated programs of gene expression are widely observed in biology, for example to trigger developmental pathways, respond to environmental stimuli, or adapt to new food sources. Virtually any step of gene expression can be modulated, from transcriptional initiation, to RNA processing, and to the post-translational modification of a protein. Often, one gene regulator controls another, and so on, in a gene regulatory network.
A circadian clock, or circadian oscillator, is a biochemical oscillator that cycles with a stable phase and is synchronized with solar time.
Eukaryotic transcription is the elaborate process that eukaryotic cells use to copy genetic information stored in DNA into units of transportable complementary RNA replica. Gene transcription occurs in both eukaryotic and prokaryotic cells. Unlike prokaryotic RNA polymerase that initiates the transcription of all different types of RNA, RNA polymerase in eukaryotes comes in three variations, each translating a different type of gene. A eukaryotic cell has a nucleus that separates the processes of transcription and translation. Eukaryotic transcription occurs within the nucleus where DNA is packaged into nucleosomes and higher order chromatin structures. The complexity of the eukaryotic genome necessitates a great variety and complexity of gene expression control.
AT-rich interactive domain-containing protein 1A is a protein that in humans is encoded by the ARID1A gene.
Developmental noise or stochastic noise is a concept within developmental biology in which the observable characteristics or traits (phenotype) varies between individuals even though both individuals share the same genetic code (genotypes) and the other environmental factors are completely the same. Factors that influence the effect include stochastic, or randomized, gene expression and other cellular noise.
Long non-coding RNAs are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as microRNAs (miRNAs), small interfering RNAs (siRNAs), Piwi-interacting RNAs (piRNAs), small nucleolar RNAs (snoRNAs), and other short RNAs. Long intervening/intergenic noncoding RNAs (lincRNAs) are sequences of lncRNA which do not overlap protein-coding genes.
MS2 tagging is a technique based upon the natural interaction of the MS2 bacteriophage coat protein with a stem-loop structure from the phage genome, which is used for biochemical purification of RNA-protein complexes and partnered to GFP for detection of RNA in living cells. More recently, the technique has been used to monitor the appearance of RNA in living cells, at the site of transcription, or simply by observing the changes in RNA number in the cytoplasm. This has revealed that transcription of both prokaryotic and eukaryotic genes occurs in a discontinuous fashion with bursts of transcription separated by irregular intervals.
Transcriptional noise is a primary cause of the variability (noise) in gene expression occurring between cells in isogenic populations. A proposed source of transcriptional noise is transcriptional bursting although other sources of heterogeneity, such as unequal separation of cell contents at mitosis are also likely to contribute considerably. Bursting transcription, as opposed to simple probabilistic models of transcription, reflects multiple states of gene activity, with fluctuations between states separated by irregular intervals, generating uneven protein expression between cells. Noise in gene expression can have tremendous consequences on cell behaviour, and must be mitigated or integrated. In certain contexts, such as establishment of viral latency, the survival of microbes in rapidly changing stressful environments, or several types of scattered differentiation, the variability may be essential. Variability also impacts upon the effectiveness of clinical treatment, with resistance of bacteria and yeast to antibiotics demonstrably caused by non-genetic differences. Variability in gene expression may also contribute to resistance of sub-populations of cancer cells to chemotherapy and appears to be a barrier to curing HIV.
Cellular noise is random variability in quantities arising in cellular biology. For example, cells which are genetically identical, even within the same tissue, are often observed to have different expression levels of proteins, different sizes and structures. These apparently random differences can have important biological and medical consequences.
CRISPR interference (CRISPRi) is a genetic perturbation technique that allows for sequence-specific repression of gene expression in prokaryotic and eukaryotic cells. It was first developed by Stanley Qi and colleagues in the laboratories of Wendell Lim, Adam Arkin, Jonathan Weissman, and Jennifer Doudna. Sequence-specific activation of gene expression refers to CRISPR activation (CRISPRa).
Nuclear organization refers to the spatial distribution of chromatin within a cell nucleus. There are many different levels and scales of nuclear organisation. Chromatin is a higher order structure of DNA.
Monoallelic gene expression (MAE) is the phenomenon of the gene expression, when only one of the two gene copies (alleles) is actively expressed (transcribed), while the other is silent. Diploid organisms bear two homologous copies of each chromosome (one from each parent), a gene can be expressed from both chromosomes (biallelic expression) or from only one (monoallelic expression). MAE can be Random monoallelic expression (RME) or Constitutive monoallelic expression (constitutive). Constitutive monoallelic expression occurs from the same specific allele throughout the whole organism or tissue, as a result of genomic imprinting. RME is a broader class of monoallelic expression, which is defined by random allelic choice in somatic cells, so that different cells of the multi-cellular organism express different alleles.
David Suter is a Swiss physician and molecular and cell biologist. His research focuses on quantitative approaches to study gene expression and developmental cell fate decisions. He is currently a professor at EPFL, where he heads the Suter Lab at the Institute of Bioengineering of the School of Life Sciences.