Transcription-translation coupling is a mechanism of gene expression regulation in which synthesis of an mRNA (transcription) is affected by its concurrent decoding (translation). In prokaryotes, mRNAs are translated while they are transcribed. This allows communication between RNA polymerase, the multisubunit enzyme that catalyzes transcription, and the ribosome, which catalyzes translation. Coupling involves both direct physical interactions between RNA polymerase and the ribosome ("expressome" complexes), as well as ribosome-induced changes to the structure and accessibility of the intervening mRNA that affect transcription ("attenuation" and "polarity"). [1] [2] [3]
Bacteria depend on transcription-translation coupling for genome integrity, termination of transcription and control of mRNA stability. Consequently, artificial disruption of transcription-translation coupling impairs the fitness of bacteria. Without coupling, genome integrity is compromised as stalled transcription complexes interfere with DNA replication and induce DNA breaks. [4] Lack of coupling produces premature transcription termination, likely due to increased binding of termination factor Rho. [5] Degradation of prokaryotic mRNAs is accelerated by loss of coupled translation due to increased availability of target sites of RNase E. [6] It has also been suggested that coupling of transcription with translation is an important mechanism of preventing formation of deleterious R-loops. [7] While transcription-translation coupling is likely prevalent across prokaryotic organisms, not all species are dependent on it. Unlike Escherichia coli , in Bacillus subtilis transcription significantly outpaces translation, and coupling consequently does not occur. [8]
Translation promotes transcription elongation and regulates transcription termination. Functional coupling between transcription and translation is caused by direct physical interactions between the ribosome and RNA polymerase ("expressome complex"), ribosome-dependent changes to nascent mRNA secondary structure which affect RNA polymerase activity (e.g. "attenuation"), and ribosome-dependent changes to nascent mRNA availability to transcription termination factor Rho ("polarity").
The expressome is a supramolecular complex consisting of RNA polymerase and a trailing ribosome linked by a shared mRNA transcript. It is supported by the transcription factors NusG and NusA, which interact with both RNA polymerase and the ribosome to couple the complexes together. [9] [10] [11] When coupled by transcription factor NusG, the ribosome binds newly synthesized mRNA and prevents formation of secondary structures that inhibit transcription. [9] Formation of an expressome complex also aids transcription elongation by the trailing ribosome opposing back-tracking of RNA polymerase. [12] [13] Three-dimensional models of ribosome-RNA polymerase expressome complexes have been determined by cryo-electron microscopy. [14] [10] [11] [9]
Ribosome-mediated attenuation is a gene expression mechanism in which a transcriptional termination signal is regulated by translation. [15] [16] [17] Attenuation occurs at the start of some prokaryotic operons at sequences called "attenuators", which have been identified in operons encoding amino acid biosynthesis enzymes, pyrimidine biosynthesis enzymes and antibiotic resistance factors. The attenuator functions via a set of mRNA sequence elements that coordinate the status of translation to a transcription termination signal:
Once the start of the leader open reading frame has been transcribed, RNA polymerase pauses due to folding of the nascent mRNA. This programmed arrest of transcription gives time for translation of the leader peptide to commence, and transcription to resume once coupled to translation. The downstream "control region" then modulates the elongation rate of either the ribosome or RNA polymerase. The factor determining this depends on the function of the downstream genes (e.g. the operon encoding enzymes involved in the synthesis of histidine contains a series of histidine codons is the control region). The role of the control region is to modulate whether transcription remains coupled to translation depending on the cellular state (e.g. a low availability of histidine slows translation leading to uncoupling, while high availability of histidine permits efficient translation and maintains coupling). Finally, the transcription terminator sequence is transcribed. Whether transcription is coupled to translation determines whether this stops transcription. The terminator requires folding of the mRNA, and by unwinding mRNA structures the ribosome elects the formation of either of two alternative structures: the terminator, or a competing fold termed the "antiterminator".
For amino acid biosynthesis operons, these allow the gene expression machinery to sense the abundance of the amino acid produced by the encoded enzymes, and adjust the level of downstream gene expression accordingly: transcription occurring only if the amino acid abundance is low and the demand for the enzymes is therefore high. Examples include the histidine (his) [18] [19] and tryptophan (trp) [20] biosynthetic operons.
The term "attenuation" was introduced to describe the his operon. [18] While it is typically used to describe biosynthesis operons of amino acids and other metabolites, programmed transcription termination that does not occur at the end of a gene was first identified in λ phage. [21] The discovery of attenuation was significant as it represented a regulatory mechanism distinct from repression. [22] [23] The trp operon is regulated by both attenuation and repression, and was the first evidence that gene expression regulation mechanisms can be overlapping or redundant. [17]
"Polarity" is a gene expression mechanism in which transcription terminates prematurely due to a loss of coupling between transcription and translation. Transcription outpaces translation when the ribosome pauses[ citation needed ] or encounters a premature stop codon. [24] This allows the transcription termination factor Rho to bind the mRNA and terminate mRNA synthesis. Consequently, genes that are downstream in the operon are not transcribed, and therefore not expressed. Polarity serves as mRNA quality control, allowing unused transcripts to be terminated prematurely, rather than synthesized and degraded. [25]
The term "polarity" was introduced to describe the observation that the order of genes within an operon is important: a nonsense mutation within an upstream gene effects the transcription of downstream genes. [24] Furthermore, the position of the nonsense mutation within the upstream gene modulates the "degree of polarity", with nonsense mutations at the start of the upstream genes exerting stronger polarity (more reduced transcription) on downstream genes.
Unlike the mechanism of attenuation, which involves intrinsic termination of transcription at well-defined programmed sites, polarity is Rho-dependent and termination occurs at variable position.
The potential for transcription and translation to regulate each other was recognized by the team of Marshall Nirenberg, who discovered that the processes are physically connected through the formation of a DNA-ribosome complex. [26] [27] As part of the efforts of Nirenberg's group to determine the genetic code that underlies protein synthesis, they pioneered the use of cell-free in vitro protein synthesis reactions. Analysis of these reactions revealed that protein synthesis is mRNA-dependent, and that the sequence of the mRNA strictly defines the sequence of the protein product. For this work in breaking in the genetic code, Nirenberg was jointly awarded the Nobel Prize in Physiology or Medicine in 1968. Having established that transcription and translation are linked biochemically (translation depends on the product of transcription), an outstanding question remained whether they were linked physically - whether the newly synthesized mRNA released from the DNA before it is translated, or if can translation occur concurrently with transcription. Electron micrographs of stained cell-free protein synthesis reactions revealed branched assemblies in which strings of ribosomes are linked to a central DNA fibre. [27] DNA isolated from bacterial cells co-sediment with ribosomes, further supporting the conclusion that transcription and translation occur together. [26] Direct contact between ribosomes and RNA polymerase are observable within these early micrographs. [3] The potential for simultaneous regulation of transcription and translation at this junction was noted in Nirenberg's work as early as 1964. [26]
Enterobacteria phage λ is a bacterial virus, or bacteriophage, that infects the bacterial species Escherichia coli. It was discovered by Esther Lederberg in 1950. The wild type of this virus has a temperate life cycle that allows it to either reside within the genome of its host through lysogeny or enter into a lytic phase, during which it kills and lyses the cell to produce offspring. Lambda strains, mutated at specific sites, are unable to lysogenize cells; instead, they grow and enter the lytic cycle after superinfecting an already lysogenized cell.
In genetics, an operon is a functioning unit of DNA containing a cluster of genes under the control of a single promoter. The genes are transcribed together into an mRNA strand and either translated together in the cytoplasm, or undergo splicing to create monocistronic mRNAs that are translated separately, i.e. several strands of mRNA that each encode a single gene product. The result of this is that the genes contained in the operon are either expressed together or not at all. Several genes must be co-transcribed to define an operon.
In molecular biology, RNA polymerase, or more specifically DNA-directed/dependent RNA polymerase (DdRP), is an enzyme that catalyzes the chemical reactions that synthesize RNA from a DNA template.
A ρ factor is a bacterial protein involved in the termination of transcription. Rho factor binds to the transcription terminator pause site, an exposed region of single stranded RNA after the open reading frame at C-rich/G-poor sequences that lack obvious secondary structure.
In genetics, a transcription terminator is a section of nucleic acid sequence that marks the end of a gene or operon in genomic DNA during transcription. This sequence mediates transcriptional termination by providing signals in the newly synthesized transcript RNA that trigger processes which release the transcript RNA from the transcriptional complex. These processes include the direct interaction of the mRNA secondary structure with the complex and/or the indirect activities of recruited termination factors. Release of the transcriptional complex frees RNA polymerase and related transcriptional machinery to begin transcription of new mRNAs.
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosomal DNA (rDNA) and then bound to ribosomal proteins to form small and large ribosome subunits. rRNA is the physical and mechanical factor of the ribosome that forces transfer RNA (tRNA) and messenger RNA (mRNA) to process and translate the latter into proteins. Ribosomal RNA is the predominant form of RNA found in most cells; it makes up about 80% of cellular RNA despite never being translated into proteins itself. Ribosomes are composed of approximately 60% rRNA and 40% ribosomal proteins by mass.
In molecular biology, a termination factor is a protein that mediates the termination of RNA transcription by recognizing a transcription terminator and causing the release of the newly made mRNA. This is part of the process that regulates the transcription of RNA to preserve gene expression integrity and are present in both eukaryotes and prokaryotes, although the process in bacteria is more widely understood. The most extensively studied and detailed transcriptional termination factor is the Rho (ρ) protein of E. coli.
Bacterial translation is the process by which messenger RNA is translated into proteins in bacteria.
In genetics, attenuation is a regulatory mechanism for some bacterial operons that results in premature termination of transcription. The canonical example of attenuation used in many introductory genetics textbooks, is ribosome-mediated attenuation of the trp operon. Ribosome-mediated attenuation of the trp operon relies on the fact that, in bacteria, transcription and translation proceed simultaneously. Attenuation involves a provisional stop signal (attenuator), located in the DNA segment that corresponds to the leader sequence of mRNA. During attenuation, the ribosome becomes stalled (delayed) in the attenuator region in the mRNA leader. Depending on the metabolic conditions, the attenuator either stops transcription at that point or allows read-through to the structural gene part of the mRNA and synthesis of the appropriate protein.
Gene structure is the organisation of specialised sequence elements within a gene. Genes contain most of the information necessary for living cells to survive and reproduce. In most organisms, genes are made of DNA, where the particular DNA sequence determines the function of the gene. A gene is transcribed (copied) from DNA into RNA, which can either be non-coding (ncRNA) with a direct function, or an intermediate messenger (mRNA) that is then translated into protein. Each of these steps is controlled by specific sequence elements, or regions, within the gene. Every gene, therefore, requires multiple sequence elements to be functional. This includes the sequence that actually encodes the functional protein or ncRNA, as well as multiple regulatory sequence regions. These regions may be as short as a few base pairs, up to many thousands of base pairs long.
A termination signal is a sequence that signals the end of transcription or translation. Termination signals are found at the end of the part of the chromosome being transcribed during transcription of mRNA. Termination signals bring a stop to transcription, ensuring that only gene-encoding parts of the chromosome are transcribed. Transcription begins at the promoter when RNA polymerase, an enzyme that facilitates transcription of DNA into mRNA, binds to a promoter, unwinds the helical structure of the DNA, and uses the single-stranded DNA as a template to synthesize RNA. Once RNA polymerase reaches the termination signal, transcription is terminated. In bacteria, there are two main types of termination signals: intrinsic and factor-dependent terminators. In the context of translation, a termination signal is the stop codon on the mRNA that elicits the release of the growing peptide from the ribosome.
The trp operon is a group of genes that are transcribed together, encoding the enzymes that produce the amino acid tryptophan in bacteria. The trp operon was first characterized in Escherichia coli, and it has since been discovered in many other bacteria. The operon is regulated so that, when tryptophan is present in the environment, the genes for tryptophan synthesis are repressed.
Amino acid synthesis is the set of biochemical processes by which the amino acids are produced. The substrates for these processes are various compounds in the organism's diet or growth media. Not all organisms are able to synthesize all amino acids. For example, humans can synthesize 11 of the 20 standard amino acids. These 11 are called the non-essential amino acids).
The L-arabinose operon, also called the ara or araBAD operon, is an operon required for the breakdown of the five-carbon sugar L-arabinose in Escherichia coli. The L-arabinose operon contains three structural genes: araB, araA, araD, which encode for three metabolic enzymes that are required for the metabolism of L-arabinose. AraB (ribulokinase), AraA, and AraD produced by these genes catalyse conversion of L-arabinose to an intermediate of the pentose phosphate pathway, D-xylulose-5-phosphate.
Eukaryotic transcription is the elaborate process that eukaryotic cells use to copy genetic information stored in DNA into units of transportable complementary RNA replica. Gene transcription occurs in both eukaryotic and prokaryotic cells. Unlike prokaryotic RNA polymerase that initiates the transcription of all different types of RNA, RNA polymerase in eukaryotes comes in three variations, each translating a different type of gene. A eukaryotic cell has a nucleus that separates the processes of transcription and translation. Eukaryotic transcription occurs within the nucleus where DNA is packaged into nucleosomes and higher order chromatin structures. The complexity of the eukaryotic genome necessitates a great variety and complexity of gene expression control.
fis is an E. coli gene encoding the Fis protein. The regulation of this gene is more complex than most other genes in the E. coli genome, as Fis is an important protein which regulates expression of other genes. It is supposed that fis is regulated by H-NS, IHF and CRP. It also regulates its own expression (autoregulation). Fis is one of the most abundant DNA binding proteins in Escherichia coli under nutrient-rich growth conditions.
Post-transcriptional regulation is the control of gene expression at the RNA level. It occurs once the RNA polymerase has been attached to the gene's promoter and is synthesizing the nucleotide sequence. Therefore, as the name indicates, it occurs between the transcription phase and the translation phase of gene expression. These controls are critical for the regulation of many genes across human tissues. It also plays a big role in cell physiology, being implicated in pathologies such as cancer and neurodegenerative diseases.
The gua operon is responsible for regulating the synthesis of guanosine mono phosphate (GMP), a purine nucleotide, from inosine monophosphate. It consists of two structural genes guaB (encodes for IMP dehydrogenase or and guaA apart from the promoter and operator region.
Paul Babitzke is a professor of biochemistry and molecular biology and director of the Center for RNA Molecular Biology at Pennsylvania State University.
Archaeal transcription is the process in which a segment of archaeal DNA is copied into a newly synthesized strand of RNA using the sole Pol II-like RNA polymerase (RNAP). The process occurs in three main steps: initiation, elongation, and termination; and the end result is a strand of RNA that is complementary to a single strand of DNA. A number of transcription factors govern this process with homologs in both bacteria and eukaryotes, with the core machinery more similar to eukaryotic transcription.