L-arabinose operon

Last updated

The L-arabinose operon, also called the ara or araBAD operon, is an operon required for the breakdown of the five-carbon sugar L-arabinose in Escherichia coli . [1] The L-arabinose operon contains three structural genes: araB, araA, araD (collectively known as araBAD), which encode for three metabolic enzymes that are required for the metabolism of L-arabinose. [2] AraB (ribulokinase), AraA (an isomerase), and AraD (an epimerase) produced by these genes catalyse conversion of L-arabinose to an intermediate of the pentose phosphate pathway, D-xylulose-5-phosphate. [2]

Contents

The structural genes of the L-arabinose operon are transcribed from a common promoter into a single transcript, a mRNA. [3] The expression of the L-arabinose operon is controlled as a single unit by the product of regulatory gene araC and the catabolite activator protein (CAP)-cAMP complex. [4] The regulator protein AraC is sensitive to the level of arabinose and plays a dual role as both an activator in the presence of arabinose and a repressor in the absence of arabinose to regulate the expression of araBAD. [5] AraC protein not only controls the expression of araBAD but also auto-regulates its own expression at high AraC levels. [6]

Structure

L-arabinose operon is composed of structural genes and regulatory regions including the operator region (araO1, araO2) and the initiator region (araI1, araI2). [7] The structural genes, araB, araA and araD, encode enzymes for L-arabinose catabolism. There is also a CAP binding site where CAP-cAMP complex binds to and facilitates catabolite repression, and results in positive regulation of araBAD when the cell is starved of glucose. [8]

Structure of L-arabinose operon of E. coli. Structure of L-arabinose operon of E. coli.tif
Structure of L-arabinose operon of E. coli.

The regulatory gene, araC, is located upstream of the L-arabinose operon and encodes the arabinose-responsive regulatory protein AraC. Both araC and araBAD have a discrete promoter where RNA polymerase binds and initiates transcription. [4] araBAD and araC are transcribed in opposite directions from the araBAD promoter (PBAD) and araC promoter (PC) respectively. [2]

Function

Metabolic pathway of L-arabinose via the action of three enzymes, which are encoded by the araBAD operon. Process of L-arabinose catabolism.png
Metabolic pathway of L-arabinose via the action of three enzymes, which are encoded by the araBAD operon.
Catabolism of arabinose in E. coli
SubstrateEnzyme(s)FunctionReversibleProduct
L-arabinose AraA Isomerase YesL-ribulose
L-ribulose AraB Ribulokinase NoL-ribulose-5-phosphate
L-ribulose-5-phosphate AraD Epimerase YesD-xylulose-5-phosphate

Both L-ribulose 5-phosphate and D-xylulose-5-phosphate are metabolites of the pentose phosphate pathway, which links the metabolism of 5-carbon sugars to that of 6-carbon sugars. [6]

Regulation

Structure of AraC monomer Structure of AraC protein.png
Structure of AraC monomer

The L-arabinose system is not only under the control of CAP-cAMP activator, but also positively or negatively regulated through binding of AraC protein. AraC functions as a homodimer, which can control transcription of araBAD through interaction with the operator and the initiator region on L-arabinose operon. Each AraC monomer is composed of two domains including a DNA binding domain and a dimerisation domain. [9] The dimerisation domain is responsible for arabinose-binding. [10] AraC undergoes conformational change upon arabinose-binding, in which, it has two distinct conformations. [6] The conformation is purely determined by the binding of allosteric inducer arabinose. [11]

AraC can also negatively autoregulate its own expression when the concentration of AraC becomes too high. AraC synthesis is repressed through binding of dimeric AraC to the operator region (araO1).

Negative regulation of araBAD

Negative regulation of L-arabinose operon via AraC protein Negative regulation of L-arabinose operon via AraC protein.png
Negative regulation of L-arabinose operon via AraC protein

When arabinose is absent, cells do not need the araBAD products for breaking down arabinose. Therefore, dimeric AraC acts as a repressor: one monomer binds to the operator of the araBAD gene (araO2), another monomer binds to a distant DNA half site known as araI1. [12] This leads to the formation of a DNA loop. [13] This orientation blocks RNA polymerase from binding to the araBAD promoter. [14] Therefore, transcription of structural gene araBAD is inhibited. [15]

Positive regulation of araBAD

Positive regulation of L-arabinose operon via dimeric AraC and CAP/cAMP Positive regulation of L-arabinose operon via AraC and CAP.png
Positive regulation of L-arabinose operon via dimeric AraC and CAP/cAMP

Expression of the araBAD operon is activated in the absence of glucose and in the presence of arabinose. When arabinose is present, both AraC and CAP work together and function as activators. [16]

Via AraC

AraC acts as an activator in the presence of arabinose. AraC undergoes a conformational change when arabinose binds to the dimerization domain of AraC. As a result, the AraC-arabinose complex falls off from araO2 and breaks the DNA loop. Hence, it is more energetically favourable for AraC-arabinose to bind to two adjacent DNA half sites: araI1 and araI2 in the presence of arabinose. One of the monomers binds araI1, the remaining monomer binds araI2 - in other words, binding of AraC to araI2 is allosterically induced by arabinose. One of the AraC monomers places near to the araBAD promoter in this configuration, which helps to recruit RNA polymerase to the promoter to initiate transcription. [17]

Via CAP/cAMP (catabolite repression)

CAP act as a transcriptional activator only in the absence of E. coli's preferred sugar, glucose. [18] When glucose is absent, high level of CAP protein/cAMP complex bind to CAP binding site, a site between araI1 and araO1. [19] Binding of CAP/cAMP is responsible for opening up the DNA loop between araI1 and araO2, increasing the binding affinity of AraC protein for araI2 and thereby promoting RNA polymerase to bind to araBAD promoter to switch on the expression of the araBAD required for metabolising L-arabinose.

Autoregulation of araC expression Autoregulation of araC expression.png
Autoregulation of araC expression

Autoregulation of AraC

The expression of araC is negatively regulated by its own protein product, AraC. The excess AraC binds to the operator of the araC gene, araO1, at high AraC levels, which physically blocks the RNA polymerase from accessing the araC promoter. [20] Therefore, the AraC protein inhibits its own expression at high concentrations. [16]

Use in protein expression system

The L-arabinose operon has been a focus for research in molecular biology since 1970, and has been investigated extensively at its genetic, biochemical, physiological and biotechnical levels. [3] The L-arabinose operon has been commonly used in protein expression system, as the araBAD promoter can be used for producing targeted expression under tight regulation. By fusing the araBAD promoter to a gene of interest, the expression of the target gene can be solely regulated by arabinose: for example, the pGLO plasmid contains a green fluorescent protein gene under the control of the PBAD promoter, allowing GFP production to be induced by arabinose.

See also

Other operon systems in E. coli:

Related Research Articles

<span class="mw-page-title-main">Lambda phage</span> Bacteriophage that infects Escherichia coli

Enterobacteria phage λ is a bacterial virus, or bacteriophage, that infects the bacterial species Escherichia coli. It was discovered by Esther Lederberg in 1950. The wild type of this virus has a temperate life cycle that allows it to either reside within the genome of its host through lysogeny or enter into a lytic phase, during which it kills and lyses the cell to produce offspring. Lambda strains, mutated at specific sites, are unable to lysogenize cells; instead, they grow and enter the lytic cycle after superinfecting an already lysogenized cell.

In genetics, an operon is a functioning unit of DNA containing a cluster of genes under the control of a single promoter. The genes are transcribed together into an mRNA strand and either translated together in the cytoplasm, or undergo splicing to create monocistronic mRNAs that are translated separately, i.e. several strands of mRNA that each encode a single gene product. The result of this is that the genes contained in the operon are either expressed together or not at all. Several genes must be co-transcribed to define an operon.

<span class="mw-page-title-main">Lac repressor</span> DNA-binding protein

The lac repressor (LacI) is a DNA-binding protein that inhibits the expression of genes coding for proteins involved in the metabolism of lactose in bacteria. These genes are repressed when lactose is not available to the cell, ensuring that the bacterium only invests energy in the production of machinery necessary for uptake and utilization of lactose when lactose is present. When lactose becomes available, it is firstly converted into allolactose by β-Galactosidase (lacZ) in bacteria. The DNA binding ability of lac repressor bound with allolactose is inhibited due to allosteric regulation, thereby genes coding for proteins involved in lactose uptake and utilization can be expressed.

<i>lac</i> operon Set genes encoding proteins and enzymes for lactose metabolism

The lactose operon is an operon required for the transport and metabolism of lactose in E. coli and many other enteric bacteria. Although glucose is the preferred carbon source for most enteric bacteria, the lac operon allows for the effective digestion of lactose when glucose is not available through the activity of beta-galactosidase. Gene regulation of the lac operon was the first genetic regulatory mechanism to be understood clearly, so it has become a foremost example of prokaryotic gene regulation. It is often discussed in introductory molecular and cellular biology classes for this reason. This lactose metabolism system was used by François Jacob and Jacques Monod to determine how a biological cell knows which enzyme to synthesize. Their work on the lac operon won them the Nobel Prize in Physiology in 1965.

A transcriptional activator is a protein that increases transcription of a gene or set of genes. Activators are considered to have positive control over gene expression, as they function to promote gene transcription and, in some cases, are required for the transcription of genes to occur. Most activators are DNA-binding proteins that bind to enhancers or promoter-proximal elements. The DNA site bound by the activator is referred to as an "activator-binding site". The part of the activator that makes protein–protein interactions with the general transcription machinery is referred to as an "activating region" or "activation domain".

<span class="mw-page-title-main">Repressor</span> Sort of RNA-binding protein in molecular genetics

In molecular genetics, a repressor is a DNA- or RNA-binding protein that inhibits the expression of one or more genes by binding to the operator or associated silencers. A DNA-binding repressor blocks the attachment of RNA polymerase to the promoter, thus preventing transcription of the genes into messenger RNA. An RNA-binding repressor binds to the mRNA and prevents translation of the mRNA into protein. This blocking or reducing of expression is called repression.

<span class="mw-page-title-main">Silencer (genetics)</span> Type of DNA sequence

In genetics, a silencer is a DNA sequence capable of binding transcription regulation factors, called repressors. DNA contains genes and provides the template to produce messenger RNA (mRNA). That mRNA is then translated into proteins. When a repressor protein binds to the silencer region of DNA, RNA polymerase is prevented from transcribing the DNA sequence into RNA. With transcription blocked, the translation of RNA into proteins is impossible. Thus, silencers prevent genes from being expressed as proteins.

<span class="mw-page-title-main">Regulator gene</span> Gene involved in controlling expression of other genes

In genetics, a regulator gene, regulator, or regulatory gene is a gene involved in controlling the expression of one or more other genes. Regulatory sequences, which encode regulatory genes, are often at the five prime end (5') to the start site of transcription of the gene they regulate. In addition, these sequences can also be found at the three prime end (3') to the transcription start site. In both cases, whether the regulatory sequence occurs before (5') or after (3') the gene it regulates, the sequence is often many kilobases away from the transcription start site. A regulator gene may encode a protein, or it may work at the level of RNA, as in the case of genes encoding microRNAs. An example of a regulator gene is a gene that codes for a repressor protein that inhibits the activity of an operator.

In molecular biology, an inducer is a molecule that regulates gene expression. An inducer functions in two ways; namely:

<span class="mw-page-title-main">TetR</span>

Tet Repressor proteins are proteins playing an important role in conferring antibiotic resistance to large categories of bacterial species.

Gene structure is the organisation of specialised sequence elements within a gene. Genes contain most of the information necessary for living cells to survive and reproduce. In most organisms, genes are made of DNA, where the particular DNA sequence determines the function of the gene. A gene is transcribed (copied) from DNA into RNA, which can either be non-coding (ncRNA) with a direct function, or an intermediate messenger (mRNA) that is then translated into protein. Each of these steps is controlled by specific sequence elements, or regions, within the gene. Every gene, therefore, requires multiple sequence elements to be functional. This includes the sequence that actually encodes the functional protein or ncRNA, as well as multiple regulatory sequence regions. These regions may be as short as a few base pairs, up to many thousands of base pairs long.

<i>trp</i> operon Operon that codes for the components for production of tryptophan

The trp operon is a group of genes that are transcribed together, encoding the enzymes that produce the amino acid tryptophan in bacteria. The trp operon was first characterized in Escherichia coli, and it has since been discovered in many other bacteria. The operon is regulated so that, when tryptophan is present in the environment, the genes for tryptophan synthesis are repressed.

In molecular genetics, a regulon is a group of genes that are regulated as a unit, generally controlled by the same regulatory gene that expresses a protein acting as a repressor or activator. This terminology is generally, although not exclusively, used in reference to prokaryotes, whose genomes are often organized into operons; the genes contained within a regulon are usually organized into more than one operon at disparate locations on the chromosome. Applied to eukaryotes, the term refers to any group of non-contiguous genes controlled by the same regulatory gene.

fis E. coli gene

fis is an E. coli gene encoding the Fis protein. The regulation of this gene is more complex than most other genes in the E. coli genome, as Fis is an important protein which regulates expression of other genes. It is supposed that fis is regulated by H-NS, IHF and CRP. It also regulates its own expression (autoregulation). Fis is one of the most abundant DNA binding proteins in Escherichia coli under nutrient-rich growth conditions.

<span class="mw-page-title-main">L-ribulose-5-phosphate 4-epimerase</span>

In enzymology, a L-ribulose-5-phosphate 4-epimerase is an enzyme that catalyzes the interconversion of ribulose 5-phosphate and xylulose 5-phosphate in the oxidative phase of the Pentose phosphate pathway.

The gal operon is a prokaryotic operon, which encodes enzymes necessary for galactose metabolism. Repression of gene expression for this operon works via binding of repressor molecules to two operators. These repressors dimerize, creating a loop in the DNA. The loop as well as hindrance from the external operator prevent RNA polymerase from binding to the promoter, and thus prevent transcription. Additionally, since the metabolism of galactose in the cell is involved in both anabolic and catabolic pathways, a novel regulatory system using two promoters for differential repression has been identified and characterized within the context of the gal operon.

The gua operon is responsible for regulating the synthesis of guanosine mono phosphate (GMP), a purine nucleotide, from inosine monophosphate. It consists of two structural genes guaB (encodes for IMP dehydrogenase or and guaA apart from the promoter and operator region.

<i>gab</i> operon

The gab operon is responsible for the conversion of γ-aminobutyrate (GABA) to succinate. The gab operon comprises three structural genes – gabD, gabT and gabP – that encode for a succinate semialdehyde dehydrogenase, GABA transaminase and a GABA permease respectively. There is a regulatory gene csiR, downstream of the operon, that codes for a putative transcriptional repressor and is activated when nitrogen is limiting.

<span class="mw-page-title-main">PBAD promoter</span>

PBAD is a promoter found in bacteria and especially as part of plasmids used in laboratory studies. The promoter is a part of the arabinose operon whose name derives from the genes it regulates transcription of: araB, araA, and araD. In E. coli, the PBAD promoter is adjacent to the PC promoter, which transcribes the araC gene in the opposite direction. araC encodes the AraC protein, which regulates activity of both the PBAD and PC promoters. The cyclic AMP receptor protein CAP binds between the PBAD and PC promoters, stimulating transcription of both when bound by cAMP.

The locus of enterocyte effacement-encoded regulator (Ler) is a regulatory protein that controls bacterial pathogenicity of enteropathogenic Escherichia coli (EPEC) and enterohemorrhagic Escherichia coli (EHEC). More specifically, Ler regulates the locus of enterocyte effacement (LEE) pathogenicity island genes, which are responsible for creating intestinal attachment and effacing lesions and subsequent diarrhea: LEE1, LEE2, and LEE3. LEE1, 2, and 3 carry the information necessary for a type III secretion system. The transcript encoding the Ler protein is the open reading frame 1 on the LEE1 operon.

References

  1. Voet, Donald & Voet, Judith G. (2011). Biochemistry (4th. ed.). Hoboken, NJ: John Wiley & Sons. pp.  1291–1294. ISBN   978-0470-57095-1.{{cite book}}: CS1 maint: multiple names: authors list (link)
  2. 1 2 3 Schleif, Robert (2000). "Regulation of the L-arabinose operon of Escherichia coli". Trends in Genetics . 16 (12): 559–565. doi: 10.1016/S0168-9525(00)02153-3 . PMID   11102706.
  3. 1 2 Watson, James D. (2008). Molecular biology of the gene (6th. ed.). Harlow: Addison-Wesley. pp. 634–635. ISBN   9780321507815.
  4. 1 2 Schleif, Robert (2010). "AraC protein, regulation of the l-arabinose operon in, and the light switch mechanism of AraC action". FEMS Microbiology Reviews. 34 (5): 779–796. doi: 10.1111/j.1574-6976.2010.00226.x . PMID   20491933.
  5. Lobell, R. B.; Schleif, R. F. (1990). "DNA looping and unlooping by AraC protein". Science. 250 (4980): 528–532. Bibcode:1990Sci...250..528L. doi:10.1126/science.2237403. PMID   2237403. S2CID   25017204.
  6. 1 2 3 Schleif, Robert (2003). "AraC protein: A love-hate relationship". BioEssays . 25 (3): 274–282. doi:10.1002/bies.10237. PMID   12596232.
  7. Schleif, Robert; Lis, John T. (1975). "The regulatory region of the l-arabinose operon: A physical, genetic and physiological study". Journal of Molecular Biology . 95 (3): 417–431. doi:10.1016/0022-2836(75)90200-4. PMID   168391.
  8. Ogden, S; Haggerty, D; Stoner, CM; Kolodrubetz, D; Schleif, R (1980). "The Escherichia coli L-arabinose operon: binding sites of the regulatory proteins and a mechanism of positive and negative regulation". Proceedings of the National Academy of Sciences of the United States of America . 77 (6): 3346–3350. Bibcode:1980PNAS...77.3346O. doi: 10.1073/pnas.77.6.3346 . PMC   349612 . PMID   6251457.
  9. Bustos, S. A; Schleif, R. F (1993). "Functional domains of the AraC protein". Proceedings of the National Academy of Sciences of the United States of America . 90 (12): 5638–5642. Bibcode:1993PNAS...90.5638B. doi: 10.1073/pnas.90.12.5638 . PMC   46776 . PMID   8516313.
  10. Saviola, B; Seabold, R; Schleif, R. F (1998). "Arm-domain interactions in AraC". Journal of Molecular Biology . 278 (3): 539–548. doi: 10.1006/jmbi.1998.1712 . PMID   9600837.
  11. Griffiths, Anthony J.; Wessler, Susan R. (2015). Introduction to genetic analysis (11th ed.). New York, NY: Freeman. pp. 413–414. ISBN   9781429276344.
  12. Casadaban, Malcolm J. (1976). "Regulation of the regulatory gene for the arabinose pathway, araC". Journal of Molecular Biology . 104 (3): 557–566. doi:10.1016/0022-2836(76)90120-0. PMID   781294.
  13. Seabold, Robert R; Schleif, Robert F (1998). "Apo-AraC actively seeks to loop". Journal of Molecular Biology . 278 (3): 529–538. doi: 10.1006/jmbi.1998.1713 . PMID   9600836.
  14. Hendrickson, William; Schleif, Robert (1984). "Regulation of the Escherichia coli L-arabinose operon studied by gel electrophoresis DNA binding assay". Journal of Molecular Biology . 178 (3): 611–628. doi:10.1016/0022-2836(84)90241-9. PMID   6387154.
  15. Weaver, Robert Franklin (2012). Molecular biology (5th int. student ed.). New York: McGraw-Hill. pp.  183–186. ISBN   9780071316866.
  16. 1 2 Snyder, Larry (2013). Molecular genetics of bacteria (4th. ed.). Washington, DC: ASM Press. pp. 487–494. ISBN   9781555816278.
  17. Hartwell, Leland; Hood, Leroy (2010). Genetics : from genes to genomes (4th ed.). Boston: McGraw-Hill Education. p.  528. ISBN   9780071102155.
  18. Cox, Michael M.; Doudna, Jennifer A.; O'Donnell, Michael E. (2012). Molecular biology : principles and practice (International ed.). New York: W.H. Freeman. pp. 707–708. ISBN   9781464102257.
  19. Griffiths, Anthony J.F. (2002). Modern genetic analysis: integrating genes and genomes (2nd. ed.). New York: W.H. Freeman. pp.  432–433. ISBN   0716743825.
  20. Lee, N. L; Gielow, W.O; Wallace, R. G (1981). "Mechanism of araC autoregulation and the domains of two overlapping promoters, Pc and PBAD, in the L-arabinose regulatory region of Escherichia coli". Proceedings of the National Academy of Sciences of the United States of America . 78 (2): 752–756. Bibcode:1981PNAS...78..752L. doi: 10.1073/pnas.78.2.752 . PMC   319880 . PMID   6262769.