The bacterial, archaeal and plant plastid code (translation table 11) is the DNA code used by bacteria, archaea, prokaryotic viruses and chloroplast proteins. It is essentially the same as the standard code, however there are some variations in alternative start codons.
Amino-acid biochemical properties | Nonpolar | Polar | Basic | Acidic | Termination: stop codon |
1st base | 2nd base | 3rd base | |||||||
---|---|---|---|---|---|---|---|---|---|
U | C | A | G | ||||||
U | UUU | (Phe/F) Phenylalanine | UCU | (Ser/S) Serine | UAU | (Tyr/Y) Tyrosine | UGU | (Cys/C) Cysteine | U |
UUC | UCC | UAC | UGC | C | |||||
UUA | (Leu/L) Leucine | UCA | UAA | Stop (Ochre) [B] | UGA | Stop (Opal) [B] | A | ||
UUG [A] | UCG | UAG | Stop (Amber) [B] | UGG | (Trp/W) Tryptophan | G | |||
C | CUU | CCU | (Pro/P) Proline | CAU | (His/H) Histidine | CGU | (Arg/R) Arginine | U | |
CUC | CCC | CAC | CGC | C | |||||
CUA | CCA | CAA | (Gln/Q) Glutamine | CGA | A | ||||
CUG [A] | CCG | CAG | CGG | G | |||||
A | AUU | (Ile/I) Isoleucine | ACU | (Thr/T) Threonine | AAU | (Asn/N) Asparagine | AGU | (Ser/S) Serine | U |
AUC | ACC | AAC | AGC | C | |||||
AUA | ACA | AAA | (Lys/K) Lysine | AGA | (Arg/R) Arginine | A | |||
AUG [A] | (Met/M) Methionine | ACG | AAG | AGG | G | ||||
G | GUU | (Val/V) Valine | GCU | (Ala/A) Alanine | GAU | (Asp/D) Aspartic acid | GGU | (Gly/G) Glycine | U |
GUC | GCC | GAC | GGC | C | |||||
GUA | GCA | GAA | (Glu/E) Glutamic acid | GGA | A | ||||
GUG | GCG | GAG | GGG | G |
As in the standard code, initiation is most efficient at AUG. In addition, GUG and UUG starts are documented in archaea and bacteria. [5] [6] [7] [8] [9] [10] [11] In Escherichia coli , UUG is estimated to serve as initiator for about 3% of the bacterium's proteins. [12] CUG is known to function as an initiator for one plasmid-encoded protein (RepA) in E. coli. [13] In addition to the NUG initiations, in rare cases bacteria can initiate translation from an AUU codon as e.g. in the case of poly(A) polymerase PcnB and the InfC gene that codes for translation initiation factor IF3. [14] [15] [9] [16] The internal assignments are the same as in the standard code though UGA codes at low efficiency for tryptophan in Bacillus subtilis and, presumably, in Escherichia coli. [17]
The genetic code is the set of rules used by living cells to translate information encoded within genetic material into proteins. Translation is accomplished by the ribosome, which links proteinogenic amino acids in an order specified by messenger RNA (mRNA), using transfer RNA (tRNA) molecules to carry amino acids and to read the mRNA three nucleotides at a time. The genetic code is highly similar among all organisms and can be expressed in a simple table with 64 entries.
In molecular biology, a stop codon is a codon that signals the termination of the translation process of the current protein. Most codons in messenger RNA correspond to the addition of an amino acid to a growing polypeptide chain, which may ultimately become a protein; stop codons signal the termination of this process by binding release factors, which cause the ribosomal subunits to disassociate, releasing the amino acid chain.
Escherichia coli ( ESH-ə-RIK-ee-ə KOH-ly) is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus Escherichia that is commonly found in the lower intestine of warm-blooded organisms. Most E. coli strains are harmless, but some serotypes such as EPEC, and ETEC are pathogenic and can cause serious food poisoning in their hosts, and are occasionally responsible for food contamination incidents that prompt product recalls. Most strains are part of the normal microbiota of the gut and are harmless or even beneficial to humans (although these strains tend to be less studied than the pathogenic ones). For example, some strains of E. coli benefit their hosts by producing vitamin K2 or by preventing the colonization of the intestine by pathogenic bacteria. These mutually beneficial relationships between E. coli and humans are a type of mutualistic biological relationship — where both the humans and the E. coli are benefitting each other. E. coli is expelled into the environment within fecal matter. The bacterium grows massively in fresh fecal matter under aerobic conditions for three days, but its numbers decline slowly afterwards.
Codon usage bias refers to differences in the frequency of occurrence of synonymous codons in coding DNA. A codon is a series of three nucleotides that encodes a specific amino acid residue in a polypeptide chain or for the termination of translation.
In biology, translation is the process in living cells in which proteins are produced using RNA molecules as templates. The generated protein is a sequence of amino acids. This sequence is determined by the sequence of nucleotides in the RNA. The nucleotides are considered three at a time. Each such triple results in addition of one specific amino acid to the protein being generated. The matching from nucleotide triple to amino acid is called the genetic code. The translation is performed by a large complex of functional RNA and proteins called ribosomes. The entire process is called gene expression.
The 5′ untranslated region is the region of a messenger RNA (mRNA) that is directly upstream from the initiation codon. This region is important for the regulation of translation of a transcript by differing mechanisms in viruses, prokaryotes and eukaryotes. While called untranslated, the 5′ UTR or a portion of it is sometimes translated into a protein product. This product can then regulate the translation of the main coding sequence of the mRNA. In many organisms, however, the 5′ UTR is completely untranslated, instead forming a complex secondary structure to regulate translation.
The Shine–Dalgarno (SD) sequence is a ribosomal binding site in bacterial and archaeal messenger RNA, generally located around 8 bases upstream of the start codon AUG. The RNA sequence helps recruit the ribosome to the messenger RNA (mRNA) to initiate protein synthesis by aligning the ribosome with the start codon. Once recruited, tRNA may add amino acids in sequence as dictated by the codons, moving downstream from the translational start site.
The start codon is the first codon of a messenger RNA (mRNA) transcript translated by a ribosome. The start codon always codes for methionine in eukaryotes and archaea and a N-formylmethionine (fMet) in bacteria, mitochondria and plastids.
Bacterial translation is the process by which messenger RNA is translated into proteins in bacteria.
Neutral mutations are changes in DNA sequence that are neither beneficial nor detrimental to the ability of an organism to survive and reproduce. In population genetics, mutations in which natural selection does not affect the spread of the mutation in a species are termed neutral mutations. Neutral mutations that are inheritable and not linked to any genes under selection will be lost or will replace all other alleles of the gene. That loss or fixation of the gene proceeds based on random sampling known as genetic drift. A neutral mutation that is in linkage disequilibrium with other alleles that are under selection may proceed to loss or fixation via genetic hitchhiking and/or background selection.
The gene rpoS encodes the sigma factor sigma-38, a 37.8 kD protein in Escherichia coli. Sigma factors are proteins that regulate transcription in bacteria. Sigma factors can be activated in response to different environmental conditions. rpoS is transcribed in late exponential phase, and RpoS is the primary regulator of stationary phase genes. RpoS is a central regulator of the general stress response and operates in both a retroactive and a proactive manner: it not only allows the cell to survive environmental challenges, but it also prepares the cell for subsequent stresses (cross-protection). The transcriptional regulator CsgD is central to biofilm formation, controlling the expression of the curli structural and export proteins, and the diguanylate cyclase, adrA, which indirectly activates cellulose production. The rpoS gene most likely originated in the gammaproteobacteria.
A bacterial initiation factor (IF) is a protein that stabilizes the initiation complex for polypeptide translation.
fis is an E. coli gene encoding the Fis protein. The regulation of this gene is more complex than most other genes in the E. coli genome, as Fis is an important protein which regulates expression of other genes. It is supposed that fis is regulated by H-NS, IHF and CRP. It also regulates its own expression (autoregulation). Fis is one of the most abundant DNA binding proteins in Escherichia coli under nutrient-rich growth conditions.
Eukaryotic translation initiation factor 1 (eIF1) is a protein that in humans is encoded by the EIF1 gene. It is related to yeast SUI1.
In molecular cloning, a vector is any particle used as a vehicle to artificially carry a foreign nucleic sequence – usually DNA – into another cell, where it can be replicated and/or expressed. A vector containing foreign DNA is termed recombinant DNA. The four major types of vectors are plasmids, viral vectors, cosmids, and artificial chromosomes. Of these, the most commonly used vectors are plasmids. Common to all engineered vectors are an origin of replication, a multicloning site, and a selectable marker.
A toxin-antitoxin system consists of a "toxin" and a corresponding "antitoxin", usually encoded by closely linked genes. The toxin is usually a protein while the antitoxin can be a protein or an RNA. Toxin-antitoxin systems are widely distributed in prokaryotes, and organisms often have them in multiple copies. When these systems are contained on plasmids – transferable genetic elements – they ensure that only the daughter cells that inherit the plasmid survive after cell division. If the plasmid is absent in a daughter cell, the unstable antitoxin is degraded and the stable toxic protein kills the new cell; this is known as 'post-segregational killing' (PSK).
A codon table can be used to translate a genetic code into a sequence of amino acids. The standard genetic code is traditionally represented as an RNA codon table, because when proteins are made in a cell by ribosomes, it is messenger RNA (mRNA) that directs protein synthesis. The mRNA sequence is determined by the sequence of genomic DNA. In this context, the standard genetic code is referred to as translation table 1. It can also be represented in a DNA codon table. The DNA codons in such tables occur on the sense DNA strand and are arranged in a 5′-to-3′ direction. Different tables with alternate codons are used depending on the source of the genetic code, such as from a cell nucleus, mitochondrion, plastid, or hydrogenosome.
Escherichia coli is a Gram-negative gammaproteobacterium commonly found in the lower intestine of warm-blooded organisms (endotherms). The descendants of two isolates, K-12 and B strain, are used routinely in molecular biology as both a tool and a model organism.
Monica Riley was an American scientist who contributed to the discovery of messenger RNA in her Ph.D work with Arthur Pardee, and was later a pioneer in the exploration and computer representation of the Escherichia coli genome.
Julio Collado-Vides is a Guatemalan scientist and Professor of Computational Genomics at the National Autonomous University of Mexico. His research focuses on genomics and bioinformatics.
This article incorporates text from the United States National Library of Medicine, which is in the public domain. [18]
{{cite journal}}
: CS1 maint: numeric names: authors list (link)