Wobble base pair

Last updated October 06, 2024

A wobble base pair is a pairing between two nucleotides in RNA molecules that does not follow Watson-Crick base pair rules.^[1] The four main wobble base pairs are guanine-uracil (G-U), hypoxanthine-uracil (I-U), hypoxanthine-adenine (I-A), and hypoxanthine-cytosine (I-C). In order to maintain consistency of nucleic acid nomenclature, "I" is used for hypoxanthine because hypoxanthine is the nucleobase of inosine;^[2] nomenclature otherwise follows the names of nucleobases and their corresponding nucleosides (e.g., "G" for both guanine and guanosine – as well as for deoxyguanosine). The thermodynamic stability of a wobble base pair is comparable to that of a Watson-Crick base pair. Wobble base pairs are fundamental in RNA secondary structure and are critical for the proper translation of the genetic code.

Brief history

In the genetic code, there are 4³ = 64 possible codons (three-nucleotide sequences). For translation, each of these codons requires a tRNA molecule with an anticodon with which it can stably complement. If each tRNA molecule is paired with its complementary mRNA codon using canonical Watson-Crick base pairing, then 64 types of tRNA molecule would be required. In the standard genetic code, three of these 64 mRNA codons (UAA, UAG and UGA) are stop codons. These terminate translation by binding to release factors rather than tRNA molecules, so canonical pairing would require 61 species of tRNA. Since most organisms have fewer than 45 types of tRNA, ⁣^[3] some tRNA types can pair with multiple, synonymous codons, all of which encode the same amino acid. In 1966, Francis Crick proposed the Wobble Hypothesis to account for this. He postulated that the 5' base on the anticodon, which binds to the 3' base on the mRNA, was not as spatially confined as the other two bases and could, thus, have non-standard base pairing.^[4] Crick creatively named it for the small amount of "play" or wobble that occurs at this third codon position. Movement ("wobble") of the base in the 5' anticodon position is necessary for small conformational adjustments that affect the overall pairing geometry of anticodons of tRNA.^[5]^[6]

As an example, yeast tRNA^Phe has the anticodon 5'-GmAA-3' and can recognize the codons 5'-UUC-3' and 5'-UUU-3'. It is, therefore, possible for non-Watson–Crick base pairing to occur at the third codon position, i.e., the 3' nucleotide of the mRNA codon and the 5' nucleotide of the tRNA anticodon.^[7]

Wobble hypothesis

These notions led Francis Crick to the creation of the wobble hypothesis, a set of four relationships explaining these naturally occurring attributes.

The first two bases in the codon create the coding specificity, for they form strong Watson-Crick base pairs and bond strongly to the anticodon of the tRNA.
When reading 5' to 3' the first nucleotide in the anticodon (which is on the tRNA and pairs with the last nucleotide of the codon on the mRNA) determines how many nucleotides the tRNA actually distinguishes.
If the first nucleotide in the anticodon is a C or an A, pairing is specific and acknowledges original Watson-Crick pairing, that is: only one specific codon can be paired to that tRNA. If the first nucleotide is U or G, the pairing is less specific and in fact two bases can be interchangeably recognized by the tRNA. Inosine displays the true qualities of wobble, in that if that is the first nucleotide in the anticodon, any of three bases in the original codon can be matched with the tRNA.
Due to the specificity inherent in the first two nucleotides of the codon, if one amino acid is coded for by multiple anticodons and those anticodons differ in either the second or third position (first or second position in the codon) then a different tRNA is required for that anticodon.
The minimum requirement to satisfy all possible codons (61 excluding three stop codons) is 32 tRNAs. That is 31 tRNAs for the amino acids and one initiation codon.^[8]

tRNA base pairing schemes

Wobble pairing rules. Watson-Crick base pairs are shown in bold. Parentheses denote bindings that work but will be favoured less. A leading x denotes derivatives (in general) of the base that follows.

tRNA 5' anticodon base	mRNA 3' codon base (Crick)^{[note 1]}	mRNA 3' codon base (Revised)^[9]
A	U	U, C, G, or (A)
C	G	G
G	C or U	C or U
U	A or G	A, G, U, or (C)
I	A, C, or U	A, C, or U
k²C		A
xm⁵s²U, xm⁵Um, Um, xm⁵U		A or (G)
xo⁵U		U, A, or G

Biological importance

Aside from the necessity of wobble, that our cells have a limited amount of tRNAs and wobble allows for more flexibility, wobble base pairs have been shown to facilitate many biological functions, most clearly demonstrated in the bacterium Escherichia coli , a model organism. In fact, in a study of E. coli's tRNA for alanine there is a wobble base pair that determines whether the tRNA will be aminoacylated. When a tRNA reaches an aminoacyl tRNA synthetase, the job of the synthetase is to join the t-shaped RNA with its amino acid. These aminoacylated tRNAs go on to the translation of an mRNA transcript, and are the fundamental elements that connect to the codon of the amino acid.^[1] The necessity of the wobble base pair is illustrated through experimentation where the Guanine-Uracil pairing is changed to its natural Guanine-Cytosine pairing. Oligoribonucleotides were synthesized on a Gene Assembler Plus, and then spread across a DNA sequence known to code a tRNA for alanine, 2D-NMRs are then run on the products of these new tRNAs and compared to the wobble tRNAs. The results indicate that with that wobble base pair changed, structure is also changed and an alpha helix can no longer be formed. The alpha helix was the recognizable structure for the aminoacyl tRNA synthetase and thus the synthetase does not connect the amino acid alanine with the tRNA for alanine. This wobble base pairing is essential for the use of the amino acid alanine in E. coli and its significance here would imply significance in many related species.^[10] More information can be seen on aminoacyl tRNA synthetase and the genomes of E. coli tRNA at the External links, Information on Aminoacyl tRNA Synthetases and Genomic tRNA Database.

Footnotes

↑ These relationships can be further observed, as well as full codons and anticodons in the correct reading frame at: SBDR (2008-04-15). "Genetic Code and Amino Acid Translation". Society for Biomedical Diabetes Research. Archived from the original on 2014-11-04. Retrieved 2014-09-14. For a modern view on the pairings, see doi:10.1093/nar/gkh185.

Related Research Articles

A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA and RNA. Dictated by specific hydrogen bonding patterns, "Watson–Crick" base pairs allow the DNA helix to maintain a regular helical structure that is subtly dependent on its nucleotide sequence. The complementary nature of this based-paired structure provides a redundant copy of the genetic information encoded within each strand of DNA. The regular structure and data redundancy provided by the DNA double helix make DNA well suited to the storage of genetic information, while base-pairing between DNA and incoming nucleotides provides the mechanism through which DNA polymerase replicates DNA and RNA polymerase transcribes DNA into RNA. Many DNA-binding proteins can recognize specific base-pairing patterns that identify particular regulatory regions of genes.

<span class="mw-page-title-main">Genetic code</span> Rules by which information encoded within genetic material is translated into proteins

The genetic code is the set of rules used by living cells to translate information encoded within genetic material into proteins. Translation is accomplished by the ribosome, which links proteinogenic amino acids in an order specified by messenger RNA (mRNA), using transfer RNA (tRNA) molecules to carry amino acids and to read the mRNA three nucleotides at a time. The genetic code is highly similar among all organisms and can be expressed in a simple table with 64 entries.

Nucleotides are organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules within all life-forms on Earth. Nucleotides are obtained in the diet and are also synthesized from common nutrients by the liver.

Protein biosynthesis is a core biological process, occurring inside cells, balancing the loss of cellular proteins through the production of new proteins. Proteins perform a number of critical functions as enzymes, structural proteins or hormones. Protein synthesis is a very similar process for both prokaryotes and eukaryotes but there are some distinct differences.

Nucleotide bases are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nucleic acids. The ability of nucleobases to form base pairs and to stack one upon another leads directly to long-chain helical structures such as ribonucleic acid (RNA) and deoxyribonucleic acid (DNA). Five nucleobases—adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U)—are called primary or canonical. They function as the fundamental units of the genetic code, with the bases A, G, C, and T being found in DNA while A, G, C, and U are found in RNA. Thymine and uracil are distinguished by merely the presence or absence of a methyl group on the fifth carbon (C5) of these heterocyclic six-membered rings. In addition, some viruses have aminoadenine (Z) instead of adenine. It differs in having an extra amine group, creating a more stable bond to thymine.

In biology, translation is the process in living cells in which proteins are produced using RNA molecules as templates. The generated protein is a sequence of amino acids. This sequence is determined by the sequence of nucleotides in the RNA. The nucleotides are considered three at a time. Each such triple results in addition of one specific amino acid to the protein being generated. The matching from nucleotide triple to amino acid is called the genetic code. The translation is performed by a large complex of functional RNA and proteins called ribosomes. The entire process is called gene expression.

A nucleic acid sequence is a succession of bases within the nucleotides forming alleles within a DNA or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the order of the nucleotides. By convention, sequences are usually presented from the 5' end to the 3' end. For DNA, with its double helix, there are two possible directions for the notated sequence; of these two, the sense strand is used. Because nucleic acids are normally linear (unbranched) polymers, specifying the sequence is equivalent to defining the covalent structure of the entire molecule. For this reason, the nucleic acid sequence is also termed the primary structure.

In biochemistry, a ribonucleotide is a nucleotide containing ribose as its pentose component. It is considered a molecular precursor of nucleic acids. Nucleotides are the basic building blocks of DNA and RNA. Ribonucleotides themselves are basic monomeric building blocks for RNA. Deoxyribonucleotides, formed by reducing ribonucleotides with the enzyme ribonucleotide reductase (RNR), are essential building blocks for DNA. There are several differences between DNA deoxyribonucleotides and RNA ribonucleotides. Successive nucleotides are linked together via phosphodiester bonds.

Transfer RNA is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length. In a cell, it provides the physical link between the genetic code in messenger RNA (mRNA) and the amino acid sequence of proteins, carrying the correct sequence of amino acids to be combined by the protein-synthesizing machinery, the ribosome. Each three-nucleotide codon in mRNA is complemented by a three-nucleotide anticodon in tRNA. As such, tRNAs are a necessary component of translation, the biological synthesis of new proteins in accordance with the genetic code.

The Nirenberg and Matthaei experiment was a scientific experiment performed in May 1961 by Marshall W. Nirenberg and his post-doctoral fellow, J. Heinrich Matthaei, at the National Institutes of Health (NIH). The experiment deciphered the first of the 64 triplet codons in the genetic code by using nucleic acid homopolymers to translate specific amino acids.

An aminoacyl-tRNA synthetase, also called tRNA-ligase, is an enzyme that attaches the appropriate amino acid onto its corresponding tRNA. It does so by catalyzing the transesterification of a specific cognate amino acid or its precursor to one of all its compatible cognate tRNAs to form an aminoacyl-tRNA. In humans, the 20 different types of aa-tRNA are made by the 20 different aminoacyl-tRNA synthetases, one for each amino acid of the genetic code.

Marshall Warren Nirenberg was an American biochemist and geneticist. He shared a Nobel Prize in Physiology or Medicine in 1968 with Har Gobind Khorana and Robert W. Holley for "breaking the genetic code" and describing how it operates in protein synthesis. In the same year, together with Har Gobind Khorana, he was awarded the Louisa Gross Horwitz Prize from Columbia University.

Biosynthesis, i.e., chemical synthesis occurring in biological contexts, is a term most often referring to multi-step, enzyme-catalyzed processes where chemical substances absorbed as nutrients serve as enzyme substrates, with conversion by the living organism either into simpler or more complex products. Examples of biosynthetic pathways include those for the production of amino acids, lipid membrane components, and nucleotides, but also for the production of all classes of biological macromolecules, and of acetyl-coenzyme A, adenosine triphosphate, nicotinamide adenine dinucleotide and other key intermediate and transactional molecules needed for metabolism. Thus, in biosynthesis, any of an array of compounds, from simple to complex, are converted into other compounds, and so it includes both the catabolism and anabolism of complex molecules. Biosynthetic processes are often represented via charts of metabolic pathways. A particular biosynthetic pathway may be located within a single cellular organelle, while others involve enzymes that are located across an array of cellular organelles and structures.

Nucleic acid analogues are compounds which are analogous to naturally occurring RNA and DNA, used in medicine and in molecular biology research. Nucleic acids are chains of nucleotides, which are composed of three parts: a phosphate backbone, a pentose sugar, either ribose or deoxyribose, and one of four nucleobases. An analogue may have any of these altered. Typically the analogue nucleobases confer, among other things, different base pairing and base stacking properties. Examples include universal bases, which can pair with all four canonical bases, and phosphate-sugar backbone analogues such as PNA, which affect the properties of the chain . Nucleic acid analogues are also called xeno nucleic acids and represent one of the main pillars of xenobiology, the design of new-to-nature forms of life based on alternative biochemistries.

Ribosomal frameshifting, also known as translational frameshifting or translational recoding, is a biological phenomenon that occurs during translation that results in the production of multiple, unique proteins from a single mRNA. The process can be programmed by the nucleotide sequence of the mRNA and is sometimes affected by the secondary, 3-dimensional mRNA structure. It has been described mainly in viruses, retrotransposons and bacterial insertion elements, and also in some cellular genes.

An expanded genetic code is an artificially modified genetic code in which one or more specific codons have been re-allocated to encode an amino acid that is not among the 22 common naturally-encoded proteinogenic amino acids.

Amino acid activation refers to the attachment of an amino acid to its respective transfer RNA (tRNA). The reaction occurs in the cell cytosol and consists of two steps: first, the enzyme aminoacyl tRNA synthetase catalyzes the binding of adenosine triphosphate (ATP) to a corresponding amino acid, forming a reactive aminoacyl adenylate intermediate and releasing inorganic pyrophosphate (PP_i). Subsequently, aminoacyl tRNA synthetase binds the AMP-amino acid to a tRNA molecule, releasing AMP and attaching the amino acid to the tRNA. The resulting aminoacyl-tRNA is said to be charged.

The RNA Tie Club was an informal scientific club, meant partly to be humorous, of select scientists who were interested in how proteins were synthesised from genes, specifically the genetic code. It was created by George Gamow upon a suggestion by James Watson in 1954 when the relationship between nucleic acids and amino acids in genetic information was unknown. The club consisted of 20 full members, each representing an amino acid, and four honorary members, representing the four nucleotides. The function of the club members was to think up possible solutions and share with the other members.

An alloprotein is a novel synthetic protein containing one or more "non-natural" amino acids. Non-natural in the context means an amino acid either not occurring in nature, or occurring in nature but not naturally occurring within proteins.

Degeneracy or redundancy of codons is the redundancy of the genetic code, exhibited as the multiplicity of three-base pair codon combinations that specify an amino acid. The degeneracy of the genetic code is what accounts for the existence of synonymous mutations.

References

1 2 Campbell, Neil; Reece, Jane B. (2011). Biology (9th ed.). Boston: Benjamin Cummings. pp. 339–342. ISBN 978-0321558237.
↑ Kuchin, Sergei (19 May 2011). "Covering All the Bases in Genetics: Simple Shorthands and Diagrams for Teaching Base Pairing to Biology Undergraduates". Journal of Microbiology & Biology Education. 12 (1): 64–66. doi:10.1128/jmbe.v12i1.267. PMC 3577215 . PMID 23653747. Archived from the original on 17 October 2013. The correct name of the base in inosine (which is a nucleoside) is hypoxanthine, however, for consistency with the nucleic acid nomenclature, the shorthand [I] is more appropriate...
↑ Lowe, Todd; Chan, Patricia (18 April 2011). "Genomic tRNA Database". University of California, Santa Cruz. Archived from the original on 30 May 2015. Retrieved 31 October 2015.
↑ Crick, F.H.C. (August 1966). "Codon—anticodon pairing: The wobble hypothesis" (PDF). Journal of Molecular Biology. 19 (2): 548–555. CiteSeerX 10.1.1.693.2333 . doi:10.1016/S0022-2836(66)80022-0. PMID 5969078. Archived (PDF) from the original on 4 March 2016. Retrieved 31 October 2015.
↑ Mathews, Christopher K.; Van Holde, K.E.; Appling, Dean; et al., eds. (2012). Biochemistry (4th ed.). Toronto: Prentice Hall. p. 1181. ISBN 978-0-13-800464-4.
↑ Voet, Donald; Voet, Judith (2011). Biochemistry (4th ed.). Hoboken, NJ: John Wiley & Sons. pp. 1360–1361. ISBN 9780470570951.
↑ Varani, Gabriele; McClain, William H (July 2000). "The G·U wobble base pair". EMBO Reports. 1 (1): 18–23. doi:10.1093/embo-reports/kvd001. PMC 1083677 . PMID 11256617.
↑ Cox, Michael M.; Nelson, David L. (2013). "Protein Metabolism: Wobble Allows Some tRNA's to Recognize More than One Codon". Lehninger Principles of Biochemistry (6th ed.). New York: W.H. Freeman. pp. 1108–1110. ISBN 9780716771081 . Retrieved 31 October 2015.
↑ Murphy IV, Frank V; Ramakrishnan, V (21 November 2004). "Structure of a purine-purine wobble base pair in the decoding center of the ribosome". Nature Structural & Molecular Biology. 11 (12): 1251–1252. doi:10.1038/nsmb866. PMID 15558050. S2CID 27022506.
↑ Limmer, S.; Reif, B.; Ott, G.; Arnold, L.; Sprinzl, M. (1996). "NMR evidence for helix geometry modifications by a G-U wobble base pair in the acceptor arm of E. Coli tRNA(Ala)". FEBS Letters. 385 (1–2): 15–20. Bibcode:1996FEBSL.385...15L. doi: 10.1016/0014-5793(96)00339-0 . PMID 8641457.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[9] These relationships can be further observed, as well as full codons and anticodons in the correct reading frame at: SBDR (2008-04-15). "Genetic Code and Amino Acid Translation". Society for Biomedical Diabetes Research. Archived from the original on 2014-11-04. Retrieved 2014-09-14. For a modern view on the pairings, see doi:10.1093/nar/gkh185.

[Campbell_9th-1] 1 2 Campbell, Neil; Reece, Jane B. (2011). Biology (9th ed.). Boston: Benjamin Cummings. pp. 339–342. ISBN 978-0321558237.

[Kuchin-JMBE-2] Kuchin, Sergei (19 May 2011). "Covering All the Bases in Genetics: Simple Shorthands and Diagrams for Teaching Base Pairing to Biology Undergraduates". Journal of Microbiology & Biology Education. 12 (1): 64–66. doi:10.1128/jmbe.v12i1.267. PMC 3577215 . PMID 23653747. Archived from the original on 17 October 2013. The correct name of the base in inosine (which is a nucleoside) is hypoxanthine, however, for consistency with the nucleic acid nomenclature, the shorthand [I] is more appropriate...

[3] Lowe, Todd; Chan, Patricia (18 April 2011). "Genomic tRNA Database". University of California, Santa Cruz. Archived from the original on 30 May 2015. Retrieved 31 October 2015.

[4] Crick, F.H.C. (August 1966). "Codon—anticodon pairing: The wobble hypothesis" (PDF). Journal of Molecular Biology. 19 (2): 548–555. CiteSeerX 10.1.1.693.2333 . doi:10.1016/S0022-2836(66)80022-0. PMID 5969078. Archived (PDF) from the original on 4 March 2016. Retrieved 31 October 2015.

[5] Mathews, Christopher K.; Van Holde, K.E.; Appling, Dean; et al., eds. (2012). Biochemistry (4th ed.). Toronto: Prentice Hall. p. 1181. ISBN 978-0-13-800464-4.

[6] Voet, Donald; Voet, Judith (2011). Biochemistry (4th ed.). Hoboken, NJ: John Wiley & Sons. pp. 1360–1361. ISBN 9780470570951.

[7] Varani, Gabriele; McClain, William H (July 2000). "The G·U wobble base pair". EMBO Reports. 1 (1): 18–23. doi:10.1093/embo-reports/kvd001. PMC 1083677 . PMID 11256617.

[8] Cox, Michael M.; Nelson, David L. (2013). "Protein Metabolism: Wobble Allows Some tRNA's to Recognize More than One Codon". Lehninger Principles of Biochemistry (6th ed.). New York: W.H. Freeman. pp. 1108–1110. ISBN 9780716771081 . Retrieved 31 October 2015.

[10] Murphy IV, Frank V; Ramakrishnan, V (21 November 2004). "Structure of a purine-purine wobble base pair in the decoding center of the ribosome". Nature Structural & Molecular Biology. 11 (12): 1251–1252. doi:10.1038/nsmb866. PMID 15558050. S2CID 27022506.

[11] Limmer, S.; Reif, B.; Ott, G.; Arnold, L.; Sprinzl, M. (1996). "NMR evidence for helix geometry modifications by a G-U wobble base pair in the acceptor arm of E. Coli tRNA(Ala)". FEBS Letters. 385 (1–2): 15–20. Bibcode:1996FEBSL.385...15L. doi: 10.1016/0014-5793(96)00339-0 . PMID 8641457.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[note 1]

[9]

[10]