The adaptor hypothesis is a theoretical scheme in molecular biology to explain how information encoded in the nucleic acid sequences of messenger RNA (mRNA) is used to specify the amino acids that make up proteins during the process of translation. It was formulated by Francis Crick in 1955 in an informal publication of the RNA Tie Club, and later elaborated in 1957 along with the central dogma of molecular biology and the sequence hypothesis. It was formally published as an article "On protein synthesis" in 1958. The name "adaptor hypothesis" was given by Sydney Brenner.
Crick postulated that there must exist a small molecule to precisely recognise and bind the mRNA sequences while amino acids are being synthesised. The hypothetical adaptor molecule was later established to be a hitherto unknown nucleic acid, transfer RNA (tRNA).
In 1953, English biophysicist Francis Crick and American biologist James Watson, working together at the Cavendish Laboratory of the University of Cambridge, worked out the correct description of the structure of DNA, one of the major genetic materials. [1] In their follow-up paper the same year, they introduced the concept of genetic information alongside the notion that DNA and protein cloud be related. [2] By 1954, it was becoming to be understood that DNA, RNA (only messenger RNA was understood at the time, but only as a vague nucleic acid, and identified as such only in 1960 [3] ) and proteins were related as components of the same genetic information pathway. However, the structure of RNA and details of how these biological molecules related and interact with each other were still a mystery, especially on how proteins could be synthesised from nucleic acids. [4] Watson called this problem "the mysteries of life" in his letter to Crick. [3] Watson and Alexander Rich discussed in the PNAS , saying, "We shall not be able to check a structural relationship between RNA and protein synthesis or between RNA and DNA until we know the structure of RNA." [5]
Evidences had been accumulating since the 1940s that protein synthesis occurs simultaneously with increased level of RNA in the cytoplasm. [6] [7] [8] The relationship between DNA and RNA for protein synthesis was first hypothesised by French biologist André Boivin and Roger Vendrely in 1947. They wrote in Experientia (later renamed Cellular and Molecular Life Sciences ): "Through catalytic actions the macromolecular desoxyribonucleic acids govern the building of macromolecular ribonucleic acids, and, in turn, these control the production of cytoplasmic enzymes. In truth, the enzymic equipment results simultaneously from the effect of ribonucleic acids (catalytic action) and from the effect of substrates (mass action). [9]
Alexander Dounce, of the University of Rochester Medical Center in New York, came close to the correct interpretation of protein synthesis from RNA. [10] In 1952, he proposed the first notion, naming it the "nucleic acid template hypotheses," [11] that proteins were made from RNA, rather than from DNA. However, since the exact chemical structure of DNA or RNA was not known at the time, he suggested erroneously that proteins and RNA have complimentary sequences, as he described in Enzymologia: "the specific arrangement of amino acid residues in a given peptide chain is derived from the specific arrangement of nucleotide residues in a corresponding specific nucleic acid molecule." [12] It was met with criticism in Nature the next year, [13] to which Dounce defended his hypothesis:
In the hypothesis proposed by me, ribonucleic acid was deliberately chosen as the template, since it seems probable from a number of considerations and experimental investigations that protein syn- thesis may occur in cytoplasm as well as in nuclei. There can be no objection at the present time to assuming that the genes are templates, but it is not necessary to assume that the genes act directly as templates for protein synthesis. [11]
Dounce was also close to a correct and fundamental explanation of the gene protein relationship, as he said: "the relationship between plasma genes and nuclear genes [postulated by Sol Spiegelman and Martin David Kamen in 1946 [14] ] could be explained by the template sequence deoxyribonucleic acid - ribonucleic acid - protein, with the additional postulate of a co-existing independent ribonucleic acid synthesis with ribonucleic acid acting as its own template." [11] The correct interpretation of genetic information transfer (DNA → RNA → protein) developed by Crick in 1957 became the central dogma of molecular biology. [15] [16]
Soviet-American physicist George Gamow was the first to come up with a coherent scheme for protein synthesis from DNA. [17] Based on the Watson-Crick model, he envisaged that the DNA itself is a direct template for protein synthesis. [18] Assuming that the four bases of DNA could produce 20 different combinations as triplets, he suggested that the different amino acids must correspond to a twenty-letter alphabet of the nucleotide sequence. [19] In such configuration, the DNA directly produces proteins from the free molecules of amino acids. [20] In the 13 February 1954 issue of Nature he explained:
It seems to me that such translation procedure can be easily established by considering the 'key-and-lock' relation between various amino-acids, and the rhomb-shaped 'holes' formed by various nucleotides in the deoxyribonucleic acid chain... One can speculate that free amino-acids from the surrounding medium get caught into the 'holes' of deoxyribonucleic acid molecules, and thus unite into the corresponding peptide chains. [21]
Watson, then at the California Institute of Technology in Massachusetts, also thought up the issue. [22] Unlike Gamow, he realised the possible importance of RNA as an intermediate stage in protein synthesis. [3] He initially imagined that DNA was first converted by chemical reaction to RNA, but then changed his view to DNA as a template for RNA synthesis. [22] This latter model, which he called "not ugly" was starting to become more convincing. In a letter to Crick on 11 December 1954, discussing how RNA could be produced from DNA since they contain fundamentally similar nucleotide compositions, he wrote: "I suspect the answer is staring us in the face." [22]
To solve the riddle of protein synthesis, Gamow (as the Synthesiser) created an informal team of scientists which he called the RNA Tide Club in 1954. As his prediction to denote 20 amino acids, the club could have only 20 members, with the designated officers: Crick (as the Pessimist), Rich (the Lord Privy Seal) and Watson (the Optimist) among them. It was meant partly to be humorous. [23]
Coming across Gamow's paper, Crick immediately noticed some issues in the scheme. The first was that he and Watson never had before counted the 20 amino acids or the possible 20 nucleotide triplets [19] (although Gamow was subsequently proven to be correct and his model was the first prediction of the genetic code [17] [23] ). The other issue was that RNA and protein synthesis appeared to exclusively take place in the cytoplasm, and not in the nucleus, indicating that DNA to protein pathway requires just not direct chemical binding, but some kind of mediator must be involved. [10]
Crick began to think of a solution to this problem in the line of Watson's idea that the role of RNA was overlooked. He rejected Gamow's assumption that molecules like amino acids that are too different chemically from nucleic acid that it would be structurally impossible to bind to DNA. Instead, he visualised a certain kind but unknown molecule that linked nucleic acids and carry amino acids to ribosomes to form peptide chains. [3] As he discussed it with his colleague Sydney Brenner about the features of such molecule, Brenner was impressed with the idea and named the concept as the "adaptor hypothesis". [24] The adaptor was then a hypothetical molecule that apparently help in an indirect interaction between DNA, RNA and amino acids during protein synthesis. [3] As Crick explained:
I don't think that anybody looking at DNA or RNA would think of them as templates for amino acids were it not for other, indirect evidence... [In one possible scheme] each amino acid would combine chemically, at a special enzyme, with a small molecule which, having a specific hydrogen-bonding surface, would combine specifically with the nucleic acid template, This combination would also supply the energy necessary for polymerisation. In its simplest form there would be 20 different kinds of adaptor molecule, one for each amino acid, and 20 different enzymes to join the amino acid to their adaptors, Sydney Brenner, with whom I have discussed this idea, calls this the "adaptor hypothesis", since each amino acid is fitted with an adaptor to go on to the template... The adaptor hypothesis implies that the actual set of twenty amino acids found in proteins is due either to a historical accident or to biological selection at an extremely primitive stage." [24]
Crick admitted that he had no way knowing what kind of chemical substance was an adaptor, but then prudently suggested that it was composed of nucleotides, a sort of RNA, that had common features of both DNA and RNA. [3] He even speculated that "insulin, for example, are probably RNA-made proteins. Perhaps a special class of DNA-made proteins exists, almost always in small quantities (and thus normally overlooked). [24]
The template could consist of perhaps a single chain of RNA…Each adaptor molecule containing, say, a di- or trinucleotide would each be joined to its own amino acid by a special enzyme. These molecules would then diffuse to the microsomal particles and attach to the proper place on the basis of the RNA by base-pairing.
The article circulated to the members of the RNA Tie Club in January 1955 as "On Degenerate Templates and the Adaptor Hypothesis: A Note for the RNA Tie Club" is described as "one of the most important unpublished articles in the history of science", [25] [26] and "the most famous unpublished paper in the annals of molecular biology." [27]
The adaptor hypothesis was fully formalised in 1957. Crick presented the elaborate explanation in a lecture titled "Protein synthesis" on 19 September 1957 at the Society for Experimental Biology symposium on the Biological Replication of Macromolecules, held at University College London.
The lecture was further expanded and published as "On protein synthesis" in 1958, which science historian Horace Freeland Judson remarked: "The paper permanently altered the logic of biology." [28] This is because it was the first comprehensive insight into genetic information (later called the central dogma of molecular biology), protein synthesis (known as the sequence hypothesis), the role of RNA (the adaptor hypothesis) as well as the existence of genetic code. [3]
The adaptor hypothesis was framed to explain how information could be extracted from a nucleic acid and used to put together a string of amino acids in a specific sequence, that sequence being determined by the nucleotide sequence of the nucleic acid (DNA or RNA) template. Crick proposed that each amino acid is first attached to its own specific “adaptor” piece of nucleic acid (in an enzyme-catalysed reaction). The order of assembly of the amino acids is then determined by a specific recognition between the adaptor and the nucleic acid which is serving as the informational template. In this way the amino acids could be lined up by the template in a specific order. Coupling between adjacent amino acids would then lead to the synthesis of a polypeptide whose sequence is determined by the template nucleic acid.
Crick's thinking behind this proposal was based on a general consideration of the chemical properties of the two classes of molecule — nucleic acids and proteins. The amino acids are characterised by having a variety of side chains which vary from being hydrophilic to hydrophobic: their individual characters reside in the very different properties these side chains have. By contrast, a nucleic acid is composed of a string of nucleotides whose sequence presents a geometrically defined surface for hydrogen bonding. This makes nucleic acids good at recognising each other, but poor at distinguishing the varied side chains of amino acids. It was this apparent lack of any possibility of specific recognition of amino acid side chains by a nucleotide sequence which led Crick to conclude that amino acids would first become attached to a small nucleic acid — the adaptor — and that this, by base-pairing with the template (presumably as occurs between DNA strands in the double helix), would carry the amino acids to be lined up on the template.
Gamow was not entirely wrong. Although his concept about the direct synthesis of amino acid from the DNA double strands was proven false, his prediction on the nature of amino acids and how they were coded by the DNA sequences (triplet nucleotides) were confirmed the key components of protein synthesis. [17] This broad concept is now known as the genetic code. It was Crick, along with J.S. Griffith and Leslie Orgel (member of the RNA Tie Club), describing the 20-amino acid prediction as the "magic number," introduced the proper concept in 1957:
[The] order of the amino acids is determined by the order of the nucleotides of the nucleic acid. There are some twenty naturally occurring amino acids commonly found in proteins, but (usually) only four different nucleotides. The problem of how a sequence of four things (nucleotides) can determine a sequence of twenty things (amino acids) is known as the 'coding' problem. [29]
As Crick was lecturing on his hypothesis, such adaptors do exist in nature was already discovered by the team of Mahlon Hoagland and Paul Zamecnik, whose paper was published the following year in March 1958. [30] These “soluble RNAs” are now called transfer RNAs and mediate the translation of messenger RNAs on ribosomes according to the rules contained in the genetic code. Crick imagined that his adaptors would be small, perhaps 5-10 nucleotides long. In fact, they are much larger, having a more complex role to play in protein synthesis, and are closer to 100 nucleotides in length.
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA and RNA. Dictated by specific hydrogen bonding patterns, "Watson–Crick" base pairs allow the DNA helix to maintain a regular helical structure that is subtly dependent on its nucleotide sequence. The complementary nature of this based-paired structure provides a redundant copy of the genetic information encoded within each strand of DNA. The regular structure and data redundancy provided by the DNA double helix make DNA well suited to the storage of genetic information, while base-pairing between DNA and incoming nucleotides provides the mechanism through which DNA polymerase replicates DNA and RNA polymerase transcribes DNA into RNA. Many DNA-binding proteins can recognize specific base-pairing patterns that identify particular regulatory regions of genes.
Deoxyribonucleic acid is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of all known organisms and many viruses. DNA and ribonucleic acid (RNA) are nucleic acids. Alongside proteins, lipids and complex carbohydrates (polysaccharides), nucleic acids are one of the four major types of macromolecules that are essential for all known forms of life.
Francis Harry Compton Crick was an English molecular biologist, biophysicist, and neuroscientist. He, James Watson, Rosalind Franklin, and Maurice Wilkins played crucial roles in deciphering the helical structure of the DNA molecule.
The genetic code is the set of rules used by living cells to translate information encoded within genetic material into proteins. Translation is accomplished by the ribosome, which links proteinogenic amino acids in an order specified by messenger RNA (mRNA), using transfer RNA (tRNA) molecules to carry amino acids and to read the mRNA three nucleotides at a time. The genetic code is highly similar among all organisms and can be expressed in a simple table with 64 entries.
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). If the sugar is ribose, the polymer is RNA; if the sugar is deoxyribose, a variant of ribose, the polymer is DNA.
The RNA world is a hypothetical stage in the evolutionary history of life on Earth, in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existence of this stage.
The central dogma of molecular biology is an explanation of the flow of genetic information within a biological system. It is often stated as "DNA makes RNA, and RNA makes protein", although this is not its original meaning. It was first stated by Francis Crick in 1957, then published in 1958:
The Central Dogma. This states that once "information" has passed into protein it cannot get out again. In more detail, the transfer of information from nucleic acid to nucleic acid, or from nucleic acid to protein may be possible, but transfer from protein to protein, or from protein to nucleic acid is impossible. Information here means the precise determination of sequence, either of bases in the nucleic acid or of amino acid residues in the protein.
A nucleic acid sequence is a succession of bases within the nucleotides forming alleles within a DNA or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the order of the nucleotides. By convention, sequences are usually presented from the 5' end to the 3' end. For DNA, with its double helix, there are two possible directions for the notated sequence; of these two, the sense strand is used. Because nucleic acids are normally linear (unbranched) polymers, specifying the sequence is equivalent to defining the covalent structure of the entire molecule. For this reason, the nucleic acid sequence is also termed the primary structure.
Leslie Eleazer Orgel FRS was a British chemist. He is known for his theories on the origin of life.
A wobble base pair is a pairing between two nucleotides in RNA molecules that does not follow Watson-Crick base pair rules. The four main wobble base pairs are guanine-uracil (G-U), hypoxanthine-uracil (I-U), hypoxanthine-adenine (I-A), and hypoxanthine-cytosine (I-C). In order to maintain consistency of nucleic acid nomenclature, "I" is used for hypoxanthine because hypoxanthine is the nucleobase of inosine; nomenclature otherwise follows the names of nucleobases and their corresponding nucleosides. The thermodynamic stability of a wobble base pair is comparable to that of a Watson-Crick base pair. Wobble base pairs are fundamental in RNA secondary structure and are critical for the proper translation of the genetic code.
The Nirenberg and Matthaei experiment was a scientific experiment performed in May 1961 by Marshall W. Nirenberg and his post-doctoral fellow, J. Heinrich Matthaei, at the National Institutes of Health (NIH). The experiment deciphered the first of the 64 triplet codons in the genetic code by using nucleic acid homopolymers to translate specific amino acids.
The Nirenberg and Leder experiment was a scientific experiment performed in 1964 by Marshall W. Nirenberg and Philip Leder. The experiment elucidated the triplet nature of the genetic code and allowed the remaining ambiguous codons in the genetic code to be deciphered.
The history of molecular biology begins in the 1930s with the convergence of various, previously distinct biological and physical disciplines: biochemistry, genetics, microbiology, virology and physics. With the hope of understanding life at its most fundamental level, numerous physicists and chemists also took an interest in what would become molecular biology.
In molecular biology and genetics, the sense of a nucleic acid molecule, particularly of a strand of DNA or RNA, refers to the nature of the roles of the strand and its complement in specifying a sequence of amino acids. Depending on the context, sense may have slightly different meanings. For example, the negative-sense strand of DNA is equivalent to the template strand, whereas the positive-sense strand is the non-template strand whose nucleotide sequence is equivalent to the sequence of the mRNA transcript.
The sequence hypothesis was first formally proposed in the review "On Protein Synthesis" by Francis Crick in 1958. It states that the sequence of bases in the genetic material determines the sequence of amino acids for which that segment of nucleic acid codes, and this amino acid sequence determines the three-dimensional structure into which the protein folds. The three-dimensional structure of a protein is required for a protein to be functional. This hypothesis then lays the essential link between information stored and inherited in nucleic acids to the chemical processes which enable life to exist.
The RNA Tie Club was an informal scientific club, meant partly to be humorous, of select scientists who were interested in how proteins were synthesised from genes, specifically the genetic code. It was created by George Gamow upon a suggestion by James Watson in 1954 when the relationship between nucleic acids and amino acids in genetic information was unknown. The club consisted of 20 full members, each representing an amino acid, and four honorary members, representing the four nucleotides. The function of the club members was to think up possible solutions and share with the other members.
Numerous key discoveries in biology have emerged from studies of RNA, including seminal work in the fields of biochemistry, genetics, microbiology, molecular biology, molecular evolution and structural biology. As of 2010, 30 scientists have been awarded Nobel Prizes for experimental work that includes studies of RNA. Specific discoveries of high biological significance are discussed in this article.
Alexander Latham Dounce was an American professor of biochemistry. Among his fields of study were the isolation and purification of cellular organelles, protein crystallization, enzymes, DNA binding proteins, and the chemical basis of protein synthesis. He also invented the Dounce homogenizer, which was named after him.
In molecular biology, hybridization is a phenomenon in which single-stranded deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) molecules anneal to complementary DNA or RNA. Though a double-stranded DNA sequence is generally stable under physiological conditions, changing these conditions in the laboratory will cause the molecules to separate into single strands. These strands are complementary to each other but may also be complementary to other sequences present in their surroundings. Lowering the surrounding temperature allows the single-stranded molecules to anneal or “hybridize” to each other.
The polyelectrolyte theory of the gene proposes that for a linear genetic biopolymer dissolved in water, such as DNA, to undergo Darwinian evolution anywhere in the universe, it must be a polyelectrolyte, a polymer containing repeating ionic charges. These charges maintain the uniform physical properties needed for Darwinian evolution, regardless of the information encoded in the genetic biopolymer. DNA is such a molecule. Regardless of its nucleic acid sequence, the negative charges on its backbone dominate the physical interactions of the molecule to such a degree that it maintains uniform physical properties such as its aqueous solubility and double-helix structure.