Sequence hypothesis

Last updated

The sequence hypothesis was first formally proposed in the review "On Protein Synthesis" by Francis Crick in 1958. [1] It states that the sequence of bases in the genetic material (DNA or RNA) determines the sequence of amino acids for which that segment of nucleic acid codes, and this amino acid sequence determines the three-dimensional structure into which the protein folds. The three-dimensional structure of a protein is required for a protein to be functional. This hypothesis then lays the essential link between information stored and inherited in nucleic acids to the chemical processes which enable life to exist. [2]

Or, as Crick put it in 1958:

In its simplest form it [the Sequence Hypothesis] assumes that the specificity of a piece of nucleic acid is expressed solely by the sequence of its bases, and that this sequence is a (simple) code for the amino acid sequence of a particular protein. This hypothesis appears to be rather widely held. Its virtue is that it unites several remarkable pairs of generalisations: the central biochemical importance of proteins and the dominating role of genes, and in particular of their nucleic acid; the linearity of protein molecules (considered covalently) and the genetic linearity within the functional gene [...]; the simplicity of the composition of protein molecules and the simplicity of the nucleic acids.

[1] :152

This description is further amplified in the article and, in discussing how a protein folds up into its three-dimensional structure, Crick suggested that "the folding is simply a function of the order of the amino acids" in the protein. [1] :144

Related Research Articles

<span class="mw-page-title-main">Base pair</span> Unit consisting of two nucleobases bound to each other by hydrogen bonds

A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA and RNA. Dictated by specific hydrogen bonding patterns, "Watson–Crick" base pairs allow the DNA helix to maintain a regular helical structure that is subtly dependent on its nucleotide sequence. The complementary nature of this based-paired structure provides a redundant copy of the genetic information encoded within each strand of DNA. The regular structure and data redundancy provided by the DNA double helix make DNA well suited to the storage of genetic information, while base-pairing between DNA and incoming nucleotides provides the mechanism through which DNA polymerase replicates DNA and RNA polymerase transcribes DNA into RNA. Many DNA-binding proteins can recognize specific base-pairing patterns that identify particular regulatory regions of genes.

<span class="mw-page-title-main">Genetic code</span> Rules by which information encoded within genetic material is translated into proteins

The genetic code is the set of rules used by living cells to translate information encoded within genetic material into proteins. Translation is accomplished by the ribosome, which links proteinogenic amino acids in an order specified by messenger RNA (mRNA), using transfer RNA (tRNA) molecules to carry amino acids and to read the mRNA three nucleotides at a time. The genetic code is highly similar among all organisms and can be expressed in a simple table with 64 entries.

<span class="mw-page-title-main">Nucleic acid</span> Class of large biomolecules essential to all known life

Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). If the sugar is ribose, the polymer is RNA; if the sugar is deoxyribose, a variant of ribose, the polymer is DNA.

<span class="mw-page-title-main">Protein</span> Biomolecule consisting of chains of amino acid residues

Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, responding to stimuli, providing structure to cells and organisms, and transporting molecules from one location to another. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the nucleotide sequence of their genes, and which usually results in protein folding into a specific 3D structure that determines its activity.

<span class="mw-page-title-main">Protein biosynthesis</span> Assembly of proteins inside biological cells

Protein biosynthesis is a core biological process, occurring inside cells, balancing the loss of cellular proteins through the production of new proteins. Proteins perform a number of critical functions as enzymes, structural proteins or hormones. Protein synthesis is a very similar process for both prokaryotes and eukaryotes but there are some distinct differences.

The central dogma of molecular biology deals with the flow of genetic information within a biological system. It is often stated as "DNA makes RNA, and RNA makes protein", although this is not its original meaning. It was first stated by Francis Crick in 1957, then published in 1958:

The Central Dogma. This states that once "information" has passed into protein it cannot get out again. In more detail, the transfer of information from nucleic acid to nucleic acid, or from nucleic acid to protein may be possible, but transfer from protein to protein, or from protein to nucleic acid is impossible. Information here means the precise determination of sequence, either of bases in the nucleic acid or of amino acid residues in the protein.

<span class="mw-page-title-main">Macromolecule</span> Very large molecule, such as a protein

A macromolecule is a very large molecule important to biological processes, such as a protein or nucleic acid. It is composed of thousands of covalently bonded atoms. Many macromolecules are polymers of smaller molecules called monomers. The most common macromolecules in biochemistry are biopolymers and large non-polymeric molecules such as lipids, nanogels and macrocycles. Synthetic fibers and experimental materials such as carbon nanotubes are also examples of macromolecules.

<span class="mw-page-title-main">Nucleic acid sequence</span> Succession of nucleotides in a nucleic acid

A nucleic acid sequence is a succession of bases within the nucleotides forming alleles within a DNA or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the order of the nucleotides. By convention, sequences are usually presented from the 5' end to the 3' end. For DNA, with its double helix, there are two possible directions for the notated sequence; of these two, the sense strand is used. Because nucleic acids are normally linear (unbranched) polymers, specifying the sequence is equivalent to defining the covalent structure of the entire molecule. For this reason, the nucleic acid sequence is also termed the primary structure.

<span class="mw-page-title-main">Molecular genetics</span> Scientific study of genes at the molecular level

Molecular genetics is a branch of biology that addresses how differences in the structures or expression of DNA molecules manifests as variation among organisms. Molecular genetics often applies an "investigative approach" to determine the structure and/or function of genes in an organism's genome using genetic screens. 

<span class="mw-page-title-main">Wobble base pair</span> RNA base pair that does not follow Watson-Crick base pair rules

A wobble base pair is a pairing between two nucleotides in RNA molecules that does not follow Watson-Crick base pair rules. The four main wobble base pairs are guanine-uracil (G-U), hypoxanthine-uracil (I-U), hypoxanthine-adenine (I-A), and hypoxanthine-cytosine (I-C). In order to maintain consistency of nucleic acid nomenclature, "I" is used for hypoxanthine because hypoxanthine is the nucleobase of inosine; nomenclature otherwise follows the names of nucleobases and their corresponding nucleosides. The thermodynamic stability of a wobble base pair is comparable to that of a Watson-Crick base pair. Wobble base pairs are fundamental in RNA secondary structure and are critical for the proper translation of the genetic code.

<span class="mw-page-title-main">Nirenberg and Matthaei experiment</span> 1961 scientific experiment instrumental in deciphering the genetic code

The Nirenberg and Matthaei experiment was a scientific experiment performed in May 1961 by Marshall W. Nirenberg and his post-doctoral fellow, J. Heinrich Matthaei, at the National Institutes of Health (NIH). The experiment deciphered the first of the 64 triplet codons in the genetic code by using nucleic acid homopolymers to translate specific amino acids.

<span class="mw-page-title-main">Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid</span> 1953 scientific paper on DNA

"Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid" was the first article published to describe the discovery of the double helix structure of DNA, using X-ray diffraction and the mathematics of a helix transform. It was published by Francis Crick and James D. Watson in the scientific journal Nature on pages 737–738 of its 171st volume.

A gene product is the biochemical material, either RNA or protein, resulting from the expression of a gene. A measurement of the amount of gene product is sometimes used to infer how active a gene is. Abnormal amounts of gene product can be correlated with disease-causing alleles, such as the overactivity of oncogenes, which can cause cancer. A gene is defined as "a hereditary unit of DNA that is required to produce a functional product". Regulatory elements include:

The history of molecular biology begins in the 1930s with the convergence of various, previously distinct biological and physical disciplines: biochemistry, genetics, microbiology, virology and physics. With the hope of understanding life at its most fundamental level, numerous physicists and chemists also took an interest in what would become molecular biology.

<span class="mw-page-title-main">Adaptor hypothesis</span>

The adaptor hypothesis is a theoretical scheme in molecular biology to explain how information encoded in the nucleic acid sequences of messenger RNA (mRNA) is used to specify the amino acids that make up proteins during the process of translation. It was formulated by Francis Crick in 1955 in an informal publication of the RNA Tie Club, and later elaborated in 1957 along with the central dogma of molecular biology and the sequence hypothesis. It was formally published as an article "On protein synthesis" in 1958. The name "adaptor hypothesis" was given by Sydney Brenner.

RNA-based evolution is a theory that posits that RNA is not merely an intermediate between Watson and Crick model of the DNA molecule and proteins, but rather a far more dynamic and independent role-player in determining phenotype. By regulating the transcription in DNA sequences, the stability of RNA, and the capability of messenger RNA to be translated, RNA processing events allow for a diverse array of proteins to be synthesized from a single gene. Since RNA processing is heritable, it is subject to natural selection suggested by Darwin and contributes to the evolution and diversity of most eukaryotic organisms.

The RNA Tie Club was an informal scientific club, meant partly to be humorous, of select scientists who were interested in how proteins were synthesised from genes, specifically the genetic code. It was created by George Gamow upon a suggestion by James Watson in 1954 when the relationship between nucleic acids and amino acids in genetic information was unknown. The club consisted of 20 full members, each representing an amino acid, and four honorary members, representing the four nucleotides. The function of the club members was to think up possible solutions and share with the other members.

Numerous key discoveries in biology have emerged from studies of RNA, including seminal work in the fields of biochemistry, genetics, microbiology, molecular biology, molecular evolution, and structural biology. As of 2010, 30 scientists have been awarded Nobel Prizes for experimental work that includes studies of RNA. Specific discoveries of high biological significance are discussed in this article.

This glossary of cellular and molecular biology is a list of definitions of terms and concepts commonly used in the study of cell biology, molecular biology, and related disciplines, including molecular genetics, biochemistry, and microbiology. It is split across two articles:

The polyelectrolyte theory of the gene proposes that for a linear genetic biopolymer dissolved in water, such as DNA, to undergo Darwinian evolution anywhere in the universe, it must be a polyelectrolyte, a polymer containing repeating ionic charges. These charges maintain the uniform physical properties needed for Darwinian evolution, regardless of the information encoded in the genetic biopolymer. DNA is such a molecule. Regardless of its nucleic acid sequence, the negative charges on its backbone dominate the physical interactions of the molecule to such a degree that it maintains uniform physical properties such as its aqueous solubility and double-helix structure.

References

  1. 1 2 3 Crick, F. H. (1958). "On protein synthesis". Symposia of the Society for Experimental Biology. 12: 138–163. PMID   13580867.
  2. Brenner, Sydney (2014). "Retrospective Frederick Sanger (1918-2013)". Science. 343 (6168): 262. Bibcode:2014Sci...343..262B. doi: 10.1126/science.1249912 . PMID   24436413. S2CID   45663657.

See also