Protein quaternary structure [lower-alpha 1] is the fourth (and highest) classification level of protein structure. Protein quaternary structure refers to the structure of proteins which are themselves composed of two or more smaller protein chains (also referred to as subunits). Protein quaternary structure describes the number and arrangement of multiple folded protein subunits in a multi-subunit complex. It includes organizations from simple dimers to large homooligomers and complexes with defined or variable numbers of subunits. [1] In contrast to the first three levels of protein structure, not all proteins will have a quaternary structure since some proteins function as single units. Protein quaternary structure can also refer to biomolecular complexes of proteins with nucleic acids and other cofactors.
Many proteins are actually assemblies of multiple polypeptide chains. The quaternary structure refers to the number and arrangement of the protein subunits with respect to one another. [2] Examples of proteins with quaternary structure include hemoglobin, DNA polymerase, ribosomes, antibodies, and ion channels.
Enzymes composed of subunits with diverse functions are sometimes called holoenzymes, in which some parts may be known as regulatory subunits and the functional core is known as the catalytic subunit. Other assemblies referred to instead as multiprotein complexes also possess quaternary structure. Examples include nucleosomes and microtubules. Changes in quaternary structure can occur through conformational changes within individual subunits or through reorientation of the subunits relative to each other. It is through such changes, which underlie cooperativity and allostery in "multimeric" enzymes, that many proteins undergo regulation and perform their physiological function.
The above definition follows a classical approach to biochemistry, established at times when the distinction between a protein and a functional, proteinaceous unit was difficult to elucidate. More recently, people refer to protein–protein interaction when discussing quaternary structure of proteins and consider all assemblies of proteins as protein complexes.
The number of subunits in an oligomeric complex is described using names that end in -mer (Greek for "part, subunit"). Formal and Greco-Latinate names are generally used for the first ten types and can be used for up to twenty subunits, whereas higher order complexes are usually described by the number of subunits, followed by -meric.
|
|
The smallest unit forming a homo-oligomer, i.e. one protein chain or subunit, is designated as a monomer, subunit or protomer. The latter term was originally devised to specify the smallest unit of hetero-oligomeric proteins, but is also applied to homo-oligomeric proteins in current literature. The subunits usually arrange in cyclic symmetry to form closed point group symmetries.
Although complexes higher than octamers are rarely observed for most proteins, there are some important exceptions. Viral capsids are often composed of multiples of 60 proteins. Several molecular machines are also found in the cell, such as the proteasome (four heptameric rings = 28 subunits), the transcription complex and the spliceosome. The ribosome is probably the largest molecular machine, and is composed of many RNA and protein molecules.
In some cases, proteins form complexes that then assemble into even larger complexes. In such cases, one uses the nomenclature, e.g., "dimer of dimers" or "trimer of dimers". This may suggest that the complex might dissociate into smaller sub-complexes before dissociating into monomers. This usually implies that the complex consists of different oligomerisation interfaces. For example, a tetrameric protein may have one four-fold rotation axis, i.e. point group symmetry 4 or C4. In this case the four interfaces between the subunits are identical. It may also have point group symmetry 222 or D2. This tetramer has different interfaces and the tetramer can dissociate into two identical homodimers. Tetramers of 222 symmetry are "dimer of dimers". Hexamers of 32 point group symmetry are "trimer of dimers" or "dimer of trimers". Thus, the nomenclature "dimer of dimers" is used to specify the point group symmetry or arrangement of the oligomer, independent of information relating to its dissociation properties.
Another distinction often made when referring to oligomers is whether they are homomeric or heteromeric, referring to whether the smaller protein subunits that come together to make the protein complex are the same (homomeric) or different (heteromeric) from each other. For example, two identical protein monomers would come together to form a homo-dimer, whereas two different protein monomers would create a hetero-dimer.
Protein quaternary structure can be determined using a variety of experimental techniques that require a sample of protein in a variety of experimental conditions. The experiments often provide an estimate of the mass of the native protein and, together with knowledge of the masses and/or stoichiometry of the subunits, allow the quaternary structure to be predicted with a given accuracy. It is not always possible to obtain a precise determination of the subunit composition for a variety of reasons.
The number of subunits in a protein complex can often be determined by measuring the hydrodynamic molecular volume or mass of the intact complex, which requires native solution conditions. For folded proteins, the mass can be inferred from its volume using the partial specific volume of 0.73 ml/g. However, volume measurements are less certain than mass measurements, since unfolded proteins appear to have a much larger volume than folded proteins; additional experiments are required to determine whether a protein is unfolded or has formed an oligomer.
Methods that measure the mass or volume under unfolding conditions (such as MALDI-TOF mass spectrometry and SDS-PAGE) are generally not useful, since non-native conditions usually cause the complex to dissociate into monomers. However, these may sometimes be applicable; for example, the experimenter may apply SDS-PAGE after first treating the intact complex with chemical cross-link reagents.
Some bioinformatics methods have been developed for predicting the quaternary structural attributes of proteins based on their sequence information by using various modes of pseudo amino acid composition. [2] [8] [9]
Protein folding prediction programs used to predict protein tertiary structure have also been expanding to better predict protein quaternary structure. One such development is AlphaFold-Multimer [10] built upon the AlphaFold model for predicting protein tertiary structure.
Protein quaternary structure also plays an important role in certain cell signaling pathways. The G-protein coupled receptor pathway involves a heterotrimeric protein known as a G-protein. G-proteins contain three distinct subunits known as the G-alpha, G-beta, and G-gamma subunits. When the G-protein is activated, it binds to the G-protein coupled receptor protein and the cell signaling pathway is initiated. Another example is the receptor tyrosine kinase (RTK) pathway, which is initiated by the dimerization of two receptor tyrosine kinase monomers. When the dimer is formed, the two kinases can phosphorylate each other and initiate a cell signaling pathway. [11]
Proteins are capable of forming very tight but also only transient complexes. For example, ribonuclease inhibitor binds to ribonuclease A with a roughly 20 fM dissociation constant. Other proteins have evolved to bind specifically to unusual moieties on another protein, e.g., biotin groups (avidin), phosphorylated tyrosines (SH2 domains) or proline-rich segments (SH3 domains). Protein–protein interactions can be engineered to favor certain oligomerization states. [12]
When multiple copies of a polypeptide encoded by a gene form a quaternary complex, this protein structure is referred to as a multimer. [13] When a multimer is formed from polypeptides produced by two different mutant alleles of a particular gene, the mixed multimer may exhibit greater functional activity than the unmixed multimers formed by each of the mutants alone. In such a case, the phenomenon is referred to as intragenic complementation (also called inter-allelic complementation). Intragenic complementation appears to be common and has been studied in many different genes in a variety of organisms including the fungi Neurospora crassa , Saccharomyces cerevisiae and Schizosaccharomyces pombe ; the bacterium Salmonella typhimurium; the virus bacteriophage T4, [14] an RNA virus, [15] and humans. [16] The intermolecular forces likely responsible for self-recognition and multimer formation were discussed by Jehle. [17]
Direct interaction of two nascent proteins emerging from nearby ribosomes appears to be a general mechanism for oligomer formation. [18] Hundreds of protein oligomers were identified that assemble in human cells by such an interaction. [18] The most prevalent form of interaction was between the N-terminal regions of the interacting proteins. Dimer formation appears to be able to occur independently of dedicated assembly machines.
In biochemistry, allosteric regulation is the regulation of an enzyme by binding an effector molecule at a site other than the enzyme's active site.
In chemistry, dimerization is the process of joining two identical or similar molecular entities by bonds. The resulting bonds can be either strong or weak. Many symmetrical chemical species are described as dimers, even when the monomer is unknown or highly unstable.
A protein complex or multiprotein complex is a group of two or more associated polypeptide chains. Protein complexes are distinct from multidomain enzymes, in which multiple catalytic domains are found in a single polypeptide chain.
The insulin receptor (IR) is a transmembrane receptor that is activated by insulin, IGF-I, IGF-II and belongs to the large class of receptor tyrosine kinase. Metabolically, the insulin receptor plays a key role in the regulation of glucose homeostasis; a functional process that under degenerate conditions may result in a range of clinical manifestations including diabetes and cancer. Insulin signalling controls access to blood glucose in body cells. When insulin falls, especially in those with high insulin sensitivity, body cells begin only to have access to lipids that do not require transport across the membrane. So, in this way, insulin is the key regulator of fat metabolism as well. Biochemically, the insulin receptor is encoded by a single gene INSR, from which alternate splicing during transcription results in either IR-A or IR-B isoforms. Downstream post-translational events of either isoform result in the formation of a proteolytically cleaved α and β subunit, which upon combination are ultimately capable of homo or hetero-dimerisation to produce the ≈320 kDa disulfide-linked transmembrane insulin receptor.
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers – specifically polypeptides – formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid monomer may also be called a residue, which indicates a repeating unit of a polymer. Proteins form by amino acids undergoing condensation reactions, in which the amino acids lose one water molecule per reaction in order to attach to one another with a peptide bond. By convention, a chain under 30 amino acids is often identified as a peptide, rather than a protein. To be able to perform their biological function, proteins fold into one or more specific spatial conformations driven by a number of non-covalent interactions, such as hydrogen bonding, ionic interactions, Van der Waals forces, and hydrophobic packing. To understand the functions of proteins at a molecular level, it is often necessary to determine their three-dimensional structure. This is the topic of the scientific field of structural biology, which employs techniques such as X-ray crystallography, NMR spectroscopy, cryo-electron microscopy (cryo-EM) and dual polarisation interferometry, to determine the structure of proteins.
Streptavidin is a 52 kDa protein (tetramer) purified from the bacterium Streptomyces avidinii. Streptavidin homo-tetramers have an extraordinarily high affinity for biotin. With a dissociation constant (Kd) on the order of ≈10−14 mol/L, the binding of biotin to streptavidin is one of the strongest non-covalent interactions known in nature. Streptavidin is used extensively in molecular biology and bionanotechnology due to the streptavidin-biotin complex's resistance to organic solvents, denaturants, detergents, proteolytic enzymes, and extremes of temperature and pH.
Protein–protein interactions (PPIs) are physical contacts of high specificity established between two or more protein molecules as a result of biochemical events steered by interactions that include electrostatic forces, hydrogen bonding and the hydrophobic effect. Many are physical contacts with molecular associations between chains that occur in a cell or in a living organism in a specific biomolecular context.
A dodecameric protein has a quaternary structure consisting of 12 protein subunits in a complex. Dodecameric complexes can have a number of subunit 'topologies', but typically only a few of the theoretically possible subunit arrangements are observed in protein structures.
A tetrameric protein is a protein with a quaternary structure of four subunits (tetrameric). Homotetramers have four identical subunits, and heterotetramers are complexes of different subunits. A tetramer can be assembled as dimer of dimers with two homodimer subunits, or two heterodimer subunits.
The enzyme argininosuccinate lyase (EC 4.3.2.1, ASL, argininosuccinase; systematic name 2-(N ω-L-arginino)succinate arginine-lyase (fumarate-forming)) catalyzes the reversible breakdown of argininosuccinate:
In biochemistry, a protein trimer is a macromolecular complex formed by three, usually non-covalently bound, macromolecules like proteins or nucleic acids. A protein trimer often occurs from the assembly of a protein's quaternary structure. The non-covalent interactions between the hydrophobic and hydrophilic regions on the polypeptides units help to stabilize the quaternary structure. Since a protein trimer is composed of multiple polypeptide subunits, it is considered an oligomer.
Heat shock protein 27 (Hsp27) also known as heat shock protein beta-1 (HSPB1) is a protein that in humans is encoded by the HSPB1 gene.
Propionyl-CoA carboxylase (EC 6.4.1.3, PCC) catalyses the carboxylation reaction of propionyl-CoA in the mitochondrial matrix. PCC has been classified both as a ligase and a lyase. The enzyme is biotin-dependent. The product of the reaction is (S)-methylmalonyl CoA.
Malate dehydrogenase, mitochondrial also known as malate dehydrogenase 2 is an enzyme that in humans is encoded by the MDH2 gene.
Morpheeins are proteins that can form two or more different homo-oligomers, but must come apart and change shape to convert between forms. The alternate shape may reassemble to a different oligomer. The shape of the subunit dictates which oligomer is formed. Each oligomer has a finite number of subunits (stoichiometry). Morpheeins can interconvert between forms under physiological conditions and can exist as an equilibrium of different oligomers. These oligomers are physiologically relevant and are not misfolded protein; this distinguishes morpheeins from prions and amyloid. The different oligomers have distinct functionality. Interconversion of morpheein forms can be a structural basis for allosteric regulation, an idea noted many years ago, and later revived. A mutation that shifts the normal equilibrium of morpheein forms can serve as the basis for a conformational disease. Features of morpheeins can be exploited for drug discovery. The dice image represents a morpheein equilibrium containing two different monomeric shapes that dictate assembly to a tetramer or a pentamer. The one protein that is established to function as a morpheein is porphobilinogen synthase, though there are suggestions throughout the literature that other proteins may function as morpheeins.
The G beta-gamma complex (Gβγ) is a tightly bound dimeric protein complex, composed of one Gβ and one Gγ subunit, and is a component of heterotrimeric G proteins. Heterotrimeric G proteins, also called guanosine nucleotide-binding proteins, consist of three subunits, called alpha, beta, and gamma subunits, or Gα, Gβ, and Gγ. When a G protein-coupled receptor (GPCR) is activated, Gα dissociates from Gβγ, allowing both subunits to perform their respective downstream signaling effects. One of the major functions of Gβγ is the inhibition of the Gα subunit.
In biochemistry, a protein dimer is a macromolecular complex or multimer formed by two protein monomers, or single proteins, which are usually non-covalently bound. Many macromolecules, such as proteins or nucleic acids, form dimers. The word dimer has roots meaning "two parts", di- + -mer. A protein dimer is a type of protein quaternary structure.
Nucleic acidquaternary structure refers to the interactions between separate nucleic acid molecules, or between nucleic acid molecules and proteins. The concept is analogous to protein quaternary structure, but as the analogy is not perfect, the term is used to refer to a number of different concepts in nucleic acids and is less commonly encountered. Similarly other biomolecules such as proteins, nucleic acids have four levels of structural arrangement: primary, secondary, tertiary, and quaternary structure. Primary structure is the linear sequence of nucleotides, secondary structure involves small local folding motifs, and tertiary structure is the 3D folded shape of nucleic acid molecule. In general, quaternary structure refers to 3D interactions between multiple subunits. In the case of nucleic acids, quaternary structure refers to interactions between multiple nucleic acid molecules or between nucleic acids and proteins. Nucleic acid quaternary structure is important for understanding DNA, RNA, and gene expression because quaternary structure can impact function. For example, when DNA is packed into heterochromatin, therefore exhibiting a type of quaternary structure, gene transcription will be inhibited.
Autophosphorylation is a type of post-translational modification of proteins. It is generally defined as the phosphorylation of the kinase by itself. In eukaryotes, this process occurs by the addition of a phosphate group to serine, threonine or tyrosine residues within protein kinases, normally to regulate the catalytic activity. Autophosphorylation may occur when a kinases' own active site catalyzes the phosphorylation reaction, or when another kinase of the same type provides the active site that carries out the chemistry. The latter often occurs when kinase molecules dimerize. In general, the phosphate groups introduced are gamma phosphates from nucleoside triphosphates, most commonly ATP.
A GPCR oligomer is a protein complex that consists of a small number of G protein-coupled receptors (GPCRs). It is held together by covalent bonds or by intermolecular forces. The subunits within this complex are called protomers, while unconnected receptors are called monomers. Receptor homomers consist of identical protomers, while heteromers consist of different protomers.
Section 4: Nuclear Magnetic Resonance Spectroscopy