The beta hairpin (sometimes also called beta-ribbon or beta-beta unit) is a simple protein structural motif involving two beta strands that look like a hairpin. The motif consists of two strands that are adjacent in primary structure, oriented in an antiparallel direction (the N-terminus of one sheet is adjacent to the C-terminus of the next), and linked by a short loop of two to five amino acids. Beta hairpins can occur in isolation or as part of a series of hydrogen bonded strands that collectively comprise a beta sheet.
Researchers such as Francisco Blanco et al. have used protein NMR to show that beta-hairpins can be formed from isolated short peptides in aqueous solution, suggesting that hairpins could form nucleation sites for protein folding. [1]
Beta hairpins were originally categorized solely by the number of amino acid residues in their loop sequences, such that they were named one-residue, two-residue, etc. [2] This system, however, is somewhat ambiguous as it does not take into account whether the residues that signal the end of the hairpin are singly or doubly hydrogen bonded to one another. An improved means of classification has since been proposed by Milner-White and Poet. [3] Beta hairpins are broken into four distinct classes as depicted in the publication's Figure 1. Each class begins with the smallest possible number of loop residues and progressively increases the loop size by removing hydrogen bonds in the beta sheet. The primary hairpin of class 1 is a one-residue loop where the bound residues share two hydrogen bonds. One hydrogen bond is then removed to create a three-residue loop, which is the secondary hairpin of class 1. Singly bound residues are counted in the loop sequence but also signal the end of the loop, thus defining this hairpin as a three-residue loop. This single hydrogen bond is then removed to create the tertiary hairpin; a five-residue loop with doubly bound residues. This pattern continues indefinitely and defines all beta hairpins within the class. Class 2 follows the same pattern beginning with a two-residue loop with terminating residues that share two hydrogen bonds. Class 3 begins with a three-residue, and class 4 with a four-residue. Class 5 does not exist as that primary hairpin is already defined in class 1. Pi This classification scheme not only accounts for various degrees of hydrogen bonding, but also says something about the biological behavior of the hairpin. Single amino acid replacements may destroy a particular hydrogen bond, but will not unfold the hairpin or change its class. On the other hand, amino acid insertions and deletions will have to unfold and reform the entire beta strand in order to avoid a beta bulge in the secondary structure. This will change the class of the hairpin in the process. As substitutions are the most common amino acid mutations, a protein could potentially undergo a conversion without affecting the functionality of the beta hairpin. [3]
Understanding the mechanism through which micro-domains fold can help to shed light onto the folding patterns of whole proteins. Studies of a beta hairpin called chignolin (see Chignolin on Proteopedia) have uncovered a stepwise folding process that drives beta-hairpin folding. This hairpin has sequence features similar to over 13,000 known hairpins, and thus may serve as a more general model for beta hairpin formation. The formation of a native turn region signals the folding cascade to start, where a native turn is one that is present in the final folded structure.
In the folding of overall proteins, the turn may originate not in the native turn region but in the C-strand of the beta-hairpin. This turn then propagates through the C-strand (the beta strand leading to C-terminus) until it reaches the native turn region. Sometimes the residue interactions leading up to the native turn region are too strong, causing reverse propagation. However, once the native turn does form, interactions between prolines and tryptophan residues (seen in image at right) in the region help to stabilize the turn, preventing "roll back" or dissolution.
Researchers believe that turns do not originate in the N-strand, due to increased rigidity (often caused by a proline leading up to the native turn region) and less conformational options. The initial turn formation takes place in about 1 μs. Once the initial turn has been established, two mechanisms have been proposed as to how the rest of the beta-hairpin folds: a hydrophobic collapse with side-chain level rearrangements, or the more accepted zipper-like mechanism. [4]
The β-hairpin loop motif can be found in many macromolecular proteins. However, small and simple β-hairpins can exist on their own as well. To see this clearly, the Pin1 Domain protein is shown to the left as an example.
Proteins that are β-sheet rich, also called WW domains, function by adhering to proline-rich and/or phosphorylated peptides to mediate protein–protein interactions. The "WW" refers to two tryptophan (W) residues that are conserved within the sequence and aid in the folding of the β-sheets to produce a small hydrophobic core. [5] These tryptophan residues can be seen below (right) in red.
This enzyme binds its ligand through van der Waals forces of the conserved tryptophans and the proline-rich areas of the ligand. Other amino acids can then associate with the hydrophobic core of the β-hairpin structure to enforce secure binding. [6]
It is also common to find proline residues within the actual loop portion of the β-hairpin, since this amino acid is rigid and contributes to the "turn" formation. These proline residues can be seen as red side chains in the image of the Pin1 WW domain below (left).
The design of peptides that adopt β-hairpin structure (without relying on metal binding, unusual amino acids, or disulfide crosslinks) has made significant progress and yielded insights into protein dynamics. Unlike α-helices, β-hairpins are not stabilized by a regular hydrogen bonding pattern. As a result, early attempts required at least 20–30 amino acid residues to attain stable tertiary folds of β-hairpins. However, this lower limit was reduced to 12 amino acids by the stability gains conferred by the incorporation of tryptophan-tryptophan cross-strand pairs. Two nonhydrogen-bonding tryptophan pairs have been shown to interlock in a zipper-like motif, stabilizing the β-hairpin structure while still allowing it to remain water-soluble. The NMR structure of a tryptophan zipper (trpzip) β-peptide shows the stabilizing effect of favorable interactions between adjacent indole rings. [7]
The synthesis of trpzip β-hairpin peptides has incorporated photoswitches that facilitate precise control over folding. Several amino acids in the turn are replaced by azobenzene, which can be induced to switch from the trans to the cis conformation by light at 360 nm. When the azobenzene moiety is in the cis conformation, the amino acid residues align correctly to adopt a β-hairpin formation. However, the trans conformation does not have proper turn geometry for the β-hairpin. [8] This phenomenon can be used to investigate peptide conformational dynamics with femtosecond absorption spectroscopy. [8]
The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues earlier along the protein sequence.
The beta sheet, (β-sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a generally twisted, pleated sheet. A β-strand is a stretch of polypeptide chain typically 3 to 10 amino acids long with backbone in an extended conformation. The supramolecular association of β-sheets has been implicated in the formation of the fibrils and protein aggregates observed in amyloidosis, notably Alzheimer's disease.
Protein secondary structure is the three dimensional form of local segments of proteins. The two most common secondary structural elements are alpha helices and beta sheets, though beta turns and omega loops occur as well. Secondary structure elements typically spontaneously form as an intermediate before the protein folds into its three dimensional tertiary structure.
Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group -NH
2 but is rather a secondary amine. The secondary amine nitrogen is in the protonated NH2+ form under biological conditions, while the carboxy group is in the deprotonated −COO− form. The "side chain" from the α carbon connects to the nitrogen forming a pyrrolidine loop, classifying it as a aliphatic amino acid. It is non-essential in humans, meaning the body can synthesize it from the non-essential amino acid L-glutamate. It is encoded by all the codons starting with CC (CCU, CCC, CCA, and CCG).
Protein structure prediction is the inference of the three-dimensional structure of a protein from its amino acid sequence—that is, the prediction of its secondary and tertiary structure from primary structure. Structure prediction is different from the inverse problem of protein design. Protein structure prediction is one of the most important goals pursued by computational biology; and it is important in medicine and biotechnology.
In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a common three-dimensional structure that appears in a variety of different, evolutionarily unrelated molecules. A structural motif does not have to be associated with a sequence motif; it can be represented by different and completely unrelated sequences in different proteins or RNA.
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers – specifically polypeptides – formed from sequences of amino acids, the monomers of the polymer. A single amino acid monomer may also be called a residue indicating a repeating unit of a polymer. Proteins form by amino acids undergoing condensation reactions, in which the amino acids lose one water molecule per reaction in order to attach to one another with a peptide bond. By convention, a chain under 30 amino acids is often identified as a peptide, rather than a protein. To be able to perform their biological function, proteins fold into one or more specific spatial conformations driven by a number of non-covalent interactions such as hydrogen bonding, ionic interactions, Van der Waals forces, and hydrophobic packing. To understand the functions of proteins at a molecular level, it is often necessary to determine their three-dimensional structure. This is the topic of the scientific field of structural biology, which employs techniques such as X-ray crystallography, NMR spectroscopy, cryo electron microscopy (cryo-EM) and dual polarisation interferometry to determine the structure of proteins.
A supersecondary structure is a compact three-dimensional protein structure of several adjacent elements of a secondary structure that is smaller than a protein domain or a subunit. Supersecondary structures can act as nucleations in the process of protein folding.
In protein structures, a beta barrel is a beta-sheet composed of tandem repeats that twists and coils to form a closed toroidal structure in which the first strand is bonded to the last strand. Beta-strands in many beta-barrels are arranged in an antiparallel fashion. Beta barrel structures are named for resemblance to the barrels used to contain liquids. Most of them are water-soluble proteins and frequently bind hydrophobic ligands in the barrel center, as in lipocalins. Others span cell membranes and commonly found in porins. Porin-like barrel structures are encoded by as many as 2–3% of the genes in Gram-negative bacteria.
A turn is an element of secondary structure in proteins where the polypeptide chain reverses its overall direction.
Bovine pancreatic ribonuclease, also often referred to as bovine pancreatic ribonuclease A or simply RNase A, is a pancreatic ribonuclease enzyme that cleaves single-stranded RNA. Bovine pancreatic ribonuclease is one of the classic model systems of protein science. Two Nobel Prizes in Chemistry have been awarded in recognition of work on bovine pancreatic ribonuclease: in 1972, the Prize was awarded to Christian Anfinsen for his work on protein folding and to Stanford Moore and William Stein for their work on the relationship between the protein's structure and its chemical mechanism; in 1984, the Prize was awarded to Robert Bruce Merrifield for development of chemical synthesis of proteins.
A beta bulge can be described as a localized disruption of the regular hydrogen bonding of beta sheet by inserting extra residues into one or both hydrogen bonded β-strands.
A polyproline helix is a type of protein secondary structure which occurs in proteins comprising repeating proline residues. A left-handed polyproline II helix is formed when sequential residues all adopt (φ,ψ) backbone dihedral angles of roughly and have trans isomers of their peptide bonds. This PPII conformation is also common in proteins and polypeptides with other amino acids apart from proline. Similarly, a more compact right-handed polyproline I helix is formed when sequential residues all adopt (φ,ψ) backbone dihedral angles of roughly and have cis isomers of their peptide bonds. Of the twenty common naturally occurring amino acids, only proline is likely to adopt the cis isomer of the peptide bond, specifically the X-Pro peptide bond; steric and electronic factors heavily favor the trans isomer in most other peptide bonds. However, peptide bonds that replace proline with another N-substituted amino acid are also likely to adopt the cis isomer.
Prolyl isomerase is an enzyme found in both prokaryotes and eukaryotes that interconverts the cis and trans isomers of peptide bonds with the amino acid proline. Proline has an unusually conformationally restrained peptide bond due to its cyclic structure with its side chain bonded to its secondary amine nitrogen. Most amino acids have a strong energetic preference for the trans peptide bond conformation due to steric hindrance, but proline's unusual structure stabilizes the cis form so that both isomers are populated under biologically relevant conditions. Proteins with prolyl isomerase activity include cyclophilin, FKBPs, and parvulin, although larger proteins can also contain prolyl isomerase domains.
A leucine-rich repeat (LRR) is a protein structural motif that forms an α/β horseshoe fold. It is composed of repeating 20–30 amino acid stretches that are unusually rich in the hydrophobic amino acid leucine. These tandem repeats commonly fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Typically, each repeat unit has beta strand-turn-alpha helix structure, and the assembled domain, composed of many such repeats, has a horseshoe shape with an interior parallel beta sheet and an exterior array of helices. One face of the beta sheet and one side of the helix array are exposed to solvent and are therefore dominated by hydrophilic residues. The region between the helices and sheets is the protein's hydrophobic core and is tightly sterically packed with leucine residues.
Protegrins are small peptides containing 16-18 amino acid residues. Protegrins were first discovered in porcine leukocytes and were found to have antimicrobial activity against bacteria, fungi, and some enveloped viruses. The amino acid composition of protegrins contains six positively charged arginine residues and four cysteine residues. Their secondary structure is classified as cysteine-rich β-sheet antimicrobial peptides, AMPs, that display limited sequence similarity to certain defensins and tachyplesins. In solution, the peptides fold to form an anti-parallel β-strand with the structure stabilized by two cysteine bridges formed among the four cysteine residues. Recent studies suggest that protegrins can bind to lipopolysaccharide, a property that may help them to insert into the membranes of gram-negative bacteria and permeabilize them.
β turns are the most common form of turns—a type of non-regular secondary structure in proteins that cause a change in direction of the polypeptide chain. They are very common motifs in proteins and polypeptides. Each consists of four amino acid residues. They can be defined in two ways: 1. By the possession of an intra-main-chain hydrogen bond between the CO of residue i and the NH of residue i+3; Alternatively, 2. By having a distance of less than 7Å between the Cα atoms of residues i and i+3. The hydrogen bond criterion is the one most appropriate for everyday use, partly because it gives rise to four distinct categories; the distance criterion gives rise to the same four categories but yields additional turn types.
The Nest is a type of protein structural motif. It is a small recurring anion-binding feature of both proteins and peptides. Each consists of the main chain atoms of three consecutive amino acid residues. The main chain NH groups bind the anions while the side chain atoms are often not involved. Proline residues lack NH groups so are rare in nests. About one in 12 of amino acid residues in proteins, on average, belongs to a nest.
Schellman loops are commonly occurring structural features of proteins and polypeptides. Each has six amino acid residues with two specific inter-mainchain hydrogen bonds and a characteristic main chain dihedral angle conformation. The CO group of residue i is hydrogen-bonded to the NH of residue i+5, and the CO group of residue i+1 is hydrogen-bonded to the NH of residue i+4. Residues i+1, i+2, and i+3 have negative φ (phi) angle values and the phi value of residue i+4 is positive. Schellman loops incorporate a three amino acid residue RL nest, in which three mainchain NH groups form a concavity for hydrogen bonding to carbonyl oxygens. About 2.5% of amino acids in proteins belong to Schellman loops. Two websites are available for examining small motifs in proteins, Motivated Proteins: ; or PDBeMotif:.
Beta bulge loops are commonly occurring motifs in proteins and polypeptides consisting of five to six amino acids. There are two types: type 1, which is a pentapeptide; and type 2, with six amino acids. They are regarded as a type of beta bulge, and have the alternative name of type G1 beta bulge. Compared to other beta bulges, beta bulge loops give rise to chain reversal such that they often occur at the loop ends of beta hairpins; hairpins of this sort can be described as 3:5 or 4:6. Two websites are available for finding and examining β bulge loops in proteins, Motivated Proteins: and PDBeMotif:.