Extensins are a family of flexuous, rodlike, hydroxyproline-rich glycoproteins (HRGPs) of the plant cell wall. [1] [2]
They are highly abundant proteins. There are around 20 extensins in Arabidopsis thaliana . They form crosslinked networks in the young cell wall. Typically they have two major diagnostic repetitive peptide motifs, one hydrophilic and the other hydrophobic, with potential for crosslinking. Extensins are thought to act as self-assembling amphiphiles [3] essential for cell-wall assembly and growth by cell extension and expansion. The name "extensin" encapsulates the hypothesis that they are involved in cell extension. [4]
This pentapeptide consists of serine (Ser) and four hydroxyprolines (Hyp): Ser-Hyp-Hyp-Hyp-Hyp. [5] [6] [7] Hydroxyproline is unusual not only as a cyclic amino acid that restricts peptide flexibility but as an amino acid with no codon, being encoded as proline. Polypeptides targeted for secretion are subsequently hydroxylated by direct addition of molecular oxygen to proline at C-4. [4] Extensin hydroxyproline is uniquely glycosylated with short chains of L-arabinose [8] that further rigidify [9] and increase hydrophilicity. Generally the serine has a single galactose attached. [10]
Two tyrosines separated by a single amino acid, typically valine or another tyrosine, form a short intra-molecular diphenylether crosslink. [11] This can be crosslinked further by the enzyme extensin peroxidase [12] [13] [14] to form an inter-molecular bridge between extensin molecules and thus form networks and sheets.
Amino acids are organic compounds that contain amine (–NH2) and carboxyl (–COOH) functional groups, along with a side chain (R group) specific to each amino acid. The key elements of an amino acid are carbon (C), hydrogen (H), oxygen (O), and nitrogen (N), although other elements are found in the side chains of certain amino acids. About 500 naturally occurring amino acids are known (though only 20 appear in the genetic code) and can be classified in many ways. They can be classified according to the core structural functional groups' locations as alpha- (α-), beta- (β-), gamma- (γ-) or delta- (δ-) amino acids; other categories relate to polarity, pH level, and side chain group type (aliphatic, acyclic, aromatic, containing hydroxyl or sulfur, etc.). In the form of proteins, amino acid residues form the second-largest component (water is the largest) of human muscles and other tissues. Beyond their role as residues in proteins, amino acids participate in a number of processes such as neurotransmitter transport and biosynthesis.
Collagen is the main structural protein in the extracellular matrix found in the body's various connective tissues. As the main component of connective tissue, it is the most abundant protein in mammals, making up from 25% to 35% of the whole-body protein content. Collagen consists of amino acids bound together to form a triple helix of elongated fibril known as a collagen helix. It is mostly found in connective tissue such as cartilage, bones, tendons, ligaments, and skin.
A cell wall is a structural layer surrounding some types of cells, just outside the cell membrane. It can be tough, flexible, and sometimes rigid. It provides the cell with both structural support and protection, and also acts as a filtering mechanism. Cell walls are present in most prokaryotes, in algae, fungi and eukaryotes including plants but are absent in animals. A major function is to act as pressure vessels, preventing over-expansion of the cell when water enters.
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.
Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group -NH
2 but is rather a secondary amine. The secondary amine nitrogen is in the protonated NH2+ form under biological conditions, while the carboxy group is in the deprotonated −COO− form. The "side chain" from the α carbon connects to the nitrogen forming a pyrrolidine loop, classifying it as a aliphatic amino acid. It is non-essential in humans, meaning the body can synthesize it from the non-essential amino acid L-glutamate. It is encoded by all the codons starting with CC (CCU, CCC, CCA, and CCG).
Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula HOOC-CH-(NH2)-CH2-SH. The thiol side chain in cysteine often participates in enzymatic reactions, as a nucleophile. The thiol is susceptible to oxidation to give the disulfide derivative cystine, which serves an important structural role in many proteins. When used as a food additive, it has the E number E920. It is encoded by the codons UGU and UGC.
(2S,4R)-4-Hydroxyproline, or L-hydroxyproline (C5H9O3N), is an amino acid, abbreviated as Hyp or O, e.g., in Protein Data Bank.
Hydroxylation is a chemical process that introduces a hydroxyl group (-OH) into an organic compound. In biochemistry, hydroxylation reactions are often facilitated by enzymes called hydroxylases. Hydroxylation is the first step in the oxidative degradation of organic compounds in air. It is extremely important in detoxification since hydroxylation converts lipophilic compounds into water-soluble (hydrophilic) products that are more readily removed by the kidneys or liver and excreted. Some drugs are activated or deactivated by hydroxylation.
Proteinogenic amino acids are amino acids that are incorporated biosynthetically into proteins during translation. The word "proteinogenic" means "protein creating". Throughout known life, there are 22 genetically encoded (proteinogenic) amino acids, 20 in the standard genetic code and an additional 2 that can be incorporated by special translation mechanisms.
An oligosaccharide is a saccharide polymer containing a small number of monosaccharides. Oligosaccharides can have many functions including cell recognition and cell binding. For example, glycolipids have an important role in the immune response.
DD-transpeptidase is a bacterial enzyme that catalyzes the transfer of the R-L-aca-D-alanyl moiety of R-L-aca-D-alanyl-D-alanine carbonyl donors to the γ-OH of their active-site serine and from this to a final acceptor. It is involved in bacterial cell wall biosynthesis, namely, the transpeptidation that crosslinks the peptide side chains of peptidoglycan strands.
Resilin is an elastomeric protein found in many insects and other arthropods. It provides soft rubber-elasticity to mechanically active organs and tissue; for example, it enables insects of many species to jump or pivot their wings efficiently. Resilin was first discovered by Torkel Weis-Fogh in locust wing-hinges.
Systemin is a plant peptide hormone involved in the wound response in the family Solanaceae. It was the first plant hormone that was proven to be a peptide having been isolated from tomato leaves in 1991 by a group led by Clarence A. Ryan. Since then, other peptides with similar functions have been identified in tomato and outside of the Solanaceae. Hydroxyproline-rich glycopeptides were found in tobacco in 2001 and AtPEPs were found in Arabidopsis thaliana in 2006. Their precursors are found both in the cytoplasm and cell walls of plant cells, upon insect damage, the precursors are processed to produce one or more mature peptides. The receptor for systemin was first thought to be the same as the brassinolide receptor but this is now uncertain. The signal transduction processes that occur after the peptides bind are similar to the cytokine-mediated inflammatory immune response in animals. Early experiments showed that systemin travelled around the plant after insects had damaged the plant, activating systemic acquired resistance, now it is thought that it increases the production of jasmonic acid causing the same result. The main function of systemins is to coordinate defensive responses against insect herbivores but they also affect plant development. Systemin induces the production of protease inhibitors which protect against insect herbivores, other peptides activate defensins and modify root growth. They have also been shown to affect plants' responses to salt stress and UV radiation. AtPEPs have been shown to affect resistance against oomycetes and may allow A. thaliana to distinguish between different pathogens. In Nicotiana attenuata, some of the peptides have stopped being involved in defensive roles and instead affect flower morphology.
Procollagen-proline dioxygenase, commonly known as prolyl hydroxylase, is a member of the class of enzymes known as alpha-ketoglutarate-dependent hydroxylases. These enzymes catalyze the incorporation of oxygen into organic substrates through a mechanism that requires alpha-Ketoglutaric acid, Fe2+, and ascorbate. This particular enzyme catalyzes the formation of (2S, 4R)-4-hydroxyproline, a compound that represents the most prevalent post-translational modification in the human proteome.
T-cell surface glycoprotein CD3 gamma chain is a protein that in humans is encoded by the CD3G gene.
Glycopeptides are peptides that contain carbohydrate moieties (glycans) covalently attached to the side chains of the amino acid residues that constitute the peptide.
Arabinogalactan-proteins (AGP) are members of the hydroxyproline (Hyp)-rich cell wall glycoprotein superfamily, and are extensively glycosylated, meaning that each one consists of a protein molecule attached to sugar molecules.
In molecular biology, the protein domain SdrG C terminal refers to the C terminus domain of an adhesin found only on the cell walls of bacteria. More specifically, SdrG is only found in gram-positive bacteria. This particular domain binds to a glycoprotein named fibrinogen. SdrG stands for serine-aspartate dipeptide repeats, which as its name suggests, contains repeats of two amino acids, serine and aspartate.
In biochemistry, non-coded or non-proteinogenic amino acids are those not naturally encoded or found in the genetic code of any organism. Despite the use of only 22 amino acids by the translational machinery to assemble proteins, over 140 amino acids are known to occur naturally in proteins and thousands more may occur in nature or be synthesized in the laboratory. Many non-proteinogenic amino acids are noteworthy because they are;
Zingibain, zingipain, or ginger protease is a cysteine protease enzyme found in ginger rhizomes. It catalyses the preferential cleavage of peptides with a proline residue at the P2 position. It has two distinct forms, ginger protease I (GP-I) and ginger protease II (GP-II).