Armadillo repeat

Last updated
Armadillo repeat domain
PDB 3bct EBI.jpg
Structure of the armadillo domain of β-catenin. [1]
Identifiers
SymbolArm
Pfam PF00514
Pfam clan CL0020
InterPro IPR000225
SMART SM00185
PROSITE PS50176
SCOP2 3bct / SCOPe / SUPFAM
CDD cd00020
Membranome 350
Available protein structures:
Pfam   structures / ECOD  
PDB RCSB PDB; PDBe; PDBj
PDBsum structure summary

An armadillo repeat is a characteristic, repetitive amino acid sequence of about 42 residues in length that is found in many proteins. Proteins that contain armadillo repeats typically contain several tandemly repeated copies. [2] [3] Each armadillo repeat is composed of a pair of alpha helices that form a hairpin structure. Multiple copies of the repeat form what is known as an alpha solenoid structure.

Contents

Examples of proteins that contain armadillo repeats include β-catenin, Sarm1 (SARM1), [4] α-importin, [5] plakoglobin, [6] adenomatous polyposis coli (APC), [7] and many others.

The term armadillo derives from the historical name of the β-catenin gene in the fruitfly Drosophila where the armadillo repeat was first discovered. Although β-catenin was previously believed to be a protein involved in linking cadherin cell adhesion proteins to the cytoskeleton, recent work indicates that β-catenin regulates the homodimerization of alpha-catenin, which in turn controls actin branching and bundling. [8] But, the armadillo repeat is found in a wide range of proteins with other functions. This type of protein domain is important in transducing WNT signals during embryonic development.

Structure

The 3-dimensional fold of an armadillo repeat was first observed in the crystal structure of β-catenin, where the 12 tandem repeats form a superhelix of alpha helices with three helices per unit. [1] The cylindrical structure features a positively charged groove, which presumably interacts with the acidic surfaces of the known interaction partners of β-catenin. [9]

Related Research Articles

<span class="mw-page-title-main">SH3 domain</span> Small protein domain found in some kinases and GTPases

The SRC Homology 3 Domain is a small protein domain of about 60 amino acid residues. Initially, SH3 was described as a conserved sequence in the viral adaptor protein v-Crk. This domain is also present in the molecules of phospholipase and several cytoplasmic tyrosine kinases such as Abl and Src. It has also been identified in several other protein families such as: PI3 Kinase, Ras GTPase-activating protein, CDC24 and cdc25. SH3 domains are found in proteins of signaling pathways regulating the cytoskeleton, the Ras protein, and the Src kinase and many others. The SH3 proteins interact with adaptor proteins and tyrosine kinases. Interacting with tyrosine kinases, SH3 proteins usually bind far away from the active site. Approximately 300 SH3 domains are found in proteins encoded in the human genome. In addition to that, the SH3 domain was responsible for controlling protein-protein interactions in the signal transduction pathways and regulating the interactions of proteins involved in the cytoplasmic signaling.

<span class="mw-page-title-main">Desmosome</span> Cell junction involved in cell-to-cell adhesion

A desmosome, also known as a macula adherens, is a cell structure specialized for cell-to-cell adhesion. A type of junctional complex, they are localized spot-like adhesions randomly arranged on the lateral sides of plasma membranes. Desmosomes are one of the stronger cell-to-cell adhesion types and are found in tissue that experience intense mechanical stress, such as cardiac muscle tissue, bladder tissue, gastrointestinal mucosa, and epithelia.

<span class="mw-page-title-main">Cadherin</span>

Cadherins (named for "calcium-dependent adhesion") are cell adhesion molecules important in forming adherens junctions that let cells adhere to each other. Cadherins are a class of type-1 transmembrane proteins, and they depend on calcium (Ca2+) ions to function, hence their name. Cell-cell adhesion is mediated by extracellular cadherin domains, whereas the intracellular cytoplasmic tail associates with numerous adaptors and signaling proteins, collectively referred to as the cadherin adhesome.

A nuclear localization signalorsequence (NLS) is an amino acid sequence that 'tags' a protein for import into the cell nucleus by nuclear transport. Typically, this signal consists of one or more short sequences of positively charged lysines or arginines exposed on the protein surface. Different nuclear localized proteins may share the same NLS. An NLS has the opposite function of a nuclear export signal (NES), which targets proteins out of the nucleus.

<span class="mw-page-title-main">Catenin</span>

Catenins are a family of proteins found in complexes with cadherin cell adhesion molecules of animal cells. The first two catenins that were identified became known as α-catenin and β-catenin. α-Catenin can bind to β-catenin and can also bind filamentous actin (F-actin). β-Catenin binds directly to the cytoplasmic tail of classical cadherins. Additional catenins such as γ-catenin and δ-catenin have been identified. The name "catenin" was originally selected because it was suspected that catenins might link cadherins to the cytoskeleton.

<span class="mw-page-title-main">Desmoglein-2</span> Protein found in humans

Desmoglein-2 is a protein that in humans is encoded by the DSG2 gene. Desmoglein-2 is highly expressed in epithelial cells and cardiomyocytes. Desmoglein-2 is localized to desmosome structures at regions of cell-cell contact and functions to structurally adhere adjacent cells together. In cardiac muscle, these regions are specialized regions known as intercalated discs. Mutations in desmoglein-2 have been associated with arrhythmogenic right ventricular cardiomyopathy and familial dilated cardiomyopathy.

<span class="mw-page-title-main">Catenin beta-1</span> Mammalian protein found in Homo sapiens

Catenin beta-1, also known as β-catenin (beta-catenin), is a protein that in humans is encoded by the CTNNB1 gene.

Importin is a type of karyopherin that transports protein molecules from the cell's cytoplasm to the nucleus. It does so by binding to specific recognition sequences, called nuclear localization sequences (NLS).

<span class="mw-page-title-main">Alpha solenoid</span>

An alpha solenoid is a protein fold composed of repeating alpha helix subunits, commonly helix-turn-helix motifs, arranged in antiparallel fashion to form a superhelix. Alpha solenoids are known for their flexibility and plasticity. Like beta propellers, alpha solenoids are a form of solenoid protein domain commonly found in the proteins comprising the nuclear pore complex. They are also common in membrane coat proteins known as coatomers, such as clathrin, and in regulatory proteins that form extensive protein-protein interactions with their binding partners. Examples of alpha solenoid structures binding RNA and lipids have also been described.

<span class="mw-page-title-main">Leucine-rich repeat</span>

A leucine-rich repeat (LRR) is a protein structural motif that forms an α/β horseshoe fold. It is composed of repeating 20–30 amino acid stretches that are unusually rich in the hydrophobic amino acid leucine. These tandem repeats commonly fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Typically, each repeat unit has beta strand-turn-alpha helix structure, and the assembled domain, composed of many such repeats, has a horseshoe shape with an interior parallel beta sheet and an exterior array of helices. One face of the beta sheet and one side of the helix array are exposed to solvent and are therefore dominated by hydrophilic residues. The region between the helices and sheets is the protein's hydrophobic core and is tightly sterically packed with leucine residues.

<span class="mw-page-title-main">Plakoglobin</span> Mammalian protein found in Homo sapiens

Plakoglobin, also known as junction plakoglobin or gamma-catenin, is a protein that in humans is encoded by the JUP gene. Plakoglobin is a member of the catenin protein family and homologous to β-catenin. Plakoglobin is a cytoplasmic component of desmosomes and adherens junctions structures located within intercalated discs of cardiac muscle that function to anchor sarcomeres and join adjacent cells in cardiac muscle. Mutations in plakoglobin are associated with arrhythmogenic right ventricular dysplasia.

δ-Catenin is a subfamily of catenin proteins with ten armadillo-repeats and includes the proteins catenin delta-1 and catenin delta-2. Catenin delta-2 is expressed in the brain where it is important for normal cognitive development. Like β-catenin and γ-catenin, δ-catenins seem to interact with presenilins. These catenin-presenilin interaction have implications for cadherin function and regulation of cell-to-cell adhesion.

α-Catenin Primary protein link between cadherins and the actin cytoskeleton

α-Catenin (alpha-catenin) functions as the primary protein link between cadherins and the actin cytoskeleton. It has been reported that the actin binding proteins vinculin and α-actinin can bind to alpha-catenin. It has been suggested that alpha-catenin does not bind with high affinity to both actin filaments and the E-cadherin-beta-catenin complex at the same time. It has been observed that when α-catenin is not in a molecular complex with β-catenin, it dimerizes and functions to regulate actin filament assembly, possibly by competing with Arp2/3 protein. α-Catenin exhibits significant protein dynamics. However, a protein complex including a cadherin, actin, β-catenin and α-catenin has not been isolated.

<span class="mw-page-title-main">Cadherin-1</span> Human protein-coding gene

Cadherin-1 or Epithelial cadherin(E-cadherin), is a protein that in humans is encoded by the CDH1 gene. Mutations are correlated with gastric, breast, colorectal, thyroid, and ovarian cancers. CDH1 has also been designated as CD324. It is a tumor suppressor gene.

<span class="mw-page-title-main">Catenin alpha-1</span> Protein-coding gene in the species Homo sapiens

αE-catenin, also known as Catenin alpha-1 is a protein that in humans is encoded by the CTNNA1 gene. αE-catenin is highly expressed in cardiac muscle and localizes to adherens junctions at intercalated disc structures where it functions to mediate the anchorage of actin filaments to the sarcolemma. αE-catenin also plays a role in tumor metastasis and skin cell function.

<span class="mw-page-title-main">HEAT repeat</span> Protein tandem repeat

A HEAT repeat is a protein tandem repeat structural motif composed of two alpha helices linked by a short loop. HEAT repeats can form alpha solenoids, a type of solenoid protein domain found in a number of cytoplasmic proteins. The name "HEAT" is an acronym for four proteins in which this repeat structure is found: Huntingtin, elongation factor 3 (EF3), protein phosphatase 2A (PP2A), and the yeast kinase TOR1. HEAT repeats form extended superhelical structures which are often involved in intracellular transport; they are structurally related to armadillo repeats. The nuclear transport protein importin beta contains 19 HEAT repeats.

<span class="mw-page-title-main">WD40 repeat</span> Short protein motif that forms a solenoid domain

The WD40 repeat is a short structural motif of approximately 40 amino acids, often terminating in a tryptophan-aspartic acid (W-D) dipeptide. Tandem copies of these repeats typically fold together to form a type of circular solenoid protein domain called the WD40 domain.

<span class="mw-page-title-main">Inhibitor of apoptosis domain</span>

The inhibitor of apoptosis domain -- also known as IAP repeat, Baculovirus Inhibitor of apoptosis protein Repeat, or BIR -- is a structural motif found in proteins with roles in apoptosis, cytokine production, and chromosome segregation. Proteins containing BIR are known as inhibitor of apoptosis proteins (IAPs), or BIR-containing proteins, and include BIRC1 (NAIP), BIRC2 (cIAP1), BIRC3 (cIAP2), BIRC4 (xIAP), BIRC5 (survivin) and BIRC6.

<span class="mw-page-title-main">Tetratricopeptide repeat</span> Protein tandem repeat

The tetratricopeptide repeat (TPR) is a structural motif. It consists of a degenerate 34 amino acid tandem repeat identified in a wide variety of proteins. It is found in tandem arrays of 3–16 motifs, which form scaffolds to mediate protein–protein interactions and often the assembly of multiprotein complexes. These alpha-helix pair repeats usually fold together to produce a single, linear solenoid domain called a TPR domain. Proteins with such domains include the anaphase-promoting complex (APC) subunits cdc16, cdc23 and cdc27, the NADPH oxidase subunit p67-phox, hsp90-binding immunophilins, transcription factors, the protein kinase R (PKR), the major receptor for peroxisomal matrix protein import PEX5, protein arginine methyltransferase 9 (PRMT9), and mitochondrial import proteins.

Importin alpha, or karyopherin alpha refers to a class of adaptor proteins that are involved in the import of proteins into the cell nucleus. They are a sub-family of karyopherin proteins.

References

  1. 1 2 Huber AH, Nelson WJ, Weis WI (September 1997). "Three-dimensional structure of the armadillo repeat region of β-catenin". Cell. 90 (5): 871–82. doi: 10.1016/S0092-8674(00)80352-9 . PMID   9298899. S2CID   18612343.
  2. Peifer M, Berg S, Reynolds AB (1994). "A repeating amino acid motif shared by proteins with diverse cellular roles". Cell. 76 (5): 789–91. doi:10.1016/0092-8674(94)90353-0. PMID   7907279. S2CID   26528190.
  3. Groves MR, Barford D (1999). "Topological characteristics of helical repeat proteins". Current Opinion in Structural Biology. 9 (3): 383–9. doi:10.1016/S0959-440X(99)80052-9. PMID   10361086.
  4. "Scopus preview - Scopus - Welcome to Scopus". www.scopus.com. Retrieved 2023-03-21.
  5. Herold A, Truant R, Wiegand H, Cullen BR (October 1998). "Determination of the functional domain organization of the importin alpha nuclear import factor". J. Cell Biol. 143 (2): 309–18. doi:10.1083/jcb.143.2.309. PMC   2132842 . PMID   9786944.
  6. McCrea PD, Turck CW, Gumbiner B (November 1991). "A homolog of the armadillo protein in Drosophila (plakoglobin) associated with E-cadherin". Science. 254 (5036): 1359–61. Bibcode:1991Sci...254.1359M. doi:10.1126/science.1962194. PMID   1962194.
  7. Hirschl D, Bayer P, Müller O (March 1996). "Secondary structure of an armadillo single repeat from the APC protein". FEBS Lett. 383 (1–2): 31–6. doi:10.1016/0014-5793(96)00215-3. PMID   8612785. S2CID   36190869.
  8. Nusse, Roel, and Hans Clevers. “Wnt/β-Catenin Signaling, Disease, and Emerging Therapeutic Modalities.” Cell, vol. 169, no. 6, 1 June 2017, pp. 985–999., doi:10.1016/j.cell.2017.05.016.
  9. "Armadillo (IPR000225)". InterPro. EMBL-EBI.