Fuzzy complexes are protein complexes, where structural ambiguity or multiplicity exists and is required for biological function. [1] [2] Alteration, truncation or removal of conformationally ambiguous regions impacts the activity of the corresponding complex. [3] [4] [5] Fuzzy complexes are generally formed by intrinsically disordered proteins. [6] [7] Structural multiplicity usually underlies functional multiplicity of protein complexes [8] [9] [10] following a fuzzy logic. Distinct binding modes of the nucleosome are also regarded as a special case of fuzziness. [11] [12]
For almost 50 years molecular biology was based on two dogmas: (i) equating biological function of the protein with a unique three-dimensional structure and (ii) assuming exquisite specificity in protein complexes. Specificity/selectivity is ensured by unambiguous set of interactions formed between the protein and its ligand (another protein, DNA, RNA or small molecule). Many protein complexes however, contain functionally important/critical regions, which remain highly dynamic in the complex or adopt different conformations. [13] This phenomenon is defined fuzziness. The most pertinent example is the cyclin-dependent kinase inhibitor Sic1, which binds to the SCF subunit of Cdc4 in a phosphorylation dependent manner. [14] No regular secondary structures are gained upon phosphorylation and the different phosphorylation sites interchange in the complex. [15]
Structural ambiguity in protein complexes covers a wide spectrum. [1] In a polymorphic complex, the protein adopts two or more different conformations upon binding to the same partner, and these conformations can be resolved. [16] Clamp, [17] flanking [18] [19] and random complexes [20] [21] are dynamic, where ambiguous conformations interchange with each other and cannot be resolved. Interactions in fuzzy complexes are usually mediated by short motifs. [22] Flanking regions are tolerant to sequence changes as long as the amino acid composition is maintained, for example in case of linker histone C-terminal domains [23] and H4 histone N-terminal domains. [24]
Fuzzy regions modulate the conformational equilibrium [25] or flexibility [3] [26] of the binding interface via transient interactions. [27] Dynamic regions can also compete with binding sites [28] or tether them to the target. [29] Modifications of fuzzy regions by further interactions, [8] [30] or posttranslational modifications [31] [32] impact binding affinity or specificity. Alternative splicing can modulate the length of fuzzy regions resulting in context-dependent binding (e.g. tissue-specificity) on the complex. [33] [34] [35] EGF/MAPK, TGF-β and WNT/Wingless signaling pathways employ tissue-specific fuzzy regions.
p53, also known as Tumor protein P53, cellular tumor antigen p53, or transformation-related protein 53 (TRP53) is a regulatory protein that is often mutated in human cancers. The p53 proteins are crucial in vertebrates, where they prevent cancer formation. As such, p53 has been described as "the guardian of the genome" because of its role in conserving stability by preventing genome mutation. Hence TP53 is classified as a tumor suppressor gene.
The insulin receptor (IR) is a transmembrane receptor that is activated by insulin, IGF-I, IGF-II and belongs to the large class of receptor tyrosine kinase. Metabolically, the insulin receptor plays a key role in the regulation of glucose homeostasis; a functional process that under degenerate conditions may result in a range of clinical manifestations including diabetes and cancer. Insulin signalling controls access to blood glucose in body cells. When insulin falls, especially in those with high insulin sensitivity, body cells begin only to have access to lipids that do not require transport across the membrane. So, in this way, insulin is the key regulator of fat metabolism as well. Biochemically, the insulin receptor is encoded by a single gene INSR, from which alternate splicing during transcription results in either IR-A or IR-B isoforms. Downstream post-translational events of either isoform result in the formation of a proteolytically cleaved α and β subunit, which upon combination are ultimately capable of homo or hetero-dimerisation to produce the ≈320 kDa disulfide-linked transmembrane insulin receptor.
The androgen receptor (AR), also known as NR3C4, is a type of nuclear receptor that is activated by binding any of the androgenic hormones, including testosterone and dihydrotestosterone, in the cytoplasm and then translocating into the nucleus. The androgen receptor is most closely related to the progesterone receptor, and progestins in higher dosages can block the androgen receptor.
In molecular biology, an intrinsically disordered protein (IDP) is a protein that lacks a fixed or ordered three-dimensional structure, typically in the absence of its macromolecular interaction partners, such as other proteins or RNA. IDPs range from fully unstructured to partially structured and include random coil, molten globule-like aggregates, or flexible linkers in large multi-domain proteins. They are sometimes considered as a separate class of proteins along with globular, fibrous and membrane proteins.
The glucocorticoid receptor also known as NR3C1 is the receptor to which cortisol and other glucocorticoids bind.
Catenin beta-1, also known as β-catenin (beta-catenin), is a protein that in humans is encoded by the CTNNB1 gene.
Paxillin is a protein that in humans is encoded by the PXN gene. Paxillin is expressed at focal adhesions of non-striated cells and at costameres of striated muscle cells, and it functions to adhere cells to the extracellular matrix. Mutations in PXN as well as abnormal expression of paxillin protein has been implicated in the progression of various cancers.
GRB2-associated-binding protein 2 also known as GAB2 is a protein that in humans is encoded by the GAB2 gene.
Cell division control protein 42 homolog is a protein that in humans is encoded by the CDC42 gene. Cdc42 is involved in regulation of the cell cycle. It was originally identified in S. cerevisiae (yeast) as a mediator of cell division, and is now known to influence a variety of signaling events and cellular processes in a variety of organisms from yeast to mammals.
Transcription factor p65 also known as nuclear factor NF-kappa-B p65 subunit is a protein that in humans is encoded by the RELA gene.
High-mobility group protein HMG-I/HMG-Y is a protein that in humans is encoded by the HMGA1 gene.
Tripartite motif-containing 28 (TRIM28), also known as transcriptional intermediary factor 1β (TIF1β) and KAP1, is a protein that in humans is encoded by the TRIM28 gene.
General transcription factor II-I is a protein that in humans is encoded by the GTF2I gene.
Syntaxin-binding protein 1 is a protein that in humans is encoded by the STXBP1 gene. This gene encodes a syntaxin-binding protein. The encoded protein appears to play a role in release of neurotransmitters via regulation of syntaxin, a transmembrane attachment protein receptor. Mutations in this gene have been associated with neurological disorders including epilepsy, intellectual disability, and movement disorders.
Cytoplasmic protein NCK2 is a protein that in humans is encoded by the NCK2 gene.
Transcriptional enhancer factor TEF-1 also known as TEA domain family member 1 (TEAD1) and transcription factor 13 (TCF-13) is a protein that in humans is encoded by the TEAD1 gene. TEAD1 was the first member of the TEAD family of transcription factors to be identified.
Pyruvate dehydrogenase lipoamide kinase isozyme 3, mitochondrial is an enzyme that in humans is encoded by the PDK3 gene. It codes for an isozyme of pyruvate dehydrogenase kinase.The pyruvate dehydrogenase (PDH) complex is a nuclear-encoded mitochondrial multienzyme complex that catalyzes the overall conversion of pyruvate to acetyl-CoA and CO2. It provides the primary link between glycolysis and the tricarboxylic acid (TCA) cycle, and thus is one of the major enzymes responsible for the regulation of glucose metabolism. The enzymatic activity of PDH is regulated by a phosphorylation/dephosphorylation cycle, and phosphorylation results in inactivation of PDH. The protein encoded by this gene is one of the four pyruvate dehydrogenase kinases that inhibits the PDH complex by phosphorylation of the E1 alpha subunit. This gene is predominantly expressed in the heart and skeletal muscles. Alternatively spliced transcript variants encoding different isoforms have been found for this gene.
Cyclin-K is a protein that in humans is encoded by the CCNK gene.
Serine/arginine-rich splicing factor 1 (SRSF1) also known as alternative splicing factor 1 (ASF1), pre-mRNA-splicing factor SF2 (SF2) or ASF1/SF2 is a protein that in humans is encoded by the SRSF1 gene. ASF/SF2 is an essential sequence specific splicing factor involved in pre-mRNA splicing. SRSF1 is the gene that codes for ASF/SF2 and is found on chromosome 17. The resulting splicing factor is a protein of approximately 33 kDa. ASF/SF2 is necessary for all splicing reactions to occur, and influences splice site selection in a concentration-dependent manner, resulting in alternative splicing. In addition to being involved in the splicing process, ASF/SF2 also mediates post-splicing activities, such as mRNA nuclear export and translation.
GCN2 is a serine/threonine-protein kinase that senses amino acid deficiency through binding to uncharged transfer RNA (tRNA). It plays a key role in modulating amino acid metabolism as a response to nutrient deprivation.