Thyroglobulin domain

Last updated
Thyroglobulin type-1 repeat

Protein CD74 PDB 1icf.png

Thyroglobulin type-1 domain
Identifiers
Symbol Thyroglobulin_1
Pfam PF00086
InterPro IPR000716
PROSITE PDOC00377
SCOP 1icf
SUPERFAMILY 1icf
CDD cd00191

Thyroglobulin type-1 repeat is an evolutionary conserved protein domain. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.

Protein domain

A protein domain is a conserved part of a given protein sequence and (tertiary) structure that can evolve, function, and exist independently of the rest of the protein chain. Each domain forms a compact three-dimensional structure and often can be independently stable and folded. Many proteins consist of several structural domains. One domain may appear in a variety of different proteins. Molecular evolution uses domains as building blocks and these may be recombined in different arrangements to create proteins with different functions. In general, domains vary in length from between about 50 amino acids up to 250 amino acids in length. The shortest domains, such as zinc fingers, are stabilized by metal ions or disulfide bridges. Domains often form functional units, such as the calcium-binding EF hand domain of calmodulin. Because they are independently stable, domains can be "swapped" by genetic engineering between one protein and another to make chimeric proteins.

Human proteins containing this domain

CD74; IGFBP-4; IGFBP1; IGFBP2; IGFBP3; IGFBP4; IGFBP5; IGFBP6; NID1; NID2; SMOC1; SMOC2; SPOCK1; SPOCK2; SPOCK3; TACSTD1; TACSTD2; TG;

CD74 protein-coding gene in the species Homo sapiens

HLA class II histocompatibility antigen gamma chain also known as HLA-DR antigens-associated invariant chain or CD74, is a protein that in humans is encoded by the CD74 gene. The invariant chain is a polypeptide involved in the formation and transport of MHC class II protein. The cell surface form of the invariant chain is known as CD74.

IGFBP1 protein-coding gene in the species Homo sapiens

Insulin-like growth factor-binding protein 1 (IBP-1) also known as placental protein 12 (PP12) is a protein that in humans is encoded by the IGFBP1 gene.

IGFBP2 protein-coding gene in the species Homo sapiens

Insulin-like growth factor-binding protein 2 is a protein that in humans is encoded by the IGFBP2 gene.

Related Research Articles

Thyroglobulin protein-coding gene in the species Homo sapiens

Thyroglobulin (Tg) is a 660 kDa, dimeric protein produced by the follicular cells of the thyroid and used entirely within the thyroid gland. Thyroglobulin protein accounts for approximately half of the protein content of the thyroid gland. Human TG (HTG) is a homodimer of subunits each containing 2768 amino acids as synthesized.

Cathepsin

Cathepsins are proteases found in all animals as well as other organisms. There are approximately a dozen members of this family, which are distinguished by their structure, catalytic mechanism, and which proteins they cleave. Most of the members become activated at the low pH found in lysosomes. Thus, the activity of this family lies almost entirely within those organelles. There are, however, exceptions such as cathepsin K, which works extracellularly after secretion by osteoclasts in bone resorption. Cathepsins have a vital role in mammalian cellular turnover.

Low-density lipoprotein receptor gene family

The low-density lipoprotein receptor gene family codes for a class of structurally related cell surface receptors that fulfill diverse biological functions in different organs, tissues, and cell types. The role that is most commonly associated with this evolutionarily ancient family is cholesterol homeostasis. In humans, excess cholesterol in the blood is captured by low-density lipoprotein (LDL) and removed by the liver via endocytosis of the LDL receptor. Recent evidence indicates that the members of the LDL receptor gene family are active in the cell signalling pathways between specialized cells in many, if not all, multicellular organisms.

Cathepsin S protein-coding gene in the species Homo sapiens

Cathepsin S is a protein that in humans is encoded by the CTSS gene. Transcript variants utilizing alternative polyadenylation signals exist for this gene.

Cathepsin C protein-coding gene in the species Homo sapiens

Cathepsin C (CTSC) also known as dipeptidyl peptidase I (DPP-I) is a lysosomal exo-cysteine protease belonging to the peptidase C1 family. In humans, it is encoded by the CTSC gene.

Cystatin

The cystatins are a family of cysteine protease inhibitors which share a sequence homology and a common tertiary structure of an alpha helix lying on top of an anti-parallel beta sheet. The family is subdivided as described below.

Cathepsin B protein-coding gene in the species Homo sapiens

Cathepsin B is in humans encoded by the CTSB gene. Cathepsin B belongs to a family of lysosomal cysteine proteases and plays an important role in intracellular proteolysis. Upregulation of cathepsin B is found in premalignant lesions and various pathological conditions, as well as cancers.

Frizzled

Frizzled is a family of G protein-coupled receptor proteins that serves as receptors in the Wnt signaling pathway and other signaling pathways. When activated, Frizzled leads to activation of Dishevelled in the cytosol.

The mannose receptor is a C-type lectin primarily present on the surface of macrophages and immature dendritic cells, but is also expressed on the surface of skin cells such as human dermal fibroblasts and keratinocytes. It is the first member of a family of endocytic receptors that includes Endo180 (CD280), M-type PLA2R, and DEC-205 (CD205).

IGFBP3 protein-coding gene in the species Homo sapiens

Insulin-like growth factor-binding protein 3, also known as IGFBP-3, is a protein that in humans is encoded by the IGFBP3 gene. IGFBP-3 is one of six IGF binding proteins that have highly conserved structures and bind the insulin-like growth factors IGF-1 and IGF-2 with high affinity. IGFBP-7, sometimes inappropriately included in this family, shares neither the conserved structural features nor the high IGF affinity.

IGFBP4 protein-coding gene in the species Homo sapiens

Insulin-like growth factor-binding protein 4 is a protein that in humans is encoded by the IGFBP4 gene.

Cathepsin D protein-coding gene in the species Homo sapiens

Cathepsin D is a protein that in humans is encoded by the CTSD gene. This gene encodes a lysosomal aspartyl protease composed of a protein dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. Cathepsin D is an aspartic endo-protease that is ubiquitously distributed in lysosomes. The main function of cathepsin D is to degrade proteins and activate precursors of bioactive proteins in pre-lysosomal compartments. This proteinase, which is a member of the peptidase A1 family, has a specificity similar to but narrower than that of pepsin A. Transcription of the CTSD gene is initiated from several sites, including one that is a start site for an estrogen-regulated transcript. Mutations in this gene are involved in the pathogenesis of several diseases, including breast cancer and possibly Alzheimer disease. Homozygous deletion of the CTSD gene leads to early lethality in the post-natal phase. Deficiency of CTSD gene has been reported an underlying cause of neuronal ceroid lipofuscinosis (NCL).

Cathepsin L1 protein-coding gene in the species Homo sapiens

Cathepsin L1 is a protein that in humans is encoded by the CTSL1 gene.

Cystatin B protein-coding gene in the species Homo sapiens

Cystatin-B is a protein that in humans is encoded by the CSTB gene.

Cathepsin H protein-coding gene in the species Homo sapiens

Cathepsin H is a protein that in humans is encoded by the CTSH gene.

Cathepsin Z protein-coding gene in the species Homo sapiens

Cathepsin Z, also called cathepsin X or cathepsin P, is a protein that in humans is encoded by the CTSZ gene. It is a member of the cysteine cathepsin protease family, which has 11 members. As one of the 11 cathepsins, cathepsin Z contains distinctive features from others. Cathepsin Z has been reported involved in cancer malignancy and inflammation.

Cathepsin F protein-coding gene in the species Homo sapiens

Cathepsin F is a protein that in humans is encoded by the CTSF gene.

CUB domain is an evolutionarily conserved protein domain.

CCN proteins are a family of extracellular matrix (ECM)-associated proteins involved in intercellular signaling. Due to their dynamic role within the ECM they are considered matricellular proteins.

Kazal domain

The Kazal domain is an evolutionary conserved protein domain usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors.

References

Digital object identifier Character string used as a permanent identifier for a digital object, in a format controlled by the International DOI Foundation

In computing, a Digital Object Identifier or DOI is a persistent identifier or handle used to uniquely identify objects, standardized by the International Organization for Standardization (ISO). An implementation of the Handle System, DOIs are in wide use mainly to identify academic, professional, and government information, such as journal articles, research reports and data sets, and official publications though they also have been used to identify other types of information resources, such as commercial videos.

PubMed Central (PMC) is a free digital repository that archives publicly accessible full-text scholarly articles that have been published within the biomedical and life sciences journal literature. As one of the major research databases within the suite of resources that have been developed by the National Center for Biotechnology Information (NCBI), PubMed Central is much more than just a document repository. Submissions into PMC undergo an indexing and formatting procedure which results in enhanced metadata, medical ontology, and unique identifiers which all enrich the XML structured data for each article on deposit. Content within PMC can easily be interlinked to many other NCBI databases and accessed via Entrez search and retrieval systems, further enhancing the public's ability to freely discover, read and build upon this portfolio of biomedical knowledge.

This article incorporates text from the public domain Pfam and InterPro IPR000519

The public domain consists of all the creative works to which no exclusive intellectual property rights apply. Those rights may have expired, been forfeited, expressly waived, or may be inapplicable.

Pfam Database of protein families

Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models. The most recent version, Pfam 31.0, was released in March 2017 and contains 16,712 families.

InterPro is a database of protein families, domains and functional sites in which identifiable features found in known proteins can be applied to new protein sequences in order to functionally characterise them.