TMTC4

Last updated
TMTC4
Identifiers
Aliases TMTC4 , transmembrane and tetratricopeptide repeat containing 4, transmembrane O-mannosyltransferase targeting cadherins 4
External IDs OMIM: 618203 MGI: 1921050 HomoloGene: 32796 GeneCards: TMTC4
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_028651
NM_001360598
NM_001360599

RefSeq (protein)

NP_082927
NP_001347527
NP_001347528

Location (UCSC) Chr 13: 100.6 – 100.68 Mb Chr 14: 122.92 – 122.98 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Transmembrane and Tetratricopeptide repeat containing 4 is a protein that in humans is encoded by the TMTC4 gene. [5] This protein crosses the plasma membrane 10 times, and resides in the ER lumen and cytosol. The predicted structure of the TMTC4 protein is a series of alpha-helices.

Contents

Gene

TMTC4 is located on chromosome 13 at 13q32.3. The gene is flanked by ADP ribosylation factor 4 pseudogene 3 (ARF4P3) on the left, and ribosomal protein S26 pseudogene 47 (RPS26P47) on the right. TMTC4 spans 4043 bp and has a total of 23 exons. [5]

mRNA

TMTC4 has seven isoform variants, the most common being isoform 1 at 4043 bp. [5]

IsoformLength (bp)
14043
23833
33500
44217
54120
64037
73827

The 5’ UTR for TMTC4 is short and in many of the shorter isoforms, portions of this untranslated region are cut. In comparison, the 3’ UTR is long and is often complete across the seven isoforms.

Protein

Physical properties

The molecular weight for TMTC4 is 85.0 kdal, and there are no positive, negative, or neutral clusters of amino acids or charge runs exceeding the normal lengths. When looking at a distant ortholog (purple sea urchin) the molecular weight of TMTC4 is 85.5 kdal and there, again, are no charge runs, positive, negative or neutral clusters, or unusual spacings. There are strong similarities in protein composition across species. The isoelectric point for the domain of unknown function (DUF 1736) is lower than that of the protein overall.

Schematic illustration of known and predicted domains and motifs for TMTC4 Schematic Illustration.png
Schematic illustration of known and predicted domains and motifs for TMTC4
DomainAmino AcidsMolecular Weight (kdal)Isoelectric Point
Human TMTC476085.09.135
DUF 1736758.64.123
TPR repeats23426.79.509

Domains

Predicted secondary structure of TMTC4. Starting at blue and going to red, the diagram illustrates the n-terminus through the c-terminus. Secondary Structure for TMTC4.png
Predicted secondary structure of TMTC4. Starting at blue and going to red, the diagram illustrates the n-terminus through the c-terminus.

TMTC4 has ten transmembrane regions, all of them spaced within the first half of the protein. [6]

TMTC4 is layered with tetratricopeptide (TPR) repeat sequences that are a part of the TPR superfamily of proteins. DUF1736 is present upstream of the TPR region. A seven residue repeat (SRR) is located toward the end of the protein, and it is thought to encode a coiled-coil structure. [7] Another member of the TPR family, PFTA (protein prenyltransferases alpha subunit repeat), is located within the protein's TPR region and is believed to be involved in signal transduction and vesicular traffic regulation. [8] LSPR coagulation factor V, also a repeat motif, is located within the TPR region, and is thought to be a central regulator of hemostasis. [9]

Secondary structure

TMTC4 takes on a series of alpha-helix structures, especially within the TPR region, though there are a minimal amount of beta-strand structures spaced throughout the beginning half of the protein. [10]

Post-translational modifications

Predicted post-translational modifications of TMTC4 protein TMTC4 image.png
Predicted post-translational modifications of TMTC4 protein

There are four predicted nuclear localization signals, each tagging the protein for nuclear import. [6] At the very end of the protein, however, there is a predicted ER retention signal which would prevent the protein from leaving the ER. The protein has three predicted N-glycosylation sites, potentially altering its structure and function and there are ten predicted phosphorylation sites, each a possible activation site for a regulatory mechanism. [6]

Expression

TMTC4 is expressed in all human tissues. The gene, however, is most highly expressed in the brain and in the spinal cord. [11]

Protein abundance seems to be lower than normal for TMTC4.

Regulation

There is one possible promoter for the TMTC4 gene, located in the 5’ UTR but before the start of the coding sequence.

Function

Currently the function of TMTC4 has not been characterized.

Interacting proteins

Possible interacting proteins are NRG1, PEX19, HERC3, TXNDC15, and COL1A1 . All of these were detected through affinity chromatography. [12]

Protein NameKnown functionLocation
Neuregulin 1 [NRG1]mediates cell to cell signaling [13] membrane glycoprotein [13]
Peroxisomal Biogenesis Factor 19 [PEX19]cytosolic chaperone [14] membrane receptor protein [14]
ECT And RLD Domain Containing E3 Ubiquitin Protein Ligase 3 [HERC3]member of the ubiquitin ligase family [15] cytosol [15]
Thioredoxin Domain Containing 15 [TXNDC15]Not knownNot known
Collagen Type I Alpha 1 Chain [COL1A1]triple helix collagen protein [16] extracellular [16]

Homology

Orthologs

Ortholog space for TMTC4 spans a large portion of evolutionary time. TMTC4 is present in mammals, reptiles, amphibians, birds, fish, and invertebrates. It is not present in plants, bacteria, archaea, or fungi. [17]

Sequence NumberGenus and SpeciesCommon NameAccession # (protein)IdentityDate of Divergence (MYA)
1Heterocephalus glaber Naked mole rat EHB03258.188%94
2Rattus norvegicus Brown rat NP_001127886.190%94
3Myotis brandtii Brandt's bat EPQ01527.190%94
4Pteropus alecto Black flying fox XP_006909447.193%88
5Erinaceus europaeus European hedgehog XP_016040457.185%94
6Sorex araneus Common shrew XP_004614101.186%94
7Sus scrofa Wild boar NP_001239134.191%94
8Lipotes vexillifer Baiji XP_007461591.190%88
9Ailuropoda melanoleuca Giant panda XP_019650336.190%94
10Acinonyx jubatus Cheetah XP_014931490.193%94
11Tyto alba Barn owl KFV56414.185%320
12Charadrius vociferus Killdeer KGL87053.184%320
13Python bivittatus Burmese python XP_007425712.181%320
14Anolis carolinensis Carolina anole XP_008105174.182%320
15Xenopus tropicalis Western clawed frog NP_001121486.138%353
16Nanorana parkeri Nanorana parkeri XP_018432106.173%353
17Callorhinchus milii Australian ghostshark XP_007885231.168%465
18Crassostrea gigas Pacific oyster XP_011422949.150%758
19Strongylocentrotus purpuratus Purple sea urchin XP_011670776.149%627

Paralogs

Paralog space for TMTC4 spans the gene family TMTC. There are four genes in this gene family: TMTC1, TMTC2, TMTC3, and TMTC4. TMTC1 and TMTC3 split from TMTC4 about 1200 million years ago, while TMTC2 split from TMTC4 1400 million years ago. Both of these events happened somewhere between invertebrates and plants.

Related Research Articles

<span class="mw-page-title-main">SGTA</span> Protein-coding gene in the species Homo sapiens

Small glutamine-rich tetratricopeptide repeat-containing protein alpha is a protein that in humans is encoded by the SGTA gene. SGTA orthologs have also been identified in several mammals for which complete genome data are available.

<span class="mw-page-title-main">Tetratricopeptide repeat</span> Protein tandem repeat

The tetratricopeptide repeat (TPR) is a structural motif. It consists of a degenerate 34 amino acid tandem repeat identified in a wide variety of proteins. It is found in tandem arrays of 3–16 motifs, which form scaffolds to mediate protein–protein interactions and often the assembly of multiprotein complexes. These alpha-helix pair repeats usually fold together to produce a single, linear solenoid domain called a TPR domain. Proteins with such domains include the anaphase-promoting complex (APC) subunits cdc16, cdc23 and cdc27, the NADPH oxidase subunit p67-phox, hsp90-binding immunophilins, transcription factors, the protein kinase R (PKR), the major receptor for peroxisomal matrix protein import PEX5, protein arginine methyltransferase 9 (PRMT9), and mitochondrial import proteins.

<span class="mw-page-title-main">Tetratricopeptide repeat 39A</span> Protein-coding gene in the species Homo sapiens

Tetratricopeptide repeat 39A is a human protein encoded by the TTC39A gene. TTC39A is also known as DEME-6, KIAA0452, and c1orf34. The function of TTC39A is currently not well understood. The main feature within tetratricopeptide repeat 39A is the domain of unknown function 3808 (DUF3808), spanning almost the entire protein. KIAA0452 can also be seen as an isoform of TTC39A because of differences in genome sequence, but overlap in DUF domain.

<span class="mw-page-title-main">Tetratricopeptide repeat protein 39B</span> Protein-coding gene in the species Homo sapiens

Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.

<span class="mw-page-title-main">EVI5L</span> Protein-coding gene in the species Homo sapiens

EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.

<span class="mw-page-title-main">TMCO6</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domain 6, TMCO6, is a protein that in humans is encoded by the TMCO6 gene with aliases of PRO1580, HQ1580 or FLJ39769.1.

CCDC116, also called coiled-coil domain containing 116, is a gene that is patented for experimentation on the possibility of being a cancer marker for prostate cancer.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

<span class="mw-page-title-main">Proline-rich protein 30</span>

Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">Tetratricopeptide repeat domain 16 isoform 1</span> Protein-coding gene in the species Homo sapiens

Tetratricopeptide repeat domain 16 (TTC16) is an uncharacterized protein that in humans is encoded by the gene TTC16. Another alias for this gene is TPR repeat protein 16, but this is not commonly used. TTC16 is one of many proteins that contain tetratricopeptide repeat motifs as a supersecondary structure.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">CXorf38 Isoform 1</span> Human protein

Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.

<span class="mw-page-title-main">TMEM128</span>

TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">OCEL1</span> Protein-coding gene in the species Homo sapiens

OCEL1, also called Occludin//ELL Domain Containing 1, is a protein encoding gene located at chromosome 19p13.11 in the human genome. Other aliases for the gene include FLJ22709, FWP009, and S863-9. The function of OCEL1 has not yet been identified.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">MROH9</span> Mammalian gene

Maestro heat-like repeat-containing protein family member 9 (MROH9) is a protein which in humans is encoded by the MROH9 gene. The word ‘maestro’ itself is an acronym, standing for male-specific transcription in the developing reproductive organs (MRO). MRO genes belong to the MROH family, which includes MROH9.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000125247 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000041594 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 "TMTC4 transmembrane and tetratricopeptide repeat containing 4". Entrez Gene.
  6. 1 2 3 "Motif Scan". Swiss Institute of Bioinformatics. Retrieved 2017-05-04.
  7. Grigoryan G, Keating AE (2008). "Structural specificity in coiled-coil interactions". Current Opinion in Structural Biology. 18 (4): 477–83. doi:10.1016/j.sbi.2008.04.008. PMC   2567808 . PMID   18555680.
  8. Zhang H, Grishin NV (August 1999). "The alpha-subunit of protein prenyltransferases is a member of the tetratricopeptide repeat family". Protein Science. 8 (8): 1658–67. doi:10.1110/ps.8.8.1658. PMC   2144414 . PMID   10452610.
  9. "Coagulation factor V, LSPD (IPR009271)". InterPro. EMBL-EBI. Retrieved 2017-04-27.
  10. "I-TASSER results". I-TASSER. University of Michigan. Retrieved 2017-04-27.
  11. "2906582". GEO Profiles. NCBI. Retrieved 2017-04-27.
  12. Huttlin EL, Ting L, Bruckner RJ, Gebreab F, Gygi MP, Szpyt J, et al. (July 2015). "The BioPlex Network: A Systematic Exploration of the Human Interactome". Cell. 162 (2): 425–40. doi:10.1016/j.cell.2015.06.043. PMC   4617211 . PMID   26186194.
  13. 1 2 "NRG1 Gene". GeneCards. Retrieved 2017-04-27.
  14. 1 2 "PEX19 Gene". GeneCards. Retrieved 2017-04-27.
  15. 1 2 "HERC3 Gene". GeneCards. Retrieved 2017-04-27.
  16. 1 2 "COL1A1 Gene". GeneCards. Retrieved 2017-04-27.
  17. "BLAST: Basic Local Alignment Search Tool". NCBI. Retrieved 2017-04-27.

Further reading