C14orf80

Last updated
TEDC1
Identifiers
Aliases TEDC1 , C14orf80, chromosome 14 open reading frame 80, tubulin epsilon and delta complex 1
External IDs MGI: 2144738 HomoloGene: 77129 GeneCards: TEDC1
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_134041

RefSeq (protein)

NP_001128347
NP_001128348
NP_001128349
NP_001185912
NP_001354107

Contents

NP_598802

Location (UCSC) Chr 14: 105.49 – 105.5 Mb Chr 12: 113.12 – 113.13 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Uncharacterized protein C14orf80 is a protein which in humans is encoded by the chromosome 14 open reading frame 80, C14orf80, gene.

Gene

Location

C14orf80 is located on chromosome 14 (14q32.33) starting at 105,489,855bp and ending at 105,499,248bp. C14orf80 is 9,393 base pairs long and contains 11 exons that can be alternatively spliced to form different mRNA variants. [5]


Variants

Transcription of C14orf80 can produce 19 mRNA splice variants. Only six of these nineteen variants are predicted to not encode for a protein. [6] Of the mRNA variants that have been found experimentally, the longest is 1,719 base pairs and produces a protein with 426 amino acids. [7]

Expression

C14orf80 has been determined to be expressed in 77 types of tissues and 100 developmental stages. [8] It has also been determined to have a higher level of expression in a few cases of pancreatic and prostate cancer cells compared to normal tissue. [9]

Expression of C14orf80 in a variety of tissues Normal Tissue Expression of C14orf80.png
Expression of C14orf80 in a variety of tissues

Homology

Paralogs

There are no paralogs of C14orf80. [10]

Orthologs

Using the BLAST program from NCBI, the orthologs of C14orf80 were found to range from primates to invertebrates. Below is a table that contains a variety of these orthologs. [11]

SpeciesCommon nameAccession numberDate of divergenceSequence length (AA)Sequence similarity
Homo sapiens Human NP_0011283470 mya 426100%
Chlorocebus sabaeus Green monkey XP_00798624729 mya42494%
Ictidomys tridecemlineatus 13-lined ground squirrel XP_00533468092.3 mya42680%
Bos taurus Cow XP_00358502694.2 mya41978%
Rattus norvegicus Brown rat XP_00272682792.3 mya42076%
Zonotrichia albicollis White-throated sparrow XP_005493655296 mya45453%
Pelodiscus sinensis Chinese softshell turtle XP_006137260296 mya40451%
Xenopus tropicalis Tropical clawed frog XP_002935771371.2 mya43750%
Danio rerio Zebra fish XP_706561400.1 mya45241%
Camponotus floridanus Florida carpenter ant XP_011255960782.7 mya37438%

Evolution rate

When compared to the slow-evolving cytochrome C gene and the fast-evolving fibrinogen gene, gene C14orf80 is also fast-evolving. [11]

Shows how fast three different genes evolved over many millions of years. Rate of Divergence C14orf80.png
Shows how fast three different genes evolved over many millions of years.

Protein

General properties

Uncharacterized protein C14orf80 is 426 amino acids long with a molecular weight of 47 kDa. [12] Its isoelectric point is 8.9. [13]

C14orf80 amino acid sequence C14orf80 Amino Acid Sequence.png
C14orf80 amino acid sequence

Composition

The amino acid composition of the uncharacterized protein C14orf80. Amino Acid Composition of C14orf80.png
The amino acid composition of the uncharacterized protein C14orf80.

Secondary structure

Uncharacterized protein C14orf80 is predicted to be entirely composed of alpha helices. [14] Using the program SOUSI-signal, it was predicted that uncharacterized protein C14orf80 does not contain a signal peptide and is a soluble protein. [15]

Function

Domains

Uncharacterized protein C14orf80 has two functional domains. The first domain is the domain of unknown function 4509 and the second is the domain of unknown function 4510. As their naming states the functions of these domains are still unknown. [10]

DUF4509 is located at amino acid 45 to amino acid 228. In this domain of unknown function there is a conserved WLL sequence motif. [16]

DUF4510 is located at amino acid 263 to amino acid 425. In this domain of unknown function there are two conserved sequence motifs: LEA and WMD. [17]

Post-translational modification

Uncharacterized protein C14orf80 is predicted to have glycation and phosphorylation sites for post-translational modification. Of these sites three are for glycation, eight are for serine phosphorylation and one site is for threonine phosphorylation. [18] [19]

Subcellular location

Uncharacterized protein C14orf80 is not predicted to be a transmembrane protein. It is mainly localized to the golgi apparatus but has been found in the nucleus and cytoplasm also. [20]

Interactions

Currently, there are 21 proteins that are predicted to interact with uncharacterized protein C14orf80. These 21 proteins were found using the databases Mentha, [21] BioGRID, [22] STRING, [23] GeneCards [24] and IntAct. [25] Below is a table of a variety of these 21 proteins.

Interacting proteinFull protein nameFunctionCitation
DDIT3 DNA-damage inducible transcript 3Induces cell cycle arrest and apoptosis when ER stress [26]
CEBPZ CCAAT enhancer binding proteinStimulates transcription from HSP70 promoter [26]
UBC Ubiquitin CUdeshi ND, Mani DR, Eisenhaure T, Mertins P, Jaffe JD, Clauser KR, Hacohen N, Carr SA (May 2012). "Methods for quantification of in vivo changes in protein ubiquitination following proteasome and deubiquitinase inhibition". Molecular & Cellular Proteomics. 11 (5): 148–59. doi:10.1074/mcp.M111.016857. PMC   3418844 . PMID   22505724.
FKBP5 FK506 binding proteinImmunophilin protein with PPIaseTaipale M, Tucker G, Peng J, Krykbaeva I, Lin ZY, Larsen B, Choi H, Berger B, Gingras AC, Lindquist S (July 2014). "A quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways". Cell. 158 (2): 434–448. doi:10.1016/j.cell.2014.05.039. PMC   4104544 . PMID   25036637.
DEF107ABeta-defensin 107Anti-bacterial activity [27]
XAGE1D Cancer/testis antigen family 12 member 1D [28]
TRAK2 Trafficking protein kinesin binding 2May regulate endosome to lysosome trafficking of membrane cargo [29]
NRF1 Nuclear respiratory factor 1Transcription factor on nuclear genes encoding respiratory subunits and components of the mitochondrial transcription and replication machinery [30]

Clinical significance

Uncharacterized protein C14orf80 has been associated with tumors in the breast, CNS, endometrium, large intestine, lung, skin, and stomach. [31]

Related Research Articles

<span class="mw-page-title-main">ABHD18</span> Protein-coding gene in the species Homo sapiens

ABHD18 is a protein that in Homo sapiens is encoded by the ABHD18 gene.

<span class="mw-page-title-main">Glutamate-rich protein 3</span>

Glutamate-rich protein 3, also known as Uncharacterized Protein C1orf173, is a protein encoded by the ERICH3 gene. ERICH3 was named “chromosome 1 open reading frame 173 (C1orf173)” based on its map location in the human genome. It was subsequently renamed “E-rich 3” as a result of the high content of glutamate (E) in its encoded amino acid sequence. Single-nucleotide polymorphisms (SNPs) in the ERICH3 gene has been identified as one of the "top" signals in a genome-wide association study (GWAS) for plasma serotonin concentrations which were themselves associated with selective serotonin reuptake inhibitor (SSRI) response in major depressive disorder (MDD) patients. The same ERICH3 SNP was later demonstrated that was significantly associated with SSRI treatment outcomes in three independent MDD trials, including STAR*D, ISPC and PReDICT. ERICH3 is most highly expressed in a variety of regions of the human brain, including the nucleus accumbens and frontal cortex based on the GTEx RNA-seq data. The single-cell RNA-seq data for human brain samples revealed that ERICH3 is predominantly expressed in neurons rather than other CNS cell types. ERICH3 was found interacts with proteins function in vesicle biogenesis and may play a significant role in vesicular function in serotonergic and other neuronal cell types, which might help explain its association with antidepressant treatment response. ERICH3 protein was also found abundant in blood platelets and cilia based on the proteomic studies. Its function in platelet was thought related to plasma serotonin storage because more than 99% of blood serotonin was stored in platelet and ERICH3 SNPs has been associated with plasma serotonin concentration in MDD patients. ERICH3 in primary cilia might regulates cilium formation and the localizations of ciliary transport.

<span class="mw-page-title-main">C1orf131</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C1orf131 is a protein that in humans is encoded by the gene C1orf131. The first ortholog of this protein was discovered in humans. Subsequently, through the use of algorithms and bioinformatics, homologs of C1orf131 have been discovered in numerous species, and as a result, the name of the majority of the proteins in this protein family is Uncharacterized protein C1orf131 homolog.

<span class="mw-page-title-main">C12orf40</span> Protein-coding gene in the species Homo sapiens

C12orf40, also known as Chromosome 12 Open Reading Frame 40, HEL-206, and Epididymis Luminal Protein 206 is a protein that in humans is encoded by the C12orf40 gene.

<span class="mw-page-title-main">C6orf201</span>

Chromosome 6 open reading frame 201, C6orf201, is a protein that in humans is encoded by the C6orf201 gene. In humans this gene encodes for a nuclear protein that is primarily expressed in the testis.

C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.

<span class="mw-page-title-main">C6orf62</span>

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

C11orf42 is an uncharacterized protein in homo sapiens that is encoded by the C11orf42 gene. It is also known as chromosome 11 open reading frame 42 and uncharacterized protein C11orf42, with no other aliases. The gene is mostly conserved in mammals, but it has also been found in rodents, reptiles, fish and worms.

<span class="mw-page-title-main">TEDC2</span> Protein-coding gene in the species Homo sapiens

Tubulin epsilon and delta complex 2 (TEDC2), also known as Chromosome 16 open reading frame 59 (C16orf59), is a protein that in humans is encoded by the TEDC2 gene. Its NCBI accession number is NP_079384.2.

<span class="mw-page-title-main">C7orf50</span>

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">C14orf119</span> Protein-coding gene in the species Homo sapiens

C14orf119 is a protein that in humans is encoded by the c14orf119 gene. The c14orf119 protein is predicted to be localized in the nucleus. Additionally, c14orf119 expression is decreased in individuals with systemic lupus erythematosus (SLE) when compared with healthy individual and is increased in individuals with various types of lymphomas when compared to healthy individuals.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">C6orf136</span>

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000185347 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000037466 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. https://www.ncbi.nlm.nih.gov/gene/283643>
  6. "Summary - Homo sapiens - Ensembl genome browser 97".
  7. "Homo sapiens tubulin epsilon and delta complex 1 (TEDC1), transcript variant 1, mRNA". 2018-12-04.
  8. "Bgee - Gene: C14orf80 - ENSG00000185347 - Homo sapiens (human)".
  9. "Home - GEO Profiles - NCBI".
  10. 1 2 "TEDC1 Gene - GeneCards | TEDC1 Protein | TEDC1 Antibody".
  11. 1 2 Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (October 1990). "Basic local alignment search tool". Journal of Molecular Biology. 215 (3): 403–410. doi:10.1006/jmbi.1990.9999. PMID   2231712.
  12. Subramaniam S (July 1998). "The Biology Workbench--a seamless database and analysis environment for the biologist". Proteins. 32 (1): 1–2. doi:10.1002/(sici)1097-0134(19980701)32:1<1::aid-prot1>3.0.co;2-q. PMID   9672036. S2CID   1412129.
  13. Bjellqvist B, Basse B, Olsen E, Celis JE (1994). "Reference points for comparisons of two-dimensional maps of proteins from different human cell types defined in a pH scale where isoelectric points correlate with polypeptide compositions". Electrophoresis. 15 (3–4): 529–539. doi:10.1002/elps.1150150171. PMID   8055880. S2CID   25560231.
  14. Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–2006. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  15. Gomi, M.; Sonoyama, M.; Mitaku, S. (2004). "High performance system for signal peptide prediction: SOSUIsignal". Chem-Bio Informatics Journal. 4 (4): 142–147. doi: 10.1273/cbij.4.142 .
  16. "Pfam: Family: DUF4509 (PF14970)".
  17. "Pfam: Family: DUF4510 (PF14971)".
  18. "NetGlycate 1.0 Server".
  19. "NetPhos 3.1 Server".
  20. "Cell atlas - C14orf80 - the Human Protein Atlas".
  21. Calderone A, Castagnoli L, Cesareni G (August 2013). "mentha: a resource for browsing integrated protein-interaction networks". Nature Methods. 10 (8): 690–691. doi:10.1038/nmeth.2561. PMID   23900247. S2CID   9733108.
  22. Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M (January 2006). "BioGRID: a general repository for interaction datasets". Nucleic Acids Research. 34 (Database issue): D535–D539. doi:10.1093/nar/gkj109. PMC   1347471 . PMID   16381927.
  23. Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. (January 2015). "STRING v10: protein-protein interaction networks, integrated over the tree of life". Nucleic Acids Research. 43 (Database issue): D447–D452. doi:10.1093/nar/gku1003. PMC   4383874 . PMID   25352553.
  24. Safran M, Dalah I, Alexander J, Rosen N, Iny Stein T, Shmoish M, et al. (August 2010). "GeneCards Version 3: the human gene integrator". Database. 2010: baq020. doi:10.1093/database/baq020. PMC   2938269 . PMID   20689021.
  25. Orchard S, Ammari M, Aranda B, Breuza L, Briganti L, Broackes-Carter F, et al. (January 2014). "The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases". Nucleic Acids Research. 42 (Database issue): D358–D363. doi:10.1093/nar/gkt1115. PMC   3965093 . PMID   24234451.
  26. 1 2 Behrends C, Sowa ME, Gygi SP, Harper JW (July 2010). "Network organization of the human autophagy system". Nature. 466 (7302): 68–76. Bibcode:2010Natur.466...68B. doi:10.1038/nature09204. PMC   2901998 . PMID   20562859.
  27. Bekhouche I, Finetti P, Adelaïde J, Ferrari A, Tarpin C, Charafe-Jauffret E, et al. (February 2011). "High-resolution comparative genomic hybridization of inflammatory breast cancer and identification of candidate genes". PLOS ONE. 6 (2): e16950. Bibcode:2011PLoSO...616950B. doi: 10.1371/journal.pone.0016950 . PMC   3037286 . PMID   21339811.
  28. DeGrado-Warren J, Dufford M, Chen J, Bartel PL, Shattuck D, Frech GC (February 2008). "Construction and characterization of a normalized yeast two-hybrid library derived from a human protein-coding clone collection". BioTechniques. 44 (2): 265–73. doi: 10.2144/000112674 . PMID   18330356.
  29. Huttlin EL, Ting L, Bruckner RJ, Paulo JA, Gygi MP, Rad R, et al. High-Throughput Proteomic Mapping of Human Interaction Networks via Affinity-Purification Mass Spectrometry (Report).
  30. Satoh J, Kawana N, Yamamoto Y (2013). "Pathway Analysis of ChIP-Seq-Based NRF1 Target Genes Suggests a Logical Hypothesis of their Involvement in the Pathogenesis of Neurodegenerative Diseases". Gene Regulation and Systems Biology. 7: 139–52. doi:10.4137/GRSB.S13204. PMC   3825669 . PMID   24250222.
  31. "Phenotypes - Homo sapiens - Ensembl genome browser 97".