C19orf18

Last updated

Chromosome 19 open reading frame 18 (c19orf18) is a protein which in humans is encoded by the c19orf18 gene. The gene is exclusive to mammals and the protein is predicted to have a transmembrane domain and a coiled coil stretch. [1] This protein has a function that is not yet fully understood by the scientific community.

Contents

Gene

Location of c19orf18 on chromosome 19 Location of c19orf18 on chromosome 19.png
Location of c19orf18 on chromosome 19

Aliases of this gene include MGC41906 and LOC147685. [1] The gene is located on chromosome 19 at 19q13.43. [2] The gene spans from 58,485,905 bp to 58,469,805 bp on the minus strand and contains 6 exons and 5 introns. [1] Transcription of this gene produces one spliced mRNA which codes for the protein c19orf18.

Expression

Expression levels of c19orf18 in various human tissues C19orf18 - Various Normal Tissues.png
Expression levels of c19orf18 in various human tissues

C19orf18 is ubiquitously expressed at moderate levels. [1] In humans, there is higher expression in the testis, prostate, lung, liver, pancreas, uterus, heart, and other connective tissues. [3] [4]

Homology

Paralogs

There are no known paralogs of this gene in the human genome. [5]

Orthologs

The gene is exclusive to mammals. [1] The transmembrane domain is the most conserved region among close orthologs and distant homologs. The following table presents some of the orthologs found using searches in BLAST. [6] This list does not contain all of the orthologs for c19orf18. It is meant to display the diversity of species for which orthologs are found. They are sorted by date of divergence and then protein similarity.

SpeciesDate of Divergence (MYA)Accession NumberSequence length (aa)IdentitySimilarity
Homo sapiens (Humans)0NP_689687.1215100%100%
Pongo abelii (Orangutan)15.2XP_002829939.121692%94%
Rhinopithecus roxellana (Golden snub-nosed Monkey)28.1XP_010385277.121684%90%
Carlito syrichta (Philippine tarsier)66.7XP_008066887.121770%81%
Otolemur garnettii (Galago)73XP_012663984.118350%62%
Mus musculus (Mouse)88XP_017167821.118346%63%
Oryctolagus cuniculus (European rabbit)88XP_008247222.124249%62%
Rhinolophus sinicus (Horseshoe bat)94XP_019567114.128470%82%
Vicugna pacos (Alpaca)94XP_015107013.121465%80%
Canis lupus familiaris (Dog)94XP_005616108.122349%61%
Bos taurus (Cow)94XP_015313970.125044%53%
Ornithorhynchus anatinus (Platypus)169XP_007664656.130834%57%

Protein

The coding sequence contains 215 amino acids. The molecular weight of c19orf18 is 24.151 kdal and the isoelectric point for the unphosphorylated state is 9.06. [7] The protein sequence is rich in leucine and is deficient in tryptophan, cysteine, and tyrosine. There is a negative charge cluster from amino acid 149 to 172. [8]

Structure

Predicted protein structure of c19orf18 C19orf18 structure.png
Predicted protein structure of c19orf18

There is a cross-program consensus between GOR4, CFSSP, and PHYRE2 that the protein structure contains mostly coiled regions and alpha helices. [9] [10] [11]

Topology

The protein sequence is predicted to contain a signal peptide (1 aa to 24 aa), an extracellular domain (25 aa to 100 aa), a transmembrane domain (101 aa to 121 aa), and a cytoplasmic domain (122 aa to 215 aa). [12]

Subcellular localization

PSORTII and CELLO predicted that the human protein would localize to the plasma membrane and part of it would be in the extracellular region. [13] [14] Immunofluorescent staining of human cell line U-2 OS shows localization to the Golgi apparatus. [15]

Function

Protein interactions

C19orf18 protein has been predicted to interact with several proteins listed in the table below. The interactions have been identified and verified through affinity capture-MS. [16]

Predicted interacting protein nameScoreExperimental verification
Nedd4 family interacting protein 10.9165Affinity capture-MS
Activin A receptor, type IIA0.7829Affinity capture-MS
Syntaxin 60.9679Affinity capture-MS
Bone morphogenetic protein receptor type 1A0.8914Affinity capture-MS
Fibroblast growth factor receptor 20.8789Affinity capture-MS
Microfibrillar-associated protein 30.8756Affinity capture-MS

C19orf18 protein interacts with Nedd4 family interacting protein 1 (NDFIP1) which promotes pancreatic beta cell death reduces insulin secretion. [17] Activin A receptor type 2A (ACVR2A) is a transmembrane receptor that is involved in ligand-binding and mediates the functions of activins. [18] Syntaxin 6 functions in trans-Golgi network vesicle trafficking, perhaps targeting to endosomes in mammalian cells. [19] Bone morphogenetic protein receptor type 1A(BMPR1A) is expressed almost exclusively in skeletal muscle and is a transcriptional regulator. [20] Fibroblast growth factor receptor 2 (FGFR2) plays an essential role in the regulation of osteoblast differentiation, proliferation and apoptosis, and is required for normal skeleton development. [21] Microfibrillar-associated protein 3 (MFAP3) has a function that is not fully understood but may be involved in nuclear signaling and may play a role in metastasis. [22]

Clinical Significance

Disease association

The c19orf18 protein is down-regulated in pancreatic cancer [23] and contains CpG sites found to be replicated for association with epithelial ovarian cancer risk. [24] The gene also decreases in expression in teratozoospermia [25] and increases in expression in polycystic ovary syndrome. [26] The gene may also be involved in prostate cancer and various tumors [3]

Related Research Articles

<span class="mw-page-title-main">KIAA1958</span> Protein-coding gene in the species Homo sapiens

Protein KIAA1958 is a protein that in humans is encoded by the KIAA1958 gene. Orthologs of KIAA1958 go as far back in evolution to chordates, although, it is closer in homology to primates than any other orthologs. KIAA1958 has no known paralogs.

<span class="mw-page-title-main">FAM214A</span> Protein-coding gene in the species Homo sapiens

Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.

Transmembrane protein 33 is a protein that in humans, is encoded by the TMEM33 gene, also known as SHINC3. Another name for the TMEM33 protein is DB83.

TMEM143 is a protein that in humans is encoded by TMEM143 gene. TMEM143, a dual-pass protein, is predicted to reside in the mitochondria and high expression has been found in both human skeletal muscle and the heart. Interaction with other proteins indicate that TMEM143 could potentially play a role in tumor suppression/expression and cancer regulation.

C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.

<span class="mw-page-title-main">C14orf93</span> Protein-coding gene in the species Homo sapiens

C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.

<span class="mw-page-title-main">TMEM176B</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.

<span class="mw-page-title-main">TMCO4</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

<span class="mw-page-title-main">CFAP157</span> Protein

Cilia and flagella associated protein 157 (CFAP157) also known as chromosome 9 open reading frame 117 (c9orf117) is a protein that in humans is encoded by the CFAP157 gene.

<span class="mw-page-title-main">GOLGA8H</span>

Golgin subfamily A member 8H, also known as GOLGA8H, is a protein that in Homo sapiens is encoded by the GOLGA8H gene. Function of the GOLGA8H involves a process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of the Golgi apparatus.

<span class="mw-page-title-main">TMEM125</span> Protein

Transmembrane protein 125 is a protein that, in humans, is encoded by the TMEM125 gene. It has 4 transmembrane domains and is expressed in the lungs, thyroid, pancreas, intestines, spinal cord, and brain. Though its function is currently poorly understood by the scientific community, research indicates it may be involved in colorectal and lung cancer networks. Additionally, it was identified as a cell adhesion molecule in oligodendrocytes, suggesting it may play a role in neuron myelination.

<span class="mw-page-title-main">TMEM101</span>

Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.

<span class="mw-page-title-main">C4orf36</span> Draft for page on C4orf36 gene/protein

C4orf36 is a protein that in humans is encoded by the c4orf36 gene.

Chromosome 20 open reading frame 85, or most commonly known as C20orf85 is a gene that encodes for the C20orf85 Protein. This gene is not yet well understood by the scientific community.

<span class="mw-page-title-main">C1orf159</span> Protein encoded on a gene

C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.

<span class="mw-page-title-main">TMEM144</span> Transmembrane Protein 144

Transmembrane Protein 144 (TMEM144) is a protein in humans encoded by the TMEM144 gene.

<span class="mw-page-title-main">TMEM248</span> Transmembrane protein 248/TMEM248 gene

Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.

References

  1. 1 2 3 4 5 Thierry-Mieg, Danielle; Thierry-Mieg, Jean. "AceView: Gene:C19orf18, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2018-02-26.
  2. "C19orf18 chromosome 19 open reading frame 18 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-02-26.
  3. 1 2 "EST Profile - Hs.134209". EST Profiles.
  4. "GDS3113 - GEO Profiles - NCBI". GEO profiles.
  5. "C19orf18 Gene - GeneCards". GeneCards - Human Gene Database.
  6. "Protein BLAST: search protein databases using a protein query". NIH Basic Local Alignment Search Tool.
  7. "C19orf18 (human)". PhosphoSitePlus.
  8. "SAPS Results". EMBL-EBI.
  9. "GOR IV Secondary Structure Prediction". PRABI Rhone-Alpes Bioinformatics Center.
  10. Kumar, Ashok T. "CFSSP: Chou & Fasman Secondary Structure Prediction Server". www.biogem.org.
  11. Kelley, Lawrence. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk.
  12. "C19orf18 - Uncharacterized protein C19orf18 precursor - Homo sapiens (Human) - C19orf18 gene & protein". UniProt.
  13. "PSORT II Prediction". PSORTII.
  14. "CELLO:Subcellular Localization Predictive System". Molecular Bioinformatics Center. Archived from the original on 2016-03-04.
  15. "C19orf18". The Human Protein Atlas.
  16. Tyers, Mike. "C19orf18 Result Summary". BioGRID.
  17. "NDFIP1 - NEDD4 family-interacting protein 1 - Homo sapiens (Human) - NDFIP1 gene & protein". UniProt.
  18. "ACVR2A activin A receptor type 2A [Homo sapiens (human)] - Gene - NCBI". Gene.
  19. Bock, J B; Klumperman, J; Davanger, S; Scheller, R H (July 1997). "Syntaxin 6 functions in trans-Golgi network vesicle trafficking". Molecular Biology of the Cell. 8 (7): 1261–1271. doi:10.1091/mbc.8.7.1261. ISSN   1059-1524. PMC   276151 . PMID   9243506.
  20. "BMPR1A Gene - GeneCards | BMR1A Protein | BMR1A Antibody". GeneCards Human Gene.
  21. "FGFR2 - Fibroblast growth factor receptor 2 precursor - Homo sapiens (Human) - FGFR2 gene & protein". UniProt.
  22. "MFAP3L - Microfibrillar-associated protein 3-like precursor - Homo sapiens (Human) - MFAP3L gene & protein". UniProt.
  23. Makler, Amy; Narayanan, Ramaswamy (2017-05-01). "Mining Exosomal Genes for Pancreatic Cancer Targets". Cancer Genomics & Proteomics. 14 (3): 161–172. doi:10.21873/cgp.20028. ISSN   1109-6535. PMC   5420817 . PMID   28446531.
  24. Fridley, Brooke L.; Armasu, Sebastian M.; Cicek, Mine S.; Larson, Melissa C.; Wang, Chen; Winham, Stacey J.; Kalli, Kimberly R.; Koestler, Devin C.; Rider, David N. (2014-04-28). "Methylation of leukocyte DNA and ovarian cancer: relationships with disease status and outcome". BMC Medical Genomics. 7: 21. doi: 10.1186/1755-8794-7-21 . ISSN   1755-8794. PMC   4102255 . PMID   24774302.
  25. "GDS2697 - GEO Profiles - NCBI". GEO profiles.
  26. "GDS4399 - GEO Profiles - NCBI". GEO profiles.