C19orf38

Last updated

Highly Expressed In Immature Dendritic Cell Transcript 1 (HIDE1) is a protein encoded by chromosome 19 open reading frame 38 (C19orf38) gene in humans. [1] There are no other aliases used for the gene. C19orf38 is only expressed in white blood cells, of the innate immune system. HIDE1 protein has been found to play a role in immune escape of tumors and diet induced obesity.

Contents

Gene

C19orf38 location on chromosome 19, gene neighborhood, and gene overview. C19orf38 gene location-01.jpg
C19orf38 location on chromosome 19, gene neighborhood, and gene overview.

Location

C19orf38 is cytogenetically banded at 19p13.2 spanning 33,216 base pairs oriented on the forward strand (chr19:+10,947,251 to +10,980,466). C19orf38 is located near the following genes: TMED1, HSALNG0123968, ZNF627, and KRIT1.

Risk-associated variants

There are five risk associated variants found within the c19orf38 gene. Three of which lead to a significant increase in low density lipoprotein cholesterol. [2] [3] One variant is associated with prevalence of coronary artery disease. [4] And the fifth identified risk variant is associated with increased reporting of Idiopathic knee osteoarthritis. [5]

mRNA Transcripts

Isoforms

C19orf38 can be alternatively spliced to form three distinct mRNA products. Both isoform's 1 and 2 differ only via the 5' UTR. Isoform 3 has a different protein product in that the mRNA transcript does not contain exon 2 or exon 3, however, isoform 3 is not expressed in humans.

Tissue Localization

C19orf38 transcript is found at the highest amount in bone marrow, with less than a fifth of the transcript amount in the spleen, testis, appendix, and lymph nodes, with little to no transcript in other tissue types. Tissues with the transcript have a high leukocyte presence. [6] It is exclusively present in the following cell types: monocytes, peripheral blood mononuclear cells, eosinophils and basophil's, so any expression in tissues comes from innate immune cells, or granulocytes. [7] Transcript is not present in neutrophils. C19orf38 transcript is not found in macrophages, despite, classical monocyte expression. [8]

Regulation of Transcription

The promoter region of C19orf38 contains two transcription factor binding domains that are particularly important for innate immune system development: Spi-C Transcription Factor (SPIC) and E74 Like ETS Transcription Factor 3 (ELF3). Both are transcription factors are only present in leukocytes are involved in the negative transcription of genes for the development of macrophages, which coincides with cellular localization of C19orf38. [9]

Protein

Structure

HIDE1 protein three-dimensional structure. C19orf38 Hide1 Protein Annotated -02.jpg
HIDE1 protein three-dimensional structure.

HIDE1 is a 230 amino acid transmembrane protein, anchored via ɑ-helix transmembrane region. F-box only protein 2 (FBXO2) binds in an extracellular region to glycosylated arginine amino acids found at positions 48 and 97. [11] The extracellular region also contains a highly conserved signal peptide sequence, which leads the protein to the membrane space. Additionally, HIDE1 protein contains a disordered region in its intracellular region. TNPO3 and XPO-4 are known to interact with HIDE1. [12]

Sub-cellular localization

Human HIDE1 protein is largely confirmed to be a signal protein existing either embedded within the cellular membrane or in a secreted form. Deeploc signal analysis predicts a signal peptide region at the start of its translation. [13] Furthermore, PSORT2 k-NN prediction finds the protein to be localized extracellularly 34.8% of the time, 30.4% in the plasma membrane, 21.7% in the endoplasmic reticulum, and 13.0% in the golgi bodies. [14]

Binding motifs

HIDE1 protein contains an ig-like domain and signal peptide in its extracellular region as well as multiple lipidification sites to assist with membrane association. [15] [16] Additionally, N-linked glycosylation sites can be found in the luminal side. The intracellular/cytoplasmic region contains multiple phosphorylation sites and calpain cleavage locations. [17]

Human Protein HIDE1 domain, motif, and post translational modification diagram. C19orf38PTMs-01.jpg
Human Protein HIDE1 domain, motif, and post translational modification diagram.

Homology

Orthologs

Orthologs are found in the following taxon classes: Mammalia, Reptilia, Aves, and Amphibia. There are no orthologs found in either class insecta or actinopterygii. C19orf38 is only present in jawed vertebrates which coincides with the divergence of adaptive immune systems 550 MYA between jawed and jawless vertebrates.

Table of C19orf38 transcript orthologs and related properties. Data is organized by median date of divergence (MYA), and then sequence identity to Homo sapien (Hsa) protein. C19orf38 Ortholog Table-01.jpg
Table of C19orf38 transcript orthologs and related properties. Data is organized by median date of divergence (MYA), and then sequence identity to Homo sapien (Hsa) protein.
human C19orf38 unrooted evolutionary tree. C19orf38-01 Phylogenetic Tree.jpg
human C19orf38 unrooted evolutionary tree.

Evolutionary rate

C19orf38 mutation rate is found to be less than that of fibrinogen alpha, but is high in comparison to other human proteins, especially, immune proteins which are highly conserved in jawed vertebrates. [20]

Clinical significance

Theodros reveals protein HIDE1 role in host homeostasis, in addition to interactions with myeloid cells which often interact with tumor development. [21] Lastly, Theodoros shows that HIDE1 accomplishes both through acting as an upstream signal to induce Trem2 myeloid signature, a unique transcriptional portrayal. Trem2 activation allows for immune escape by the tumor, thus acting as a crucial regulator in antitumor immune response. Blocking transcription of C19orf38 or preventing HIDE1 activation may be a novel approach to avoid immune escape by various tumors.

Despite these findings, HIDE1 shows no significant association with any cancer. [22]

Xiaoxu’s dissertation examines HIDE1’s interaction with diet induced obesity. It was found that HIDE1 plays a critical role in weight gain, in that, deletion of c19orf38 resulted in a gain of weight. [11] This allows for potential use for drug treatment therapies to block activity of HIDE1 to control weight gain.

Related Research Articles

CD16, also known as FcγRIII, is a cluster of differentiation molecule found on the surface of natural killer cells, neutrophils, monocytes, macrophages, and certain T cells. CD16 has been identified as Fc receptors FcγRIIIa (CD16a) and FcγRIIIb (CD16b), which participate in signal transduction. The most well-researched membrane receptor implicated in triggering lysis by NK cells, CD16 is a molecule of the immunoglobulin superfamily (IgSF) involved in antibody-dependent cellular cytotoxicity (ADCC). It can be used to isolate populations of specific immune cells through fluorescent-activated cell sorting (FACS) or magnetic-activated cell sorting, using antibodies directed towards CD16.

<span class="mw-page-title-main">Granulocyte-macrophage colony-stimulating factor receptor</span> Protein-coding gene in humans

The granulocyte-macrophage colony-stimulating factor receptor, also known as CD116, is a receptor for granulocyte-macrophage colony-stimulating factor, which stimulates the production of white blood cells. In contrast to M-CSF and G-CSF which are lineage specific, GM-CSF and its receptor play a role in earlier stages of development. The receptor is primarily located on neutrophils, eosinophils and monocytes/macrophages, it is also on CD34+ progenitor cells (myeloblasts) and precursors for erythroid and megakaryocytic lineages, but only in the beginning of their development.

<span class="mw-page-title-main">EMR3</span> Protein-coding gene in the species Homo sapiens

EGF-like module-containing mucin-like hormone receptor-like 3 is a protein encoded by the ADGRE3 gene. EMR3 is a member of the adhesion GPCR family. Adhesion GPCRs are characterized by an extended extracellular region often possessing N-terminal protein modules that is linked to a TM7 region via a domain known as the GPCR-Autoproteolysis INducing (GAIN) domain.

<span class="mw-page-title-main">PD-L1</span> Mammalian protein found in Homo sapiens

Programmed death-ligand 1 (PD-L1) also known as cluster of differentiation 274 (CD274) or B7 homolog 1 (B7-H1) is a protein that in humans is encoded by the CD274 gene.

<span class="mw-page-title-main">SPI1</span> Protein-coding gene in the species Homo sapiens

Transcription factor PU.1 is a protein that in humans is encoded by the SPI1 gene.

<span class="mw-page-title-main">Colony stimulating factor 1 receptor</span> Protein found in humans

Colony stimulating factor 1 receptor (CSF1R), also known as macrophage colony-stimulating factor receptor (M-CSFR), and CD115, is a cell-surface protein encoded by the human CSF1R gene. CSF1R is a receptor that can be activated by two ligands: colony stimulating factor 1 (CSF-1) and interleukin-34 (IL-34). CSF1R is highly expressed in myeloid cells, and CSF1R signaling is necessary for the survival, proliferation, and differentiation of many myeloid cell types in vivo and in vitro. CSF1R signaling is involved in many diseases and is targeted in therapies for cancer, neurodegeneration, and inflammatory bone diseases.

<span class="mw-page-title-main">CD47</span> Protein-coding gene in humans

CD47 also known as integrin associated protein (IAP) is a transmembrane protein that in humans is encoded by the CD47 gene. CD47 belongs to the immunoglobulin superfamily and partners with membrane integrins and also binds the ligands thrombospondin-1 (TSP-1) and signal-regulatory protein alpha (SIRPα). CD-47 acts as a don't eat me signal to macrophages of the immune system which has made it a potential therapeutic target in some cancers, and more recently, for the treatment of pulmonary fibrosis.

<span class="mw-page-title-main">CD244</span> Protein found in humans

CD244 also known as 2B4 or SLAMF4 is a protein that in humans is encoded by the CD244 gene.

<span class="mw-page-title-main">NT5E</span> Convert AMP to adenosine.

5′-nucleotidase (5′-NT), also known as ecto-5′-nucleotidase or CD73, is an enzyme that in humans is encoded by the NT5E gene. CD73 commonly serves to convert AMP to adenosine.

<span class="mw-page-title-main">CD200</span> Protein-coding gene in the species Homo sapiens

OX-2 membrane glycoprotein, also named CD200 is a human protein encoded by the CD200 gene. CD200 gene is in human located on chromosome 3 in proximity to genes encoding other B7 proteins CD80/CD86. In mice CD200 gene is on chromosome 16.

<span class="mw-page-title-main">TREM2</span> Protein-coding gene in the species Homo sapiens

Triggering receptor expressed on myeloid cells 2(TREM2) is a protein that in humans is encoded by the TREM2 gene. TREM2 is expressed on macrophages, immature monocyte-derived dendritic cells, osteoclasts, and microglia, which are immune cells in the central nervous system. In the liver, TREM2 is expressed by several cell types, including macrophages, that respond to injury. In the intestine, TREM2 is expressed by myeloid-derived dendritic cells and macrophage. TREM2 is overexpressed in many tumor types and has anti-inflammatory activities. It might therefore be a good therapeutic target.

<span class="mw-page-title-main">Interleukin-1 family</span> Group of cytokines playing a key role in the regulation of immune and inflammatory responses

The Interleukin-1 family is a group of 11 cytokines that plays a central role in the regulation of immune and inflammatory responses to infections or sterile insults.

<span class="mw-page-title-main">DMAC1</span> Protein-coding gene in the species Mus musculus

Transmembrane protein 261 is a protein that in humans is encoded by the TMEM261 gene located on chromosome 9. TMEM261 is also known as C9ORF123 and DMAC1, Chromosome 9 Open Reading Frame 123 and Transmembrane Protein C9orf123 and Distal membrane-arm assembly complex protein 1.

Chromosome 19 open reading frame 18 (c19orf18) is a protein which in humans is encoded by the c19orf18 gene. The gene is exclusive to mammals and the protein is predicted to have a transmembrane domain and a coiled coil stretch. This protein has a function that is not yet fully understood by the scientific community.

<span class="mw-page-title-main">RESF1</span> Protein-coding gene in the species Homo sapiens

Retroelement silencing factor 1 is a protein that in humans is encoded by the RESF1 gene. RESF1 is broadly expressed in the lymph nodes, ovaries, appendix and spleen. RESF1 shows characteristics of being a minor histocompatibility antigen, as well as tumor suppressor capabilities. The high expression in the lymph nodes and spleen indicate function in the immune system.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">SNAP47</span>

Synaptosome-associated protein, 47 kDal (SNAP47) is a human protein encoded by the SNAP47 gene. Other aliases of this gene are SVAP1, HEL170, ESFI5812, and HEL-S-290. SNAP47 is a synaptosome protein which is associated with the protein coding in multiple diseases, including non small cell lung cancer and schizophrenia. SNAP47 is a member of the SNAP protein family. SNAP proteins are t-snare proteins that are a component of SNARE complex. The SNARE complex mediates vesicle fusion by creating tight complex that brings vesicle and membrane together. This protein causes ubiquitous expression in testis, ovary, and many other tissues

<span class="mw-page-title-main">C12orf29</span> Protein-coding gene in humans

C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.

Human protein 53 intron 1 (Hp53int1) is a protein encoded by the Hp53int1 gene in humans.

SERTM2, also known as the Serine Rich And Transmembrane Domain Containing 2, is a protein which in humans is encoded by the SERTM2 gene. The SERTM2 protein is a transmembrane protein located in the intracellular membrane and active in membrane-bound organelles. SERTM2 expression has been linked to metastatic prostate tumors, prostate carcinomas and renal cell carcinomas.

References

  1. "C19orf38 chromosome 19 open reading frame 38 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-10-01.
  2. Sinnott-Armstrong, Nasa; Tanigawa, Yosuke; Amar, David; Mars, Nina; Benner, Christian; Aguirre, Matthew; Venkataraman, Guhan Ram; Wainberg, Michael; Ollila, Hanna M.; Kiiskinen, Tuomo; Havulinna, Aki S.; Pirruccello, James P.; Qian, Junyang; Shcherbina, Anna; FinnGen (February 2021). "Genetics of 35 blood and urine biomarkers in the UK Biobank". Nature Genetics. 53 (2): 185–194. doi:10.1038/s41588-020-00757-z. ISSN   1546-1718. PMC   7867639 . PMID   33462484.
  3. Lee, Sung-Bum; Choi, Ja-Eun; Park, Byoungjin; Cha, Mi-Yeon; Hong, Kyung-Won; Jung, Dong-Hyuk (2022-07-06). "Dyslipidaemia-Genotype Interactions with Nutrient Intake and Cerebro-Cardiovascular Disease". Biomedicines. 10 (7): 1615. doi: 10.3390/biomedicines10071615 . ISSN   2227-9059. PMC   9312854 . PMID   35884923.
  4. Patrick, Matthew T.; Li, Qinmengge; Wasikowski, Rachael; Mehta, Nehal; Gudjonsson, Johann E.; Elder, James T.; Zhou, Xiang; Tsoi, Lam C. (2022-11-02). "Shared genetic risk factors and causal association between psoriasis and coronary artery disease". Nature Communications. 13 (1): 6565. Bibcode:2022NatCo..13.6565P. doi:10.1038/s41467-022-34323-4. ISSN   2041-1723. PMC   9630428 . PMID   36323703.
  5. Hollis, B.; Chatzigeorgiou, C.; Southam, L.; Hatzikotoulas, K.; Kluzek, S.; Williams, A.; Genetics of Osteoarthritis Consortium; Zeggini, E.; Jostins-Dean, L.; Watt, F. E. (October 2023). "Lifetime risk and genetic predisposition to post-traumatic OA of the knee in the UK Biobank". Osteoarthritis and Cartilage. 31 (10): 1377–1387. doi: 10.1016/j.joca.2023.05.012 . hdl: 10044/1/104501 . ISSN   1522-9653. PMID   37247657.
  6. "C19orf38 Gene Expression - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-07.
  7. "Immune cell - C19orf38 - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2023-12-07.
  8. Maouche, Seraya; Poirier, Odette; Godefroy, Tiphaine; Olaso, Robert; Gut, Ivo; Collet, Jean-Phillipe; Montalescot, Gilles; Cambien, François (2008-06-25). "Performance comparison of two microarray platforms to assess differential gene expression in human monocyte and macrophage cells". BMC Genomics. 9: 302. doi: 10.1186/1471-2164-9-302 . ISSN   1471-2164. PMC   2464609 . PMID   18578872.
  9. Carlsson, Robert; Hjalmarsson, Anna; Liberg, David; Persson, Christine; Leanderson, Tomas (2002-10-16). "Genomic structure of mouse SPI-C and genomic structure and expression pattern of human SPI-C". Gene. 299 (1–2): 271–278. doi:10.1016/s0378-1119(02)01078-8. ISSN   0378-1119. PMID   12459275.
  10. "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2023-12-17.
  11. 1 2 Wang, Xiaoxu (2023-03-27). CGEN009/JHU001 IS A NOVEL REGULATOR OF OBESITY AND METABOLISM (Thesis). Johns Hopkins University.
  12. Diaz, Connor; G. Thankam, Finosh; K. Agrawal, Devendra (2023). "Karyopherins in the Remodeling of Extracellular Matrix: Implications in Tendon Injury". Journal of Orthopaedics and Sports Medicine. 05 (3): 357–374. doi:10.26502/josm.511500122. PMC   10569131 . PMID   37829147.
  13. "DeepLoc 2.0 - DTU Health Tech - Bioinformatic Services". services.healthtech.dtu.dk. Retrieved 2023-12-16.
  14. "PSORT II Prediction". psort.hgc.jp. Retrieved 2023-12-16.
  15. "Motif Scan". myhits.sib.swiss. Retrieved 2023-12-17.
  16. "ELM - Search the ELM resource". elm.eu.org. Retrieved 2023-12-17.
  17. "PhosphoSitePlus". www.phosphosite.org. Retrieved 2023-12-17.
  18. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2023-12-17.
  19. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2023-12-17.
  20. Cooper, Max D.; Alder, Matthew N. (February 2006). "The Evolution of Adaptive Immune Systems". Cell. 124 (4): 815–822. doi: 10.1016/j.cell.2006.02.001 . ISSN   0092-8674. PMID   16497590. S2CID   16590222.
  21. Theodros, Debebe; Murter, Benjamin M.; Sidhom, John-William; Nirschl, Thomas R.; Clark, David J.; Chen, LiJun; Tam, Ada J.; Blosser, Richard L.; Schwen, Zeyad R.; Johnson, Michael H.; Pierorazio, Phillip M.; Zhang, Hui; Ganguly, Sudipto; Pardoll, Drew M.; Zarif, Jelani C. (November 2020). "High-dimensional Cytometry (ExCYT) and Mass Spectrometry of Myeloid Infiltrate in Clinically Localized Clear Cell Renal Cell Carcinoma Identifies Novel Potential Myeloid Targets for Immunotherapy". Molecular & Cellular Proteomics. 19 (11): 1850–1859. doi: 10.1074/mcp.ra120.002049 . ISSN   1535-9476. PMC   7664124 . PMID   32737216.
  22. "Expression of C19orf38 in cancer - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2023-12-17.