C19orf38

Last updated

Highly Expressed In Immature Dendritic Cell Transcript 1 (HIDE1) is a protein encoded by chromosome 19 open reading frame 38 (C19orf38) gene in humans. [1] There are no other aliases used for the gene. C19orf38 is only expressed in white blood cells, of the innate immune system. HIDE1 protein has been found to play a role in immune escape of tumors and diet induced obesity.

Contents

Gene

C19orf38 location on chromosome 19, gene neighborhood, and gene overview. C19orf38 gene location-01.jpg
C19orf38 location on chromosome 19, gene neighborhood, and gene overview.

Risk-associated variants

There are five risk associated variants found within the c19orf38 gene. Three of which lead to a significant increase in low density lipoprotein cholesterol. [2] [3] One variant is associated with prevalence of coronary artery disease. [4] And the fifth identified risk variant is associated with increased reporting of Idiopathic knee osteoarthritis. [5]

mRNA Transcripts

Isoforms

C19orf38 can be alternatively spliced to form three distinct mRNA products. Both isoform's 1 and 2 differ only via the 5' UTR. Isoform 3 has a different protein product in that the mRNA transcript does not contain exon 2 or exon 3, however, isoform 3 is not expressed in humans.

Tissue Localization

C19orf38 transcript is found at the highest amount in bone marrow, with less than a fifth of the transcript amount in the spleen, testis, appendix, and lymph nodes, with little to no transcript in other tissue types. Tissues with the transcript have a high leukocyte presence. [6] It is exclusively present in the following cell types: monocytes, peripheral blood mononuclear cells, eosinophils and basophil's, so any expression in tissues comes from innate immune cells, or granulocytes. [7] Transcript is not present in neutrophils. C19orf38 transcript is not found in macrophages, despite, classical monocyte expression. [8]

Regulation of Transcription

The promoter region of C19orf38 contains two transcription factor binding domains that are particularly important for innate immune system development: Spi-C Transcription Factor (SPIC) and E74 Like ETS Transcription Factor 3 (ELF3). Both are transcription factors are only present in leukocytes are involved in the negative transcription of genes for the development of macrophages, which coincides with cellular localization of C19orf38. [9]

Protein

Structure

HIDE1 protein three-dimensional structure. C19orf38 Hide1 Protein Annotated -02.jpg
HIDE1 protein three-dimensional structure.

HIDE1 is a 230 amino acid transmembrane protein, anchored via ɑ-helix transmembrane region. F-box only protein 2 (FBXO2) binds in an extracellular region to glycosylated arginine amino acids found at positions 48 and 97. [11] The extracellular region also contains a highly conserved signal peptide sequence, which leads the protein to the membrane space. Additionally, HIDE1 protein contains a disordered region in its intracellular region. TNPO3 and XPO-4 are known to interact with HIDE1. [12]

Sub-cellular localization

Human HIDE1 protein is largely confirmed to be a signal protein existing either embedded within the cellular membrane or in a secreted form. Deeploc signal analysis predicts a signal peptide region at the start of its translation. [13] Furthermore, PSORT2 k-NN prediction finds the protein to be localized extracellularly 34.8% of the time, 30.4% in the plasma membrane, 21.7% in the endoplasmic reticulum, and 13.0% in the golgi bodies. [14]

Binding motifs

HIDE1 protein contains an ig-like domain and signal peptide in its extracellular region as well as multiple lipidification sites to assist with membrane association. [15] [16] Additionally, N-linked glycosylation sites can be found in the luminal side. The intracellular/cytoplasmic region contains multiple phosphorylation sites and calpain cleavage locations. [17]

Human Protein HIDE1 domain, motif, and post translational modification diagram. C19orf38PTMs-01.jpg
Human Protein HIDE1 domain, motif, and post translational modification diagram.

Homology

Orthologs

Orthologs are found in the following taxon classes: Mammalia, Reptilia, Aves, and Amphibia. There are no orthologs found in either class Insecta or Actinopterygii. C19orf38 is only present in jawed vertebrates which coincides with the divergence of adaptive immune systems 550 MYA between jawed and jawless vertebrates.

Table of C19orf38 transcript orthologs and related properties. Data is organized by median date of divergence (MYA), and then sequence identity to Homo sapien (Hsa) protein. C19orf38 Ortholog Table-01.jpg
Table of C19orf38 transcript orthologs and related properties. Data is organized by median date of divergence (MYA), and then sequence identity to Homo sapien (Hsa) protein.
human C19orf38 unrooted evolutionary tree. C19orf38-01 Phylogenetic Tree.jpg
human C19orf38 unrooted evolutionary tree.

Evolutionary rate

C19orf38 mutation rate is found to be less than that of fibrinogen alpha, but is high in comparison to other human proteins, especially, immune proteins which are highly conserved in jawed vertebrates. [20]

Clinical significance

HIDE1 shows no significant association with any cancer. [21]

Related Research Articles

CD58, or lymphocyte function-associated antigen 3 (LFA-3), is a cell adhesion molecule expressed on Antigen Presenting Cells (APCs), particularly macrophages, and other tissue cells.

<span class="mw-page-title-main">Granulocyte-macrophage colony-stimulating factor receptor</span> Protein-coding gene in humans

The granulocyte-macrophage colony-stimulating factor receptor, also known as CD116, is a receptor for granulocyte-macrophage colony-stimulating factor, which stimulates the production of white blood cells. In contrast to M-CSF and G-CSF which are lineage specific, GM-CSF and its receptor play a role in earlier stages of development. The receptor is primarily located on neutrophils, eosinophils and monocytes/macrophages, it is also on CD34+ progenitor cells (myeloblasts) and precursors for erythroid and megakaryocytic lineages, but only in the beginning of their development.

<span class="mw-page-title-main">CEBPB</span> Protein-coding gene in humans

CCAAT/enhancer-binding protein beta is a protein that in humans is encoded by the CEBPB gene.

<span class="mw-page-title-main">SLAMF1</span> Protein-coding gene in humans

Signaling lymphocytic activation molecule 1 is a protein that in humans is encoded by the SLAMF1 gene. Recently SLAMF1 has also been designated CD150.

<span class="mw-page-title-main">CLEC7A</span> Protein-coding gene in humans

C-type lectin domain family 7 member A or Dectin-1 is a protein that in humans is encoded by the CLEC7A gene. CLEC7A is a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. The encoded glycoprotein is a small type II membrane receptor with an extracellular C-type lectin-like domain fold and a cytoplasmic domain with a partial immunoreceptor tyrosine-based activation motif. It functions as a pattern-recognition receptor for a variety of β-1,3-linked and β-1,6-linked glucans from fungi and plants, and in this way plays a role in innate immune response. Expression is found on myeloid dendritic cells, monocytes, macrophages and B cells. Alternate transcriptional splice variants, encoding different isoforms, have been characterized. This gene is closely linked to other CTL/CTLD superfamily members on chromosome 12p13 in the natural killer gene complex region.

<span class="mw-page-title-main">ATP6V1G2</span>

V-type proton ATPase subunit G 2 is an enzyme that in humans is encoded by the ATP6V1G2 gene.

<span class="mw-page-title-main">FCGR2B</span> Antibody fragment receptor in humoral immunity

Fc fragment of IgG receptor IIb is a low affinity inhibitory receptor for the Fc region of immunoglobulin gamma (IgG). FCGR2B participates in the phagocytosis of immune complexes and in the regulation of antibody production by B lymphocytes.

<span class="mw-page-title-main">TREM2</span> Protein-coding gene in the species Homo sapiens

Triggering receptor expressed on myeloid cells 2(TREM2) is a protein that in humans is encoded by the TREM2 gene. TREM2 is expressed on macrophages, immature monocyte-derived dendritic cells, osteoclasts, and microglia, which are immune cells in the central nervous system. In the liver, TREM2 is expressed by several cell types, including macrophages, that respond to injury. In the intestine, TREM2 is expressed by myeloid-derived dendritic cells and macrophage. TREM2 is overexpressed in many tumor types and has anti-inflammatory activities. It might therefore be a good therapeutic target.

<span class="mw-page-title-main">Interleukin-1 family</span> Group of cytokines playing a key role in the regulation of immune and inflammatory responses

The Interleukin-1 family is a group of 11 cytokines that plays a central role in the regulation of immune and inflammatory responses to infections or sterile insults.

<span class="mw-page-title-main">SLC46A3</span> Protein-coding gene in the species Homo sapiens

Solute carrier family 46 member 3 (SLC46A3) is a protein that in humans is encoded by the SLC46A3 gene. Also referred to as FKSG16, the protein belongs to the major facilitator superfamily (MFS) and SLC46A family. Most commonly found in the plasma membrane and endoplasmic reticulum (ER), SLC46A3 is a multi-pass membrane protein with 11 α-helical transmembrane domains. It is mainly involved in the transport of small molecules across the membrane through the substrate translocation pores featured in the MFS domain. The protein is associated with breast and prostate cancer, hepatocellular carcinoma (HCC), papilloma, glioma, obesity, and SARS-CoV. Based on the differential expression of SLC46A3 in antibody-drug conjugate (ADC)-resistant cells and certain cancer cells, current research is focused on the potential of SLC46A3 as a prognostic biomarker and therapeutic target for cancer. While protein abundance is relatively low in humans, high expression has been detected particularly in the liver, small intestine, and kidney.

Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">FAM155B</span> Protein-coding gene in humans

Family with Sequence Similarity 155 Member B is a protein in humans that is encoded by the FAM155B gene. It belongs to a family of proteins whose function is not yet well understood by the scientific community. It is a transmembrane protein that is highly expressed in the heart, thyroid, and brain.

Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.

C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">GPATCH2L</span> It is Wikipedia article of unknown gene called "GPATCH2L".

GPATCH2L is a protein that is encoded by the GPATCH2L human gene located at 14q24.3. In humans, the length of mRNA in GPATCH2L (NM_017926) is 14,021 base pairs and the gene spans bases is 62,422 nt between chr14: 76,151,922 - 76,214,343. GPATCH2L is on the positive strand. IFT43 is the gene directly before GPATCH2L on the positive strand and LOC105370575 is the uncharacterized gene on the negative strand, which is approximately one and a half the size of GPATCH2L. Known aliases for GPATCH2L contain C14orf118, FLJ20689, FLJ10033, and KIAA1152. GPATCH2L produces 28 distinct introns, 17 different mRNAs, 14 alternatively spliced variants, and 3 unspliced forms. It has 5 probable alternative promoters, 7 validated polyadenylation sites, and 6 predicted promoters of varying lengths.

<span class="mw-page-title-main">TEKTIP1</span> Gene

TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene.

<span class="mw-page-title-main">TMEM61</span> Protein and coding gene in humans

Transmembrane protein 61 (TMEM61) is a protein that is encoded by the TMEM61 gene in humans. It is located on the first chromosome in humans and is highly expressed in the intestinal regions predominantly the kidney, adrenal gland and pituitary tissues. The protein, unlike other transmembrane protein in the region does not promote cancer growth. However, the TMEM61 protein when inhibited by secondary factors restricts normal activity in the kidney. The human protein shares many Orthologs and has been prevalent on Earth for millions of years.

<span class="mw-page-title-main">C6orf118</span> Protein-coding gene in the species Homo sapiens

C6orf118 is a protein in humans, which is encoded by the C6orf118 gene. The protein domain, translin-associated factor X-interacting N-terminus (TRAX), is involved in RNA binding and RNA nuclease activity and in the regulation of mitochondrial function and cellular homeostasis. TRAX interacts with translin, a DNA-binding protein that binds to consensus sequences at breakpoint junctions of chromosomal translocation. TRAX in general contains bipartite nuclear targeting sequences, which may provide nuclear transport for translin, as translin lacks any nuclear targeting motifs. This protein is localized to the mitochondria.

References

  1. "C19orf38 chromosome 19 open reading frame 38 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-10-01.
  2. Sinnott-Armstrong, Nasa; Tanigawa, Yosuke; Amar, David; Mars, Nina; Benner, Christian; Aguirre, Matthew; Venkataraman, Guhan Ram; Wainberg, Michael; Ollila, Hanna M.; Kiiskinen, Tuomo; Havulinna, Aki S.; Pirruccello, James P.; Qian, Junyang; Shcherbina, Anna; FinnGen (February 2021). "Genetics of 35 blood and urine biomarkers in the UK Biobank". Nature Genetics. 53 (2): 185–194. doi:10.1038/s41588-020-00757-z. ISSN   1546-1718. PMC   7867639 . PMID   33462484.
  3. Lee, Sung-Bum; Choi, Ja-Eun; Park, Byoungjin; Cha, Mi-Yeon; Hong, Kyung-Won; Jung, Dong-Hyuk (2022-07-06). "Dyslipidaemia-Genotype Interactions with Nutrient Intake and Cerebro-Cardiovascular Disease". Biomedicines. 10 (7): 1615. doi: 10.3390/biomedicines10071615 . ISSN   2227-9059. PMC   9312854 . PMID   35884923.
  4. Patrick, Matthew T.; Li, Qinmengge; Wasikowski, Rachael; Mehta, Nehal; Gudjonsson, Johann E.; Elder, James T.; Zhou, Xiang; Tsoi, Lam C. (2022-11-02). "Shared genetic risk factors and causal association between psoriasis and coronary artery disease". Nature Communications. 13 (1): 6565. Bibcode:2022NatCo..13.6565P. doi:10.1038/s41467-022-34323-4. ISSN   2041-1723. PMC   9630428 . PMID   36323703.
  5. Hollis, B.; Chatzigeorgiou, C.; Southam, L.; Hatzikotoulas, K.; Kluzek, S.; Williams, A.; Genetics of Osteoarthritis Consortium; Zeggini, E.; Jostins-Dean, L.; Watt, F. E. (October 2023). "Lifetime risk and genetic predisposition to post-traumatic OA of the knee in the UK Biobank". Osteoarthritis and Cartilage. 31 (10): 1377–1387. doi: 10.1016/j.joca.2023.05.012 . hdl: 10044/1/104501 . ISSN   1522-9653. PMID   37247657.
  6. "C19orf38 Gene Expression - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-07.
  7. "Immune cell - C19orf38 - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2023-12-07.
  8. Maouche, Seraya; Poirier, Odette; Godefroy, Tiphaine; Olaso, Robert; Gut, Ivo; Collet, Jean-Phillipe; Montalescot, Gilles; Cambien, François (2008-06-25). "Performance comparison of two microarray platforms to assess differential gene expression in human monocyte and macrophage cells". BMC Genomics. 9: 302. doi: 10.1186/1471-2164-9-302 . ISSN   1471-2164. PMC   2464609 . PMID   18578872.
  9. Carlsson, Robert; Hjalmarsson, Anna; Liberg, David; Persson, Christine; Leanderson, Tomas (2002-10-16). "Genomic structure of mouse SPI-C and genomic structure and expression pattern of human SPI-C". Gene. 299 (1–2): 271–278. doi:10.1016/s0378-1119(02)01078-8. ISSN   0378-1119. PMID   12459275.
  10. "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2023-12-17.
  11. Wang, Xiaoxu (2023-03-27). CGEN009/JHU001 IS A NOVEL REGULATOR OF OBESITY AND METABOLISM (Thesis). Johns Hopkins University.
  12. Diaz, Connor; G. Thankam, Finosh; K. Agrawal, Devendra (2023). "Karyopherins in the Remodeling of Extracellular Matrix: Implications in Tendon Injury". Journal of Orthopaedics and Sports Medicine. 05 (3): 357–374. doi:10.26502/josm.511500122. PMC   10569131 . PMID   37829147.
  13. "DeepLoc 2.0 - DTU Health Tech - Bioinformatic Services". services.healthtech.dtu.dk. Retrieved 2023-12-16.
  14. "PSORT II Prediction". psort.hgc.jp. Retrieved 2023-12-16.
  15. "Motif Scan". myhits.sib.swiss. Retrieved 2023-12-17.
  16. "ELM - Search the ELM resource". elm.eu.org. Retrieved 2023-12-17.
  17. "PhosphoSitePlus". www.phosphosite.org. Retrieved 2023-12-17.
  18. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2023-12-17.
  19. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2023-12-17.
  20. Cooper, Max D.; Alder, Matthew N. (February 2006). "The Evolution of Adaptive Immune Systems". Cell. 124 (4): 815–822. doi: 10.1016/j.cell.2006.02.001 . ISSN   0092-8674. PMID   16497590. S2CID   16590222.
  21. "Expression of C19orf38 in cancer - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2023-12-17.