IGSF6

Last updated
IGSF6
Identifiers
Aliases IGSF6 , DORA, immunoglobulin superfamily member 6
External IDs OMIM: 606222 MGI: 1891393 HomoloGene: 36189 GeneCards: IGSF6
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_005849

NM_030691

RefSeq (protein)

NP_005840

NP_109616

Location (UCSC) Chr 16: 21.64 – 21.65 Mb Chr 7: 120.66 – 120.67 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

IGSF6 is a protein that in humans is encoded by the IGSF6 gene. [5] [6]

Contents

Predicted IGSF6 Model from Phyre Viewed with iCN3D. IGSF6 Phyre Structure.png
Predicted IGSF6 Model from Phyre Viewed with iCN3D.

Overview

In humans, the immunoglobulin superfamily member 6 (IGSF6) gene with alias DORA encodes CD8 protein IGSF6 (24 kDA) with orthologs in mammals, birds, reptiles, and bony fishes. [7] IGSF6 is located on the complement strand of chromosome 16 (16p12.2) spanning 13059 base pairs and is located entirely within an intron of the gene METTL9. [8] IGSF6 is predicted to be an integral component of the plasma membrane and contribute to immune response. [9] It is also predicted to be involved in cell surface receptor signaling and enable transmembrane signaling receptor activity. IGSF6 gene was localized to a locus associated with inflammatory bowel disease (IBD). However, there was no association with single nucleotide polymorphisms (SNPs) and IBD in patients with the disease. [10]

Gene

A common alias for IGSF6 is downregulated by activation (DORA). The cytogenic location is on chromosome 16 (16p12.2). IGSF6 has 6 exons total. [11] The span of the gene is 13059 base pairs. [12]

Proteins

The theoretical isoelectric point (pI) and molecular weight (mw) for the IGSF6 protein are 8.9 and 27 kDa, respectively, before any modification. The pI of the protein is not consistent throughout, as the N-terminal half has a lower pI than the C-terminal half. IGSF6 is neutral at 8.93 and would be negative around 7. [13]

Eukaryotic Linear Motif (ELM) was used to find protein Motifs. The list ELM provided was after globular domain filtering, structural filtering, and context filtering. The four Motifs shown are organized by probability and are conserved in mammalian orthologs. [14]

ELM Binding Motifs for IGSF6. ELM Motifs.png
ELM Binding Motifs for IGSF6.

Structure

The secondary structure of IGSF6 is predicted to have regions of coils, strands, and alpha helices. [15] The most pronounced helix regions occur from amino acids 149-178 and amino acids 197-218. [16] [17] [18] [19]

I-TASSER predicted three-dimensional structure of IGSF6. I-Tasser IGSF6.png
I-TASSER predicted three-dimensional structure of IGSF6.

IGSF6 contains a transmembrane domain from amino acids 154 to 176. [20] The predicted disulfide bonds were found using DiANNA. [21]

Predicted disulfide bonding patterns from DiANNA. DiANNA disulfide bonds.png
Predicted disulfide bonding patterns from DiANNA.

Gene Level Regulation

Expression Pattern

IGSF6 is highly expressed in white blood cells and secondary lymphoid organs including the lymph nodes and spleen. [22] The mRNA abundance across 20 human tissues is low. [23] The micro-array assessed tissue expression patterns showed high expression in ganglia, monocytes, and myeloid tissue. [24] In situ hybridization showed that the regulation of IGSF6 was low and ubiquitously expressed in the mouse brain. [25] Proteins are localized in the human testis and thyroid. [26]

Promotor and Transcription Factors

The promotor region and transcription factors are shown in the promotor diagram. The transcription factors shown were highly conserved in animal orthologs of IGSF6.

IGSF6 Promotor Diagram. IGSF6 Promotor Diagram.png
IGSF6 Promotor Diagram.

Protein Level Regulation

The IGSF6 protein is predicted to be in the plasma membrane. [27] [28] IGSF6 has a signal peptide from amino acids 17 to 32. [29] IGSF6 has post-translational modifications including phosphorylation sites and lysine acetylation sites. The phosphorylation sites at amino acid positions 3, 5, 91, 193, 198, 222, and 236, and these sites are important in enzymatic function. [30] The lysine acetylation sites are at amino acids 187, 195, 196, 213, and 224, and they are important in gene expression, protein–protein interactions, and protein processing and degradation. [31] [32] IGSF6 has a SUMOylation site at amino acid 190. [33]

Homology and Evolution

Paralogs

The only paralog of IGSF6 is T cell receptor beta variable 28 (TCRBV28). [34] Birds are the most distant organism that TRBV28 is found in, so the gene duplication to create the paralog occurred about 320 million years ago. [35] TRBV28 is a quickly evolving gene, as it evolves similarly to fibrinogen alpha.

IGSF6 Paralog Table. IGSF6 Paralog Table.png
IGSF6 Paralog Table.

Orthologs

The orthologs of IGSF6 were found through NCBI protein and sorted by median date of divergence and sequence identity to the human protein. [36] The IGSF6 protein is found only in vertebrates with the H. sapiens IGSF6 protein being most distantly related to the fish IGSF6 protein. The human IGSF6 protein is most closely related to the IGSF6 protein of other mammals. Aves, reptiles, amphibians, and fish proteins have an average sequence similarity to the human protein of 52%, 50%, 50%, and 45% respectively. [37] IGSF6 is a fast-evolving gene because it evolves similarly to fibrinogen alpha.

IGSF6 Ortholog Table. IGSF6 Orthologs.png
IGSF6 Ortholog Table.
Graph of IGSF6 Showing Rate of Evolution Compared to Other Genes and Proteins. IGSF6 Evolution Rate Graph.png
Graph of IGSF6 Showing Rate of Evolution Compared to Other Genes and Proteins.

Interacting Proteins

The most likely protein to interact with IGSF6 is methyltransferase-like protein 9 (METTL9) because IGSF6 is in an intron of METTL9. Most of the proteins that IGSF6 interacts with have immunological functions. [38]

Proteins that IGSF6 Interacts With. IGSF6 Interacting Proteins.png
Proteins that IGSF6 Interacts With.

Clinical Significance

IGSF6 is predicted to be involved in immunological response. Its high expression in white blood cells and secondary lymphoid organs support this. IGSF6 has been associated with several diseases and conditions.

Inflammatory Bowel Disease

The human IGSF6 gene was localized to a locus associated with inflammatory bowel disease. IGSF6 has been researched as a possible indicator of inflammatory bowel disease (IBD) susceptibility. [39] However, there was no association with single nucleotide polymorphisms (SNPs) and IBD in patients with the disease. [40]

Esophageal Squamous Cell Carcinoma

The combined expression of IGSF6 and nine other genes was significantly related to the overall and disease-free survival in patients with esophageal squamous cell carcinoma. [41]

Multiple Sclerosis

IGSF6 was found to be upregulated in the myeloid cells function pathway in patients with multiple sclerosis, a chronic autoimmune demyelinating disease of the central nervous system. [42]

Related Research Articles

<span class="mw-page-title-main">TMEM8B</span> Protein-coding gene in humans

Transmembrane protein 8B is a protein that in humans is encoded by the TMEM8B gene. It encodes for a transmembrane protein that is 338 amino acids long, and is located on human chromosome 9. Aliases associated with this gene include C9orf127, NAG-5, and NGX61.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

<span class="mw-page-title-main">C9orf135</span> Mammalian protein found in Homo sapiens

C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">Coiled-coil domain containing 74a</span> Protein found in humans

Coiled-coil domain containing 74A is a protein that in humans is encoded by the CCDC74A gene. The protein is most highly expressed in the testis and may play a role in developmental pathways. The gene has undergone duplication in the primate lineage within the last 9 million years, and its only true ortholog is found in Pan troglodytes.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">PROB1</span> Protein-coding gene in the species Homo sapiens

Proline-rich basic protein 1(PROB1) is a protein encoded by the PROB1 gene located on human chromosome 5, open reading frame 65. PROB1 is also known as C5orf65 and weakly similar to basic proline-rich protein.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">C2orf16</span> Protein-coding gene in the species Homo sapiens

C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

<span class="mw-page-title-main">SAAL1</span> Protein-coding gene in the species Homo sapiens

Serum amyloid A-like 1 is a protein in humans encoded by the SAAL1 gene.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">ZNF548</span> Protein-coding gene in the species Homo sapiens

Zinc Finger Protein 548 (ZNF548) is a human protein encoded by the ZNF548 gene which is located on chromosome 19. It is found in the nucleus and is hypothesized to play a role in the regulation of transcription by RNA Polymerase II. It belongs to the Krüppel C2H2-type zinc-finger protein family as it contains many zinc-finger repeats.

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">C10orf53</span> Human gene

C10orf53 is a protein that in humans is encoded by the C10orf53 gene. The gene is located on the positive strand of the DNA and is 30,611 nucleotides in length. The protein is 157 amino acids and the gene has 3 exons. C10orf53 orthologs are found in mammals, birds, reptiles, amphibians, fish, and invertebrates. It is primarily expressed in the testes and at very low levels in the cerebellum, liver, placenta, and trachea.

<span class="mw-page-title-main">UBALD1</span> Human Gene/Protein

UBALD1 is a protein encoded by the UBALD1 gene, located on chromosome 16 in humans. UBALD1 has high ubiquitous tissue expression and localizes in the nucleus and cytoplasm. UBALD1 is conserved in animals, including invertebrates. An alias for UBALD1 is FAM100A.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000140749 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000035004 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Phyre Predicted Structure of IGSF6
  6. iCN3D .
  7. NCBI (National Library of Medicine) .
  8. NCBI (National Library of Medicine IGSF6 .
  9. NCBI (National Library of Medicine) Gene Entry on IGSF6 .
  10. King, K., Moody, A., Fisher, S. A., Mirza, M. M., Cuthbert, A. P., Hampe, J., Sutherland-Craggs, A., Sanderson, J., MacPherson, A. J., Forbes, A., Mansfield, J., Schreiber, S., Lewis, C. M., & Mathew, C. G. (2003). Genetic variation in the IGSF6 gene and lack of association with inflammatory bowel disease. European journal of immunogenetics : official journal of the British Society for Histocompatibility and Immunogenetics, 30(3), 187–190. https://doi.org/10.1046/j.1365-2370.2003.00387.x
  11. NCBI IGSF6 mRNA .
  12. UCSC Genome Browser IGSF6 .
  13. ExPASy pI/Mw for IGSF6 [ permanent dead link ].
  14. ELM Prediction of IGSF6 Protein .
  15. I-Tasser Results for IGSF6 [ permanent dead link ].
  16. I-Tasser Results for IGSF6 [ permanent dead link ].
  17. Wei Zheng, Chengxin Zhang, Yang Li, Robin Pearce, Eric W. Bell, Yang Zhang. Folding non-homology proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Reports Methods, 1: 100014 (2021).
  18. Chengxin Zhang, Peter L. Freddolino, and Yang Zhang. COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information. Nucleic Acids Research, 45: W291-299 (2017).
  19. Jianyi Yang, Yang Zhang. I-TASSER server: new development for protein structure and function predictions, Nucleic Acids Research, 43: W174-W181, 2015.
  20. SAPS analysis of IGSF6 .
  21. DiANNA predicted disulfide bonds for IGSF6 Archived 2022-07-24 at the Wayback Machine .
  22. NCBI Gene Entry on IGSF6 .
  23. NCBI Gene Entry on IGSF6 .
  24. NCBI GeoProfiles of IGSF6 .
  25. Allen Brain Atlas .
  26. ThermoFischer Scientific IGSF6 Antibody .
  27. PSORT II Prediction of IGSF6 .
  28. DeepLoc IGSF6 Prediction .
  29. NCBI (National Library of Medicine) .
  30. GPS Phosphorylation Sites .
  31. GPS-Pail Prediction of Acetylation .
  32. Zencheck, W. D., Xiao, H., & Weiss, L. M. (2012). Lysine post-translational modifications and the cytoskeleton. Essays in biochemistry, 52, 135–145. https://doi.org/10.1042/bse0520135
  33. GPS-SUMO Prediction of SUMOylation sites Archived 2013-05-10 at the Wayback Machine .
  34. NCBI Blast .
  35. TimeTree Divergence of Humans and Birds .
  36. NCBI Protein .
  37. Emboss Needle Alignment .
  38. String DB view of IGSF6 Protein interactions .
  39. Bates, E. E., Kissenpfennig, A., Péronne, C., Mattei, M. G., Fossiez, F., Malissen, B., & Lebecque, S. (2000). The mouse and human IGSF6 (DORA) genes map to the inflammatory bowel disease 1 locus and are embedded in an intron of a gene of unknown function. Immunogenetics, 52(1-2), 112–120. https://doi.org/10.1007/s002510000259
  40. King, K., Moody, A., Fisher, S. A., Mirza, M. M., Cuthbert, A. P., Hampe, J., Sutherland-Craggs, A., Sanderson, J., MacPherson, A. J., Forbes, A., Mansfield, J., Schreiber, S., Lewis, C. M., & Mathew, C. G. (2003). Genetic variation in the IGSF6 gene and lack of association with inflammatory bowel disease. European journal of immunogenetics : official journal of the British Society for Histocompatibility and Immunogenetics, 30(3), 187–190. https://doi.org/10.1046/j.1365-2370.2003.00387.x
  41. Guo, J., Liu, T., Shi, X., Mao, B., Zhang, J., Zhang, H., & Xian, L. (2020). Transcriptional profile of immune microenvironment and their prediction role for the prognosis of esophageal squamous cell carcinoma. Journal of Clinical Oncology, 38(4), 416–416. https://doi.org/10.1200/jco.2020.38.4_suppl.416
  42. Ivanova, M., Voronkova, A., Sukhorukov, V., & Zakharova, M. (2021). Different neuroinflammatory gene expression profiles in highly active and benign multiple sclerosis. Journal of Neuroimmunology, 358, 577650. https://doi.org/10.1016/j.jneuroim.2021.577650