SHOC1

Last updated
SHOC1
Identifiers
Aliases SHOC1 , chromosome 9 open reading frame 84, ZIP2H, ZIP2, shortage in chiasmata 1, C9orf84, MZIP2
External IDs OMIM: 618038 MGI: 2140313 HomoloGene: 79783 GeneCards: SHOC1
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001080551
NM_173521
NM_001378211
NM_001378212

NM_001033200
NM_001370843

RefSeq (protein)

NP_001074020
NP_775792
NP_001365140
NP_001365141

NP_001357772

Location (UCSC) Chr 9: 111.69 – 111.8 Mb Chr 4: 59.04 – 59.14 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Shortage In Chiasmata 1, also known as SHOC1, is a protein that in humans is encoded by the SHOC1 gene.

Contents

Gene

The chromosomal locus of SHOC1 is 9q31.3, which it shares with at least 115 other protein encoding genes, and it is located on the negative strand. [5] [6] In humans it contains 34 exons, and it is 108,834 base pairs long, including introns and exons. C9orf84 is located between the protein encoding genes GNG10 and UGCG . When this gene is transcribed in humans, it most often forms a mRNA which is 4,721 base pairs long and contains 26 exons. There are at least 13 alternate splice forms of C9orf84, with more predicted. [7]

Protein

SHOC1 in humans has at least 6 alternate isoforms, with at least 10 more predicted. [8] The primarily used sequence in humans is C9orf84 Isoform 1. This isoform is 1444 aa long, contains 26 exons, has a predicted molecular weight of 165.190 kDa, and a predicted pI of 5.10. [9]

SHOC1 has been shown to undergo phosphorylation. [10] It is predicted that C9orf84 undergoes several other post-translational modifications, including glycosylation and o-linked glycosylation, and it contains leucine-rich nuclear export signals. [11] [12] [13] Compared to the generic reference set swp23s.q, the primary structure of the protein is deficient in the amino acid grouping AGP (alanine, glycine, proline), and contains more acidic amino acids (glutamate, aspartate) than basic amino acids (lysine, arginine). [14] This is true for the protein in all vertebrates. In the human Isoform 1, there have been 220 identified single nucleotide polymorphisms detected in the coding region, but none have currently been linked to human disease. [15] The secondary structure of this protein is predicted to be mainly alpha-helices in roughly the first two thirds of the protein, and coils in the last third. [16] It is predicted that this protein is localized in the nucleus. [17]

Expression

SHOC1 is ubiquitously expressed in most tissues with higher than average expression in the testes, the kidney, the thymus, and the adrenal gland. [18] [19]

The promoter for SHOC1 Isoform 1 in humans is 639 bp long and overlaps with the 5’ untranslated region of the gene. There are four alternate promoters that promote different transcript variants. [20]

Interactions

SHOC1 has been experimentally determined, through a two hybrid pooling approach, to interact with methionine aminopeptidase, a protein encoded by the maP3 gene in Bacillus anthracis . [21]

Several of the most common and most conserved transcription factor binding sites families that are predicted to be found in C9orf84's promoter region are ETS1 factors, Ccaat/Enhancer Binding Proteins, and Lymphoid enhancer-binding factor 1. [22] ETS1, Ccaat-enhancer-binding proteins, and Lymphoid enhancer-binding factor 1 are all related to immunity.

Evolutionary history

This gene is found in all vertebrates, and some invertebrates. The most distant ortholog detectable by NCBI BLAST is in Nematostella vectensis (starlet sea anemone). [23] The closest plant ortholog to C9orf84 is the SHOC1 protein in Arabidopsis thaliana . [24] C9orf84 is not very well conserved even among mammals.

Clinical significance

SHOC1 is highly upregulated in psoriasis patients with lesional skin as opposed to psoriasis patients with non-lesional skin and non-psoriasis patients. [25]

Related Research Articles

<span class="mw-page-title-main">YIF1A</span> Protein-coding gene in the species Homo sapiens

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.

<span class="mw-page-title-main">SUHW4</span> Protein-coding gene in the species Homo sapiens

Zinc finger protein 280D, also known as Suppressor Of Hairy Wing Homolog 4, SUWH4, Zinc Finger Protein 634, ZNF634, or KIAA1584, is a protein that in humans is encoded by the ZNF280D gene located on chromosome 15q21.3.

<span class="mw-page-title-main">EVI5L</span> Protein-coding gene in the species Homo sapiens

EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">Proline-rich protein 30</span>

Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

<span class="mw-page-title-main">C1orf112</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">C5orf46</span> Protein-coding gene in the species Homo sapiens

C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">C7orf50</span> Mammalian protein found in Homo sapiens

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">SAAL1</span> Protein-coding gene in the species Homo sapiens

Serum amyloid A-like 1 is a protein in humans encoded by the SAAL1 gene.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000165181 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000038598 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "1) C9orf84 chromosome 9 open reading frame 84 [ Homo sapiens (human) ]". National Center for Biotechnology Information.
  6. "Homo sapiens (human) Map Location: 9q31". GenScript. Archived from the original on 2015-09-24. Retrieved 2015-05-09.
  7. "Homo sapiens complex locus C9orf84, encoding chromosome 9 open reading frame 84". AceView.
  8. "Protein Search C9orf84". National Center for Biotechnology Information.
  9. "Compute pI/MW". ExPASy.
  10. Wu X, Tian L, Li J, Zhang Y, Han V, Li Y, Xu X, Li H, Chen X, Chen J, Jin W, Xie Y, Han J, Zhong CQ (December 2012). "Investigation of receptor interacting protein (RIP3)-dependent protein phosphorylation by quantitative phosphoproteomics". Molecular & Cellular Proteomics. 11 (12): 1640–51. doi: 10.1074/mcp.M112.019091 . PMC   3518118 . PMID   22942356.
  11. "NetGlycate". Center for Biological Sequence Analysis.
  12. "NetOGlyc". Center for Biological Sequence Analysis.
  13. "NetNES". Center for Biological Sequence Analysis.
  14. "SAPS". SDSC Biology WorkBench.[ permanent dead link ]
  15. "SNP linked to Gene (geneID:158401) Via Contig Annotation". NCBI dbSNP.
  16. "PELE". SDSC Biology WorkBench.[ permanent dead link ]
  17. "PSORTII". PSORT.
  18. Dezso Z, Nikolsky Y, Sviridov E, Shi W, Serebriyskaya T, Dosymbekov D, Bugrim A, Rakhmatulin E, Brennan RJ, Guryanov A, Li K, Blake J, Samaha RR, Nikolskaya T (November 2008). "A comprehensive functional analysis of tissue specificity of human gene expression". BMC Biology. 6: 49. doi: 10.1186/1741-7007-6-49 . PMC   2645369 . PMID   19014478.
  19. Shyamsundar R, Kim YH, Higgins JP, Montgomery K, Jorden M, Sethuraman A, van de Rijn M, Botstein D, Brown PO, Pollack JR (2005). "A DNA microarray survey of gene expression in normal human tissues". Genome Biology. 6 (3): R22. doi: 10.1186/gb-2005-6-3-r22 . PMC   1088941 . PMID   15774023.
  20. "ElDorado". Genomatix.[ permanent dead link ]
  21. Dyer MD, Neff C, Dufford M, Rivera CG, Shattuck D, Bassaganya-Riera J, Murali TM, Sobral BW (August 2010). "The human-bacterial pathogen protein interaction networks of Bacillus anthracis, Francisella tularensis, and Yersinia pestis". PLOS ONE. 5 (8): e12089. Bibcode:2010PLoSO...512089D. doi: 10.1371/journal.pone.0012089 . PMC   2918508 . PMID   20711500.
  22. "MatInspector". Genomatix.[ permanent dead link ]
  23. "BLAST". National Center for Biotechnology Information.
  24. Macaisne N, Novatchkova M, Peirera L, Vezon D, Jolivet S, Froger N, Chelysheva L, Grelon M, Mercier R (September 2008). "SHOC1, an XPF endonuclease-related protein, is essential for the formation of class I meiotic crossovers". Current Biology. 18 (18): 1432–7. Bibcode:2008CBio...18.1432M. doi: 10.1016/j.cub.2008.08.041 . PMID   18812090. S2CID   16418136.
  25. Nair RP, Duffin KC, Helms C, Ding J, Stuart PE, Goldgar D, Gudjonsson JE, Li Y, Tejasvi T, Feng BJ, Ruether A, Schreiber S, Weichenthal M, Gladman D, Rahman P, Schrodi SJ, Prahalad S, Guthery SL, Fischer J, Liao W, Kwok PY, Menter A, Lathrop GM, Wise CA, Begovich AB, Voorhees JJ, Elder JT, Krueger GG, Bowcock AM, Abecasis GR (February 2009). "Genome-wide scan reveals association of psoriasis with IL-23 and NF-kappaB pathways". Nature Genetics. 41 (2): 199–204. doi:10.1038/ng.311. PMC   2745122 . PMID   19169254.