C9orf152

Last updated
C9orf152
Identifiers
Aliases C9orf152 , bA470J20.2, chromosome 9 open reading frame 152
External IDs MGI: 2442889 HomoloGene: 52276 GeneCards: C9orf152
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001012993

NM_178727

RefSeq (protein)

NP_001013011
NP_001013011.2

NP_848842

Location (UCSC) Chr 9: 110.19 – 110.21 Mb Chr 4: 57.91 – 57.92 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 9 open reading frame 152 is a protein that in humans is encoded by the C9orf152 gene. [5] [6] The exact function of the protein is not completely understood.

Contents

Gene

The human gene C9orf152 is located on the long (q) arm of Chromosome 9. [7] Its cytogenetic location is 9q31.1. It has one known alias: bA470J20.2. [8]

The DNA sequence encoding C9orf152 contains a single intron. [7] The final mRNA consists of 2698 base pairs. Nucleotides 66-68 encode an upstream in frame stop codon. [5]

The exact location of C9orf152 alongside the closest genetic neighbors. Location of C9orf152 with neighbors on chromosome 9.jpg
The exact location of C9orf152 alongside the closest genetic neighbors.

Evolution

C9orf152 has orthologs in mammals, birds, reptiles and amphibians. No orthologs have been detected in bony fish or in any invertebrates. [7] [9] The following table lists a subset of conserved orthologs.

Scientific nameCommon nameAccession numberSequence length (aa)Percent identityPercent similarity
Homo sapiens HumanNP_001013011.2239--
Pan troglodytes ChimpanzeeXP_0011451872399898
Tarsius syrichta Philippine tarsierXP_0080643672377885
Ceratotherium simum simum RhinocerosXP_0044237842397882
Sus scrofa Wild boarXP_0031221172397483
Equus caballus HorseXP_0014916972397480
Tursiops truncatus Bottlenose dolphinXP_0043290842347381
Heterocephalus glaber Naked mole ratXP_0049038162397484
Orcinus orca Killer whaleXP_0042694442317279
Mus musculus MouseNP_8488422366272
Rattus norvegicus RatXP_0037540802346270
Chelonia mydas Green sea turtleXP_0070594912673349
Nestor notabilis KeaXP_0100095252653449
Python bivittatus Burmese pythonXP_0074284152343044
Meleagris gallopavo Wild turkeyXP_0107106602672943
Pelodiscus sinensis Chinese softshell turtleXP_0061206152682943
Haliaeetus albicilla White tailed eagleXP_0099114012663348
Xenopus tropicalis Western clawed frogXP_0049155652263145

Differences among shown orthologs suggest a slow rate of evolution. [10]

Protein

Chromosome 9 open reading frame 152 contains 239 amino acids. The molecular weight is 26.3 kilodaltons. The protein has a high chance of existing nuclear region of cells. [11] There are likely no transmembrane regions. [12] One isoform exists, containing 194 amino acids. [9] [13]

Within the coding sequence, there are two sumoylation sites [14] [15] [16] and a single serine phosphorylation site. [17]

There are three regions predicted to form alpha helices on the final protein. [18] [19]

Expression

Expression of C9orf152 in the brain of a mouse via Allen Brain Atlas. The only area of high expression is the dark purple on the left, which is located in the olfactory bulb. C9orf152 Brain Expression.jpg
Expression of C9orf152 in the brain of a mouse via Allen Brain Atlas. The only area of high expression is the dark purple on the left, which is located in the olfactory bulb.

C9orf152 is expressed in the bladder, intestine, mammary gland, and trachea and in smaller amounts in the lungs, liver, prostate, uterus, and brain. [20] Within the brain, expression of C9orf152 is limited to the olfactory bulb. [21] Gene expression was found to increase in the presence of stress, including disease and heat stress. [22]

A wide variety of transcription factors interact with the promoter of C9orf152, most notably two olfactory related factors (specifically, a neuron-specific olfactory factor and an olfactory associated zinc finger protein) and a negative glucocorticoid response element. [23]

Related Research Articles

<span class="mw-page-title-main">C11orf49</span> Protein-coding gene in the species Homo sapiens

C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.

<span class="mw-page-title-main">C11orf16</span>

Gene C11orf16, chromosome 11 open reading frame 16, is a protein in humans that is encoded by the C11orf16 gene. It has 7 exons, and the size of 467 amino acids.

<span class="mw-page-title-main">C1orf21</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene. C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers. There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates. C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.

<span class="mw-page-title-main">C9orf64</span> Protein-coding gene in the species Homo sapiens

C9orf64 is a gene located on chromosome 9, that in humans encodes the protein queuosine salvage protein. The function and biological process of the queuosine salvage protein is not well understood by the scientific community, but some evidence from orthologs indicates it may be involved in tRNA processing. The most common mRNA contains 4 coding exons, and it has 2 additional alternatively spliced exons. C9orf64 has been found in 5 different splice variants.

<span class="mw-page-title-main">OSER1</span>

Chromosome 20 open reading frame 111, or C20orf111, is the hypothetical protein that in humans is encoded by the C20orf111 gene. C20orf111 is also known as Perit1, HSPC207, and dJ1183I21.1. It was originally located using genomic sequencing of chromosome 20. The National Center for Biotechnology Information, or NCBI, shows that it is located at q13.11 on chromosome 20, however the genome browser at the University of California-Santa Cruz (UCSC) website shows that it is at location q13.12, and within a million base pairs of the adenosine deaminase locus. It was also found to have an increase in expression in cells undergoing hydrogen peroxide(H
2
O
2
)-induced apoptosis. After analyzing the amino acid content of C20orf111, it was found to be rich in serine residues.

<span class="mw-page-title-main">SHOC1</span>

Shortage In Chiasmata 1, also known as SHOC1, is a protein that in humans is encoded by the SHOC1 gene.

<span class="mw-page-title-main">C2orf73</span>

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C17orf50</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C17orf50 is a protein which in humans is encoded by the C17orf50 gene.

<span class="mw-page-title-main">C18orf63</span> Protein-coding gene in the species Homo sapiens

Chromosome 18 open reading frame 63 is a protein which in humans is encoded by the C18orf63 gene. This protein is not yet well understood by the scientific community. Research has been conducted suggesting that C18orf63 could be a potential biomarker for early stage pancreatic cancer and breast cancer.

<span class="mw-page-title-main">C3orf67</span> Human gene

Chromosome 3 open reading frame 67 or C3orf67 is a protein that in humans is encoded by the gene C3orf67. The function of C3orf67 is not yet fully understood.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">C22orf23</span>

C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">C11orf98</span>

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000188959 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000052117 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 "NCBI Gene". National Center of Biotechnology Information.
  6. "Symbol Report: C9orf152". HUGO Gene Nomenclature Committee.
  7. 1 2 3 "UCSC Genome Browser on Human Feb. 2009 (GRCh37/hg19) Assembly". Human BLAT Search. University of California Santa Cruz.
  8. "Chromosome 9 Open Reading Frame 152". GeneCards.
  9. 1 2 "BLAST: Basic Local Alignment Search Tool". National Center for Biotechnology Information.
  10. Hedges SB, Dudley J, Kumar S (Dec 2006). "TimeTree: a public knowledge-base of divergence times among organisms". Bioinformatics. 22 (23): 2971–2. doi: 10.1093/bioinformatics/btl505 . PMID   17021158.
  11. "PSORTII". GenScript. Retrieved 26 April 2015.
  12. "SOSUI". Classification and Secondary Structure Prediction of Membrane Proteins.
  13. "PREDICTED: uncharacterized protein C9orf152 isoform X1 [Homo sapiens]". National Center of Biotechnology Information.
  14. "SUMOplot". ExPASy: SIB Bioinformatics Resource Portal. Retrieved 26 April 2015.
  15. Zhao Q, Xie Y, Zheng Y, Jiang S, Liu W, Mu W, Liu Z, Zhao Y, Xue Y, Ren J (Jul 2014). "GPS-SUMO: a tool for the prediction of sumoylation sites and SUMO-interaction motifs". Nucleic Acids Research. 42 (Web Server issue): W325–30. doi:10.1093/nar/gku383. PMC   4086084 . PMID   24880689.
  16. Ren J, Gao X, Jin C, Zhu M, Wang X, Shaw A, Wen L, Yao X, Xue Y (Jun 2009). "Systematic study of protein sumoylation: Development of a site-specific predictor of SUMOsp 2.0". Proteomics. 9 (12): 3409–3412. doi:10.1002/pmic.200800646. PMID   19504496. S2CID   4900031.
  17. "NetPhos 2.0 Server". ExPASy: SIB Bioinformatics Resource Portal. Retrieved 26 April 2015.
  18. "PELE- Protein Structure Prediction". SDSC Biology WorkBench. Retrieved 26 April 2015.
  19. Subramaniam S (Jul 1998). "The Biology Workbench--a seamless database and analysis environment for the biologist". Proteins. 32 (1): 1–2. doi:10.1002/(sici)1097-0134(19980701)32:1<1::aid-prot1>3.0.co;2-q. PMID   9672036. S2CID   1412129.
  20. "Chromosome 9 open reading frame 152 (C9orf152)". National Center for Biotechnology Information. Retrieved 26 April 2015.
  21. "D630039A03Rik - RP_040920_02_E06 - sagittal". Allen Brain Atlas.
  22. "C9or152 - GEO Profiles". National Center of Biotechnology Information. Retrieved 26 April 2015.
  23. "Genomatix - NGS Data Analysis & Personalized Medicine". Genomatix. Retrieved 26 April 2015.