C6orf201

Last updated
C6orf201
Identifiers
Aliases C6orf201 , dJ1013A10.5, chromosome 6 open reading frame 201, protein MGC87625
External IDs MGI: 1914011 HomoloGene: 23555 GeneCards: C6orf201
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001085401
NM_206834

NM_025750

RefSeq (protein)

NP_001078870

NP_080026

Location (UCSC) Chr 6: 4.08 – 4.13 Mb Chr 13: 35.11 – 35.14 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 6 open reading frame 201, C6orf201, is a protein that in humans is encoded by the C6orf201 gene. [5] In humans this gene encodes for a nuclear protein that is primarily expressed in the testis. [6] [7]

Contents

Gene

In humans, the gene is 51,558 base pairs long. The transcript that produces the longest protein of 140 amino acids is translated from unprocessed mRNA that has six exons and is 1664 nucleotides in length. [8] [9]

C6orf201-gene.png

Aliases

Chromosome 6 open reading frame 201 (C6orf201) is also referred to as: dJ1013A10.5, MGC87625, RP5-1013A10.5, and LOC404220. [10]

Locus

C6orf201 is located on chromosome 6 at 6p25.2 position and is encoded on the plus strand. [11] C6orf201 is located near the FAM217A gene and the ECI2 gene. [12]

Conservation

C6orf201 is highly conserved in primates, is moderately conserved in other mammals, and there is also conservation in a few reptiles. [13] There is enough conservation in mycoplasma gallinarum to suggest that there may have been a horizontal gene transfer event sometime during the evolutionary history of C6orf201. There are no paralogs or gene duplication events for C6orf201.

Orthologs

Homolog List.png

Homologous domains

C6orf201 belongs to DUF4523 (Pfam15023), a functionally uncharacterized family of proteins that is found in mammals. [14]

Protein

Names

Less common names of the C6orf201 protein are: protein MGC87625, hypothetical protein LOC404220, OTTHUMP00000213693, and OTTHUMP00000213725. [10]

General properties/features

In humans the longest protein variant is 140 amino acids long, [15] has a molecular weight of 16.2 kDa, [16] and an isoelectric point of 10.88. [16] C6orf201 is predicted to be a nuclear protein. [7]

Modification and Structure

C6orf201 has multiple predicted PKC and CKII phosphorylation sites in humans. [17] The protein also has a nuclear localization signal. [7] C6orf201 has a conserved alpha helix and a conserved beta strand in the protein. [16]

Protein interactions

C6orf201 interacts with SRPK1, [18] TMEM106B, [19] and APP. [20]

Expression

C6orf201 is primarily expressed in the testis of humans and is also expressed in the testis of adult mice and rats. [6] GEO microarray data also supports expression of C6orf201 in the testis of humans and mouse. [21] [22]

Expression of C6orf201 in various human tissues showing highest expression in the testis. Human GEO profile.PNG
Expression of C6orf201 in various human tissues showing highest expression in the testis.
Expression of C6orf201 in various mouse tissues showing highest expression in the testis. Mouse GEO profile.PNG
Expression of C6orf201 in various mouse tissues showing highest expression in the testis.

Clinical relevance

Research Results

A toxicology study revealed that C6orf201 was one of the top 20 deregulated genes in monkeys that had been exposed to welding fumes for an extended period of time. [23] Another study investigated gene expression after the use of a methylation inhibitor, C6orf201 being one of many genes investigated. [24]

Related Research Articles

<span class="mw-page-title-main">FAM135B</span> Protein-coding gene in the species Homo sapiens

FAM135B is a human gene coding for a protein of unknown function. It is well conserved in primates, rodents, zebra fish. It has one paralog, FAM135A.

<span class="mw-page-title-main">CFAP206</span> Protein-coding gene in the species Homo sapiens

Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38

<span class="mw-page-title-main">FAM167A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.

<span class="mw-page-title-main">C3orf70</span> Protein-coding gene in the species Homo sapiens

C3orf70 also known as Chromosome 3 Open Reading Frame 70, is a 250aa protein in humans that is encoded by the C3orf70 gene. The protein encoded is predicted to be a nuclear protein; however, its exact function is currently unknown. C3orf70 can be identified with known aliases: Chromosome 3 Open Reading Frame 70, AK091454, UPF0524, and LOC285382.

<span class="mw-page-title-main">C7orf31</span> Protein-coding gene in the species Homo sapiens

Chromosome Seven Open Reading Frame 31 (C7orf31) is a protein that in humans is encoded by the C7orf31 gene on chromosome seven.

<span class="mw-page-title-main">C12orf40</span> Protein-coding gene in the species Homo sapiens

C12orf40, also known as Chromosome 12 Open Reading Frame 40, HEL-206, and Epididymis Luminal Protein 206 is a protein that in humans is encoded by the C12orf40 gene.

<span class="mw-page-title-main">C1orf74</span> Protein-coding gene in the species Homo sapiens

UPF0739 protein C1orf74 is a protein that in humans is encoded by the C1orf74 gene.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

<span class="mw-page-title-main">C11orf86</span> Protein-coding gene in the species Homo sapiens

Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.

<span class="mw-page-title-main">C12orf42</span> Protein-coding gene in the species Homo sapiens

Chromosome 12 Open Reading Frame 42 (C12orf42) is a protein-encoding gene in Homo sapiens.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C1orf112</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">FAM222A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 222 member A or Aggregatin is a protein of unknown function. In humans it is encoded by the gene FAM222A. Aggregatin's cellular function is not well understood, however it has been implicated in Alzheimer's Disease.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in the species Homo sapiens

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

<span class="mw-page-title-main">C6orf136</span>

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">C11orf53</span> Protein-coding gene in the species Homo sapiens

Chromosome 11 open reading frame 53 is a protein that in humans is encoded by the C11orf53 gene. Reduction in C11orf53 gene expression is associated with increased odds of occurrence of colorectal cancer. Specifically sequence variation (rs3802842) close to the C11orf53 gene locus that lowers the expression of C11orf53 has been observed in the colonic mucosal cells immediately adjacent to colon cancer tumors. C11orf53 downregulation aids in cells' ability to survive in acidic conditions, which are typical of the tumor microenvironment. CRISPR-Cas9 inactivation of C11orf53 in an acute myeloid leukemia cell line made the cells resistant to the BCL2 inhibitor Venetoclax, further supporting a role in cancer predisposition.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">Chromosome 12 open reading frame 71</span> Protein encoded in humans by c12orf71 gene

Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000185689 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000021415 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Mungall AJ, Palmer SA, Sims SK, Edwards CA, Ashurst JL, Wilming L, et al. (October 2003). "The DNA sequence and analysis of human chromosome 6". Nature. 425 (6960): 805–11. Bibcode:2003Natur.425..805M. doi: 10.1038/nature02055 . PMID   14574404.
  6. 1 2 "EST Profile". National Center for Biotechnology Information. Retrieved 9 May 2015.
  7. 1 2 3 Nakai, Kenta. "PSORT II prediction". GenScript. Archived from the original on 6 September 2021. Retrieved 9 May 2015.
  8. "GeneCards: The Human Gene Compendium" . Retrieved 8 February 2015.
  9. "Homo sapiens chromosome 6 open reading frame 201 (C6orf201) mRNA". National Center for Biotechnology Information. Retrieved 9 May 2015.
  10. 1 2 "Homo sapiens gene C6orf201, encoding chromosome 6 open reading frame 201". NCBI: AceView. Retrieved 9 May 2015.
  11. "Genomatix" . Retrieved 9 May 2015.
  12. "NCBI gene: C6orf201 Gene" . Retrieved 8 February 2015.
  13. "Basic Local Alignment Search Tool". NCBI BLAST.
  14. "Family: DUF4523 (PF15023)". The European Bioinformatics Institute. Retrieved 26 April 2015.
  15. "uncharacterized protein C6orf201 [Homo sapiens]". National Center for Biotechnology Information. Retrieved 9 May 2015.
  16. 1 2 3 "The Biology WorkBench". San Diego Super Computer. Retrieved 9 May 2015.
  17. "Motif Scan". ExPASy: SIB Bioinformatics Resource Portal. Retrieved 9 May 2015.
  18. Varjosalo M, Keskitalo S, Van Drogen A, Nurkkala H, Vichalkovski A, Aebersold R, Gstaiger M (April 2013). "The protein interaction landscape of the human CMGC kinase group". Cell Reports. 3 (4): 1306–20. doi: 10.1016/j.celrep.2013.03.027 . PMID   23602568.
  19. Rolland T, Taşan M, Charloteaux B, Pevzner SJ, Zhong Q, Sahni N, et al. (November 2014). "A proteome-scale map of the human interactome network". Cell. 159 (5): 1212–1226. doi:10.1016/j.cell.2014.10.050. PMC   4266588 . PMID   25416956.
  20. Oláh J, Vincze O, Virók D, Simon D, Bozsó Z, Tõkési N, Horváth I, Hlavanda E, Kovács J, Magyar A, Szũcs M, Orosz F, Penke B, Ovádi J (September 2011). "Interactions of pathological hallmark proteins: tubulin polymerization promoting protein/p25, beta-amyloid, and alpha-synuclein". The Journal of Biological Chemistry. 286 (39): 34088–100. doi: 10.1074/jbc.M111.243907 . PMC   3190826 . PMID   21832049.
  21. "C6orf201 Small cell lung cancers - Homo sapiens". National Center for Biotechnology Information Search database. Retrieved 9 May 2015.
  22. "C6orf201 Various tissues - Mus musculus". National Center for Biotechnology Information. Retrieved 9 May 2015.
  23. Heo JD, Oh JH, Lee K, Kim CY, Song CW, Yoon S, Han JS, Yu IJ (March 2010). "Gene expression profiling in the lung tissue of cynomolgus monkeys in response to repeated exposure to welding fumes". Archives of Toxicology. 84 (3): 191–203. doi:10.1007/s00204-009-0486-z. PMC   2820669 . PMID   19936710.
  24. Park GT, Han J, Park SG, Kim S, Kim TY (April 2014). "DNA methylation analysis of CD4+ T cells in patients with psoriasis". Archives of Dermatological Research. 306 (3): 259–68. doi:10.1007/s00403-013-1432-8. PMID   24323136. S2CID   25754439.