C1orf127

Last updated
C1orf127
Identifiers
Aliases C1orf127 , chromosome 1 open reading frame 127
External IDs MGI: 2685418 HomoloGene: 52134 GeneCards: C1orf127
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001170754
NM_173507
NM_001366227

NM_001085505
NM_001368835

RefSeq (protein)

NP_001164225
NP_001353156

NP_001078974
NP_001355764

Location (UCSC) Chr 1: 10.95 – 10.98 Mb Chr 4: 148.73 – 148.76 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Uncharactarized protein C1orf127 is a protein that in humans is encoded by the C1orf127 gene, the structure and function of which is poorly understood by the scientific community. C1orf127 is targeted for extracellular secretion in humans.

Contents

Gene

C1orf127 is located on the short arm of Chromosome 1 (1p36.22), spanning 35,566 base pairs from 10946471 to 10982037. It is oriented on the minus strand of the chromosome.

mRNA

The primary assembly has 13 exons, and yields an 823 amino acid protein product. There are two known isoforms caused by alternative splicing. [5]

Protein

C1orf127's protein product is a member of the Ensembl protein family TF607005. [6] The primary assembly weighs 89 kDa with an isoelectric point of 5.54, making it both longer and heavier than the average protein. [7]

Domains and Motifs

C1orf127 is contains two protein domains: DUF4556 and PHA03247, a domain in the Atrophin-1 superfamily. [8] The functions of both domains are unknown. The protein also appears to have a cleavable signal peptide from Met1 to Pro18. [9]

Subcellular Localization

The protein C1orf127 is suggested to be localized to the extracellular matrix in humans. [9]

Post-Translational Modifications

C1orf127 undergoes N and O-linked glycosylation, and contains a number of potential phosphorylation sites.

Protein-Protein Interactions

C1orf127 is suggested to interact with two different proteins, CCT3, a molecular chaperone, and CCT6B, also a molecular chaperone found in the testis. Because these interacting proteins are both molecular chaperones, it is possible that C1orf127 must undergo chaperone-assisted folding or unfolding.

Expression

C1orf127 is not constitutively expressed, but it is expressed at low to medium levels in a variety of tissues. Greatest expression is observed in the stomach and pancreas. [10] It is also thought to be expressed in certain areas of both the developing and adult brain, such as the cerebellum, as well as skeletal muscle tissue, the testis, cardiac muscle, and throughout the digestive system.

Little else is known about this gene's expression, however a 2012 paper published in the World Journal of Gastroenterology suggested that its mis-expression could be used as a diagnostic marker locus in the detection of cancer [11]

Evolutionary History

DUF4556
Identifiers
SymbolDUF4556
Pfam PF15094
InterPro IPR027956
Available protein structures:
Pfam   structures / ECOD  
PDB RCSB PDB; PDBe; PDBj
PDBsum structure summary

C1orf127 has no paralogs within the human genome, however a number of orthologs have been identified, ranging across the jawed vertebrates, including a number of other mammals, marsupials, amphibians, and fish. One of the most distant ortholog identified is found in Danio rerio. Thus, the ancestor of C1orf127 likely arose around 435 MYA.

SpeciesNCBI Accession NumberSequence LengthIdentity to Human
Papio anubis XP_021791537.176989%
Saimiri boliviensis boliviensis XP_010344835.181779%
Octodon degus XP_023555153.151455%
Jaculus jaculus XP_004657440.182053%
Heterocephalus glaber XP_021099206.181155%
Echinops telfairi XP_012860770.176665%
Chrysochloris asiatica XP_006866497.151358%
Oryctolagus cuniculus XP_017195816.169658%
Chinchilla lanigera XP_005404362.177855%
Loxodonta africana XP_023408259.1112952%
Sarcophilus harrisii XP_023344649.1108856%
Phascolarctos cinereus XP_020835267.181853%
Xenopus laevis XP_018081142.169046%
Haplochromis burtoni XP_00591528.156434%
Lates calcarifer XP_018521386.136039%
Lepisosteus oculatus XP_015192693.182040%
Acanthochromis polyacanthus XP_022062388.139736%
Oncorhynchus mykiss CDQ71724.149635%
Danio rerio XP_021325672.132840%
Astyanax mexicanus XP_022532665.166236%

Related Research Articles

<span class="mw-page-title-main">CFAP206</span> Protein-coding gene in the species Homo sapiens

Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38

<span class="mw-page-title-main">C16orf96</span> Protein-coding gene in the species Homo sapiens

C16orf96, or chromosome 16 open reading frame 96, is a protein in humans that is encoded by C16orf96 that is found on the 16th chromosome. In Homo sapiens, the protein is 1141 amino acids in length

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">TEX55</span> Protein-coding gene in the species Homo sapiens

Testis expressed 55 (TEX55) is a human protein that is encoded by the C3orf30 gene located on the forward strand of human chromosome three, open reading frame 30 (3q13.32). TEX55 is also known as Testis-specific conserved, cAMP-dependent type II PK anchoring protein (TSCPA), and uncharacterized protein C3orf30.

<span class="mw-page-title-main">SMIM11</span>

Small integral membrane protein 11 is a protein which in humans is encoded by the SMIM11 gene.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in the species Homo sapiens

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">C12orf50</span> Protein encoding gene C12orf50

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

<span class="mw-page-title-main">C3orf38</span> An article about the uncharacterized gene C3orf38.

Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">C20orf144</span> Human protein-encoding gene

Chromosome 20 open reading frame 144 (c20orf144) is a human protein-encoding gene. The human c20orf144 protein consists of 153 amino acids, with the first 150 amino acids being characterized as part of the Bcl-2 like protein of testis (Bclt) family.

<span class="mw-page-title-main">C10orf53</span> Human gene

C10orf53 is a protein that in humans is encoded by the C10orf53 gene. The gene is located on the positive strand of the DNA and is 30,611 nucleotides in length. The protein is 157 amino acids and the gene has 3 exons. C10orf53 orthologs are found in mammals, birds, reptiles, amphibians, fish, and invertebrates. It is primarily expressed in the testes and at very low levels in the cerebellum, liver, placenta, and trachea.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000175262 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000070577 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "C1orf127 chromosome 1 open reading frame 127 [ Homo sapiens (human) ]". National Center for Biotechnology Information. Retrieved 18 February 2018.
  6. "Gene: C1orf127". Ensembl. Retrieved 19 February 2018.
  7. Lodish H, Berk A, Matsudaira P, Kaiser CA, Krieger M, Scott MP, Zipurksy SL, Darnell J (2004). Molecular Cell Biology (5th ed.). New York, New York: WH Freeman and Company.
  8. "Conserved domains on uncharacterized protein precursor C1orf127". National Center for Biotechnology Information.
  9. 1 2 "PSORT II". PSORT II Prediction. Retrieved 6 May 2018.
  10. "C1orf127 chromosome open reading frame 127 [Homo sapiens (human)]" . Retrieved 19 February 2018.
  11. Liu YY, Chen HY, Zhang ML, Tian D, Li S, Lee JY (September 2012). "Loss of fragile histidine triad and amplification of 1p36.22 and 11p15.5 in primary gastric adenocarcinomas". World Journal of Gastroenterology. 18 (33): 4522–32. doi:10.3748/wjg.v18.i33.4522. PMC   3435777 . PMID   22969225.

C1orf127