C1orf159

Last updated
C1orf159
Identifiers
Aliases C1orf159 , chromosome 1 open reading frame 159
External IDs MGI: 2444364; HomoloGene: 51678; GeneCards: C1orf159; OMA:C1orf159 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_017891
NM_001330306
NM_001363525

NM_145557
NM_177205

RefSeq (protein)

NP_001317235
NP_060361
NP_001350454

n/a

Location (UCSC) Chr 1: 1.08 – 1.12 Mb Chr 4: 156.19 – 156.21 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. [5] [6] This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer. [7]

Contents

Gene

The Homo sapiensC1orf159 gene (UniProt ID: Q96HA4) is a gene located on the short arm of chromosome 1 at locus 1p36.33. [6] The gene is 34,247 base pairs in length, located at Chromosome 1 position 1,081,818 to 1,116,089 on the reverse strand. [8]

Transcript

The longest variant of human C1orf159 gene encodes an mRNA that is 2,432 nucleotides in length with 12 exons. [9] A promoter region was predicted using UCSC Genome Browser, [10] which is 762 nucleotides long, including a 434 nucleotide upstream of the transcriptional start site, exon 1, and a 298 nucleotide region of intron 1.[ citation needed ]

Protein

Isoforms

Alternative splicing of the gene creates 5 protein isoforms. [5] The longest isoform is 380 amino acids in length with a molecular mass of 40.382 kDa. [5]

Isoforms of human C1orf159 protein
IsoformUniProt [11] IDLength (aa)
1 Q96HA4-1 380
2 Q96HA4-2 185
3 Q96HA4-3 189
4 Q96HA4-4 198
5 Q96HA4-5 254

Composition

C1orf159 protein is a proline- and arginine-rich, and a lysine- and glutamic acid- poor protein. The isoelectric point of the human C1orf159 protein is 10.07, [12] which is more basic than the average human proteomic protein pI of 7.36. [13]

Domain

The human C1orf159 protein contains a domain of unknown function DUF4501. [5] Although the exact function of the domain is not clear, it is thought to be a single pass-membrane protein with highly conserved cysteine residues.[ citation needed ]

The protein also contains a transmembrane domain at positions 144-169 [11] and a signal peptide at positions 1-18. [9] [11]

Structure

Alphafold predicts the structure of human C1orf159 protein to be mainly composed of alpha-helices. [14]

Post-translational modification

The predicted post-translational modifications of the C1orf159 protein includes N-linked glycosylation on asparagine at positions 104, 111, and 128. [5] [15]

Homology/evolution

Orthologs

Orthologs of human C1orf159 are found in vertebrates including mammals, birds, reptiles, amphibians, and fish [16] with the most distantly related group of organisms being cartilaginous fish, with a date of divergence of approximately 450 million years ago. [17] Orthologs are not found in jawless fish or invertebrates. [16]

Orthologs of C1orf159 Protein
SpeciesGroupTaxonomic

Group

NCBI Protein Accession NumberProtein Sequence

Similarity

(% Relative to Human Protein)

Human MammalsPrimates NP_001317235.1 100.0
Chimpanzee Primates XP_024204744.1 98.4
Bonobo Primates XP_008975653.2 88.9
House Mouse Rodentia NP_796179.1 40.9
Cattle Artiodactyla NP_001026925.1 36.6
Sunda Flying Lemur Dermoptera XP_008567908.1 39.4
Chinese Tree Shrew Scandentia XP_027622332.1 35.8
Cougar Carnivora XP_025768111.1 41.7
Chicken BirdsGalliformes XP_024998437.2 32.8
Rock Pigeon Columbiformes XP_013226562.2 35.7
Hooded Crow Passeriformes XP_039420032.1 29.9
Golden-collared Manakin Passeriformes XP_017934783.1 36.5
Gharial ReptilesCrocodilia XP_019367354.1 36.8
Leatherback Sea Turtle Testudines XP_027584571.1 35.9
Chinese Softshell Turtle Testudines XP_006127168.1 35.2
Western Clawed Frog Anura NP_001039047.1 34.6
Two-lined Caecilian AmphibiansGymnophiona XP_029433955.1 33.9
Asiatic Toad Anura XP_044137731.1 31.6
Zebrafish FishCypriniformes NP_001313355.1 26.4
Sterlet Acipenseriformes XP_034760226.1 32.8
Reedfish Polypteriformes XP_028663678.1 32.9
Small-spotted Catshark Carcharhiniformes XP_038629468.1 28.0
Whale Shark Orectolobiformes XP_020381962.1 32.9
Unrooted phylogenetic tree of C1orf159 orthologs generated by Phylogeny.fr. Phylo tree c1orf159.jpg
Unrooted phylogenetic tree of C1orf159 orthologs generated by Phylogeny.fr.

Evolutionary History

When compared with the evolution rate with cytochrome c and fibrinogen alpha, the C1orf159 protein has a similar evolutionary rate of change to the fast-evolving fibrinogen alpha protein, C1orf159 protein has a relatively fast evolution rate.[ citation needed ]

Evolutionary change of C1orf159 protein compared to the change of Cytochrome C and Fibrinogen Alpha. m in the vertical axis is defined as the total number of amino acid changes occurred in a 100-amino acid segment of a protein. Evolutionary change C1orf159.png
Evolutionary change of C1orf159 protein compared to the change of Cytochrome C and Fibrinogen Alpha. m in the vertical axis is defined as the total number of amino acid changes occurred in a 100-amino acid segment of a protein.

Clinical Significance

The Human Protein Atlas shows that C1orf159 is an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer, indicating that a high expression of C1orf159 is associated with a lower survival probability for patients with renal and liver cancer, and is associated with a higher survival probability for patients with urothelial cancer. [7]

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000131591 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000059939 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 5 "C1orf159 Gene". GeneCards. Retrieved 2022-06-20.
  6. 1 2 "Gene symbol report". HUGO Gene Nomenclature Committee. Retrieved 2022-06-20.
  7. 1 2 "Expression of C1orf159 in cancer - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-06-20.
  8. "Gene: C1orf159 (ENSG00000131591) - Summary - Homo_sapiens - Ensembl genome browser 107". uswest.ensembl.org. Retrieved 2022-07-27.
  9. 1 2 "Homo sapiens chromosome 1 open reading frame 159 (C1orf159), transcript variant 1, mRNA". 2022-04-17.
  10. "UCSC Genome Browser Home". genome.ucsc.edu. Retrieved 2022-07-28.
  11. 1 2 3 "UniProt". www.uniprot.org. Retrieved 2022-07-28.
  12. "SIB Swiss Institute of Bioinformatics | Expasy". www.expasy.org. Retrieved 2022-07-28.
  13. Kurotani A, Tokmakov AA, Sato KI, Stefanov VE, Yamada Y, Sakurai T (August 2019). "Localization-specific distributions of protein pI in human proteome are governed by local pH and membrane charge". BMC Molecular and Cell Biology. 20 (1): 36. doi: 10.1186/s12860-019-0221-4 . PMC   6701068 . PMID   31429701.
  14. "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2022-07-28.
  15. "C1orf159 - Proteomics". www.nextprot.org. Retrieved 2022-07-30.
  16. 1 2 "C1orf159 orthologs". NCBI. Retrieved 2022-07-28.
  17. Redmond AK, Macqueen DJ, Dooley H (November 2018). "Phylotranscriptomics suggests the jawed vertebrate ancestor could generate diverse helper and regulatory T cell subsets". BMC Evolutionary Biology. 18 (1): 169. Bibcode:2018BMCEE..18..169R. doi: 10.1186/s12862-018-1290-2 . PMC   6238376 . PMID   30442091.
  18. "Phylogeny.fr: "One Click" Mode". www.phylogeny.fr. Retrieved 2022-07-28.