C17orf107

Last updated
C17orf107
Identifiers
Aliases C17orf107 , chromosome 17 open reading frame 107
External IDs MGI: 2148639; HomoloGene: 109413; GeneCards: C17orf107; OMA:C17orf107 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001145536

NM_001145537

RefSeq (protein)

NP_001139008

n/a

Location (UCSC) Chr 17: 4.9 – 4.9 Mb Chr 11: 70.51 – 70.51 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 17 Open Reading Frame 107 is a protein encoded by the C17orf107 gene in Homo sapiens . C17orf107 in located intracellularly with a secondary strucutre of five alpha helixes.

Contents

Gene

Location

Human C17orf107 is located on the short arm of chromosome 17 at p 13.2. With introns, the gene spans nucleotides 4,899,536 to 4,906,715. The spliced gene is 3,201 nucleotides with three exons on the plus strand. [5]

Gene Level Regulation

Human C17orf107 is expressed ubiquitously in tissues with the heart and brain tissues having a higher expression and the endocrine and salivary gland tissues having the highest expression. [5] [6]

Protein

iTasser Secondary Structure of human C17orf107 protein with 5 alpha helixes in magenta and coils in blue. Secondary protein Structure of human C17orf107.gif
iTasser Secondary Structure of human C17orf107 protein with 5 alpha helixes in magenta and coils in blue.

Human C17orf107 protein, encoded by the mRNA sequence, is 190 amino acids in length. The molecular mass of the protein is approximately 20 kDa with a basal isoelectric point of 7 pH. [7] C17orf107 is a part of the DUFF5536 conserved protein domain which is part of the pfam17688, a member of the superfamily cl39220. [8]

Protein Level Regulation

Human C17orf107 protein is localized intracellularly with no transmembrane domains. Their is evidence of being expressed in the nucleoplasm. [6] The protein has no asparagines, therefore there are no N-glycosylation sites. [9] C17orf107 protein does not have post-translational modifications, disulfide bonds, or signal peptides. [10]

Evolution

Orthologs

Human C17orf107 is found in all mammals except Monotremes, Hyracoidea, Tubulidentata, Cingulata, Peramelemorphia, Paucituberculata, Sirenia, and Notoryctemorphia groups. [11] The C17orf107 gene first appeared in mammals with the earliest found species dating back to 160 million years ago in marsupials. [12] The table below shows protein orthologs to Human C17orf107, sorted by estimated date of divergence, then by taxonomic group, and lastly sequence identity scores.

Orthologs
Taxonomic GroupCommon NameGenus and SpeciesAccession NumberDate of Divergence (MYA)Sequence Length (aa)Sequence Identity (%)Sequence Similarity (%)Sequence Gaps (%)
Primates Human Homo sapiensNP_001139008.1 [13] 190
Western Lowland Gorilla Gorilla gorilla gorillaXP_018868465.1 [14] 9189828315
Lagomorpha European Rabbit Oryctolagus cuniculusXP_008269053.1 [15] 87204647213
Rodentia House Mouse Mus musculusEDL12584.1 [16] 87128333755
Artiodactyla East African HippopotamusHippopotamus amphibius kibokoXP_057569928.1 [17] 9418378844
Sheep Ovis ariesKAG5203555.1 [18] 94233515534
Wild Bactrian Camel Camelus ferusEPY72335.1 [19] 94183435032
Carnivora Sea Otter Enhydra lutris kenyoniXP_022380488.1 [20] 94216636919
Giant Panda Ailuropoda melanoleucaXP_034502549.1 [21] 94247586325
Clouded Leopard Neofelis nebulosaXP_058561310.1 [22] 9490404453
Cetacea Harbor Porpoise Phocoena phocoenaXP_065753064.1 [23] 94179707911
Narwhal Monodon monocerosTKC42450.1 [24] 94263263051
Chiroptera Nathusius Pipistrelle BatPipistrellus nathusiiCAK6437783.1 [25] 94135394542
Perissodactyla Horse Equus caballusXP_005597777.1 [26] 94212697516
Southern White Rhinoceros Diceros bicornis minorXP_058415933.1 [27] 94218677513
Pholidota Sunda Pangolin Manis javanicaXP_017503787.1 [28] 9419765738
Proboscidea Indian Elephant Elephas maximus indicusXP_049717565.1 [29] 99248566226
Marsupial Agile Gracile Opossom Gracilinanus agilisXP_044529018.1 [30] 160179505222
Tasmanian Devil Sarcophilus harrisiiXP_012404229.1 [31] 160189485922
Monito Del Monte Dromiciops gliroidesXP_043856787.1 [32] 160198415220

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000205710 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000087279 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 "C17orf107 chromosome 17 open reading frame 107". National Library of Medicine. Retrieved 4 December 2024.
  6. 1 2 "C17orf107". The Human Protein Atlas. Retrieved 11 December 2024.
  7. "Uncharacterized protein C17orf107". PhosphoSitePlus. Cell Signaling Technology. Retrieved 4 December 2024.
  8. "Conserved Protein Domain Family DUF5536". NCBI. Retrieved 11 December 2024.
  9. "NetNGlyc". DTU Health Tech. Retrieved 11 December 2024.
  10. "Protter". ETHZürich. Retrieved 11 December 2024.
  11. "NCBI Blast". National Library of Medicine. Retrieved 11 December 2024.
  12. "Divergence Time". TimeTree5. Retrieved 11 December 2024.
  13. "uncharacterized protein C17orf107 [Homo sapiens]". National Library of Medicine. Retrieved 11 December 2024.
  14. "uncharacterized protein C17orf107 homolog [Gorilla gorilla gorilla]". National Library of Medicine. Retrieved 11 December 2024.
  15. "uncharacterized protein C17orf107 homolog [Oryctolagus cuniculus]". National Library of Medicine. Retrieved 11 December 2024.
  16. "mCG21181, isoform CRA_a, partial [Mus musculus]". National Library of Medicine. Retrieved 11 December 2024.
  17. "uncharacterized protein C17orf107 homolog [Hippopotamus amphibius kiboko]". National Library of Medicine. Retrieved 11 December 2024.
  18. "hypothetical protein JEQ12_003138 [Ovis aries]". National Library of Medicine. Retrieved 11 December 2024.
  19. "hypothetical protein CB1_083302001 [Camelus ferus]". National Library of Medicine. Retrieved 11 December 2024.
  20. "uncharacterized protein C17orf107 homolog [Enhydra lutris kenyoni]". National Library of Medicine. Retrieved 11 December 2024.
  21. "uncharacterized protein C17orf107 homolog [Ailuropoda melanoleuca]". National Library of Medicine. Retrieved 11 December 2024.
  22. "uncharacterized protein C17orf107 homolog [Neofelis nebulosa]". National Library of Medicine. Retrieved 11 December 2024.
  23. "uncharacterized protein C17orf107 homolog, partial [Phocoena phocoena]". National Library of Medicine. Retrieved 11 December 2024.
  24. "hypothetical protein EI555_006246, partial [Monodon monoceros]". National Library of Medicine. Retrieved 11 December 2024.
  25. "unnamed protein product [Pipistrellus nathusii]". National Library of Medicine. Retrieved 11 December 2024.
  26. "uncharacterized protein C17orf107 homolog [Equus caballus]". National Library of Medicine. Retrieved 11 December 2024.
  27. "uncharacterized protein C17orf107 homolog isoform X1 [Diceros bicornis minor]". National Library of Medicine. Retrieved 11 December 2024.
  28. "LOW QUALITY PROTEIN: uncharacterized protein C17orf107 homolog [Manis javanica]". National Library of Medicine. Retrieved 11 December 2024.
  29. "uncharacterized protein C17orf107 homolog [Elephas maximus indicus]". National Library of Medicine. Retrieved 11 December 2024.
  30. "uncharacterized protein C17orf107 homolog [Gracilinanus agilis]". National Library of Medicine. Retrieved 11 December 2024.
  31. "uncharacterized protein C17orf107 homolog [Sarcophilus harrisii]". National Library of Medicine. Retrieved 11 December 2024.
  32. "LOW QUALITY PROTEIN: uncharacterized protein C17orf107 homolog [Dromiciops gliroides]". National Library of Medicine. Retrieved 11 December 2024.