C8orf58

Last updated
C8orf58
Identifiers
Aliases C8orf58 , chromosome 8 open reading frame 58
External IDs MGI: 2145726 HomoloGene: 19540 GeneCards: C8orf58
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_173686
NM_001013842
NM_001198827

NM_001004155
NM_001112735

RefSeq (protein)

NP_001013864
NP_001185756
NP_775957

NP_001004155
NP_001106206

Location (UCSC) Chr 8: 22.6 – 22.6 Mb Chr 14: 70.39 – 70.4 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. [5] The protein is predicted to be localized in the nucleus.

Contents

Gene

The C8orf58 gene is located on chromosome 8 at position 8p21.3. It spans a total of 4,550 base pairs and has seven exons. C8orf58 is flanked by the genes PDLIM2 and CCAR2. [6] There are no aliases. It is defined as a protein coding gene. [7]

mRNA

C8orf58 produces three transcript splice variants. The transcript of variant 1 represents the longest transcript and encodes the largest protein. It is 2,062 base pairs and contains seven exons. There are two other splice variants, produced by alternative splice sites. [8]

IsoformExonsLength (base pairs)Features
Transcript Variant 11, 2, 3, 4, 5, 6, 72062One upstream in-frame stop codon.
Transcript Variant 21, 2, 3, 4, 5, 6, 72038Alternate in-frame splice site in the 3' coding region.
Transcript Variant 31, 2, 3, 4, 5, 61955Lacks an alternate exon, results in a frameshift in the 3' coding region.

C8orf58 has a relatively short 5’ region and a moderate 3’ region. Both the 5’ and 3’ regions contain stem loops. [9] There is one predicted miRNA binding site that found in the 3’UTR of C8orf58. [10]

Protein

C8orf58 protein Isoform 1 is 365 amino acids long. Isoform 2 and Isoform 3 are 357 and 300 amino acids respectively. There is a kozak consensus sequence present, which confirms it is a protein coding sequence. [11]

C8orf58 Isoform 1 has a molecular weight of 39.7 kDa and an isoelectric point of 8.29. It is proline and arginine rich and isoleucine, asparagine, phenylalanine, and tyrosine poor. [12]

The predicted secondary structure of the C8orf58 protein include multiple alpha helices and one beta strands. [12] [13]

IsoformFrom mRNA VariantLength (amino acids)Molecular Weight (kDa)Isoelectric Point
1136539.78.30
2235738.68.30
3330032.05.82

Evolutionary history

It is part of the DUF4657 family, a family of proteins found in eukaryotes. Proteins in this family are typically between 305 and 370 amino acids in length. [14] The Domain of Unknown Function (DUF) of C8orf58 is located between amino acids 73 to 364.

Expression

According to the NCBI GEO profiles, C8orf58 is a narrowly expressed protein found in spleen, lung, thymus, prostate, and spinal cord tissue. It is constitutively expressed in these tissues. [15]

Post-translational modification

The bioinformatic tools on Expasy were used to determine potential post translational modification sites for the C8orf58 protein. There are two predicted phosphorylation sites and one predicted sumoylation site. [16]

Subcellular localization

According to PSORT II, C8orf58 is located in the nucleus. This is supported by the presence of a sumoylation site, which is involved in nucleic cytoplasmic transport.

Interacting proteins

Two proteins have been found to interact with protein C8orf58, CENPH and metG1, which were found using two hybrid assay and the two hybrid pooling approach respectively. [17] CENPH (Centromere Protein H) plays a critical role in centromere structure, kinetochore formation, and sister chromatid separation. [18] MetG1 (Methionine—tRNA ligase) is required for elongation of protein synthesis and the initiation of all mRNA translation through initiator tRNA(fMet) aminoacylation. [19]

Homology

An important paralog of this gene is ENSG00000248235. [20] Orthologs of the human gene C8orf58 are limited to vertebrates of the animal kingdom.

Scientific NameCommon NameNCBI Accession NumberLength (Amino Acids)Date of Divergence (MYA)Identity (%)Similarity (%)
Homo sapiens HumanNP_001013864.1365---
Gorilla gorilla GorillaXP_004046807.14399.069679.50
Marmota marmota Alpine MarmotXP_015354979.1369906875.7
Oryctolagus cuniculus European RabbitXP_008248092.1371906672
Nannospalax galili SpalaxXP_008848689.1362906574.7
Ceratotherium simum simum White RhinocerosXP_014652157.1381966672.7
Odobenus rosmarus divergens Pacific walrusXP_012418498.1388966574.7
Sus scrofa Wild BoarXP_005670472.1382966573.3
Hipposideros armiger Great Roundleaf BatXP_019487131.1387966271
Eptesicus fuscus Big Brown BatXP_008149784.1377966270.1
Loxodonta africana African Bush ElephantXP_003412428.13721057177.2
Orycteropus afer afer AardvarkXP_007949039.13701056571.7
Parus major Great TitXP_015504136.13203123235.6
Anolis carolinensis Carolina AnoleXP_008118367.14533122838.9

Related Research Articles

<span class="mw-page-title-main">C20orf27</span>

UPF0687 protein C20orf27 is a protein that in humans is encoded by the C20orf27 gene. It is expressed in the majority of the human tissues. One study on this protein revealed its role in regulating cell cycle, apoptosis, and tumorigenesis via promoting the activation of NFĸB pathway.

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C2orf73</span>

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">C6orf62</span>

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">C7orf50</span>

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">C4orf36</span> Draft for page on C4orf36 gene/protein

C4orf36 is a protein that in humans is encoded by the c4orf36 gene.

<span class="mw-page-title-main">Chromosome 12 open reading frame 71</span> Protein encoded in humans by c12orf71 gene

Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000241852 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000044551 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "Entrez Gene: Chromosome 8 open reading frame 58" . Retrieved 2017-11-22.
  6. NCBI Nucleotide. Homo sapiens chromosome 8 open reading frame 58 (C8orf58), transcript variant 1, mRNA.
  7. GeneCard. C8orf58 Gene(Protein Coding) Chromosome 8 Open Reading Frame 58.
  8. NCBI Gene. C8orf58 chromosome 8 open reading frame 58 [Homo sapiens (human)].
  9. RNA Folding Form
  10. TargetScan Human
  11. NCBI Protein. Uncharacterized protein C8orf58 isoform 1 [Homo sapiens].
  12. 1 2 SDSC Biology Workbench
  13. Chou-Fasman Secondary Structure Prediction Server
  14. UniProtKB - Q8NAV2 (CH058_HUMAN). UniProt
  15. NCBI GEO Profiles
  16. Expasy Bioinformatics Resource Portal
  17. IntAct Molecular Interaction Database
  18. Centromere protein H
  19. Methionine--tRNA ligase
  20. GeneCard. 8orf58 Gene(Protein Coding) Chromosome 8 Open Reading Frame 58. .