C16orf96

Last updated
C16orf96
Identifiers
Aliases C16orf96 , chromosome 16 open reading frame 96
External IDs MGI: 1926059 HomoloGene: 53527 GeneCards: C16orf96
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001145011
NM_001387219

NM_001252142
NM_030192

RefSeq (protein)

NP_001138483

NP_084468

Location (UCSC) Chr 16: 4.56 – 4.6 Mb Chr 16: 4.65 – 4.69 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

C16orf96, or chromosome 16 open reading frame 96, is a protein in humans that is encoded by C16orf96 that is found on the 16th chromosome. [5] In Homo sapiens, the protein is 1141 amino acids in length [6]

Contents

Protein

The structure of human C16orf96 protein with important regions labeled. The dark grey lines within the proline rich region and the domain of unknown function are nuclear import and export signals, respectively. C16orf96 Protein Structure.png
The structure of human C16orf96 protein with important regions labeled. The dark grey lines within the proline rich region and the domain of unknown function are nuclear import and export signals, respectively.

Characteristics

The molecular weight of the processed C16orf96 protein in humans is 125kdal with an isoelectric point of 6.58 [7] About 9% of the amino acid makeup of C16orf96 is proline, which is significantly higher than an average human gene. [8]

Post Translational Modification

Human C16orf96 has a large amount of predicted phosphorylation sites on serines throughout the protein. [9] C16orf96 also has been found in primates and mammals to have both a nuclear import and export signal [10]

Domains

This gene contains some known domains, such as: an RNA recognition motif, a Merozoite surface protein (MSP-1), a pentapeptide repeat MXKDX, and a domain of unknown function (DUF1387). These domains could give a hint of what the function of this gene is. [6]

Structure

Secondary Structure

The majority of the secondary structure of C16orf96 is alpa-helices with coils being the second most abundant structure. [11]

Tertiary Structure

At this time there is no known tertiary structure for C16orf96.

Cellular Localization

The C16orf96 protein is predicted to be localized in the nucleus 82% of the time and 4.5% of the time in the cytosol and 4.5% of the time in the mitochondria. [12]

Expression

Expression of C16orf96 when different hypoxia-inducible factors are reduced Expression of C16orf96 under different HIF depletion.png
Expression of C16orf96 when different hypoxia-inducible factors are reduced

C16orf96 expression is generally low in cells. in situ hybridization experiments suggest that C16orf96 RNA is only expressed in the testis while the EST profile for C16orf96 shows gene expression is low in testis and skin only. [13] [14] Expression of the C16orf96 gene is modulated by the depletion of both hypoxia induced factor 1/2α (HIF1/2α). When only one of the factors is depleted expression does not change suggesting that there is redundancy with these two HIF. [15]

Homology

Strict orthologs of this gene exist only in mammals. However, a portion of DUF1387 is found in more distant species back to reptiles. [16] No orthologs of this gene can be found in plants, fungi, or bacteria. Suggesting that this gene is relatively new and evolves quickly.

Function

The function of C16orf96 is currently unknown. Studies have cited this gene among many other genes as a possible candidate that has an effect on childhood obesity. [17]

Related Research Articles

<span class="mw-page-title-main">TSR3</span> Protein-coding gene in the species Homo sapiens

TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long. and its cDNA has 1214 base pairs It was previously designated C16orf42.

<span class="mw-page-title-main">HIKESHI</span>

HIKESHI is a protein important in lung and multicellular organismal development that, in humans, is encoded by the HIKESHI gene. HIKESHI is found on chromosome 11 in humans and chromosome 7 in mice. Similar sequences (orthologs) are found in most animal and fungal species. The mouse homolog, lethal gene on chromosome 7 Rinchik 6 protein is encoded by the l7Rn6 gene.

<span class="mw-page-title-main">METTL26</span> Protein-coding gene in the species Homo sapiens

METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.

<span class="mw-page-title-main">Tetratricopeptide repeat 39A</span>

Tetratricopeptide repeat 39A is a human protein encoded by the TTC39A gene. TTC39A is also known as DEME-6, KIAA0452, and c1orf34. The function of TTC39A is currently not well understood. The main feature within tetratricopeptide repeat 39A is the domain of unknown function 3808 (DUF3808), spanning almost the entire protein. KIAA0452 can also be seen as an isoform of TTC39A because of differences in genome sequence, but overlap in DUF domain.

<span class="mw-page-title-main">FAM203B</span> Protein-coding gene in the species Homo sapiens

Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.

<span class="mw-page-title-main">CFAP206</span>

Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38

C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).

WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.

Septin 4 is a protein that in humans is encoded by the gene SEPTIN4. The gene is 2,698 base pairs long, contains one gt-ag intron, and is oriented on the minus strand of DNA. The pre-messenger has 2 exons and the predicted protein is 570 amino acids long. There are currently no experimental structures for the SEPTIN4 gene product with a sequence identity >90%.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.

<span class="mw-page-title-main">Proser1</span>

PROSER1 is a protein that in humans is encoded by the PROSER1 gene.

Chromosome 10 open reading frame 35 (c10orf35) is a gene that in humans, encodes for a protein-binding, transmembrane protein. The protein contains the domain of unknown function 4605 (DUF4605) which belongs to the protein family pfam15378. This gene is located at locus 10q22.1.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

UPF0575 protein C19orf67 is a protein which in humans is encoded by the C19orf67 gene. Orthologs of C19orf67 are found in many mammals, some reptiles, and most jawed fish. The protein is expressed at low levels throughout the body with the exception of the testis and breast tissue. Where it is expressed, the protein is predicted to be localized in the nucleus to carry out a function. The highly conserved and slowly evolving DUFF3314 region is predicted to form numerous alpha helices and may be vital to the function of the protein.

<span class="mw-page-title-main">C6orf62</span>

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C7orf61</span> Human gene

Uncharacterized protein chromosome 7 open reading frame 61 is an asparagine-poor protein in humans encoded by the c7orf61 gene. The protein function is relatively unknown and is highly conserved in mammals.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000205832 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000022518 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Martin J, Han C, Gordon LA, Terry A, Prabhakar S, She X, et al. (December 2004). "The sequence and analysis of duplication-rich human chromosome 16". Nature. 432 (7020): 988–94. Bibcode:2004Natur.432..988M. doi: 10.1038/nature03187 . PMID   15616553.
  6. 1 2 NCBI Protein NP_001138483
  7. GeneCards C16orf96
  8. Biology Workbench: SAPS Tool [ permanent dead link ]
  9. NetPhos 2.0
  10. NetNES 1.1
  11. Biology Workbench: PELE Tool
  12. "PSORT II Server". Archived from the original on 2021-07-09. Retrieved 2015-04-30.
  13. The Human Protein Atlas of C16orf96
  14. EST profile Hs.684212
  15. GeoProfiles ID:40057343
  16. NCBI BLAST
  17. Comuzzie AG, Cole SA, Laston SL, Voruganti VS, Haack K, Gibbs RA, Butte NF (2012). "Novel genetic loci identified for the pathophysiology of childhood obesity in the Hispanic population". PLOS ONE. 7 (12): e51954. doi: 10.1371/journal.pone.0051954 . PMC   3522587 . PMID   23251661.]