C16orf46

Last updated
C16orf46 protein
Identifiers
Aliases
External IDs GeneCards:
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

n/a

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC)n/an/a
PubMed searchn/an/a
Wikidata
View/Edit Human
3D Rendering of C16orf46. C16orf46.png
3D Rendering of C16orf46.

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus. [2]

Contents

Gene

An alternative name for this gene is FLJ32702, however it is most commonly referred to as C16orf46. [3]

Location

The C16orf26 gene is found on chromosome 16q23.2 negative strand. [4] The promoter region is 1152 base pairs long. [5] It has three exons, one from 1-380 bp, the second from 381-1254 bp, and the third from 1255-1982 bp. [2]

Expression

C16orf46 is broadly expressed in the testis and thyroid as well as 18 other tissues. [4] These tissue expression patterns are found to be low to moderate (25-50%). [6] When looking at tissue profiles, the highest expression is in the adult mammalian kidney, liver, prefrontal cortex, cerebellum, heart, and brain. [7]

Protein

Immunofluorescence for C16orf46 in a rabbit. Immunofluorescence of C16orf46.png
Immunofluorescence for C16orf46 in a rabbit.

Protein Analysis

The full C16orf46 protein is 417 amino acids long. [9] It has no isoforms, and its most distant ortholog, Rhincodon typus (whale shark), also has no known isoforms. [10] The molecular weight was found to be 45.8 kdal. [11] The isoelectric point is 7.4, average for all proteins, and C16orf46 is electrically neutral. [12]

C16orf46 is predicted to be found in the nucleus by all orthologs. [13]

The secondary structure of C16orf46 has alternating alpha helices and beta sheets. [14]

Protein Level Regulation

In C16orf46, there is N-linked glycosylation, O-linked glycosylation, and SUMOylation. [15] [16]

There are phosphorylation sites found with the kinases CKII, CKI, PKC, and cdc2. [17]

A coronavirus cleavage site is predicted at the 235 amino acid position. [18] There are also tyrosine motif locations between amino acids 42-45 and 251-252. [19]

Transcript Level Regulation

mRNA folding on the 5' UTR predicts a stem loop twice in the area between base pairs 47-90. [20]

Homologs

Orthologs

C16orf46 has over 50 orthologs ranging from primate to chordate. [21] The table below shows a representation of the diversity of C16orf46 by listing a selection of orthologs found using NCBI. When C16orf46 Homo sapiens was run through a multiple alignment sequence program, Clustal Omega, against 20 true orthologs and 16 distant orthologs, Trp74 and Pro212 were found to be conserved in all. [22]

SpeciesCommon NameDivergence (MYA)Accession NumberIdentity
Homo sapiensHumans--- XP_016878405.1 100.0%
Ochotona princeps American Pika90 XP_004584265.1 52.7%
Octodon degus Common Degu90 XP_003434773.2 47.8%
Ursus maritimus Polar Bear96 XP_008687958.1 67.5%
Leptonychotes weddellii Weddell Seal96 XP_006748170.1 67.2%
Canis lupus Gray Wolf96 XP_003434773.2 65.8%
Pteropus vampyrus Large Flying Fox96 XP_011354946.1 63.5%
Sus scrofa Wild Boar96 XP_020952705.1 61.5%
Bos indicus Zebu96 XP_019835282.1 60.2%
Erinaceus europaeus European Hedgehog96 XP_007516703.1 56.7%
Loxodonta africana African Bush Elephant105 XP_010596137.1 60.9%
Sarcophilus harrisii Tasmanian Devil159 XP_003757901.1 43.1%
Apteryx australis Southern Brown Kiwi312 XP_013796688.1 18.5%
Aptenodytes forsteri Emperor Penguin312 XP_019327074.1 17.4%
Chelonia mydas Green Sea Turtle312 XP_007059324.1 29.7%
Gekko japonicus Gekko Japonicus312 XP_015261305.1 25.3%
Nanorana parkeri High Himalaya Frog352 XP_018410908.1 22.4%
Pygocentrus nattereri Red Bellied Piranha435 XP_017578196.1 21.2%
Lepisosteus oculatus Spotted Gar435 XP_015223705.1 20.6%
Callorhinchus milii Australian Ghost Shark473 XP_007887408.1 22.7%

Paralogs

C16orf46 has no known paralogs. [21]

Mutations

C16orf46 has been compared against Fibrinogen, a protein which mutates rapidly, and Cytochrome C, a protein which mutates slowly.

As can be seen below, when multiple species of the three proteins were plotted, C16orf46 more closely resembled that of Fibrinogen than Cytochrome C, suggesting a possible rapid mutation. [21]

The trend of C16orf46, as compared to Fibrinogen and Cytochrome C, suggests faster mutation rates as it diverges from Homo sapiens. Divergence vs Number of Mutations in C16orf46.png
The trend of C16orf46, as compared to Fibrinogen and Cytochrome C, suggests faster mutation rates as it diverges from Homo sapiens.

Interacting Proteins

C16orf46 interacts with FAT3 which has been linked to neurite interactions during development. [23] C16orf46 is thought to have coexpression with the PLAC8L1 and CFAP43 gene, both of unknown function. [24]

Clinical Significance

There are higher levels of C16orf46 expression in pancreatic adenocarcinoma tumor epithelia tissue compared to the control. [25] There is also higher gene expression in patients with small-cell carcinoma compared to the control. [26]

Related Research Articles

Interferon-inducible GTPase 5

Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.

ANKRD24

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

PRR29

PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.

Glutamate rich 5

Glutamate Rich Protein 5 is a protein in humans encoded by the ERICH5 gene, also known as Chromosome 8 open reading frame 47 (C8orf47).

C17orf53

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

C21orf58

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

Chromosome 9 open reading frame 43

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

C9orf25

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

C19orf44

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

C4orf51

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

SMCO3

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

C1orf185

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

C16orf90

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

C20orf202

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

LSMEM2

Leucine rich single-pass membrane protein 2 is a protein that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, aves, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart and skeletal muscle tissue.

C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.

References

  1. "I-TASSER results". zhanglab.ccmb.med.umich.edu. Retrieved 2018-05-07.[ permanent dead link ]
  2. 1 2 "Gene: C16orf46 (OTTHUMG00000137629) - Summary - Homo sapiens - Vega Genome Browser 68". vega.archive.ensembl.org. Retrieved 2018-05-07.
  3. Database, GeneCards Human Gene. "C16orf46 Gene - GeneCards | CP046 Protein | CP046 Antibody". www.genecards.org. Retrieved 2018-05-07.
  4. 1 2 "C16orf46 Symbol Report | HUGO Gene Nomenclature Committee". www.genenames.org. Retrieved 2018-05-01.
  5. "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Retrieved 2018-05-07.
  6. geo. "Home - GEO - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  7. "Gene: C16orf46 - ENSG00000166455". bgee.org. Retrieved 2018-05-07.
  8. "Anti-C16orf46 antibody produced in rabbit HPA041136". Immunohistochemistry, Immunofluorescence. Retrieved 2018-05-07.
  9. "uncharacterized protein C16orf46 isoform X1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  10. "uncharacterized protein C16orf46 homolog [Rhincodon typus] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  11. Kozlowski, Lukasz P. "CALCULATION OF PROTEIN ISOELECTRIC POINT". isoelectric.org. Retrieved 2018-05-07.
  12. EMBL-EBI. "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2018-05-06.
  13. "PSORT WWW Server". psort.hgc.jp. Retrieved 2018-05-07.
  14. "Bioinformatics Toolkit". toolkit.tuebingen.mpg.de. Retrieved 2018-05-07.
  15. "NetNGlyc 1.0 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  16. "NetOGlyc 4.0 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  17. "NetPhos 3.1 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  18. "NetCorona 1.0 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  19. "Human Protein Reference Database". www.hprd.org. Archived from the original on 2006-04-24. Retrieved 2018-05-07.
  20. "The Mfold Web Server | mfold.rit.albany.edu". unafold.rna.albany.edu. Retrieved 2018-05-07.
  21. 1 2 3 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  22. EMBL-EBI. "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2018-05-07.
  23. Lab, Mike Tyers. "BioGRID | Database of Protein, Chemical, and Genetic Interactions". thebiogrid.org. Retrieved 2018-05-07.
  24. "C16orf46 protein (human) - STRING interaction network". string-db.org. Retrieved 2018-05-07.
  25. "GDS4103 / 230281_at". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  26. "GDS4794 / 230281_at". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.