C16orf46 protein | |||||||
---|---|---|---|---|---|---|---|
Identifiers | |||||||
Aliases | |||||||
External IDs | GeneCards: | ||||||
Orthologs | |||||||
Species | Human | Mouse | |||||
Entrez |
|
| |||||
Ensembl |
|
| |||||
UniProt |
|
| |||||
RefSeq (mRNA) |
|
| |||||
RefSeq (protein) |
|
| |||||
Location (UCSC) | n/a | n/a | |||||
PubMed search | n/a | n/a | |||||
Wikidata | |||||||
|
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus. [2]
An alternative name for this gene is FLJ32702, however it is most commonly referred to as C16orf46. [3]
The C16orf26 gene is found on chromosome 16q23.2 negative strand. [4] The promoter region is 1152 base pairs long. [5] It has three exons, one from 1-380 bp, the second from 381-1254 bp, and the third from 1255-1982 bp. [2]
C16orf46 is broadly expressed in the testis and thyroid as well as 18 other tissues. [4] These tissue expression patterns are found to be low to moderate (25-50%). [6] When looking at tissue profiles, the highest expression is in the adult mammalian kidney, liver, prefrontal cortex, cerebellum, heart, and brain. [7]
The full C16orf46 protein is 417 amino acids long. [9] It has no isoforms, and its most distant ortholog, Rhincodon typus (whale shark), also has no known isoforms. [10] The molecular weight was found to be 45.8 kdal. [11] The isoelectric point is 7.4, average for all proteins, and C16orf46 is electrically neutral. [12]
C16orf46 is predicted to be found in the nucleus by all orthologs. [13]
The secondary structure of C16orf46 has alternating alpha helices and beta sheets. [14]
In C16orf46, there is N-linked glycosylation, O-linked glycosylation, and SUMOylation. [15] [16]
There are phosphorylation sites found with the kinases CKII, CKI, PKC, and cdc2. [17]
A coronavirus cleavage site is predicted at the 235 amino acid position. [18] There are also tyrosine motif locations between amino acids 42-45 and 251-252. [19]
mRNA folding on the 5' UTR predicts a stem loop twice in the area between base pairs 47-90. [20]
C16orf46 has over 50 orthologs ranging from primate to chordate. [21] The table below shows a representation of the diversity of C16orf46 by listing a selection of orthologs found using NCBI. When C16orf46 Homo sapiens was run through a multiple alignment sequence program, Clustal Omega, against 20 true orthologs and 16 distant orthologs, Trp74 and Pro212 were found to be conserved in all. [22]
C16orf46 has been compared against Fibrinogen, a protein which mutates rapidly, and Cytochrome C, a protein which mutates slowly.
As can be seen below, when multiple species of the three proteins were plotted, C16orf46 more closely resembled that of Fibrinogen than Cytochrome C, suggesting a possible rapid mutation. [21]
C16orf46 interacts with FAT3 which has been linked to neurite interactions during development. [23] C16orf46 is thought to have coexpression with the PLAC8L1 and CFAP43 gene, both of unknown function. [24]
There are higher levels of C16orf46 expression in pancreatic adenocarcinoma tumor epithelia tissue compared to the control. [25] There is also higher gene expression in patients with small-cell carcinoma compared to the control. [26]
Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.
Glutamate Rich Protein 5 is a protein in humans encoded by the ERICH5 gene, also known as Chromosome 8 open reading frame 47 (C8orf47).
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.
Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.
Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.
Leucine rich single-pass membrane protein 2 is a protein that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, aves, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart and skeletal muscle tissue.
C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.