C16orf82 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C16orf82 , TNT, chromosome 16 open reading frame 82 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | HomoloGene: 82387 GeneCards: C16orf82 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
C16orf82 is a protein that, in humans, is encoded by the C16orf82 gene. [3] C16orf82 encodes a 2285 nucleotide mRNA transcript which is translated into a 154 amino acid protein using a non-AUG (CUG) start codon. The gene has been shown to be largely expressed in the testis, tibial nerve, and the pituitary gland, although expression has been seen throughout a majority of tissue types. [4] [5] [6] The function of C16orf82 is not fully understood by the scientific community. [7]
C16orf82 is located in humans at locus 16p12.1 on the positive strand.
The gene encodes for a 2285 nucleotide mRNA transcript that is intronless. Human intronless genes represent a unique subset of the genome that are often involved in signaling, sperm formation, immune responses, or development. [8] C16orf82 being such a gene indicates it may play a role in one of these processes. Translation of C16orf82 initiates at a non-AUG (CUG) start codon. The presence of the non-canonical start codon suggests possible increased regulation of C16orf82 translation and/or possibly could allow for the translation of protein products that start with leucine instead of methionine as seen in proteins coded for by some genes present in the major histocompatibility complex. [9] [10]
The C16orf82 promoter region has been predicted to contain a number of transcription factor binding sites including binding sites for transcription factors within the SOX family. [11] The presence of the SOX family transcription binding sites suggests that C16orf82 may play a role in sex determination. [12] Actual transcription factor functional studies show binding of the C16ORF82 promoter by ARNT, ELF5, SMAD4, and STAT3. [13]
C16orf82 expression in humans has been observed in major organ systems including the heart, liver, brain, and kidney at a constant level. [14] The tissue in which C16orf82 has been seen to be most highly expressed has been the testis, both by microarray experiments as well as RNA-seq. [4] [5] C16orf82 expression is also highly variable between individuals, with some expressing the gene in large amounts while others barely express the gene within the same tissue type. [6] [15] Micro RNA (miR-483) over expression has been shown to knock down C16orf82 expression. [16]
The C16orf82 protein is 154 amino acids in length with an approximate molecular weight of 16.46 kDa with a predicted isoelectric point of 6.06. [17] There are no known variants or isoforms of C16orf82.
C16orf82 contains one domain, DUF4694, which currently has a function that is uncharacterized. The domain spans from amino acid 8 to amino acid 153. [18] DUF4694 contains a SSGY (serine-serine-glycine-tyrosine) sequence motif that is found in a majority of the protein's orthologs. [19] [20] There is no presence of a transmembrane domain thus the protein is not a transmembrane protein. [21]
The localization of C16orf82 within a cell has been predicted to be nuclear. [21] A bipartite nuclear localization signal can be found starting at Arg107.
The human C16orf82 protein has been predicted to be phosphorylated at a number of serine residues. [26] O-linked glycosylation has also been predicted to happen at a number of sites, including some that overlap with the aforementioned phosphorylation sites. [27] The sites of overlap between the two types of post-translational modifications could play important regulatory roles in the activity and lifespan of the human C16orf82 protein. [28]
The secondary structure of the human C16orf82 protein has been predicted to be largely disordered by a number of modeling programs. [29] [30] [31] [32]
No paralogs of C16orf82 exist within humans. [20]
C16orf82 has over 100 predicted orthologs, which all reside in the class mammalia and more precisely the subclass eutheria. [33] [20] All of the orthologs contained the domain DUF4964. [33] The most distant ortholog detected was within the nine-banded armadillo (Dasypus novemcinctus) within the order Cingluata. Below is a table of 20 orthologs from various orders within the subclass eutheria with the sequence identity and time since divergence in relation to humans.
C16orf82's rate of evolution was determined to be relatively fast even in comparison to fibrinogen, a gene that has been shown to evolve quickly. [36]
C16orf82 has been associated with Schizophrenia through a genome-wide association study and autism based on copy number variation analysis. [37] [38] Currently, research has not shown if C16orf82 plays any direct role in either of these disorders.
DGLUCY is a protein that in humans is encoded by the DGLUCY gene.
C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).
C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.
Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.
UPF0575 protein C19orf67 is a protein which in humans is encoded by the C19orf67 gene. Orthologs of C19orf67 are found in many mammals, some reptiles, and most jawed fish. The protein is expressed at low levels throughout the body with the exception of the testis and breast tissue. Where it is expressed, the protein is predicted to be localized in the nucleus to carry out a function. The highly conserved and slowly evolving DUFF3314 region is predicted to form numerous alpha helices and may be vital to the function of the protein.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Chromosome 1 open reading frame 198 (C1orf198) is a protein that in humans is encoded by the C1orf198 gene. This particular gene does not have any paralogs in Homo sapiens, but many orthologs have been found throughout the Eukarya domain. C1orf198 has high levels of expression in all tissues throughout the human body, but is most highly expressed in lung, brain, and spinal cord tissues. Its function is most likely involved in lung development and hypoxia-associated events in the mitochondria, which are major consumers of oxygen in cells and are severely affected by decreases in available cellular oxygen.
C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
C4orf19 is a protein which in humans is encoded by the C4orf19 gene.
C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.
{{cite book}}
: CS1 maint: location missing publisher (link)