C11orf86 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C11orf86 , chromosome 11 open reading frame 86 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1917111 HomoloGene: 52832 GeneCards: C11orf86 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. [5] It encodes for a protein known as uncharacterized protein C11orf86, [6] which is predicted to be a nuclear protein. The function of this protein is currently unknown.
C11orf86 is located on the long arm of chromosome 11 at 11q13.2. It consists of 1732 base pairs, and is found on the plus strand. Gene neighbors of C11orf86 include uncharacterized LOC105369355, microRNA 6860, microRNA 3163, synaptotagmin 12, ras homolog family member D, and pyruvate carboxylase. [7]
The program ElDorado, by Genomatix, identified the promoter region of C11orf86 on the positive strand from 66974707 to 66975464, for a total length of 758 base pairs. [8]
C11orf86 appears to be primarily expressed in the gastrointestinal tract. [9] Expression occurs in ascites, the intestine, the stomach, gastrointestinal tumors, and non-neoplasia. [10]
According to AceView, transcription of the gene produces three different mRNAs, two of which are alternatively spliced variants, while the third is an unspliced form. [11] All three variants could possibly code for functional proteins. The transcript used for this article is made up of two exons, amounting to 1185 base pairs, and has the reference number NM_001136485.1. [12]
mRNA variant | Exon | Exon Size (bp) | Intron | Intron Size (bp) |
---|---|---|---|---|
a | 1 | 362 | 1 | 541 |
2 | 829 | |||
b | 1 | 249 | 1 | 538 |
2 | 251 | |||
c | 1 | 551 | 0 |
C11orf86 protein is 115 amino acids in length. [6] The molecular weight of C11orf86 is 13.2 kdal. [13] Its isoelectric point is predicted to be 11.9. [14]
1 MGTGLRSQSL REPRPSYGKL QEPWGRPQEG QLRRALSLRQ GQEKSRSQGL ERGTEGPDAT 61 AQERVPGSLG DTEQLIQAQR RGSRWWLRRY QQVRRRWESF VAIFPSVTLS QPASP
The majority of the C11orf86 protein is composed of arginine (15.7%), glutamine (12.2%), serine (10.4%), glycine (10.4%), and leucine (9.6%). No cysteine, histidine, or asparagine residues are found in this protein. [13]
A : 6( 5.2%); C : 0( 0.0%); D : 2( 1.7%); E : 9( 7.8%); F : 2( 1.7%) G : 12(10.4%); H : 0( 0.0%); I : 2( 1.7%); K : 2( 1.7%); L : 11( 9.6%) M : 1( 0.9%); N : 0( 0.0%); P : 9( 7.8%); Q : 14(12.2%); R : 18(15.7%) S : 12(10.4%); T : 5( 4.3%); V : 4( 3.5%); W : 4( 3.5%); Y : 2( 1.7%)
C11orf86 has no positive, negative, or mixed charge clusters. However, there is a higher presence of arginine, which is positively charged.
1 00000+0000 +-0+0000+0 0-000+00-0 00++0000+0 00-+0+0000 -+00-00-00 61 00-+000000 -0-000000+ +00+000++0 000+++0-00 0000000000 00000
This protein is a part of the DUF4633 superfamily. Proteins that belong to this family are often between 94 and 123 amino acids in length. [15] This domain is found in bacteria, viruses, fungi, plants, insects, reptiles, birds, and mammals. [16]
C11orf86 is predicted to have nine possible phosphorylation sites, of which eight are serine, and one is threonine. [17] It is also predicted to have ten O-linked glycosylation sites. [18]
C11orf86 is primarily composed of random coil and alpha helices. [19] [20]
This protein is predicted to be a nuclear protein. [21] There appears to be a bipartite nuclear localization sequence beginning at position 80. [22]
The C11orf86 protein is conserved in mammals, and orthologs can easily be traced back to marsupials, monotremes, and reptiles. No orthologs of C11orf86 appear to be present in plants, fungi, fish, amphibians, or birds. There are no paralogs of C11orf86. The table below shows some orthologs that were found using BLAST. [23] Dates of divergence were found from TimeTree, using the median molecular time estimate. [24]
Genus and Species | Common Name | Accession Number | Protein Length (aa) | Percent Identity (%) | Date of Divergence (mya) |
---|---|---|---|---|---|
Homo sapiens | Human | NP_001129957.1 | 115 | 100 | |
Macaca mulatta | Rhesus macaque | XP_001115236.1 | 115 | 89 | 27.3 |
Rattus norvegicus | Brown rat | NP_001102991.1 | 123 | 74 | 90.1 |
Bos taurus | Cattle | NP_001070558.1 | 122 | 74 | 95.0 |
Equus caballus | Horse | XP_014585069.1 | 113 | 71 | 95.0 |
Mus musculus | Mouse | NP_081513.1 | 124 | 71 | 90.1 |
Felis catus | Cat | XP_003993784.1 | 121 | 70 | 95.0 |
Canis lupus familiaris | Dog | XP_003639729.1 | 121 | 69 | 95.0 |
Sarcophilus harrisii | Tasmanian devil | XP_012408455.1 | 120 | 40 | 162.4 |
Ornithorhynchus anatinus | Platypus | XP_007659213.1 | 119 | 40 | 169.7 |
Chrysemys picta bellii | Painted turtle | XP_008177865.1 | 135 | 35 | 320.5 |
A bipolar disorder association study identified C11orf86 as one of many genes found in a region of linkage disequilibrium on chromosome 11. Despite evidence of some association, C11orf86 was not found to be in an area of particular significance. [25] C11orf86 is down-regulated from non-neoplastic mucosa to adenomas and carcinomas, [26] down-regulated in renal cell carcinoma, [27] and harbors chromosomal gains that are significantly associated with pure mucinous subtypes in mucinous carcinoma. [28]
HIKESHI is a protein important in lung and multicellular organismal development that, in humans, is encoded by the HIKESHI gene. HIKESHI is found on chromosome 11 in humans and chromosome 7 in mice. Similar sequences (orthologs) are found in most animal and fungal species. The mouse homolog, lethal gene on chromosome 7 Rinchik 6 protein is encoded by the l7Rn6 gene.
METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.
WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.
C20orf96 is a protein-coding gene in humans. It codes for an unknown protein known as uncharacterized protein C20orf96, predicted to be a nuclear protein. The function and biological processes of the gene is not well understood by the scientific community yet.
CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.
C21orf62 is a protein that, in humans, is encoded by the C21orf62 gene. C21orf62 is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, C21orf62 is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.
C11orf42 is an uncharacterized protein in homo sapiens that is encoded by the C11orf42 gene. It is also known as chromosome 11 open reading frame 42 and uncharacterized protein C11orf42, with no other aliases. The gene is mostly conserved in mammals, but it has also been found in rodents, reptiles, fish and worms.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.