FAM216A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | FAM216A , C12orf24, HSU79274, family with sequence similarity 216 member A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1916198 HomoloGene: 8328 GeneCards: FAM216A | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
C12Orf24 | |
---|---|
Alliases | FAM216A, HSU79274 |
External IDs | NCBI: NM_013300.3 |
C12orf24 (Chromosome 12, open reading frame 24) is a gene in humans ( Homo sapiens ) that encodes a protein known as FAM216A. [5] This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. [5] [6] FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. [7] The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids. [8]
C12orf24 is a protein-encoding gene found on the forward strand of chromosome 12 at the locus 12q24.11. C12orf24 is located on the long arm of chromosome 12. [9]
FAM216A has 2 isoforms X1 and X2 that are both shorter than the main protein, which is known as FAM216A. Isoform X2 has 2 variants which are the same length and isoform X1 only has 1 variant. [5]
Name | Transcript ID | Base pair length | Protein length |
---|---|---|---|
FAM216A | NP_037432.2 | 1,101 | 273 |
FAM216A_X1 | XP_011536548.1 | 1,148 | 234 |
FAM216A_X2 | XP_024304727.1 | 1,142 | 179 |
FAM216A_X2 | XP_005253932.1 | 1,040 | 179 |
Chromosome 12 is a medium-sized chromosome, on which C12orf24 spans from 110,468,415 to 110,490,387 which is 21,973 bases long. The resulting mRNA transcript is 1,101 bases, 822 of which is the coding sequence. The resulting protein is 273 amino acids long. [10] [5]
Exon | Start | End | Length in base pairs |
---|---|---|---|
ENSE00001474815 | 110,468,845 | 110,469,018 | 174 |
ENSE00003514548 | 110,473,078 | 110,473,077 | 41 |
ENSE00003459833 | 110,485,078 | 110,485,199 | 122 |
ENSE00003639607 | 110,486,325 | 110,486,454 | 130 |
ENSE00003553453 | 110,486,534 | 110,486,717 | 184 |
ENSE00003571489 | 110,487,861 | 110,487,943 | 83 |
ENSE00003568139 | 110,490,019 | 110,490,385 | 367 |
C12orf24 is primarily expressed in the brain, spinal cord, and testis of humans. [6] [11] Within the testis C12orf24 is expressed in Sertoli cells. [8] Within the brain C12orf24 is expressed within neuropils. [12]
According to the UCSC Genome Browser and Genomatix Eldorado there are two promoters of C12orf24 and no enhancers or other regulatory elements. [13] [14] Only one of the two predicted promoters binds transcription factors. [14]
There are many transcription factors predicted to bind to the promoter region of C12orf24 by Genomatix and the UCSC Genome Browser. [14] [13]
Name | Function | Binding sequence |
---|---|---|
ZKSCAN3 | C2H2 zinc finger transcription factors 2 | ccctcCCCCaccgtaactccggg |
SPZ1 | Testis-specific bHLH-Zip transcription factors | aGGAGggaaat |
WT1 | Wilms tumor suppressor | cggtgggGGAGgggcagga |
E2F7 | Cell cycle regulator | cacaggaGGGAaatata |
MZF1 | Myeloid zinc finger 1 factors | gcGGGGagcag |
ETSF | Human and murine ETS1 factors | gtttgacaGGAAggtggctca |
ZNF7 | C2H2 zinc finger transcription factors 18 | cgggaGGCTgaggca |
NKX2 | NKX homeodomain factors | gccctcAAGTgagaggcgg |
MYOD | Myoblast determining factor | ctgggaCAGCtgctccc |
According to the PSORT program of Genescript, C12orf24 is 69.6% likely to be a nuclear protein. [7] NCBI Gene predicts that C12orf24 is an intracellular protein. [5]
Protein FAM261A has 2 charge runs, a positive run from amino acids 200-229 and a negative charge run from amino acids 238–268. [15] Methionine, histidine, and serine are all seen at a higher than expected rate in FAM216A while valine is seen at a significantly lower than expected rate. [15]
The CFSSP (Chou and Fassman Secondary Structure Prediction Server) predicts a secondary structure for FAM216A that has multiple alpha helices with a few large beta pleated sheets. [16] [17] I-TASSER structure prediction program shows a 3D structure of FAM216A that has many alpha helices and a few coil turns with no beta pleated sheets. [18]
There are 8 predicted sites of sumoylation on FAM216A, with only 2 of them having a high probability of occurrence. [19] There are 7 predicted sites of glycation on FAM216A. [20] There is no predicted signal peptide for FAM216A, which may prevent the protein from being glycosylated. [21] There is 42 predicted sites for phosphorylation, however given the predicted structure of the protein not all 42 sites will be accessible for phosphorylation. [22]
There are only a few known proteins that are predicted to interact with FAM216A. [23] [24] [25]
Protein | Function |
---|---|
E6 | HPV type 8 protein that prevents apoptosis in infected cells |
MAGEA10 | Possible roles in embryonal development, tumor transformation, and tumor development |
DKC1 | Required for ribosome biogenesis and telomere maintenance |
RCF4 | Replication factor C subunit 4. Possible required for elongation of multiprimed DNA |
CCDC34 | Coiled-coil domain containing 34. Involved in the cell cycle |
NIFK | Nucleolar protein interacting with the FHA domain of MKI67 |
PSMA4 | Protease responsible for degrading most intracellular proteins |
The only paralog for FAM216A is FAM216B. [26]
According to the NCBI Gene page for C12orf24, there are at least 182 organisms with an ortholog of C12orf24. [27] The farthest back known orthologs are in sea corals which diverged from humans 824 million years ago. [28] [29]
Genus and species | Common name | Taxonomic group | Date of divergence (MYA) | Accession number | Sequence length | Sequence identity | Sequence similarity |
---|---|---|---|---|---|---|---|
Pan paniscus | Bonobo | Primate | 6.7 | XP_003832531 | 281 | 95.7 | 96.1 |
Pan troglodytes | Chimpanzee | Primate | 6.7 | PNI64720 | 256 | 92.7 | 93 |
Gulo gulo | Wolverine | Carnivore | 96 | VCW50183 | 254 | 79.9 | 86.6 |
Orcinus orca | Killer whale | Odontoceti | 96 | XP_004276778 | 256 | 78.8 | 84.2 |
Delphinapteruss leucas | Beluga whale | Odonoceti | 96 | XP_022453741 | 256 | 78.4 | 84.2 |
Mondon monoceros | Narwhal | Odontoceti | 96 | XP_029084711 | 256 | 78.4 | 84.2 |
Tursiops truncatus | Bottlenose dolphin | Odontoceti | 96 | XP_019806066 | 256 | 78.4 | 84.2 |
Callhorhinus ursinus | Northern fur seal | Carnivore | 96 | XP_025745569 | 253 | 77.7 | 85 |
Felis catus | House cat | Carnivore | 96 | XP_003994705 | 253 | 75.5 | 84.2 |
Puma concolor | Puma | Carnivore | 96 | XP_025789304 | 253 | 75.5 | 84.2 |
Acinonyx jubatus | Cheetah | Carnivore | 96 | XP_026899980 | 253 | 75.1 | 84.2 |
Equus asinus | Donkey | Equidae | 96 | XP_014714174 | 253 | 74.7 | 80.6 |
Camelus ferus | Wild Bactrian camel | Artidoctyla | 96 | XP_006178132 | 256 | 73.6 | 82.8 |
Loxodonta africana | African bush elephant | Proboscidea | 105 | XP_010598175 | 254 | 76.2 | 83.2 |
Ciona intestinalis | Sea vase | Tunicata | 676 | XP_002124486 | 268 | 16.9 | 28.9 |
Acanthaster planci | Crown-of-thorns starfish | Echinodermata | 684 | XP_022109834 | 368 | 17 | 29.7 |
Actinia tenebrosa | Australian red Waratah Sea anemone | Cnidaria | 824 | XP_031556553 | 433 | 16.1 | 24.8 |
Pocillopora damicornis | Cauliflower coral | Cnidaria | 824 | XP_027047953 | 415 | 16 | 23.7 |
Orbicella faveolata | Mountainous star coral | Cnidaria | 824 | XP_020623330 | 435 | 15.4 | 23.2 |
TSBP1 is a protein that in humans is encoded by the TSBP1 gene. C6orf10 is an open reading frame on chromosome 6 containing a protein that is ubiquitously expressed at low levels in the adult genome and may play a role during fetal development. C6orf10 has been found to be linked to both neurodegenerative and autoimmune diseases in adults. Expression of this gene is highest in the testis but is also seen in other tissue types such as the brain, lens of the eye and the medulla. TSBP1 was previously known as C6orf10.
Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.
C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.
Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
Transmembrane protein 155 is a protein that in humans is encoded by the TMEM155 gene. It is located on human chromosome 4, spanning 6,497 bases. It is also referred to as FLJ30834 and LOC132332. This protein is known to be expressed mainly in the brain, placenta, and lymph nodes and is conserved throughout most placental mammals. The function and structure of this protein is still not well understood, but its level of expression has been studied pertaining to various pathologies.
C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
C4orf19 is a protein which in humans is encoded by the C4orf19 gene.
Zinc Finger Protein 62, also known as "ZNF62," "ZNF755," or "ZET," is a protein that in humans is encoded by the ZFP62 gene. ZFP62 is part of the C2H2 Zinc Finger family of genes.