C2orf72 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C2orf72 , chromosome 2 open reading frame 72, Chromosome 2, Open Reading Frame 72 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1920042 HomoloGene: 54780 GeneCards: C2orf72 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
C2orf72 (Chromosome 2, Open Reading Frame 72) is a gene in humans ( Homo sapiens) that encodes a protein currently named after its gene, C2orf72. [5] It is also designated LOC257407 [5] and can be found under GenBank accession code NM_001144994.2. [6] The protein can be found under UniProt accession code A6NCS6. [7]
This gene is primarily expressed in the liver, brain, placental, and small intestine tissues. [8] C2orf72 is an intracellular protein that has been predicted to reside within the nucleus, cytosol, and plasma membrane of cells. [5] The function of C2orf72 is unknown, but it is predicted to be involved in very-low-density lipoprotein particle assembly and also involved in the regulation of cholesterol esterification. [9] This prediction also matches with the fact that both estradiol [10] and testosterone [11] have been reported to upregulate expression of C2orf72. [12]
C2orf72 is a protein-coding gene found on the forward (+) strand of chromosome 2 at the locus 2q37.1, on the long arm of the chromosome. [5]
C2orf72's mRNA transcript is reported to be about 3,629 base pairs long. [6] It appears to have two polyadenylation sites near the 5′ end of the mRNA transcript, each preceded by their respective regulatory sequences, such as ATTAAA or AATAAA. [6]
There are three predicted exons reported for human C2orf72. [6]
C2orf72 is preferentially expressed in brain, liver, placenta, colon, small intestine, gallbladder, stomach, and prostate, and to a lesser extent in adrenal gland, appendix, pancreas, lung, kidney, testis, and urinary bladder. [8]
It is predicted via Archs4 [13] (July 16, 2022) that the function of this gene may be related to very-low-density lipoprotein particle assembly [14] and also involved in the regulation of cholesterol esterification. [9]
In a study of embryonic liver samples lacking hepatocyte nuclear factor 4 alpha (HNF4α), the expression of C2orf72 was downregulated. [15]
Both estradiol [10] and testosterone [11] upregulate expression of C2orf72. [12]
C2orf72 mRNA and protein products are found preferentially in the liver, kidney, and placenta. [16] The protein is localized to the cell membrane and cytoplasm in liver, brain, and placental tissues. [16]
miR-1271-5p is a microRNA that could bind to the 3′ untranslated region of the C2orf72 mRNA transcript at 5′-...GUGCCAA...-3′. [6] [17] [18]
There are at least two predicted phosphorylation sites for the human C2orf72 protein, one at threonine-286 and the other at serine-294. [7]
The predicted molecular weight of C2orf72 is 30.5 kDa, [19] and it has a predicted isoelectric point (pI) of pH 8.7. [20]
There are eight cysteine residues, for a potential of four disulfide bonds. [21] Most of the cysteine residues are positioned next to a polar amino acid (uncharged or positively or negatively charged). [21]
At physiological pH, there are 33 positively charged amino acid residues, including histidine, most of which are arginines. [21] Likewise, there are 33 negatively charged amino acid residues, most of which are glutamates. [21]
There are 14 hydroxyl-containing residues (tyrosine, threonine or serine) that could serve as typical phosphorylation sites; most of these are serines. [21]
These proteins have been reported to interact with human C2orf72: RASN (GTPase NRas), [23] RASK (GTPase KRas), [23] and CD81. [24] [25]
There are at least 203 organisms with an ortholog of C2orf72. [26] The most evolutionarily distant reported ortholog of C2orf72 is in the Australian ghost shark (Callorhincus milii);, [27] [28] [29] and it is broadly conserved from Actinopterygii (bony fish) to Mammalia.
Genus and species | Common name | Order | Date of divergence from human (million years ago) | GenBank accession code | Sequence length | Sequence identity (%) | Sequence similarity (%) |
---|---|---|---|---|---|---|---|
Pan troglodytes | Chimpanzee | Primates | 6.7 | XP_516141.5 | 295 | 98.6 | 98.6 |
Pongo abelii | Sumatran orangutan | Primates | 15.76 | XP_024099683.1 | 295 | 95.3 | 96.9 |
Castor canadensis | American beaver | Rodentia | 90 | XP_020011841.1 | 282 | 77.6 | 82.4 |
Oryx dammah | Scimitar-horned oryx | Artiodactyla | 96 | XP_040084064.1 | 285 | 74.6 | 79.3 |
Sus scrofa | Wild boar | Artiodactyla | 96 | XP_005657646.1 | 282 | 75.3 | 80.7 |
Tursiops truncatus | Common bottlenose dolphin | Cetacea | 96 | XP_033715450.1 | 285 | 76.9 | 80.7 |
Felis catus | Domestic cat | Carnivora | 96 | XP_023115562.1 | 286 | 80.1 | 83.1 |
Eptesicus fuscus | Big brown bat | Chiroptera | 96 | XP_027993078.1 | 151 | 36.1 | 38.9 |
Corapipo altera | White-ruffed manakin | Passeriformes | 312 | XP_027503457.1 | 181 | 26.7 | 34.0 |
Pipra filicauda | Wire-tailed manakin | Passeriformes | 312 | XP_027606890.1 | 243 | 34.7 | 45.2 |
Taeniopygia guttata | Zebra finch | Passeriformes | 312 | XP_030136117.3 | 255 | 35.1 | 45.4 |
Corvus cornix cornix | Hooded crow | Passeriformes | 312 | XP_039412719.1 | 245 | 36.0 | 45.3 |
Hirundo rustica | Barn swallow | Passeriformes | 312 | XP_039930397.1 | 243 | 37.0 | 46.7 |
Aythya fuligula | Tufted duck | Anseriformes | 312 | XP_032049188 | 251 | 36.3 | 46.7 |
Anas platyrhynchos | Mallard | Anseriformes | 312 | XP_038039556.1 | 251 | 36.3 | 46.7 |
Protobothrops mucrosquamatus | Brown-spotted pit viper | Squamata | 312 | XP_029139335.1 | 278 | 22.9 | 34.5 |
Python bivittatus | Burmese python | Squamata | 312 | XP_025023716.1 | 279 | 23.3 | 35.9 |
Pseudonaja textilis | Eastern brown snake | Squamata | 312 | XP_026577460.1 | 272 | 31.6 | 41.0 |
Pantherophis guttatus | Corn snake | Squamata | 312 | XP_034263860.1 | 252 | 33.0 | 42.5 |
Pogona vitticeps | Central bearded dragon | Squamata | 312 | XP_020657305.1 | 295 | 24.1 | 34.0 |
Zootoca vivipara | Common lizard | Squamata | 312 | XP_034989711.1 | 285 | 37.9 | 48.6 |
Lacerta agilis | Sand lizard | Squamata | 312 | XP_033004091.1 | 289 | 38.0 | 49.5 |
Podarcis muralis | Common wall lizard | Squamata | 312 | XP_028587763.1 | 272 | 38.7 | 50.8 |
Gopherus evgoodei | Goode's thornscrub tortoise | Testudines | 312 | XP_030431493.1 | 481 | 24.2 | 31.1 |
Terrapene carolina triunguis | Three-toed box turtle | Testudines | 312 | XP_029766982.1 | 262 | 35.1 | 43.2 |
Chrysemys picta bellii | Painted turtle | Testudines | 312 | XP_023966073.1 | 306 | 36.6 | 47.4 |
Dermochelys coriacea | Leatherback sea turtle | Testudines | 312 | XP_038272534.1 | 271 | 38.1 | 48.1 |
Mauremys reevesii | Reeves' turtle | Testudines | 312 | XP_039344659.1 | 277 | 39.5 | 51.4 |
Nanorana parkeri | High Himalaya frog | Anura | 351.8 | XP_018432004.1 | 304 | 27.3 | 40.1 |
Xenopus tropicalis | Tropical clawed frog | Anura | 351.8 | XP_002937397.3 | 289 | 30.7 | 42.4 |
Rhinatrema bivittatum | Two-lined caecilian | Gymnophiona | 351.8 | XP_029473197.1 | 358 | 30.3 | 36.1 |
Geotrypetes seraphini | Gaboon caecilian | Gymnophiona | 351.8 | XP_033814148.1 | 233 | 33.9 | 44.2 |
Parambassis ranga | Indian glass fish | Perciformes | 435 | XP_028260036.1 | 334 | 19.7 | 34.5 |
Acanthochromis polyacanthus | Spiny chromis | Perciformes | 435 | XP_022050415.1 | 317 | 21.8 | 35.6 |
Acanthopagrus latus | Yellowfin seabream | Perciformes | 435 | XP_036971960.1 | 309 | 22.0 | 35.5 |
Cyprinodon tularosa | White Sands pupfish | Cyprinodontiformes | 435 | XP_038147473.1 | 296 | 20.1 | 33.1 |
Esox lucius | Northern pike | Esociformes | 435 | XP_012990404.1 | 332 | 20.6 | 33.1 |
Thunnus maccoyii | Southern bluefin tuna | Scombriformes | 435 | XP_042273029.1 | 329 | 20.2 | 34.0 |
Syngnathus acus | Greater pipefish | Syngnathiformes | 435 | XP_037106050.1 | 274 | 19.5 | 34.9 |
Callorhinchus milii | Australian ghost shark | Chimaeriformes | 473 | XP_007887618.1 | 413 | 17.6 | 26.5 |
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
C14orf180 is found on chromosome 14 in humans: 14q32.33. It consists of 1832 bp and 160 amino acids post translation. There is a total number of 6 exons. C14orf180 is also known as NRAC, C14orf77, and Chromosome 14 Open Reading Frame 180.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.
KIAA2013, also known as Q8IYS2 or MGC33867, is a single-pass transmembrane protein encoded by the KIAA2013 gene in humans. The complete function of KIAA2013 has not yet been fully elucidated.
Chromosome 20 open reading frame 85, or most commonly known as C20orf85 is a gene that encodes for the C20orf85 Protein. This gene is not yet well understood by the scientific community.
NADP-dependent oxidoreductase domain-containing protein 1 is a protein that in humans is encoded by the NOXRED1 gene. An alias of this gene is Chromosome 14 Open Reading Frame 148 (c14orf148). This gene is located on chromosome 14, at 14q24.3. NOXRED1 is predicted to be involved in pyrroline-5-carboxylate reductase activity as part of the L-proline biosynthetic pathway. It is expressed in a wide variety of tissues at a relatively low level, including the testes, thyroid, skin, small intestine, brain, kidney, colon, and more.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Transmembrane protein 271, or TMEM271 is a protein in Homo sapiens encoded by the TMEM271 gene, located at 4p16.3 on the minus strand. The protein is located on the plasma membrane of cells and highly expressed in several regions of the brain.