C6orf62 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C6orf62 , Nbla00237, XTP12, dJ30M3.2, chromosome 6 open reading frame 62 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2441726; HomoloGene: 11551; GeneCards: C6orf62; OMA:C6orf62 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), [5] is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol. [6]
In the DNA, C6orf62 is 12,529 base pairs long and is located at 6q22.3. [7] It is located on chromosome 6 on position 22.3 (6q22.3). The mature mRNA sequence is 2498 base-pairs long with 5 exons and 4 intronic regions that translates a protein that is 229 amino acids long and two predicted isoforms of 160 amino acids and 200 amino acids. [8] [9]
The main transcript is 229 amino acids long and is encoded from 5 exonic regions. There exists two transcript variants that are 200 amino acids and 160 amino acids long. There is a domain of unknown function (DUF4566) present in all three variants and spans positions 1–226 on the main transcript. [10] The molecular weight of C6orf62 is 27.1 kDa and its isoelectric point is at a pH of 9.24. [11] It is located subcellularly localized throughout the cytosol. [6]
Protein | |
---|---|
ELAVL1 | predicted through affinity capture-RNA. [12] |
PVRL4 | predicted through affinity capture-mass spectroscopy. [12] |
NADH | predicted through yeast two-hybrid screening. [13] |
C6orf62 is broadly expressed within the human body, however, its protein abundance is not high. [14] It is more heavily expressed in the gallbladder and testis, but it is not predicted to be expressed in the smooth muscle, lymph nodes, the spleen, ovaries, adipose tissue, and soft tissue.
C6orf62 is highly conserved among vertebrates and has orthologs found in invertebrates.
Genus | Species | Common Name | Date of divergence (MYA) | RefSeq | AA Length | % Identity | % Similarity |
---|---|---|---|---|---|---|---|
Pongo | abelii | Sumatran Orangutan | 15.2 | NP_001126883.1 | 232 | 99 | 99 |
Ictidomys | tridecemlineatus | Ground Squirrel | 88 | XP_005337081.1 | 231 | 99 | 98 |
Tupaia | chinensis | Treeshrew | 85 | XP_006162754.1 | 267 | 84 | 98 |
Rattus | norvegicus | Brown Rat | 88 | NP_001017510.1 | 232 | 99 | 98 |
Genus | Species | Common Name | Date of divergence (MYA) | RefSeq | AA Length | % Identity | % Similarity |
---|---|---|---|---|---|---|---|
Salmo | salar | Salmon | 435 | XP_014034327.1 | 233 | 82 | 93 |
Sinocyclocheilus | rhinoceros | Golden-lined Barbell | 435 | XP_016388718.1 | 230 | 87 | 94 |
Hippocampus | comes | Seahorse | 435 | XP_019729847.1 | 234 | 80 | 91 |
Genus | Species | Common Name | Date of divergence (MYA) | RefSeq | AA Length | % Identity | % Similarity |
---|---|---|---|---|---|---|---|
Nanorana | parkeri | High Himalaya Frog | 353 | XP_018411819.1 | 232 | 95 | 97 |
Xenopus | trpocalis | African Clawed Frog | 353 | NP_001120278.1 | 232 | 93 | 98 |
Genus | Species | Common Name | Date of divergence (MYA) | RefSeq | AA Length | % Identity | % Similarity |
---|---|---|---|---|---|---|---|
Chrysemys | picta belli | Painted Turtle | 320 | XP_005281462.1 | 233 | 97 | 98 |
Python | vbivittatus | Burmese Python | 320 | XP_007521998.1 | 233 | 96 | 99 |
Chelonia | mydas | Green Sea Turtle | 320 | XP_007060449.1 | 231 | 97 | 98 |
Alligator | sinensis | Chinese Alligator | 320 | XP_006034488.1 | 232 | 98 | 99 |
Genus | Species | Common Name | Date of divergence (MYA) | RefSeq | AA Length | % Identity | % Similarity |
---|---|---|---|---|---|---|---|
Tauraco | erythrolophus | Red-crested Tauraco | 320 | XP_009991398.1 | 232 | 97 | |
Taeniopygia | guttata | Zebra Finch | 320 | XP_002194166.1 | 232 | 98 | |
Manacus | vitellinus | Gold Marked Manakin | 320 | XP_017925511.1 | 241 | 98 |
Orthologs in Select Invertebrates [15]
Genus | Species | Common Name | Date of divergence (MYA) | RefSeq | AA Length | % Identity | % Similarity |
---|---|---|---|---|---|---|---|
Ciona | intestinalis | Sea Squirt | 677 | XP_002119736.1 | 233 | 38 | 62 |
Crassostrea | gigas | Pacific Oyster | 758 | XP_011438527.1 | 231 | 62 | 80 |
Hellobdella | robusta | Glossiphoniidae Leech | 758 | XP_009016977.1 | 264 | 35 | 51 |
Octopus | bimaculoides | Two-Spot Octopus | 758 | XP_014780369.1 | 197 | 60 | 79 |
Saccoglossus | kowalevskii | Acorn Worm | 627 | XP_002741666.1 | 230 | 66 | 83 |
TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long and its cDNA has 1214 base pairs. It was previously designated C16orf42.
C2CD4D, or C2 calcium-dependent domain-containing protein 4D is a protein product of the human genome. The gene that codes for this protein is found on chromosome 1, from 150,076,963 to 150,079,657. The gene contains 2 exons and encodes 353 amino acids. Synonyms for C2CD4D are "FAM148D" and NP_001129475. C2CD4D contains a conserved metal binding domain that is a known as Protein kinase C conserved region 2, subgroup 1. This motif is known to be a member of the C2 superfamily, which is present in phospholipases, protein kinases C, and synaptotagmins. The amino acid sequence of C2CD4D can be accessed at Prior to any post translational modification, C2CD4D has a molecular weight of 37.6 kdal. Although scientists have not yet determined where C2CD4D functions within the cell, C2CD4D has a predicted isoelectric point of 11.636 which severely limits the places in which it can be effective. In addition, C2CD4D does not contain any predicted transmembrane domains or any predicted signal peptides.
METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.
CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.