C1orf122 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C1orf122 , ALAESM, chromosome 1 open reading frame 122 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | HomoloGene: 82588 GeneCards: C1orf122 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
C1orf122 (Chromosome 1 open reading frame 122) is a gene in the human genome that encodes the cytosolic protein ALAESM.. [3] ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. [4] This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization. [5]
C1orf122 is located on chromosome 1 at 1p34.3. The gene is 1,665 nucleotides long, covering 37,808,405 to 37,809,454. [6] It contains three exons [7] with boundaries between amino acids 12 and 13, and amino acids 79 and 80.
C1orf122 has two isoforms. Variant one contains 1,329 nucleotides with three exons. [8] Variant two contains 1,226 nucleotides with three exons. [9] Variant two lacks an in-frame portion of the 5' coding region, resulting in a shorter N-terminus. [10]
ALAESM has a molecular weight of 1100 kDa and an isoelectric point of 6.29. [11] It is a cytosolic protein without a transmembrane domain.
There are few predicted kinase phosphorylation sites in this protein. Position 7 is predicted to be phosphorylated by CK1, VRK, and VRK2. Position 10 is predicted to be phosphorylated by CRK1, VRK, PKC, PLK, and AGC. Position 82 has a possible phosphorylation by TKL and MLK. Position 94 is predicted to be phosphorylated by PKC, AGC, MAPK, NEK, CMGC and IKK. [12]
ALAESM does have a few predicted reactive sites. It is predicted to be palmitoylated at position 10, allowing the covalent attachment of fatty acids. [13] It is predicted to undergo glycation at positions 21 and 101 which attaches a sugar molecule to the amino acid. [14] It is predicted to have a nuclear export signal strand from position 55-64 which signals the protein to leave the nucleus. [15] It likely can be glycosylated at position 82 and 94 which attaches a carbohydrate to the amino acid. [16] It is predicted to be phosphorylated by an unspecified actor at position 10, 82, and 94 in the nucleus. [17]
The secondary structure of ALAESM is predicted to be structured as 55% random coil, 35% alpha helix and 9% extended strand. There are two alpha helices between positions 11-18 and 36-68. There are three 2 amino acid sections after position 80 and one 4 amino acid section at position 20 of extended strand. [18] The rest of the protein is random coil. There is no transmembrane domain within ALAESM [19]
ALAESM is expressed throughout all tissue cells in the body. [20] It is also expressed up to 2.5 times higher than its average level in the brain, spinal cord, adrenal gland and kidneys. The protein is expressed in the cytoplasm and since it is predicted to have a nuclear export signal, it is kept in the cytoplasm even in telophase when the nuclear envelope disassembles.
Human C1orf122 does not have any paralogs, however it has multiple orthologs amongst placental mammals. These species range from cats, horses, rabbits, alpacas, and elephants. [21] The sequence across these species are highly conserved.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
C14orf180 is found on chromosome 14 in humans: 14q32.33. It consists of 1832 bp and 160 amino acids post translation. There is a total number of 6 exons. C14orf180 is also known as NRAC, C14orf77, and Chromosome 14 Open Reading Frame 180.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
Chromosome 4 open reading frame 50 is a protein that in humans is encoded by the C4orf50 gene. The protein localizes in the nucleus. C4orf50 has orthologs in vertebrates but not invertebrates
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.
C12orf54 is a protein in humans that is encoded by the C12orf54 gene.