C1orf122

Last updated
C1orf122
Identifiers
Aliases C1orf122 , ALAESM, chromosome 1 open reading frame 122
External IDs HomoloGene: 82588 GeneCards: C1orf122
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_198446
NM_001142726

n/a

RefSeq (protein)

NP_001136198
NP_940848

n/a

Location (UCSC) Chr 1: 37.81 – 37.81 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

C1orf122 (Chromosome 1 open reading frame 122) is a gene in the human genome that encodes the cytosolic protein ALAESM.. [3] ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. [4] This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization. [5]

Contents

Gene

C1orf122 is located on chromosome 1 at 1p34.3. The gene is 1,665 nucleotides long, covering 37,808,405 to 37,809,454. [6] It contains three exons [7] with boundaries between amino acids 12 and 13, and amino acids 79 and 80.

mRNA

C1orf122 has two isoforms. Variant one contains 1,329 nucleotides with three exons. [8] Variant two contains 1,226 nucleotides with three exons. [9] Variant two lacks an in-frame portion of the 5' coding region, resulting in a shorter N-terminus. [10]

Protein

ALAESM has a molecular weight of 1100 kDa and an isoelectric point of 6.29. [11] It is a cytosolic protein without a transmembrane domain.

This conceptual translation shows various points of interest in the C1orf122 gene. Hsu C1orf122 Conceptual Translation.png
This conceptual translation shows various points of interest in the C1orf122 gene.

Predicted post-translational modifications

There are few predicted kinase phosphorylation sites in this protein. Position 7 is predicted to be phosphorylated by CK1, VRK, and VRK2. Position 10 is predicted to be phosphorylated by CRK1, VRK, PKC, PLK, and AGC. Position 82 has a possible phosphorylation by TKL and MLK. Position 94 is predicted to be phosphorylated by PKC, AGC, MAPK, NEK, CMGC and IKK. [12]

ALAESM does have a few predicted reactive sites. It is predicted to be palmitoylated at position 10, allowing the covalent attachment of fatty acids. [13] It is predicted to undergo glycation at positions 21 and 101 which attaches a sugar molecule to the amino acid. [14] It is predicted to have a nuclear export signal strand from position 55-64 which signals the protein to leave the nucleus. [15] It likely can be glycosylated at position 82 and 94 which attaches a carbohydrate to the amino acid. [16] It is predicted to be phosphorylated by an unspecified actor at position 10, 82, and 94 in the nucleus. [17]

Structure

The secondary structure of ALAESM is predicted to be structured as 55% random coil, 35% alpha helix and 9% extended strand. There are two alpha helices between positions 11-18 and 36-68. There are three 2 amino acid sections after position 80 and one 4 amino acid section at position 20 of extended strand. [18] The rest of the protein is random coil. There is no transmembrane domain within ALAESM [19]

Expression

ALAESM is expressed throughout all tissue cells in the body. [20] It is also expressed up to 2.5 times higher than its average level in the brain, spinal cord, adrenal gland and kidneys. The protein is expressed in the cytoplasm and since it is predicted to have a nuclear export signal, it is kept in the cytoplasm even in telophase when the nuclear envelope disassembles.

Homology

This graph shows the date of divergence, sequence length, and sequence identity for the orthologs of the human gene C1orf122. Hsu C1orf122 Orthologs.png
This graph shows the date of divergence, sequence length, and sequence identity for the orthologs of the human gene C1orf122.

Human C1orf122 does not have any paralogs, however it has multiple orthologs amongst placental mammals. These species range from cats, horses, rabbits, alpacas, and elephants. [21] The sequence across these species are highly conserved.

Related Research Articles

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">C7orf26</span> Human protein-encoding gene on chromosome 7

c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">C16orf90</span> Protein-coding gene in the species Homo sapiens

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in the species Homo sapiens

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">C14orf180</span> Protein-coding gene in the species Homo sapiens

C14orf180 is found on chromosome 14 in humans: 14q32.33. It consists of 1832 bp and 160 amino acids post translation. There is a total number of 6 exons. C14orf180 is also known as NRAC, C14orf77, and Chromosome 14 Open Reading Frame 180.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

Chromosome 4 open reading frame 50 is a protein that in humans is encoded by the C4orf50 gene. The protein localizes in the nucleus. C4orf50 has orthologs in vertebrates but not invertebrates

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">C12orf54</span> Protein-coding gene in the species Homo sapiens

C12orf54 is a protein in humans that is encoded by the C12orf54 gene.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000197982 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. "DAS-TMfilter server". mendel.imp.ac.at. Archived from the original on 2018-02-05. Retrieved 2020-04-30.
  4. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-30.
  5. "AceView: Gene:C1orf122, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2020-05-03.
  6. "Genome Data Viewer". www.ncbi.nlm.nih.gov. Retrieved 2020-05-03.
  7. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-27.
  8. "Homo sapiens chromosome 1 open reading frame 122 (C1orf122), transcript variant 1, mRNA". Int. J. Mol. Med. 24 (2): 233–246. 2009 via NCBI.
  9. "Homo sapiens chromosome 1 open reading frame 122 (C1orf122), transcript variant 2, mRNA". Int. J. Mol. Med. 24 (2): 233–246. 2009 via NCBI.
  10. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-30.
  11. "ExPASy: SIB Bioinformatics Resource Portal - Categories". www.expasy.org. Retrieved 2020-04-30.
  12. Wall D (2004). "HTTP in PHP". Multi-Tier Application Programming with PHP. Elsevier. pp. 21–43. doi:10.1016/b978-012732350-3/50003-x. ISBN   978-0-12-732350-3.
  13. "CSS-Palm - Palmitoylation Site Prediction". csspalm.biocuckoo.org. Retrieved 2020-05-03.
  14. "NetGlycate 1.0 Server". www.cbs.dtu.dk. Retrieved 2020-05-03.
  15. "NetNES 1.1 Server". www.cbs.dtu.dk. Retrieved 2020-05-03.
  16. "DictyOGlyc 1.1 Server". www.cbs.dtu.dk. Retrieved 2020-05-03.
  17. "NetPhosK 1.0 Server". www.cbs.dtu.dk. Archived from the original on 2021-07-09. Retrieved 2020-05-03.
  18. "NPS@ : GOR4 secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2020-05-03.
  19. "TMpred Server". embnet.vital-it.ch. Retrieved 2020-05-03.
  20. "AceView: Gene:C1orf122, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2020-04-30.
  21. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-03.