C20orf144

Last updated
C20orf144
Identifiers
Aliases C20orf144 , dJ63M2.6, chromosome 20 open reading frame 144
External IDs HomoloGene: 76810 GeneCards: C20orf144
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_080825

n/a

RefSeq (protein)

NP_543015

n/a

Location (UCSC) Chr 20: 33.66 – 33.67 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

Chromosome 20 open reading frame 144 (c20orf144) is a human protein-encoding gene. [3] The human c20orf144 protein consists of 153 amino acids, with the first 150 amino acids being characterized as part of the Bcl-2 like protein of testis (Bclt) family (pfam 15318). [4]

Contents

Gene

The c20orf144 gene is located on the plus strand at 20q11.22 and spans 3,293 base pairs. [5] The gene contains two exons. [3] Of the plus strand, 572 nucleotides are antisense to parts of the human genes PXMP4 and NECAB3. [6] Other gene neighbors include ACTL10 and CBFA2T2. [7]

Transcript

The encoded mRNA is 522 nucleotides in length (Accession: NM_080825) and there are no identified alternative splicings. [8] Human c20orf144 mRNA expression is enriched in the testis, specifically in the early and late spermatids. [9]

Protein

The human c20orf144 gene encodes a protein of 153 amino acids in length, and there are three disordered regions (Accession: NP_543015.1). [4] Amino acids 1-150 are a part of the Bclt protein family which is predicted to be involved in apoptosis. [10] The molecular weight is 17.2kDa and the theoretical isoelectric point is 11.47. [11] There are 21 more lysines and arginines, which are positively charged, than there are aspartates and glutamates, which are negatively charged.

Annotated conceptual translation of human C20orf144 mRNA and protein. Conceptual Translation of Human C20orf144.jpg
Annotated conceptual translation of human C20orf144 mRNA and protein.

The tertiary protein structure, produced by AlphaFold, [12] predicts the presence of 3 α helices, and the absence of β sheets in human c20orf144.

AlphaFold structure prediction of human C20orf144. The predicted alpha helices are shown in spherical form. Wiki Structure prediction.jpg
AlphaFold structure prediction of human C20orf144. The predicted alpha helices are shown in spherical form.

Cellular localization

Analysis of the localization of human c20orf144 and many mammalian orthologs predicts localization of c20orf144 in the nucleus, with 78.3% confidence for the human protein. [13]

Post translational modifications

Table 1. Predicted Post Translation Modifications of Human C20orf144
ModificationModification Site in Human C20orf144
N-Myristoylation [13] [14] 2G
Protein Kinase C Phosphorylation [15] 6S
Casein Kinase 2 Phosphorylation [15] 87S
Non-Specific Phosphorylation [15] 117S
O-Glycosylation [16] 117S
Protein Kinase C Phosphorylation [15] 123S

Evolution and orthologs

The rate of evolution of C20orf144 in comparison to the rates of evolution of Cytochrome C and Fibrinogen Alpha Chain. Wiki Rate of Evolution.jpg
The rate of evolution of C20orf144 in comparison to the rates of evolution of Cytochrome C and Fibrinogen Alpha Chain.

The evolutionary rate of C20orf144 is comparable to the high rate of evolution of fibrinogen alpha chain, suggesting the protein is evolving quickly.

Orthologs of the c20orf144 gene in Homo sapiens are found in many mammals excluding monotremes. [17] As shown in Table 2, marsupials are the most distantly related organisms to humans in which proteins encoded by human c20orf144 gene orthologs are found, suggesting that C20orf144 first appeared approximately 160 million years ago.

Table 2. Proteins encoded by the orthologs of the c20orf144 gene in humans.
Genus and SpeciesCommon NameOrderProtein Accession #Median Date of Divergence (MYA) [18] Sequence LengthSequence Identity (%)Sequence Similarity (%)
Homo sapiens Human Primata NP_543015.10153100100
Macaca mulatta Rhesus Monkey PrimataXP_001105397.128.915386.390.8
Piliocolobus tephrosceles Ugandan Red Colobus PrimataXP_023076213.128.914163.766.1
Jaculus jaculus Lesser Egyptian Jerboa Rodentia XP_045011648.18717646.455.8
Myodes glareolus Bank Vole RodentiaXP_048287479.18719742.151.8
Mus musculus House Mouse RodentiaNP_083581.18719741.449.8
Camelus ferus Wild Bactrian Camel Artiodactyla XP_032318023.1941745464.4
Equus caballus Domestic Horse Perissodactyla XP_023482143.19417845.756
Monodon monoceros Narwhal ArtiodactylaXP_029075207.19418142.950.5
Physeter catodon Sperm Whale ArtiodactylaXP_023984368.19414840.848.4
Prionailurus bengalensis Leopard Cat Carnivora XP_043458511.1941795260.9
Ursus arctos Brown Bear CarnivoraXP_026358671.19418451.661.4
Eumetopias jubatus Steller Sea Lion CarnivoraXP_027974622.19418447.358.1
Rousettus aegyptiacus Egyptian Fruit Bat ChiropteraXP_016017694.29417551.462.7
Rhinolophus ferrumenquinum Greater Horseshoe Bat ChiropteraXP_032951343.19419140.251.5
Pteropus vampyrus Large Flying Fox ChiropteraXP_023377960.1942094050.5
Choloepus didactylus Southern Two-Toed Sloth Pilosa XP_037668100.19918847.957.4
Gracilinanus agilis Agile Gracile Mouse Opossum Didelphimorphia XP_044517537.116016937.949.7
Dromiciops gliroides Monito del Monte Microbiotheria XP_043845608.11601703750.8
Sarcophilus harrisii Tasmanian Devil Dasyuromorphia XP_031809718.116016036.450

Clinical significance

In a study of 28 breast cancer patients, missense mutations in c20orf144 were found in approximately 33% of patients, suggesting a potential role for c20orf144 in the development of breast cancer. [19] Furthermore, c20orf144 is listed in primary renal proximal tubule epithelial cells as a top candidate hit in an siRNA screen, which silences targeted genes. [20] The silencing of c20orf144 in cells exposed to Shiga toxin resulted in metabolic activity that was greater than or equal to 90% of that in a typical cell.

Related Research Articles

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C17orf98</span> Protein-coding gene in the species Homo sapiens

C17orf98 is a protein which in humans is coded by the gene c17orf98. The protein is derived from Homo sapiens chromosome 17. The C17orf98 gene consists of a 6,302 base sequence. Its mRNA has three exons and no alternative splice sites. The protein has 154 amino acids, with no abnormal amino acid levels. C17orf98 has a domain of unknown function (DUF4542) and is 17.6kDa in weight. C17orf98 does not belong to any other families nor does it have any isoforms. The protein has orthologs with high percent similarity in mammals and reptiles. The protein has additional distantly related orthologs across the metazoan kingdom, culminating with the sponge family.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">LOC101059915</span> Protein-coding gene in the species Homo sapiens

LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">SMIM11</span>

Small integral membrane protein 11 is a protein which in humans is encoded by the SMIM11 gene.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in humans

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

Chromosome 4 open reading frame 54 is a protein that in humans is coded by the c4orf54 gene. This gene is also known as FOPV and LOC285556. This protein is mostly expressed in the nucleus of muscle cells. Orthologs are found in vertebrates but not invertebrates.

Chromosome 4 open reading frame 50 is a protein that in humans is encoded by the C4orf50 gene. The protein localizes in the nucleus. C4orf50 has orthologs in vertebrates but not invertebrates

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000149609 Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. 1 2 "C20orf144 chromosome 20 open reading frame 144 [Homo sapiens (human)] - Gene - NCBI" . Retrieved 19 September 2022.
  4. 1 2 3 "RecName: Full=Uncharacterized protein C20orf144; AltName: Full=Bcl-2-like protein from testis; Short=Bclt - Gene - NCBI" . Retrieved 19 September 2022.
  5. "C20orf144 Gene - Chromosome 20 Open Reading Frame 144". GeneCards. Retrieved 29 September 2022.
  6. "Gene C20orf144". AceView. Retrieved 29 September 2022.
  7. "Entry on C20orf144". UCSC Genome Browser. Retrieved 5 December 2022.
  8. "Homo sapiens chromosome 20 open reading frame 144 (C20orf144), mRNA - Gene - NCBI". 24 June 2021. Retrieved 19 September 2022.
  9. "Human Protein Atlas C20orf144 entry". The Human Protein Atlas. Retrieved 8 December 2022.
  10. "NCBI Entry on Bclt". National Center for Biotechnology Information. Retrieved 15 December 2022.
  11. "Compute pI/MW". Expasy Swiss Bioinformatics Resource Portal. Retrieved 15 December 2022.
  12. 1 2 "AlphaFold Protein Structure Database entry on Human C20orf144". alphafold.com. Retrieved 2022-12-15.
  13. 1 2 "PSORT II Prediction". PSORT WWW Server. Retrieved 7 December 2022.
  14. "Myristoylator". Expasy Swiss Bioinformatics Resource Portal. Retrieved 15 December 2022.
  15. 1 2 3 4 "Phosphorylation Sites in Eukaryotic Proteins". NetPhos-3.1. DTU Health Tech. Retrieved 15 December 2022.
  16. "O-(beta)-GlcNAc glycosylation and Yin-Yang sites". YinOYang-1.2. DTU Health Tech. Retrieved 15 December 2022.
  17. "C20orf144 Entry". Protein BLAST. Retrieved 23 October 2022.
  18. Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, et al. (August 2022). "TimeTree 5: An Expanded Resource for Species Divergence Times". Molecular Biology and Evolution. 39 (8). doi:10.1093/molbev/msac174. PMC   9400175 . PMID   35932227.
  19. Aravind Kumar M, Singh V, Naushad SM, Shanker U, Lakshmi Narasu M (May 2018). "Microarray-based SNP genotyping to identify genetic risk factors of triple-negative breast cancer (TNBC) in South Indian population". Molecular and Cellular Biochemistry. 442 (1–2): 1–10. doi:10.1007/s11010-017-3187-6. PMID   28918577. S2CID   254795241.
  20. MacMaster K. "Characterization of cellular pathways and potency of Shiga toxin on endothelial cells" (PDF). University of Cincinnati. Retrieved 15 December 2022.