C10orf53

C10orf53
Identifiers
Aliases	C10orf53 , chromosome 10 open reading frame 53
External IDs	MGI: 1914335 HomoloGene: 28053 GeneCards: C10orf53
Gene location (Human)
Chr.	Chromosome 10 (human)
End	49,710,261 bp
Gene location (Mouse)
Chr.	Chromosome 14 (mouse)
End	32,110,348 bp
RNA expression pattern
	Top expressed in
	sperm; ; right uterine tube; ; face; ; hypothalamus; ; hippocampal formation; ; temporal lobe; ; renal cortex; ; hippocampus proper; ; human musculoskeletal system; ; caudate nucleus;
	Top expressed in
	spermatid; ; testicle; ; ovary; ; hypothalamus; ; spermatocyte; ; pharynx; ; nasopharynx; ; hippocampus proper; ; cerebellum; ; cerebellar cortex;
	More reference expression data
	n/a
Orthologs
	282966
	67085
	ENSG00000178645
	ENSMUSG00000072473
	Q8N6V4
	Q3KNL4
	NM_182554 ; NM_001042427
	NM_001034037
	NP_001035892 ; NP_872360
	NP_001029209
	Wikidata
View/Edit Human	View/Edit Mouse

Last updated March 31, 2024

C10orf53 is a protein that in humans is encoded by the C10orf53 gene.^[6] The gene is located on the positive strand of the DNA and is 30,611 nucleotides in length.^[8] The protein is 157 amino acids and the gene has 3 exons.^[6] C10orf53 orthologs are found in mammals, birds, reptiles, amphibians, fish, and invertebrates.^[9] It is primarily expressed in the testes and at very low levels in the cerebellum, liver, placenta, and trachea.^[6]

Gene

Chromosome 10 open reading frame 53 (10orf53), also known as uncharacterized protein family 0728 (UPF0728), in humans, is encoded by Chromosome 10 (10q11.23),^[6] spanning 30,611 nucleotides.^[8] The gene is located on the positive strand^[8] with 3 identified exons.^[6]

Transcript

Isoform	Accession #^[6]	Length (amino acids)^[6]	Length (nucleotides)^[6]	Exons translated^[6]	Additional comments^[6]
a	NM_182554.4 NP_872360.2	157	30,611	3	Less common isoform
b	NM_001042427.3 NP_001035892.1	93	17,882	3	More common isoform – alternative 3’ exon and distinct C-terminus

The table outlines the two identified isoforms of C10orf53. The most common isoform has 93 amino acids which is a shorter amino acid sequence due to an alternative 3’ terminal exon and distinct C-terminus.^[6] Isoform A has a higher frequency of analysis due to it being the longer isoform.^[8] The analysis in this article is focused on isoform A.

Structure

The molecular weight predicted was 17.6 kDa.^[10] The isoelectric point for C10orf53 is estimated to be 6.36 pl^[10]

Secondary

MPKNAVVILRYGPYSAAGLPVEHHTFRLQGLQAVLAIDGHEVILEKIEDWNVVELMVNEEVIFHCNI         67

CCCCCSSSSSSCCCHHCCSSSSSCHHHHHHHHHHHHHCCCSSSSSSSCCCCSSSSSSCCCSSSSSCC         67

KDLEFGKLTPSSDKRTTSSSRLTFHQLSSPCRMKVSPLQQFPQKTQDLTCTVLAQIGSCIHFQTNLC         134

CCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHCCCHHHHCCCCCCCSSSSSHHHHCCSSSSSCCCC         134

DLGWPGLDHMLISGLEKRGTQPY                                                     157

CCCCCCCHHHHHHHHHHCCCCCC                                                     157

The secondary structure above illustrates the estimated secondary structure for Isoform A of C10orf53.^[11] The C that are italicized indicate that the amino acid is located within a coil, the bolded S is referring to the amino acid being in a strand, and the underlined H shows the amino acid is in a helix.^[11] The strand and helix structures can then be translated into the predicted tertiary structure done through I-TASSER.

Tertiary

The predicted tertiary structure of C10orf53 is shown. The tertiary structure contains the seven helix and 8 strand structures predicted through I-TASSER.^[11]

Regulation

Gene

C10orf53 is primarily expressed in the testes, but also has very low levels of expression in the cerebellum, liver, placenta, and trachea.^[6] It is tissue-specific to the testes due to the low expression in other tissues compared to the testes. C10orf53 majorly is secreted in the cytoplasm of cells and has moderate levels in the nucleus and mitochondria.^[12]

Protein

C10orf53 was found to have two phosphorylation sites, two SUMOylation sites, and one lysine acetylation site.^[5] All of these regions are shown in the conceptual translation of C10orf53.

Evolution

C10orf53 is predicted to have a slower evolution compared to the gene, fibrinogen alpha, but it also has a quicker evolution compared to cytochrome c. Fibrinogen alpha is considered as a gene that had evolved rather quickly when examining the gene in different organisms. When two organisms diverged into different taxa, the gene went through alterations that made it significantly different from other organisms, causing it to have a quick rate of evolution. In contrast, Cytochrome C is relatively conserved throughout different organisms, which shows that it has a slow rate of evolution. C10orf53 has a rate of evolution that is smaller than fibrinogen alpha, but larger than cytochrome c.

Homology

A group of distantly and closely related orthologs were chosen and categorized by their date of divergence from humans.^[13] The percent similarity and percent identity in relation to humans showed the predicted conservation between C10orf53 in humans compared to their orthologs.

Interacting Proteins

Protein^[14]	Protein Name^[14]	Identification^[14]	Function^[15]
PIGR	Polymeric Immunoglobulin Receptor	Affinity chromatography	Aids in the movement of polymeric IgA and IgM across mucosal epithelial cells
DSCC1	DNA Replication And Sister Chromatid Cohesion 1	Affinity chromatography	Loads PCNA onto primed templates to manipulate replication forks
UXT	Ubiquitously Expressed Prefoldin Like Chaperone	Affinity chromatography	Regulates androgen receptor transcription
ZG16B	Zymogen Granule Protein 16B	Affinity chromatography	Located in the extracellular exosome and enables carbohydrate binding activity
SPECC1L	Sperm Antigen With Calponin Homology And Coiled-Coil Domains 1 Like	Affinity chromatography	Helps with spindle organization and cytokinesis
MNAT1	CDK Activating Kinase	Affinity chromatography	Creates the CDK-activating kinase enzyme complex
SPTB	Spectrin Beta, Erythrocytic	Affinity chromatography	Composes the cytoskeletal network in erythrocyte membranes

The table contains all predicted proteins that were found to interact with C10orf53.^[14] It includes the protein acronym and name, the means that identification occurred, and the function of each protein. Due to the related function of some of the proteins, this provides evidence that these interactions coincide with the predicted location.

Clinical Significance

A study was conducted that compared the relative spermatogenesis in humans to the relative expression of RNAs correlated to teratozoospermia. In non-afflicted humans, there is a relatively high expression of C10orf53 across the RNAs tested. However, when a human had teratozoospermia, those levels dropped to almost zero.^[16] Another study examined the expression of C10orf53 in spermatogenesis and testis development in mice during development. C10orf53 is only highly expressed from day 30-56 of mice development, with the expression decreasing slightly on each five-day period until 56 days were reached.^[17] The final study looked at research done within the past five years has correlated African American prostate cancer patients with the presence of C10orf535. When examining the exosome found in Caucasian populations associated with prostate cancer (PCC) against the African American exosome (PAA), C10orf53 was unique only to PAA.^[18]

Related Research Articles

GPATCH11 is a protein that in humans is encoded by the G-patch domain containing protein 11 gene. The gene has four transcript variants encoding two functional protein isoforms and is expressed in most human tissues. The protein has been found to interact with several other proteins, including two from a splicing pathway. In addition, GPATCH11 has orthologs in all taxa of the eukarya domain.

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.

IGSF6 is a protein that in humans is encoded by the IGSF6 gene.

Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa

KIAA1143 is an uncharacterized protein in humans that is encoded by the KIAA1143 gene. it may play a role in cell growth mechanisms and regulation/creation of cytoskeletal structure. This gene is located on chromosome 3 on the minus strand

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

Transmembrane protein 61 (TMEM61) is a protein that is encoded by the TMEM61 gene in humans. TMEM61 is located on the first chromosome in humans.TMEM61 is highly expressed in the intestinal regions predominantly the kidney, adrenal gland and pituitary tissues. The protein, unlike other transmembrane protein in the region does not promote cancer growth. However, the TMEM61 protein when inhibited by secondary factors restricts normal activity in the kidney. The human protein shares many Orthologs and has been prevalent on Earth for millions of years.

NKAPD1 is a protein, which in humans, is encoded by the gene NKAPD1. This protein is also commonly referred to as C11ORF57.

References

1 2 3 GRCh38: Ensembl release 89: ENSG00000178645 - Ensembl, May 2017
1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000072473 - Ensembl, May 2017
↑ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
↑ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
1 2 "Kinase-specific Phosphorylation Site Prediction". Group-based Prediction System (GPS) 5.0. Wuhan, Hubei, China: Huazhong University of Science and Technology. Retrieved 2022-12-16.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 "C10orf53 chromosome 10 open reading frame 53 [Homo sapiens (human)] - Gene". National Center for Biotechnology Information (NCBI). U.S. National Library of Medicine. Retrieved 2022-12-16.
↑ "ELM - unknown". Eukaryotic Linear Motif (ELM). Retrieved 2022-12-17.
1 2 3 4 "C10orf53 Gene". GeneCards. Retrieved 2022-12-16.
1 2 3 4 5 "BLAST: Basic Local Alignment Search Tool". National Center for Biotechnology Information (NCBI). U.S. Library of Medicine. Retrieved 2022-12-16.
1 2 "Statistical Analysis of Protein Sequences (SAPS)". European Bioinformatics Institute (EMBL-EBI). European Molecular Biology Laboratory (EMBL). Retrieved 2022-12-16.
1 2 3 4 "I-TASSER server for protein structure and function prediction". Yang Zhang's Research Group. University of Michigan. Retrieved 2022-12-16.
↑ "PSORT II Prediction". psort.hgc.jp. Retrieved 2022-12-16. Program to predict the subcellular localization sites of proteins from their amino acid sequences.
1 2 "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2022-12-17.
1 2 3 4 "PSICQUIC View". www.ebi.ac.uk. Retrieved 2022-12-16.
↑ "GeneCards - Human Genes | Gene Database | Gene Search". www.genecards.org. Retrieved 2022-12-16.
↑ Platts AE, Dix DJ, Chemes HE, Thompson KE, Goodrich R, Rockett JC, et al. (April 2007). "Success and failure in human spermatogenesis as revealed by teratozoospermic RNAs". Human Molecular Genetics. 16 (7): 763–773. doi: 10.1093/hmg/ddm012 . PMID 17327269.
↑ Shima JE, McLean DJ, McCarrey JR, Griswold MD (July 2004). "The murine testicular transcriptome: characterizing gene expression in the testis during the progression of spermatogenesis". Biology of Reproduction. 71 (1): 319–330. doi: 10.1095/biolreprod.103.026880 . PMID 15028632. S2CID 16839090.
↑ Panigrahi GK, Praharaj PP, Kittaka H, Mridha AR, Black OM, Singh R, et al. (March 2019). "Exosome proteomic analyses identify inflammatory phenotype and novel biomarkers in African American prostate cancer patients". Cancer Medicine. 8 (3): 1110–1123. doi:10.1002/cam4.1885. PMC 6434210 . PMID 30623593.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[refGRCh38Ensembl-1] 1 2 3 GRCh38: Ensembl release 89: ENSG00000178645 - Ensembl, May 2017

[refGRCm38Ensembl-2] 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000072473 - Ensembl, May 2017

[3] "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.

[4] "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.

[:6-5] 1 2 "Kinase-specific Phosphorylation Site Prediction". Group-based Prediction System (GPS) 5.0. Wuhan, Hubei, China: Huazhong University of Science and Technology. Retrieved 2022-12-16.

[NCBI_C10orf53-6] 1 2 3 4 5 6 7 8 9 10 11 12 13 14 "C10orf53 chromosome 10 open reading frame 53 [Homo sapiens (human)] - Gene". National Center for Biotechnology Information (NCBI). U.S. National Library of Medicine. Retrieved 2022-12-16.

[7] "ELM - unknown". Eukaryotic Linear Motif (ELM). Retrieved 2022-12-17.

[GeneCards_C10orf53-8] 1 2 3 4 "C10orf53 Gene". GeneCards. Retrieved 2022-12-16.

[:2-9] 1 2 3 4 5 "BLAST: Basic Local Alignment Search Tool". National Center for Biotechnology Information (NCBI). U.S. Library of Medicine. Retrieved 2022-12-16.

[:3-10] 1 2 "Statistical Analysis of Protein Sequences (SAPS)". European Bioinformatics Institute (EMBL-EBI). European Molecular Biology Laboratory (EMBL). Retrieved 2022-12-16.

[:4-11] 1 2 3 4 "I-TASSER server for protein structure and function prediction". Yang Zhang's Research Group. University of Michigan. Retrieved 2022-12-16.

[12] "PSORT II Prediction". psort.hgc.jp. Retrieved 2022-12-16. Program to predict the subcellular localization sites of proteins from their amino acid sequences.

[:7-13] 1 2 "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2022-12-17.

[:5-14] 1 2 3 4 "PSICQUIC View". www.ebi.ac.uk. Retrieved 2022-12-16.

[15] "GeneCards - Human Genes | Gene Database | Gene Search". www.genecards.org. Retrieved 2022-12-16.

[16] Platts AE, Dix DJ, Chemes HE, Thompson KE, Goodrich R, Rockett JC, et al. (April 2007). "Success and failure in human spermatogenesis as revealed by teratozoospermic RNAs". Human Molecular Genetics. 16 (7): 763–773. doi: 10.1093/hmg/ddm012 . PMID 17327269.

[17] Shima JE, McLean DJ, McCarrey JR, Griswold MD (July 2004). "The murine testicular transcriptome: characterizing gene expression in the testis during the progression of spermatogenesis". Biology of Reproduction. 71 (1): 319–330. doi: 10.1095/biolreprod.103.026880 . PMID 15028632. S2CID 16839090.

[18] Panigrahi GK, Praharaj PP, Kittaka H, Mridha AR, Black OM, Singh R, et al. (March 2019). "Exosome proteomic analyses identify inflammatory phenotype and novel biomarkers in African American prostate cancer patients". Cancer Medicine. 8 (3): 1110–1123. doi:10.1002/cam4.1885. PMC 6434210 . PMID 30623593.

[1]

[2]

[3]

[4]

[6]

[8]

[9]

[10]

[11]

[12]

[5]

[13]

[14]

[15]

[16]

[17]

[18]

C10orf53

Contents

Gene

Transcript

Structure

Secondary

Tertiary

Regulation

Gene

Protein

Evolution

Homology

Interacting Proteins

Clinical Significance

Related Research Articles

References