C10orf53 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C10orf53 , chromosome 10 open reading frame 53 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1914335 HomoloGene: 28053 GeneCards: C10orf53 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
C10orf53 is a protein that in humans is encoded by the C10orf53 gene. [6] The gene is located on the positive strand of the DNA and is 30,611 nucleotides in length. [8] The protein is 157 amino acids and the gene has 3 exons. [6] C10orf53 orthologs are found in mammals, birds, reptiles, amphibians, fish, and invertebrates. [9] It is primarily expressed in the testes and at very low levels in the cerebellum, liver, placenta, and trachea. [6]
Chromosome 10 open reading frame 53 (10orf53), also known as uncharacterized protein family 0728 (UPF0728), in humans, is encoded by Chromosome 10 (10q11.23), [6] spanning 30,611 nucleotides. [8] The gene is located on the positive strand [8] with 3 identified exons. [6]
Isoform | Accession # [6] | Length (amino acids) [6] | Length (nucleotides) [6] | Exons translated [6] | Additional comments [6] |
a | NM_182554.4 | 157 | 30,611 | 3 | Less common isoform |
b | NM_001042427.3 | 93 | 17,882 | 3 | More common isoform – alternative 3’ exon and distinct C-terminus |
The table outlines the two identified isoforms of C10orf53. The most common isoform has 93 amino acids which is a shorter amino acid sequence due to an alternative 3’ terminal exon and distinct C-terminus. [6] Isoform A has a higher frequency of analysis due to it being the longer isoform. [8] The analysis in this article is focused on isoform A.
The molecular weight predicted was 17.6 kDa. [10] The isoelectric point for C10orf53 is estimated to be 6.36 pl [10]
MPKNAVVILRYGPYSAAGLPVEHHTFRLQGLQAVLAIDGHEVILEKIEDWNVVELMVNEEVIFHCNI 67
CCCCCSSSSSSCCCHHCCSSSSSCHHHHHHHHHHHHHCCCSSSSSSSCCCCSSSSSSCCCSSSSSCC 67
KDLEFGKLTPSSDKRTTSSSRLTFHQLSSPCRMKVSPLQQFPQKTQDLTCTVLAQIGSCIHFQTNLC 134
CCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHCCCHHHHCCCCCCCSSSSSHHHHCCSSSSSCCCC 134
DLGWPGLDHMLISGLEKRGTQPY 157
CCCCCCCHHHHHHHHHHCCCCCC 157
The secondary structure above illustrates the estimated secondary structure for Isoform A of C10orf53. [11] The C that are italicized indicate that the amino acid is located within a coil, the bolded S is referring to the amino acid being in a strand, and the underlined H shows the amino acid is in a helix. [11] The strand and helix structures can then be translated into the predicted tertiary structure done through I-TASSER.
The predicted tertiary structure of C10orf53 is shown. The tertiary structure contains the seven helix and 8 strand structures predicted through I-TASSER. [11]
C10orf53 is primarily expressed in the testes, but also has very low levels of expression in the cerebellum, liver, placenta, and trachea. [6] It is tissue-specific to the testes due to the low expression in other tissues compared to the testes. C10orf53 majorly is secreted in the cytoplasm of cells and has moderate levels in the nucleus and mitochondria. [12]
C10orf53 was found to have two phosphorylation sites, two SUMOylation sites, and one lysine acetylation site. [5] All of these regions are shown in the conceptual translation of C10orf53.
C10orf53 is predicted to have a slower evolution compared to the gene, fibrinogen alpha, but it also has a quicker evolution compared to cytochrome c. Fibrinogen alpha is considered as a gene that had evolved rather quickly when examining the gene in different organisms. When two organisms diverged into different taxa, the gene went through alterations that made it significantly different from other organisms, causing it to have a quick rate of evolution. In contrast, Cytochrome C is relatively conserved throughout different organisms, which shows that it has a slow rate of evolution. C10orf53 has a rate of evolution that is smaller than fibrinogen alpha, but larger than cytochrome c.
A group of distantly and closely related orthologs were chosen and categorized by their date of divergence from humans. [13] The percent similarity and percent identity in relation to humans showed the predicted conservation between C10orf53 in humans compared to their orthologs.
Protein [14] | Protein Name [14] | Identification [14] | Function [15] |
PIGR | Polymeric Immunoglobulin Receptor | Affinity chromatography | Aids in the movement of polymeric IgA and IgM across mucosal epithelial cells |
DSCC1 | DNA Replication And Sister Chromatid Cohesion 1 | Affinity chromatography | Loads PCNA onto primed templates to manipulate replication forks |
UXT | Ubiquitously Expressed Prefoldin Like Chaperone | Affinity chromatography | Regulates androgen receptor transcription |
ZG16B | Zymogen Granule Protein 16B | Affinity chromatography | Located in the extracellular exosome and enables carbohydrate binding activity |
SPECC1L | Sperm Antigen With Calponin Homology And Coiled-Coil Domains 1 Like | Affinity chromatography | Helps with spindle organization and cytokinesis |
MNAT1 | CDK Activating Kinase | Affinity chromatography | Creates the CDK-activating kinase enzyme complex |
SPTB | Spectrin Beta, Erythrocytic | Affinity chromatography | Composes the cytoskeletal network in erythrocyte membranes |
The table contains all predicted proteins that were found to interact with C10orf53. [14] It includes the protein acronym and name, the means that identification occurred, and the function of each protein. Due to the related function of some of the proteins, this provides evidence that these interactions coincide with the predicted location.
A study was conducted that compared the relative spermatogenesis in humans to the relative expression of RNAs correlated to teratozoospermia. In non-afflicted humans, there is a relatively high expression of C10orf53 across the RNAs tested. However, when a human had teratozoospermia, those levels dropped to almost zero. [16] Another study examined the expression of C10orf53 in spermatogenesis and testis development in mice during development. C10orf53 is only highly expressed from day 30-56 of mice development, with the expression decreasing slightly on each five-day period until 56 days were reached. [17] The final study looked at research done within the past five years has correlated African American prostate cancer patients with the presence of C10orf535. When examining the exosome found in Caucasian populations associated with prostate cancer (PCC) against the African American exosome (PAA), C10orf53 was unique only to PAA. [18]
GPATCH11 is a protein that in humans is encoded by the G-patch domain containing protein 11 gene. The gene has four transcript variants encoding two functional protein isoforms and is expressed in most human tissues. The protein has been found to interact with several other proteins, including two from a splicing pathway. In addition, GPATCH11 has orthologs in all taxa of the eukarya domain.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.
IGSF6 is a protein that in humans is encoded by the IGSF6 gene.
Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa
KIAA1143 is an uncharacterized protein in humans that is encoded by the KIAA1143 gene. it may play a role in cell growth mechanisms and regulation/creation of cytoskeletal structure. This gene is located on chromosome 3 on the minus strand
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.
Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.
Transmembrane protein 61 (TMEM61) is a protein that is encoded by the TMEM61 gene in humans. TMEM61 is located on the first chromosome in humans.TMEM61 is highly expressed in the intestinal regions predominantly the kidney, adrenal gland and pituitary tissues. The protein, unlike other transmembrane protein in the region does not promote cancer growth. However, the TMEM61 protein when inhibited by secondary factors restricts normal activity in the kidney. The human protein shares many Orthologs and has been prevalent on Earth for millions of years.
NKAPD1 is a protein, which in humans, is encoded by the gene NKAPD1. This protein is also commonly referred to as C11ORF57.
Program to predict the subcellular localization sites of proteins from their amino acid sequences.