Proline-rich protein 23C is a protein that in humans is encoded by the proline-rich 23C (PRR23C) gene.
PRR23C Homo sapiens is located on the long arm of chromosome 3, (3q23) on the antisense strand. [1] When pertaining to the mRNA of PRR23C Homo sapiens, it is 2,791 bp in length. [2] PRR23C Homo sapiens has one exon covering the entire length of mRNA (1-2,791 bp). [3] PRR23C Homo sapiens has a clone name of FLJ46210. [4]
PRR23C Homo sapiens is expressed in the testis. [5] Ottolini et al. (2014) discussed the PRR23 family to which they revealed that through RNA sequencing data, that PRR23A, PRR23B and PRR23C are testis-specific genes. [6] Ottolini et al. (2014) believes that this family may be a crucial part for the male reproductive system given their RNA-seq data findings. [7]
Proline-rich protein 23C Homo sapiens is 262 amino acids long [8] with a calculated molecular weight of 27,674 Da. [9] Proline-rich protein 23C Homo sapiens has a domain of unknown function (DUF2476) that spans the majority of the protein (1-259 aa) which is a conserved domain. [10] DUF2476 belongs to pfam10630 which is a part of superfamily c|11241. [11] DUF2476 is a family of proteins that are rich in proline residues and have unknown function. [12] Proline-rich protein 23C is the preferred name but other aliases include proline-rich protein 23A. [13]
Proline is predicted to be the most abundant amino acid in proline-rich protein 23C Homo sapiens. [14] In comparison to the prevalence of amino acids in other human proteins, it is predicted that proline-rich protein 23C Homo sapiens has a higher abundance of proline along with very low abundances of asparagine, threonine, and lysine. [15] Orthologs for this protein are predicted to also have a high abundance of proline. [16]
The basal isoelectric point for PRR23C Homo sapiens was 4.48 (pH) according to phoshosite.org. [17]
Proline-rich protein 23C is predicted to localize to the nucleus for the human protein and its orthologs. [18] There are predicted nuclear localization signals seen in both the human proline-rich protein 23C and its orthologs. [19]
PRR23C Homo sapiens is strictly conserved in mammals. [20] The table below lists mammalian orthologs for PRR23C Homo sapiens . [21]
Genus & Species | Common Name | Accession Number [22] | Seq. Length [23] | Seq. Identity [24] | Seq. Similarity [25] |
---|---|---|---|---|---|
Homo sapiens | Human | NP_001128129.1 | 262 aa | 100% | 100% |
Pan troglodytes | Chimpanzee | XP_003310067.1 | 263 aa | 98.50% | 98% |
Gorilla gorilla gorilla | Western lowland gorilla | XP_004037787.1 | 263 aa | 96.60% | 98% |
Pongo pygmaeus | Bornean orangutan | XP_002814147.1 | 263 aa | 94.70% | 95% |
Nomascus leucogenys | Northern white-cheeked gibbon | XP_003265349.1 | 262 aa | 94.70% | 95% |
Chlorocebus sabaeus | Green monkey | XP_008007123.1 | 263 aa | 92% | 94% |
Rhinopithecus roxellana | Golden snub-nosed monkey | XP_010365056.1 | 263 aa | 91.60% | 93% |
Papio anubis | Olive baboon | XP_003895087.1 | 263 aa | 94.70% | 93% |
Callithrix jacchus | Common marmoset | XP_002759594.1 | 248 aa | 72.90% | 77% |
Otolemur garnettii | Northern greater galago | XP_003789481.1 | 267 aa | 66.50% | 71% |
Ceratotherium simum simum | White rhinoceros | XP_004419371.1 | 263 aa | 63.70% | 75% |
Pteropus alecto | Black flying fox | XP_006907344.1 | 268 aa | 50.90% | 68% |
Myotis lucifugus | Little brown bat | XP_006083994.1 | 257 aa | 61.20% | 64% |
Bubalus bubalis | Water buffalo | XP_006070045.1 | 258 aa | 55.40% | 62% |
Bison bison bison | American bison | XP_010856601.1 | 258 aa | 51.90% | 61% |
Bos mutus | Yak | XP_005909007.1 | 258 aa | 51.50% | 61% |
Lipotes vexillifer | Baiji | XP_007458979.1 | 260 aa | 51.50% | 64% |
Tursiops truncatus | Common bottlenose dolphin | XP_004330081.1 | 261 aa | 54.50% | 61% |
Balaenoptera acutorostrata scammoni | Minke whale | XP_007170446.1 | 255 aa | 53.60% | 61% |
Physeter catodon | Sperm whale | XP_007117538.1 | 260 aa | 53.80% | 64% |
Peromyscus maniculatus bairdii | Prairie deer mouse | XP_006975251.1 | 266 aa | 50.90% | 60% |
There were two paralogs found for PRR23C Homo sapiens: PRR23B and PRR23A. Both have similar sequence identities with PRR23B having 86% identity and PRR23A having 85% identity. [26]
TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long and its cDNA has 1214 base pairs. It was previously designated C16orf42.
C2CD4D, or C2 calcium-dependent domain-containing protein 4D is a protein product of the human genome. The gene that codes for this protein is found on chromosome 1, from 150,076,963 to 150,079,657. The gene contains 2 exons and encodes 353 amino acids. Synonyms for C2CD4D are "FAM148D" and NP_001129475. C2CD4D contains a conserved metal binding domain that is a known as Protein kinase C conserved region 2, subgroup 1. This motif is known to be a member of the C2 superfamily, which is present in phospholipases, protein kinases C, and synaptotagmins. The amino acid sequence of C2CD4D can be accessed at Prior to any post translational modification, C2CD4D has a molecular weight of 37.6 kdal. Although scientists have not yet determined where C2CD4D functions within the cell, C2CD4D has a predicted isoelectric point of 11.636 which severely limits the places in which it can be effective. In addition, C2CD4D does not contain any predicted transmembrane domains or any predicted signal peptides.
METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.
Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.
Family with Sequence Similarity 78-Member B (FAM78B) is a protein of unknown function in humans that is encoded by the FAM78B gene (1q24.1). It has orthologous genes and predicted proteins in vertebrates and several invertebrates, but not in arthropods. It has a nuclear localization signal in the protein sequence and a miRNA target region in the mRNA sequence.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.
C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.
CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Proline-Rich Protein 23A is a protein that is encoded by the Proline-Rich 23A (PRR23A) gene.
{{cite journal}}
: Cite journal requires |journal=
(help){{cite journal}}
: Cite journal requires |journal=
(help)