Proser1 | |||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||
Aliases | 2810046L04Rik9330161F11proline and serine rich 1 | ||||||||||||||||||||||||
External IDs | HomoloGene: 13463 GeneCards: | ||||||||||||||||||||||||
| |||||||||||||||||||||||||
| |||||||||||||||||||||||||
Orthologs | |||||||||||||||||||||||||
Species | Human | Mouse | |||||||||||||||||||||||
Entrez |
| ||||||||||||||||||||||||
Ensembl |
| ||||||||||||||||||||||||
UniProt |
| ||||||||||||||||||||||||
RefSeq (mRNA) |
| ||||||||||||||||||||||||
RefSeq (protein) |
| ||||||||||||||||||||||||
Location (UCSC) | Chr 3: 53.46 – 53.48 Mb | n/a | |||||||||||||||||||||||
PubMed search | [1] | n/a | |||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||
|
PROSER1 is a protein that in humans is encoded by the PROSER1 gene. [2]
Proteins are large biomolecules, or macromolecules, consisting of one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, responding to stimuli, providing structure to cells and organisms, and transporting molecules from one location to another. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the nucleotide sequence of their genes, and which usually results in protein folding into a specific three-dimensional structure that determines its activity.
In biology, a gene is a sequence of nucleotides in DNA or RNA that codes for a molecule that has a function. During gene expression, the DNA is first copied into RNA. The RNA can be directly functional or be the intermediate template for a protein that performs a function. The transmission of genes to an organism's offspring is the basis of the inheritance of phenotypic trait. These genes make up different DNA sequences called genotypes. Genotypes along with environmental and developmental factors determine what the phenotypes will be. Most biological traits are under the influence of polygenes as well as gene–environment interactions. Some genetic traits are instantly visible, such as eye color or number of limbs, and some are not, such as blood type, risk for specific diseases, or the thousands of basic biochemical processes that constitute life.
PROSER1 has several aliases: C13orf23, KIAA2032, and proline and serine-rich protein 1. [3] [4]
PROSER1 is located on the negative (reverse) strand of chromosome 13 at position 13q13.3. It spans from 39,009,865 base pairs from the pter to 39,038,095 bp from the pter, with a size of 28,231 bases. [5] PROSER1 has a total of 13 exons in its primary unspliced transcript mRNA of 5,185 bp. There are 2 isoforms of PROSER1, both within the 5,000 bp range. [6]
Chromosome 13 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 13 spans about 114 million base pairs and represents between 3.5 and 4% of the total DNA in cells.
Genes STOML3 and NHLRC3 neighbor PROSER1 on chromosome 13. [7]
Expressed Sequence Tag mapping of PROSER1 expression shows that it has particularly high expression in lymph, embryonic tissue, thymus, and uterus sites. It has moderate expression in testis, larynx, nerve, blood, and adipose tissue sites. [8] According to the Human Protein Atlas, PROSER1 has general cytoplasmic expression and is expressed in all RNA tissue categories. [9]
PROSER1 has no paralogs. [10]
PROSER1 is highly conserved among mammals. It is less highly conserved, though has been found, in fish, birds, and some invertebrates. It is not expressed in bacteria, plants, or fungi. [11] Show below is table of orthologs compiled from NCBI.
Latin name | Common name | Date of divergence (from H. sapiens) | Accession number | Protein sequence length | Sequence identity | Sequence similarity |
---|---|---|---|---|---|---|
Homo sapiens | Human | - | NP_079414.3 | 944 amino acids | - | - |
Gorilla gorilla | Gorilla | 8.9 million years ago | XP_004054459 | 994 aa | 99% | 99% |
Colobus angolensis palliatus | Angola colobus | 29.1 MYA | XP_011798372 | 947 aa | 97% | 97% |
Nannospalax galili | Spalax | 90.9 MYA | XP_008847583 | 916 aa | 80% | 85% |
Cricetulus griseus | Chinese Hamster | 90.9 MYA | XP_007626898 | 912 aa | 79% | 84% |
Panthera tigris altaica | Siberian Tiger | 97.5 MYA | XP_007097793 | 665 aa | 85% | 89% |
Acinonyx jubatus | Cheetah | 97.5 MYA | XP_014926409 | 770 aa | 79% | 83% |
Tursiops truncatus | Common Bottlenose Dolphin | 97.5 MYA | XP_004332367 | 1064 aa | 77% | 83% |
Ornithorhynchus anatinus | Platypus | 179.2 MYA | XP_007666895 | 888 aa | 69% | 74% |
Chelonia mydas | Green Sea Turtle | 320.5 MYA | XP_007070970 | 936 aa | 75% | 83% |
Pygoscelis adeliae | Adelie Penguin | 320.5 MYA | XP_009318201 | 938 aa | 75% | 83% |
Calidris (philomachus) pugnax | Ruff | 320.5 MYA | XP_014821534 | 899 aa | 74% | 83% |
Chrysemys picta bellii | Painted Turtle | 320.5 MYA | XP_008175998 | 825 aa | 73% | 82% |
Aquila chrysaetos Canadensis | Golden Deagle | 320.5 MYA | XP_011579121 | 916 aa | 73% | 81% |
Alligator sinensis | Chinese Alligator | 320.5 MYA | XP_014376376 | 940 aa | 72% | 81% |
Gekko japonicus | Japanese Gecko | 320.5 MYA | XP_015281837 | 1053 aa | 65% | 75% |
Python bivittatus | Burmese Python | 320.5 MYA | XP_007438154 | 924 aa | 65% | 75% |
Anolis carolinensis | Carolina Anole | 320.5 MYA | XP_008124125 | 920 aa | 64% | 75% |
Xenopus tropicalis | Western Clawed Frog | 355.7 MYA | XP_012813331 | 944 aa | 51% | 63% |
Lepisosteus oculatus | Spotted Gar | 429.6 MYA | XP_015197497 | 885 aa | 47% | 58% |
Callorhinchus milii | Australian Ghost Shark | 482.9 MYA | XP_007889503 | 965 aa | 64% | 75% |
The translated PROSER1 protein is 944 amino acids long. Its predicted molecular weight is 95.7 kdal. [12] PROSER1 has an isoelectric point of 9. [13] It is predicted to be localized to the nucleus. [14]
The sequence is rich in proline and serine and not particularly low in any other amino acids.
PROSER1 contains one domain of unknown function, DUF 4476, part of pfam14771. The DUF spans from amino acids 26 to 121. The molecular weight of DUF 4476 is 11.1 kdal.
PROSER1 is composed primarily of alpha helices, beta sheets, and coils. The protein is largely coiled. The DUF is composed mainly of alpha helices and coils. It has slightly fewer beta sheets compared to the protein as a whole. [15]
Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.
Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.
Coiled-coil domain-containing protein 144A is a protein that in humans is encoded by the CCDC144A gene. An alias of this gene is called KIAA0565. There are four members of the CCDC family: CCDC 144A, 144B, 144C and putative CCDC 144 N-terminal like proteins.
Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.
Chromosome 6 open reading frame 165 (C6orf165) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs(trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38
Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.
Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.
KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.
Family with sequence similarity 63, member A is a protein that, in humans, is encoded by the FAM63A gene. It is located on the minus strand of chromosome 1 at locus 1q21.3.
C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).
C16orf96, or chromosome 16 open reading frame 96, is a protein in humans that is encoded by C16orf96 that is found on the 16th chromosome. In Homo sapiens, the protein is 1141 amino acids in length
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
GPATCH11 is a protein that in humans is encoded by the G-patch domain containing protein 11 gene. The gene has four transcript variants encoding two functional protein isoforms and is expressed in most human tissues. The protein has been found to interact with several other proteins, including two from a splicing pathway. In addition, GPATCH11 has orthologs in all taxa of the eukarya domain.
PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.
Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.
C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.
FAM231B, or family with sequence similarity 231B, is a protein found in humans and is encoded by FAM231B gene. Orthologs of FAM231B are only found back to primates.
Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.