Proser1

Proser1
Identifiers
Aliases	2810046L04Rik9330161F11proline and serine rich 1
External IDs	HomoloGene: 13463 GeneCards:
Gene location (Human)
Chr.	Chromosome 3 (human)
End	53,481,755 bp
Gene ontology
Molecular function	• molecular function ;
Cellular component	• cellular component ;
Biological process	• biological process ;
	Sources:Amigo / QuickGO
Orthologs
	212127
	n/a
	ENSMUSG00000049504
	n/a
	Q5PRE5
	n/a
	NM_173382
	n/a
	NP_775558
	n/a
	Wikidata
View/Edit Human

Last updated May 03, 2019

PROSER1 is a protein that in humans is encoded by the PROSER1 gene.^[2]

Protein biological molecule consisting of chains of amino acid residues

Proteins are large biomolecules, or macromolecules, consisting of one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, responding to stimuli, providing structure to cells and organisms, and transporting molecules from one location to another. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the nucleotide sequence of their genes, and which usually results in protein folding into a specific three-dimensional structure that determines its activity.

In biology, a gene is a sequence of nucleotides in DNA or RNA that codes for a molecule that has a function. During gene expression, the DNA is first copied into RNA. The RNA can be directly functional or be the intermediate template for a protein that performs a function. The transmission of genes to an organism's offspring is the basis of the inheritance of phenotypic trait. These genes make up different DNA sequences called genotypes. Genotypes along with environmental and developmental factors determine what the phenotypes will be. Most biological traits are under the influence of polygenes as well as gene–environment interactions. Some genetic traits are instantly visible, such as eye color or number of limbs, and some are not, such as blood type, risk for specific diseases, or the thousands of basic biochemical processes that constitute life.

Nomenclature

PROSER1 has several aliases: C13orf23, KIAA2032, and proline and serine-rich protein 1.^[3]^[4]

Gene

Location

PROSER1 is located on the negative (reverse) strand of chromosome 13 at position 13q13.3. It spans from 39,009,865 base pairs from the pter to 39,038,095 bp from the pter, with a size of 28,231 bases.^[5] PROSER1 has a total of 13 exons in its primary unspliced transcript mRNA of 5,185 bp. There are 2 isoforms of PROSER1, both within the 5,000 bp range.^[6]

Chromosome 13 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 13 spans about 114 million base pairs and represents between 3.5 and 4% of the total DNA in cells.

Gene neighborhood

Genes STOML3 and NHLRC3 neighbor PROSER1 on chromosome 13.^[7]

Tissue distribution

Expressed Sequence Tag mapping of PROSER1 expression shows that it has particularly high expression in lymph, embryonic tissue, thymus, and uterus sites. It has moderate expression in testis, larynx, nerve, blood, and adipose tissue sites.^[8] According to the Human Protein Atlas, PROSER1 has general cytoplasmic expression and is expressed in all RNA tissue categories.^[9]

Homology

Paralogs

PROSER1 has no paralogs.^[10]

Orthologs

PROSER1 is highly conserved among mammals. It is less highly conserved, though has been found, in fish, birds, and some invertebrates. It is not expressed in bacteria, plants, or fungi.^[11] Show below is table of orthologs compiled from NCBI.

Latin name	Common name	Date of divergence (from H. sapiens)	Accession number	Protein sequence length	Sequence identity	Sequence similarity
Homo sapiens	Human	-	NP_079414.3	944 amino acids	-	-
Gorilla gorilla	Gorilla	8.9 million years ago	XP_004054459	994 aa	99%	99%
Colobus angolensis palliatus	Angola colobus	29.1 MYA	XP_011798372	947 aa	97%	97%
Nannospalax galili	Spalax	90.9 MYA	XP_008847583	916 aa	80%	85%
Cricetulus griseus	Chinese Hamster	90.9 MYA	XP_007626898	912 aa	79%	84%
Panthera tigris altaica	Siberian Tiger	97.5 MYA	XP_007097793	665 aa	85%	89%
Acinonyx jubatus	Cheetah	97.5 MYA	XP_014926409	770 aa	79%	83%
Tursiops truncatus	Common Bottlenose Dolphin	97.5 MYA	XP_004332367	1064 aa	77%	83%
Ornithorhynchus anatinus	Platypus	179.2 MYA	XP_007666895	888 aa	69%	74%
Chelonia mydas	Green Sea Turtle	320.5 MYA	XP_007070970	936 aa	75%	83%
Pygoscelis adeliae	Adelie Penguin	320.5 MYA	XP_009318201	938 aa	75%	83%
Calidris (philomachus) pugnax	Ruff	320.5 MYA	XP_014821534	899 aa	74%	83%
Chrysemys picta bellii	Painted Turtle	320.5 MYA	XP_008175998	825 aa	73%	82%
Aquila chrysaetos Canadensis	Golden Deagle	320.5 MYA	XP_011579121	916 aa	73%	81%
Alligator sinensis	Chinese Alligator	320.5 MYA	XP_014376376	940 aa	72%	81%
Gekko japonicus	Japanese Gecko	320.5 MYA	XP_015281837	1053 aa	65%	75%
Python bivittatus	Burmese Python	320.5 MYA	XP_007438154	924 aa	65%	75%
Anolis carolinensis	Carolina Anole	320.5 MYA	XP_008124125	920 aa	64%	75%
Xenopus tropicalis	Western Clawed Frog	355.7 MYA	XP_012813331	944 aa	51%	63%
Lepisosteus oculatus	Spotted Gar	429.6 MYA	XP_015197497	885 aa	47%	58%
Callorhinchus milii	Australian Ghost Shark	482.9 MYA	XP_007889503	965 aa	64%	75%

Protein

General properties

The translated PROSER1 protein is 944 amino acids long. Its predicted molecular weight is 95.7 kdal.^[12] PROSER1 has an isoelectric point of 9.^[13] It is predicted to be localized to the nucleus.^[14]

Composition

The sequence is rich in proline and serine and not particularly low in any other amino acids.

Domains

PROSER1 contains one domain of unknown function, DUF 4476, part of pfam14771. The DUF spans from amino acids 26 to 121. The molecular weight of DUF 4476 is 11.1 kdal.

Detailed location of DUF 4476 on PROSER1.

Secondary structure

PROSER1 is composed primarily of alpha helices, beta sheets, and coils. The protein is largely coiled. The DUF is composed mainly of alpha helices and coils. It has slightly fewer beta sheets compared to the protein as a whole.^[15]

Related Research Articles

Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.

Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.

Coiled-coil domain-containing protein 144A is a protein that in humans is encoded by the CCDC144A gene. An alias of this gene is called KIAA0565. There are four members of the CCDC family: CCDC 144A, 144B, 144C and putative CCDC 144 N-terminal like proteins.

Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.

Chromosome 6 open reading frame 165 (C6orf165) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs(trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38

Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.

Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.

KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.

Family with sequence similarity 63, member A is a protein that, in humans, is encoded by the FAM63A gene. It is located on the minus strand of chromosome 1 at locus 1q21.3.

C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).

C16orf96, or chromosome 16 open reading frame 96, is a protein in humans that is encoded by C16orf96 that is found on the 16th chromosome. In Homo sapiens, the protein is 1141 amino acids in length

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

GPATCH11 is a protein that in humans is encoded by the G-patch domain containing protein 11 gene. The gene has four transcript variants encoding two functional protein isoforms and is expressed in most human tissues. The protein has been found to interact with several other proteins, including two from a splicing pathway. In addition, GPATCH11 has orthologs in all taxa of the eukarya domain.

PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.

FAM231B, or family with sequence similarity 231B, is a protein found in humans and is encoded by FAM231B gene. Orthologs of FAM231B are only found back to primates.

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.

References

↑ "Human PubMed Reference:".
↑ "GeneCards". Human Gene Database.
↑ "BioCompare". BioCompare.
↑ "Ensembl". Ensembl.
↑ "GeneCards". Human Gene Database.
↑ "NCBI Gene". NCBI.
↑ "NCBI Gene". NCBI.
↑ "Uniprot". Uniprot.
↑ "Tissue Atlas". The Human Protein Atlas.
↑ "Ensembl". Ensembl.
↑ "Ensembl". Ensembl.
↑ "Uniprot". Uniprot.
↑ "Expasy". Expasy.
↑ "PSORT II Prediction". PSORT II.
↑ "SDSC Biology Workbench". SDSC.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] "Human PubMed Reference:".

[2] "GeneCards". Human Gene Database.

[3] "BioCompare". BioCompare.

[4] "Ensembl". Ensembl.

[5] "GeneCards". Human Gene Database.

[6] "NCBI Gene". NCBI.

[7] "NCBI Gene". NCBI.

[8] "Uniprot". Uniprot.

[9] "Tissue Atlas". The Human Protein Atlas.

[10] "Ensembl". Ensembl.

[11] "Ensembl". Ensembl.

[12] "Uniprot". Uniprot.

[13] "Expasy". Expasy.

[14] "PSORT II Prediction". PSORT II.

[15] "SDSC Biology Workbench". SDSC.