TSR3

TSR3
Identifiers
Aliases	TSR3 , C16orf42, acp transferase ribosome maturation factor, TSR3 ribosome maturation factor, HsTsr3
External IDs	OMIM: 617058; MGI: 1915577; HomoloGene: 6922; GeneCards: TSR3; OMA:TSR3 - orthologs
Gene location (Human)
Chr.	Chromosome 16 (human)
End	1,351,878 bp
Gene location (Mouse)
Chr.	Chromosome 17 (mouse)
End	25,242,798 bp
RNA expression pattern
	Top expressed in
	tendon of biceps brachii; ; apex of heart; ; internal globus pallidus; ; left ventricle; ; mucosa of transverse colon; ; putamen; ; C1 segment; ; muscle of thigh; ; amygdala; ; gastrocnemius muscle;
	Top expressed in
	otic placode; ; embryo; ; otic vesicle; ; embryo; ; facial motor nucleus; ; ventricular zone; ; right kidney; ; epiblast; ; muscle of thigh; ; saccule;
	More reference expression data
	n/a
Gene ontology
Molecular function	transferase activity ;
Cellular component	cytosol ;
Biological process	rRNA processing ; maturation of SSU-rRNA ; ribosome biogenesis ; rRNA modification ;
	Sources:Amigo / QuickGO
Orthologs
	115939
	68327
	ENSG00000007520
	ENSMUSG00000015126
	Q9UJK0
	Q5HZH2
	NM_001001410
	NM_001163718 ; NM_026676
	NP_001001410
	NP_001157190 ; NP_080952
	Wikidata
View/Edit Human	View/Edit Mouse

Last updated August 22, 2024

TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16.^[5] Its protein is 312 amino acids long^[6] and its cDNA has 1214 base pairs.^[7] It was previously designated C16orf42.

Function

The function of C16orf42 is unknown. It is predicted to be a transmembrane protein, however the cellular or subcellular membrane in which is resides is as well unknown.^[8]

Homology

C16orf42 can also be found in many other organisms, including mammals, and certain fungi and plants.^[9] It is not found in bacteria.^[9] C16orf42 is highly conserved in many of its orthologs, especially its mammalian orthologs, as high as 95% identity in rhesus monkeys.^[9] It also has fairly high conservation in its more distant homologs, 53% identity in corn for example.^[9] It has one potential human paralog, the protein EGFL6.^[10]

Ortholog Analysis:

Genus and species	Common name	NCBI accession	Length	Identity	Similarity
Macaca mulatta	Rhesus monkey	XP_001088384	312 aa	95%	96%
Bos taurus	Cattle	NP_001092381	315 aa	82%	87%
Canis familiaris	Dog	XP_537015	312 aa	79%	85%
Equus caballus	Horse	XP_001915599	324 aa	78%	83%
Rattus norvegicus	Rat	EMD03916	324 aa	73%	80%
Mus musculus	Mouse	NP_080952	323 aa	73%	80%
Gallus gallus	Chicken	XP_414890	311 aa	63%	76%
Danio rerio	Zebra fish	NP_001002370	297 aa	62%	75%
Tetraodon nigroviridis	Green spotted puffer	CAG04371	247 aa	62%	74%
Trichoplax adhaerens	Trichoplax	XP_002118089	284 aa	46%	64%
Ciona intestinalis	Sea squirt	XP_002129931	265 aa	52%	68%
Ornithorhynchus anatinus	Platypus	XP_001517034	285 aa	74%	85%
Hydra magnipapillata	Hydra	XP_002161680	232 aa	55%	71%
Aedes aegypti	Yellow fever mosquito	XP_001654463	269 aa	52%	70%
Drosophila melanogaster	Fruit fly	NP_650441	274 aa	54%	70%
Apis mellifera	Western honey bee	XP_625167	258 aa	50%	69%
Debaryomyces hansenii	Marine yeast	XP_461733	333 aa	49%	64%
Aspergillus oryzae	Mold	XP_001818982	374 aa	50%	69%
Saccharomyces cerevisiae	Yeast	NP_014648	313 aa	52%	68%
Zea mays	Corn	ACG47566	271 aa	53%	69%
Caenorhabditis elegans	C. elegans	NP_741332	261 aa	47%	65%

Expression

C16orf42 is not expressed ubiquitously in humans. It is most highly expressed in the ovary, but not expressed at all in the blood and very little in the brain.^[11] One microarray experiment suggested that malaria causes its expression in the blood,^[11] but further experimentation is needed to support this claim. Its expression in tissues tends to remain constant when the tissue is diseased.^[11] However, a brief analysis of its orthologs show inconsistencies in tissue expression. This could be due to a lack of research of this protein in other species.

Structure

The structure of C16orf42 is unknown. It is predicted to have multiple regions of alpha-helices, and a few short stretches of beta-strands.^[8] It contains a potential metal binding domain between amino acids 60–90.^[6] It has a predicted molecular weight of 33.6 kdal and an isoelectric point of 6.496000, making it slightly acidic.^[8] Compared to other human proteins, C16orf42 is high in the amino acids arginine and alanine, and low in the amino acid threonine.^[8] A brief analysis of its strict orthologs show that they too are generally high in arginine and low in threonine as well as compared to the typical protein in their respective species.^[8]

Related Research Articles

C2CD4D, or C2 calcium-dependent domain-containing protein 4D is a protein product of the human genome. The gene that codes for this protein is found on chromosome 1, from 150,076,963 to 150,079,657. The gene contains 2 exons and encodes 353 amino acids. Synonyms for C2CD4D are "FAM148D" and NP_001129475. C2CD4D contains a conserved metal binding domain that is a known as Protein kinase C conserved region 2, subgroup 1. This motif is known to be a member of the C2 superfamily, which is present in phospholipases, protein kinases C, and synaptotagmins. The amino acid sequence of C2CD4D can be accessed at Prior to any post translational modification, C2CD4D has a molecular weight of 37.6 kdal. Although scientists have not yet determined where C2CD4D functions within the cell, C2CD4D has a predicted isoelectric point of 11.636 which severely limits the places in which it can be effective. In addition, C2CD4D does not contain any predicted transmembrane domains or any predicted signal peptides.

Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.

C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.

FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.

Exosomal polycystin-1-interacting protein is a protein that, in humans, is encoded by the EPCIP gene. EPCIP is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, EPCIP is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

FAM227A is a protein that in humans is encoded by FAM227A gene. Current studies have determined the location of this gene to be in the nuclear region of the cell. FAM227A is most highly expressed in the tissues of the fallopian tube, testis, and pituitary gland. FAM227A is present in species of mammals, birds and reptiles, and gene alignment sequences have shown that FAM227A is a rapidly evolving gene.

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">SPMAP1</span> Protein-coding gene in the species Homo sapiens

Sperm microtubule associated protein 1 is a protein which in humans is encoded by the SPMAP1 gene. The protein is derived from Homo sapiens chromosome 17. The C17orf98 gene consists of a 6,302 base sequence. Its mRNA has three exons and no alternative splice sites. The protein has 154 amino acids, with no abnormal amino acid levels. C17orf98 has a domain of unknown function (DUF4542) and is 17.6kDa in weight. C17orf98 does not belong to any other families nor does it have any isoforms. The protein has orthologs with high percent similarity in mammals and reptiles. The protein has additional distantly related orthologs across the metazoan kingdom, culminating with the sponge family.

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

C2orf72 is a gene in humans that encodes a protein currently named after its gene, C2orf72. It is also designated LOC257407 and can be found under GenBank accession code NM_001144994.2. The protein can be found under UniProt accession code A6NCS6.

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

References

1 2 3 GRCh38: Ensembl release 89: ENSG00000007520 – Ensembl, May 2017
1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000015126 – Ensembl, May 2017
↑ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
↑ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
↑ Strausberg RL, Feingold EA, Grouse LH, et al. (December 2002). "Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences". Proc. Natl. Acad. Sci. U.S.A. 99 (26): 16899–903. Bibcode:2002PNAS...9916899M. doi: 10.1073/pnas.242603899 . PMC 139241 . PMID 12477932.
1 2 NCBI Protein
↑ NCBI Nucleotide.
1 2 3 4 5 Biology Workbench
1 2 3 4 NCBI BLink
↑ NCBI BLAST
1 2 3 UniGene

External links

Human TSR3 genome location and TSR3 gene details page in the UCSC Genome Browser .