TSR3 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | TSR3 , C16orf42, acp transferase ribosome maturation factor, TSR3 ribosome maturation factor, HsTsr3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 617058; MGI: 1915577; HomoloGene: 6922; GeneCards: TSR3; OMA:TSR3 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. [5] Its protein is 312 amino acids long [6] and its cDNA has 1214 base pairs. [7] It was previously designated C16orf42.
The function of C16orf42 is unknown. It is predicted to be a transmembrane protein, however the cellular or subcellular membrane in which is resides is as well unknown. [8]
C16orf42 can also be found in many other organisms, including mammals, and certain fungi and plants. [9] It is not found in bacteria. [9] C16orf42 is highly conserved in many of its orthologs, especially its mammalian orthologs, as high as 95% identity in rhesus monkeys. [9] It also has fairly high conservation in its more distant homologs, 53% identity in corn for example. [9] It has one potential human paralog, the protein EGFL6. [10]
Ortholog Analysis:
Genus and species | Common name | NCBI accession | Length | Identity | Similarity |
---|---|---|---|---|---|
Macaca mulatta | Rhesus monkey | XP_001088384 | 312 aa | 95% | 96% |
Bos taurus | Cattle | NP_001092381 | 315 aa | 82% | 87% |
Canis familiaris | Dog | XP_537015 | 312 aa | 79% | 85% |
Equus caballus | Horse | XP_001915599 | 324 aa | 78% | 83% |
Rattus norvegicus | Rat | EMD03916 | 324 aa | 73% | 80% |
Mus musculus | Mouse | NP_080952 | 323 aa | 73% | 80% |
Gallus gallus | Chicken | XP_414890 | 311 aa | 63% | 76% |
Danio rerio | Zebra fish | NP_001002370 | 297 aa | 62% | 75% |
Tetraodon nigroviridis | Green spotted puffer | CAG04371 | 247 aa | 62% | 74% |
Trichoplax adhaerens | Trichoplax | XP_002118089 | 284 aa | 46% | 64% |
Ciona intestinalis | Sea squirt | XP_002129931 | 265 aa | 52% | 68% |
Ornithorhynchus anatinus | Platypus | XP_001517034 | 285 aa | 74% | 85% |
Hydra magnipapillata | Hydra | XP_002161680 | 232 aa | 55% | 71% |
Aedes aegypti | Yellow fever mosquito | XP_001654463 | 269 aa | 52% | 70% |
Drosophila melanogaster | Fruit fly | NP_650441 | 274 aa | 54% | 70% |
Apis mellifera | Western honey bee | XP_625167 | 258 aa | 50% | 69% |
Debaryomyces hansenii | Marine yeast | XP_461733 | 333 aa | 49% | 64% |
Aspergillus oryzae | Mold | XP_001818982 | 374 aa | 50% | 69% |
Saccharomyces cerevisiae | Yeast | NP_014648 | 313 aa | 52% | 68% |
Zea mays | Corn | ACG47566 | 271 aa | 53% | 69% |
Caenorhabditis elegans | C. elegans | NP_741332 | 261 aa | 47% | 65% |
C16orf42 is not expressed ubiquitously in humans. It is most highly expressed in the ovary, but not expressed at all in the blood and very little in the brain. [11] One microarray experiment suggested that malaria causes its expression in the blood, [11] but further experimentation is needed to support this claim. Its expression in tissues tends to remain constant when the tissue is diseased. [11] However, a brief analysis of its orthologs show inconsistencies in tissue expression. This could be due to a lack of research of this protein in other species.
The structure of C16orf42 is unknown. It is predicted to have multiple regions of alpha-helices, and a few short stretches of beta-strands. [8] It contains a potential metal binding domain between amino acids 60–90. [6] It has a predicted molecular weight of 33.6 kdal and an isoelectric point of 6.496000, making it slightly acidic. [8] Compared to other human proteins, C16orf42 is high in the amino acids arginine and alanine, and low in the amino acid threonine. [8] A brief analysis of its strict orthologs show that they too are generally high in arginine and low in threonine as well as compared to the typical protein in their respective species. [8]
C2CD4D, or C2 calcium-dependent domain-containing protein 4D is a protein product of the human genome. The gene that codes for this protein is found on chromosome 1, from 150,076,963 to 150,079,657. The gene contains 2 exons and encodes 353 amino acids. Synonyms for C2CD4D are "FAM148D" and NP_001129475. C2CD4D contains a conserved metal binding domain that is a known as Protein kinase C conserved region 2, subgroup 1. This motif is known to be a member of the C2 superfamily, which is present in phospholipases, protein kinases C, and synaptotagmins. The amino acid sequence of C2CD4D can be accessed at Prior to any post translational modification, C2CD4D has a molecular weight of 37.6 kdal. Although scientists have not yet determined where C2CD4D functions within the cell, C2CD4D has a predicted isoelectric point of 11.636 which severely limits the places in which it can be effective. In addition, C2CD4D does not contain any predicted transmembrane domains or any predicted signal peptides.
Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.
C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.
FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.
Exosomal polycystin-1-interacting protein is a protein that, in humans, is encoded by the EPCIP gene. EPCIP is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, EPCIP is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
FAM227A is a protein that in humans is encoded by FAM227A gene. Current studies have determined the location of this gene to be in the nuclear region of the cell. FAM227A is most highly expressed in the tissues of the fallopian tube, testis, and pituitary gland. FAM227A is present in species of mammals, birds and reptiles, and gene alignment sequences have shown that FAM227A is a rapidly evolving gene.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
Sperm microtubule associated protein 1 is a protein which in humans is encoded by the SPMAP1 gene. The protein is derived from Homo sapiens chromosome 17. The C17orf98 gene consists of a 6,302 base sequence. Its mRNA has three exons and no alternative splice sites. The protein has 154 amino acids, with no abnormal amino acid levels. C17orf98 has a domain of unknown function (DUF4542) and is 17.6kDa in weight. C17orf98 does not belong to any other families nor does it have any isoforms. The protein has orthologs with high percent similarity in mammals and reptiles. The protein has additional distantly related orthologs across the metazoan kingdom, culminating with the sponge family.
Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
C2orf72 is a gene in humans that encodes a protein currently named after its gene, C2orf72. It is also designated LOC257407 and can be found under GenBank accession code NM_001144994.2. The protein can be found under UniProt accession code A6NCS6.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.