FAM227B | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | FAM227B , C15orf33, family with sequence similarity 227 member B | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1923073 HomoloGene: 27384 GeneCards: FAM227B | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
FAM227B is a protein that in humans is encoded by FAM227B gene. FAM227B stands for family with sequence similarity 227 member B and encodes protein FAM227B of the same name. Its aliases include C15orf33, MGC57432 and FLJ23800. [5] [6]
FAM227B is located at 15q21.2 and contains 24 exons. The current size determined for FAM227B is 293,961 base pairs (NCBI). Neighbors of FAM227B on chromosome fifteen include: “ribosomal protein L15 pseudogene”, “galactokinase 2”, “RNA, 7SL, cytoplasmic 307, pseudogene”, “signal peptide peptidase like 2A pseudogene”, “fibroblast growth factor 7”, “uncharacterized LOC105370811”, “DTW domain containing 1”, and “ring finger protein, LIM domain interacting pseudogene 3”. [5]
There are 30 isoforms of FAM227B and one paralog, FAM227A. The conserved domains in these isoforms (as well as the paralog) are of various sizes and encode the protein FWWh (pfam14922) of unknown function, which all contain the distinctive motif FWW with a hydrophobic residue h. The main isoform used for analysis of FAM227B is isoform 1 (NM_152647.3). The next most reliable isoform of FAM227B is isoform 2 ( NM_001330293.2). The second isoform is shorter and has a distinct C-terminus. [5]
Below are cartoons depicting the different lengths and cutting patterns of the isoforms*:
*The cartoons do not precisely depict differences between all the isoforms, but instead act as a simple depiction of a larger pattern between the isoforms.
The primary sequence for FAM227B is isoform 1 with accession number: NP_689860.2. It is 508 amino acids long. There are 30 isoforms. The molecular weight is 59.9kD and the isoelectric point is predicted to be high, around 10. [7] Compared to other proteins in humans, FAM227B has high abundance of Phenylalanine and Glycine and low abundance levels of Valine. The protein is predicted to be in the nuclear region of the cell. [8] There is a bipartite nuclear localization signal at RKLERYGEFLKKYHKKK, and three other nuclearization signals at HKKK, KKKK, and PKKTKIK. There is also a vacuolar targeting motif at TLPI. [8] [9] An FWWh region, where h signifies hydrophobic, runs from amino acids 135-296 in Homo sapiens FAM227B isoform 1. The function of this region is still unknown.
The secondary structure is predicted to be made up of alpha helices mainly and coiled coils [10]
Phosphorylation is the main post-translational modification predicted for FAM227B due to its predicted localization to the nucleus. [8] [11] There are many experimentally predicted phosphorylation sites, the most highly rated included in the conceptual translation. [12] Glycosylation sites and SUMOylation sites were also predicted. [13] [14]
FAM227B is most highly expressed in the testis at 1.983 +/- 0.404 RPKM, in the kidney at 1.408 +/- 0.152 RPKM, in the adrenal at 1.177 +/- 0.088 RPKM, and in the thyroid 1.133 +/- 0.165 RPKM. It is also expressed to a lesser degree in the appendix, bone marrow, brain, colon, duodenum, endometrium, esophagus, fat, gall bladder, heart, liver, lung, lymph node, ovary, pancreas, placenta, prostate, salivary gland, skin, small intestine, spleen, stomach, and urinary bladder [5]
Currently, the function of FAM227B has not been characterized [15]
RNF123 was found to be an interacting protein of FAM227B through Affinity Capture – MS. [16] RAB3A was found to be an interacting protein of FAM227B through tandem affinity purification. [17]
Current studies have determined the location of this gene to be in the nuclear region of the cell. [8] [18] [11]
Paralogs: FAM227A
Orthologs: FAM227B is present in Deuterostomia and Protostomia, dating as far back as porifera. FAM227B is not present in choanoflagellates, and gene alignment sequences have shown that FAM227B is a rapidly evolving gene due to its evolution trajectory compared to cytochrome c and fibrinogen alpha. [19]
The location of FAM227B, 15q21.2, was found to be associated with oral cancer. [20] The 15q21.2 locus is mentioned in other literature as well. [21] [22] FGF7 is a neighbour of FAM227B in the 15q21.2 locus (rs10519227), and encodes for the fibroblast growth factor, which is involved in processes such as embryonic development, cell growth, tissue repair, tumor growth, invasion, and morphogenesis. FGF works as a signal for thyroid gland development, and an SNP on intron 2 of FGF7 has been associated with thyroid growth/goiter growth. [21] This association was only significant at the genome level in males. [21] It was found that the abnormal goiter growth is likely due to variant signals that cause increased levels of TSH. [21] [22] FAM227B was found to be related to at least some of the 48 significant DMRs (differentially methylated regions) between HF (high fertile) and LF (low fertile) groups in the genome of spermatozoa from boar animal model. [23] FAM227B was found to be upregulated in LOXL2 knockdown. [24] Knocking down LOXL2 results in lower levels of H3K4ox, resulting in chromatin decompaction, thus continuing activation of DNA damage response. This results in anticancer agents being more effective against cancerous cell lines. [24] FAM227B was found to be a genetic risk variant in breast cancer. [25] FAM227B was differentially expressed in prostrate genes of Esr2 knockout rats compared to wildtype rats. [26] Esr2 is involved in anti-proliferation and differentiation. [26] FAM227B was part of 20 upregulated genes in chorionic girdle during trophoblast development in horses. [27] Protein FAM227B was differentially expressed in cardiovascular disease. [28] FAM227B was found to be a candidate causal gene for lung cancer. [29] FAM227B has a predicted p53 binding site. [30]
Receptor expression-enhancing protein 5 is a protein that in humans is encoded by the REEP5 gene. Receptor Expression Enhancing Protein is a protein encoded for in Humans by the REEP5 gene.
E3 ubiquitin-protein ligase RNF128 is an enzyme that in humans is encoded by the RNF128 gene.
The family with sequence similarity 43 member A (FAM43A) gene, also known as; GCO3P195887, GC03P194406, GC03P191784, and NM_153690.3, codes for a 423 bp protein that is conserved in primates, and orthologs have been found in vertebrate and invertebrate species. Three transcripts have been identified, two protein coding isoforms, and a non-coding transcript (cAug10). Molecular weight of 45.8 kdal in the unphosphorylated state and isoelectric point of 6.1.
INAVA, sometimes referred to as hypothetical protein LOC55765, is a protein of unknown function that in humans is encoded by the INAVA gene. Less common gene aliases include FLJ10901 and MGC125608.
DEP Domain Containing Protein 1B also known as XTP1, XTP8, HBV XAg-Transactivated Protein 8, [formerly referred to as BRCC3] is a human protein encoded by a gene of similar name located on chromosome 5.
C11orf52 is an uncharacterized protein that in homo sapiens is encoded by the C11orf52 gene.
Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.
Coiled-coil domain containing 74A is a protein that in humans is encoded by the CCDC74A gene. The protein is most highly expressed in the testis and may play a role in developmental pathways. The gene has undergone duplication in the primate lineage within the last 9 million years, and its only true ortholog is found in Pan troglodytes.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
Synaptosome-associated protein, 47 kDal (SNAP47) is a human protein encoded by the SNAP47 gene. Other aliases of this gene are SVAP1, HEL170, ESFI5812, and HEL-S-290. SNAP47 is a synaptosome protein which is associated with the protein coding in multiple diseases, including non small cell lung cancer and schizophrenia. SNAP47 is a member of the SNAP protein family. SNAP proteins are t-snare proteins that are a component of SNARE complex. The SNARE complex mediates vesicle fusion by creating tight complex that brings vesicle and membrane together. This protein causes ubiquitous expression in testis, ovary, and many other tissues
FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.
Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.
Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa
Maestro heat-like repeat-containing protein family member 9 (MROH9) is a protein which in humans is encoded by the MROH9 gene. The word ‘maestro’ itself is an acronym, standing for male-specific transcription in the developing reproductive organs (MRO). MRO genes belong to the MROH family, which includes MROH9.
Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.
ZNF839 or zinc finger protein 839 is a protein which in humans is encoded by the ZNF839 gene. It is located on the long arm of chromosome 14. Zinc finger protein 839 is speculated to pay a role in humoral immune response to cancer as a renal carcinoma antigen (NY-REN-50). This is because NY-REN-50 was found to be over expressed in cancer patients, especially those with renal carcinoma. Zinc finger protein 839 also plays a role in transcription regulation by metal-ion binding since it binds to DNA via C2H2-type zinc finger repeats.
Coiled-Coil Domain Containing 177 (CCDC177) is a protein, which in humans, is encoded by the gene CCDC177. It is composed of a coiled helical domain that spans half of the protein. CCDC177 deletions are associated with intellectual disability and congenital heart defects.
This article needs additional or more specific categories .(July 2021) |