FAM227B

Last updated
FAM227B
Identifiers
Aliases FAM227B , C15orf33, family with sequence similarity 227 member B
External IDs MGI: 1923073 HomoloGene: 27384 GeneCards: FAM227B
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_152647
NM_001330293

NM_029455

RefSeq (protein)

NP_001317222
NP_689860

NP_083731

Location (UCSC) Chr 15: 49.33 – 49.62 Mb Chr 2: 125.83 – 125.99 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse
I-Tasser Prediction of human FAM277B Isoform 1 Primary Structure.png

FAM227B is a protein that in humans is encoded by FAM227B gene. FAM227B stands for family with sequence similarity 227 member B and encodes protein FAM227B of the same name. Its aliases include C15orf33, MGC57432 and FLJ23800. [5] [6]

Contents

Gene

FAM227B is located at 15q21.2 and contains 24 exons. The current size determined for FAM227B is 293,961 base pairs (NCBI). Neighbors of FAM227B on chromosome fifteen include: “ribosomal protein L15 pseudogene”, “galactokinase 2”, “RNA, 7SL, cytoplasmic 307, pseudogene”, “signal peptide peptidase like 2A pseudogene”, “fibroblast growth factor 7”, “uncharacterized LOC105370811”, “DTW domain containing 1”, and “ring finger protein, LIM domain interacting pseudogene 3”. [5]

Transcript

There are 30 isoforms of FAM227B and one paralog, FAM227A. The conserved domains in these isoforms (as well as the paralog) are of various sizes and encode the protein FWWh (pfam14922) of unknown function, which all contain the distinctive motif FWW with a hydrophobic residue h. The main isoform used for analysis of FAM227B is isoform 1 (NM_152647.3). The next most reliable isoform of FAM227B is isoform 2 ( NM_001330293.2). The second isoform is shorter and has a distinct C-terminus. [5]

Below are cartoons depicting the different lengths and cutting patterns of the isoforms*:

Conceptual translation pg. 1.jpg
Cartoon of FAM227B isoforms comparing different lengths and cutting patterns.jpg
Conceptual Translation pg. 2.jpg
Conceptual Translation pg. 3.jpg

*The cartoons do not precisely depict differences between all the isoforms, but instead act as a simple depiction of a larger pattern between the isoforms.

Protein

The primary sequence for FAM227B is isoform 1 with accession number: NP_689860.2. It is 508 amino acids long. There are 30 isoforms. The molecular weight is 59.9kD and the isoelectric point is predicted to be high, around 10. [7] Compared to other proteins in humans, FAM227B has high abundance of Phenylalanine and Glycine and low abundance levels of Valine. The protein is predicted to be in the nuclear region of the cell. [8] There is a bipartite nuclear localization signal at RKLERYGEFLKKYHKKK, and three other nuclearization signals at HKKK, KKKK, and PKKTKIK. There is also a vacuolar targeting motif at TLPI. [8] [9] An FWWh region, where h signifies hydrophobic, runs from amino acids 135-296 in Homo sapiens FAM227B isoform 1. The function of this region is still unknown.

Multiple Sequence Alignment Human FAM227B and paralog Human FAM227A.jpg
Human FAM227B Promoter.jpg
Human FAM227B promoter key.jpg

Secondary structure

The secondary structure is predicted to be made up of alpha helices mainly and coiled coils [10]

Secondary Structure.jpg

Post translational modifications

Phosphorylation is the main post-translational modification predicted for FAM227B due to its predicted localization to the nucleus. [8] [11] There are many experimentally predicted phosphorylation sites, the most highly rated included in the conceptual translation. [12] Glycosylation sites and SUMOylation sites were also predicted. [13] [14]

Expression

FAM227B is most highly expressed in the testis at 1.983 +/- 0.404 RPKM, in the kidney at 1.408 +/- 0.152 RPKM, in the adrenal at 1.177 +/- 0.088 RPKM, and in the thyroid 1.133 +/- 0.165 RPKM. It is also expressed to a lesser degree in the appendix, bone marrow, brain, colon, duodenum, endometrium, esophagus, fat, gall bladder, heart, liver, lung, lymph node, ovary, pancreas, placenta, prostate, salivary gland, skin, small intestine, spleen, stomach, and urinary bladder [5]

Function

Currently, the function of FAM227B has not been characterized [15]

Protein-protein interactions

RNF123 was found to be an interacting protein of FAM227B through Affinity Capture – MS. [16] RAB3A was found to be an interacting protein of FAM227B through tandem affinity purification. [17]

Subcellular localization

Current studies have determined the location of this gene to be in the nuclear region of the cell. [8] [18] [11]

Homology and evolution

Paralogs: FAM227A

Orthologs: FAM227B is present in Deuterostomia and Protostomia, dating as far back as porifera. FAM227B is not present in choanoflagellates, and gene alignment sequences have shown that FAM227B is a rapidly evolving gene due to its evolution trajectory compared to cytochrome c and fibrinogen alpha. [19]

Evolution of FAM227B.png
Homology of FAM227B.png

Clinical significance

The location of FAM227B, 15q21.2, was found to be associated with oral cancer. [20] The 15q21.2 locus is mentioned in other literature as well. [21] [22] FGF7 is a neighbour of FAM227B in the 15q21.2 locus (rs10519227), and encodes for the fibroblast growth factor, which is involved in processes such as embryonic development, cell growth, tissue repair, tumor growth, invasion, and morphogenesis. FGF works as a signal for thyroid gland development, and an SNP on intron 2 of FGF7 has been associated with thyroid growth/goiter growth. [21] This association was only significant at the genome level in males. [21] It was found that the abnormal goiter growth is likely due to variant signals that cause increased levels of TSH. [21] [22] FAM227B was found to be related to at least some of the 48 significant DMRs (differentially methylated regions) between HF (high fertile) and LF (low fertile) groups in the genome of spermatozoa from boar animal model. [23] FAM227B was found to be upregulated in LOXL2 knockdown. [24] Knocking down LOXL2 results in lower levels of H3K4ox, resulting in chromatin decompaction, thus continuing activation of DNA damage response. This results in anticancer agents being more effective against cancerous cell lines. [24] FAM227B was found to be a genetic risk variant in breast cancer. [25] FAM227B was differentially expressed in prostrate genes of Esr2 knockout rats compared to wildtype rats. [26] Esr2 is involved in anti-proliferation and differentiation. [26] FAM227B was part of 20 upregulated genes in chorionic girdle during trophoblast development in horses. [27] Protein FAM227B was differentially expressed in cardiovascular disease. [28] FAM227B was found to be a candidate causal gene for lung cancer. [29] FAM227B has a predicted p53 binding site. [30]

Related Research Articles

<span class="mw-page-title-main">REEP5</span> Protein-coding gene in the species Homo sapiens

Receptor expression-enhancing protein 5 is a protein that in humans is encoded by the REEP5 gene. Receptor Expression Enhancing Protein is a protein encoded for in Humans by the REEP5 gene.

<span class="mw-page-title-main">RNF128</span> Protein-coding gene in the species Homo sapiens

E3 ubiquitin-protein ligase RNF128 is an enzyme that in humans is encoded by the RNF128 gene.

<span class="mw-page-title-main">FAM43A</span> Protein-coding gene in the species Homo sapiens

The family with sequence similarity 43 member A (FAM43A) gene, also known as; GCO3P195887, GC03P194406, GC03P191784, and NM_153690.3, codes for a 423 bp protein that is conserved in primates, and orthologs have been found in vertebrate and invertebrate species. Three transcripts have been identified, two protein coding isoforms, and a non-coding transcript (cAug10). Molecular weight of 45.8 kdal in the unphosphorylated state and isoelectric point of 6.1.

<span class="mw-page-title-main">INAVA</span> Protein-coding gene in the species Homo sapiens

INAVA, sometimes referred to as hypothetical protein LOC55765, is a protein of unknown function that in humans is encoded by the INAVA gene. Less common gene aliases include FLJ10901 and MGC125608.

<span class="mw-page-title-main">DEPDC1B</span> Protein-coding gene in the species Homo sapiens

DEP Domain Containing Protein 1B also known as XTP1, XTP8, HBV XAg-Transactivated Protein 8, [formerly referred to as BRCC3] is a human protein encoded by a gene of similar name located on chromosome 5.

<span class="mw-page-title-main">C11orf52</span> Protein-coding gene in the species Homo sapiens

C11orf52 is an uncharacterized protein that in homo sapiens is encoded by the C11orf52 gene.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

<span class="mw-page-title-main">Coiled-coil domain containing 74a</span> Protein found in humans

Coiled-coil domain containing 74A is a protein that in humans is encoded by the CCDC74A gene. The protein is most highly expressed in the testis and may play a role in developmental pathways. The gene has undergone duplication in the primate lineage within the last 9 million years, and its only true ortholog is found in Pan troglodytes.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">CXorf38 Isoform 1</span> Human protein

Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">SNAP47</span>

Synaptosome-associated protein, 47 kDal (SNAP47) is a human protein encoded by the SNAP47 gene. Other aliases of this gene are SVAP1, HEL170, ESFI5812, and HEL-S-290. SNAP47 is a synaptosome protein which is associated with the protein coding in multiple diseases, including non small cell lung cancer and schizophrenia. SNAP47 is a member of the SNAP protein family. SNAP proteins are t-snare proteins that are a component of SNARE complex. The SNARE complex mediates vesicle fusion by creating tight complex that brings vesicle and membrane together. This protein causes ubiquitous expression in testis, ovary, and many other tissues

<span class="mw-page-title-main">FAM120AOS</span> Protein-coding gene in the species Homo sapiens

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

<span class="mw-page-title-main">CCDC190</span> Protein found in humans

Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.

<span class="mw-page-title-main">CXorf65</span> Gene on human chromosome X

Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa

<span class="mw-page-title-main">MROH9</span> Mammalian gene

Maestro heat-like repeat-containing protein family member 9 (MROH9) is a protein which in humans is encoded by the MROH9 gene. The word ‘maestro’ itself is an acronym, standing for male-specific transcription in the developing reproductive organs (MRO). MRO genes belong to the MROH family, which includes MROH9.

<span class="mw-page-title-main">SCRN3</span> Protein-coding gene in the species Homo sapiens

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

<span class="mw-page-title-main">ZNF839</span>

ZNF839 or zinc finger protein 839 is a protein which in humans is encoded by the ZNF839 gene. It is located on the long arm of chromosome 14. Zinc finger protein 839 is speculated to pay a role in humoral immune response to cancer as a renal carcinoma antigen (NY-REN-50). This is because NY-REN-50 was found to be over expressed in cancer patients, especially those with renal carcinoma. Zinc finger protein 839 also plays a role in transcription regulation by metal-ion binding since it binds to DNA via C2H2-type zinc finger repeats.

<span class="mw-page-title-main">CCDC177</span> Protein-coding gene in Homo sapiens

Coiled-Coil Domain Containing 177 (CCDC177) is a protein, which in humans, is encoded by the gene CCDC177. It is composed of a coiled helical domain that spans half of the protein. CCDC177 deletions are associated with intellectual disability and congenital heart defects.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000166262 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000027209 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 "FAM227B family with sequence similarity 227 member B [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-07-31.
  6. "FAM227B Gene". www.genecards.org. Retrieved 2021-07-31.
  7. "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2021-07-31.
  8. 1 2 3 4 "PSORT: Protein Subcellular Localization Prediction Tool". www.genscript.com. Retrieved 2021-07-31.
  9. "Motif Scan". myhits.sib.swiss. Retrieved 2021-07-31.
  10. "The Yang Zhang Lab". zhanggroup.org. Retrieved 2021-07-31.
  11. 1 2 "Tissue expression of FAM227B - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2021-07-31.
  12. "GPS 5.0 - Kinase-specific Phosphorylation Site Prediction". gps.biocuckoo.cn. Retrieved 2021-07-31.
  13. "YinOYang 1.2 Server". www.cbs.dtu.dk. Retrieved 2021-07-31.
  14. "[JASSA] Joined Advanced SUMOylation site and SIM Analyser". www.jassa.fr. Retrieved 2021-07-31.
  15. "Family: FWWh (PF14922)". pfam.xfam.org. Retrieved 2021-07-31.
  16. Khanna R, Krishnamoorthy V, Parnaik VK (June 2018). "E3 ubiquitin ligase RNF123 targets lamin B1 and lamin-binding proteins". The FEBS Journal. 285 (12): 2243–2262. doi: 10.1111/febs.14477 . PMID   29676528.
  17. Li Y, Wang Y, Zou L, Tang X, Yang Y, Ma L, et al. (February 2016). "Analysis of the Rab GTPase Interactome in Dendritic Cells Reveals Anti-microbial Functions of the Rab32 Complex in Bacterial Containment". Immunity. 44 (2): 422–37. doi: 10.1016/j.immuni.2016.01.027 . PMID   26885862.
  18. "PredictProtein - Protein Sequence Analysis, Prediction of Structural and Functional Features". predictprotein.org. Retrieved 2021-07-31.
  19. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2021-07-31.
  20. Lesseur C, Diergaarde B, Olshan AF, Wünsch-Filho V, Ness AR, Liu G, et al. (December 2016). "Genome-wide association analyses identify new susceptibility loci for oral cavity and pharyngeal cancer". Nature Genetics. 48 (12): 1544–1550. doi:10.1038/ng.3685. PMC   5131845 . PMID   27749845.
  21. 1 2 3 4 Porcu E, Medici M, Pistis G, Volpato CB, Wilson SG, Cappola AR, et al. (2013-02-07). "A meta-analysis of thyroid-related traits reveals novel loci and gender-specific differences in the regulation of thyroid function". PLOS Genetics. 9 (2): e1003266. doi: 10.1371/journal.pgen.1003266 . PMC   3567175 . PMID   23408906.
  22. 1 2 Teumer A, Rawal R, Homuth G, Ernst F, Heier M, Evert M, et al. (May 2011). "Genome-wide association study identifies four genetic loci associated with thyroid volume and goiter risk". American Journal of Human Genetics. 88 (5): 664–73. doi:10.1016/j.ajhg.2011.04.015. PMC   3146733 . PMID   21565293.
  23. Pértille F, Alvarez-Rodriguez M, da Silva AN, Barranco I, Roca J, Guerrero-Bosagna C, Rodriguez-Martinez H (March 2021). "Sperm Methylome Profiling Can Discern Fertility Levels in the Porcine Biomedical Model". International Journal of Molecular Sciences. 22 (5): 2679. doi: 10.3390/ijms22052679 . PMC   7961483 . PMID   33800945.
  24. 1 2 Cebrià-Costa JP, Pascual-Reguant L, Gonzalez-Perez A, Serra-Bardenys G, Querol J, Cosín M, et al. (January 2020). "LOXL2-mediated H3K4 oxidation reduces chromatin accessibility in triple-negative breast cancer cells". Oncogene. 39 (1): 79–121. doi:10.1038/s41388-019-0969-1. PMC   6937214 . PMID   31462706.
  25. Rashkin SR, Graff RE, Kachuri L, Thai KK, Alexeeff SE, Blatchins MA, et al. (September 2020). "Pan-cancer study detects genetic risk variants and shared genetic basis in two large cohorts". Nature Communications. 11 (1): 4423. Bibcode:2020NatCo..11.4423R. doi:10.1038/s41467-020-18246-6. PMC   7473862 . PMID   32887889.
  26. 1 2 Khristi V, Ghosh S, Chakravarthi VP, Wolfe MW, Rumi MA (June 2019). "Transcriptome data analyses of prostatic hyperplasia in Esr2 knockout rats". Data in Brief. 24: 103826. doi:10.1016/j.dib.2019.103826. PMC   6475810 . PMID   31016213.
  27. Read JE, Cabrera-Sharp V, Offord V, Mirczuk SM, Allen SP, Fowkes RC, de Mestre AM (October 2018). "Dynamic changes in gene expression and signalling during trophoblast development in the horse". Reproduction. 156 (4): 313–330. doi:10.1530/REP-18-0270. PMC   6170800 . PMID   30306765.
  28. Lygirou V, Latosinska A, Makridakis M, Mullen W, Delles C, Schanstra JP, et al. (April 2018). "Plasma proteomic analysis reveals altered protein abundances in cardiovascular disease". Journal of Translational Medicine. 16 (1): 104. doi: 10.1186/s12967-018-1476-9 . PMC   5905170 . PMID   29665821.
  29. Bossé Y, Li Z, Xia J, Manem V, Carreras-Torres R, Gabriel A, et al. (April 2020). "Transcriptome-wide association study reveals candidate causal genes for lung cancer". International Journal of Cancer. 146 (7): 1862–1878. doi:10.1002/ijc.32771. PMC   7008463 . PMID   31696517.
  30. Hearnes JM, Mays DJ, Schavolt KL, Tang L, Jiang X, Pietenpol JA (November 2005). "Chromatin immunoprecipitation-based screen to identify functional genomic binding sites for sequence-specific transactivators". Molecular and Cellular Biology. 25 (22): 10148–58. doi:10.1128/mcb.25.22.10148-10158.2005. PMC   1280257 . PMID   16260627.