FAM237A

Last updated

FAM237A is a protein coding gene which encodes a protein of the same name. [1] Within Homo sapiens, FAM237A is believed to be primarily expressed within the brain, with moderate heart and lesser testes expression [1] ,. [2] FAM237A is hypothesized to act as a specific activator of receptor GPR83. [3]

Contents

Gene

FAM237A is alternatively known as HCG1657980 and LOC200726. [2] [4] Homo sapiens FAM237A's sequence resides on chromosome 2’s + strand, and extends from bases 207486904 to 207514174. [2] Homo sapiens FAM237A sequence contains 13 exons unspliced. [2]

Transcripts

Homo sapiens FAM237A is predicted to produce six unique transcripts, of which four are spliced. [2]

Alternate Transcripts of Homo sapiens FAM237A [2]
NameSize (Base Pairs)Exon Usage
aAug1017972,3,8,10
bAug1013764,6,13
cAug1011215,12
dAug10 (Unspliced)7987
eAug10 (Unspliced)8739
fAug107061,11

Proteins

Homo sapiens FAM237A is associated with three unnamed protein isoforms. [2] FAM237A's most-researched isoform is 181 amino acids long, and is predicted to contain a transmembrane domain. [2] FAM237A's second protein isoform is predicted to be 417 amino acids long; it contains a transmembrane domain and an upstream open reading frame. [2] The last protein isoform of FAM237A is made up of 158 amino acids and contains a transmembrane domain; this isoform is predicted to localize within the membrane. [2] Several databases, including NCBI, only recognize FAM237A's 181 amino acid isoform. [1] Given the relative abundance of literature surrounding it, the remainder of this page's findings only discuss FAM237A's 181 amino acid isoform.

The theoretical molecular weight of this isoform is 20.56 kDA. [5] [6] [7] Its theoretical isoelectric point is 8.96. [5] [6] [7] Homo sapiens FAM237A amino acid composition is predicted to be relatively standard. [8] It notably contains a repeat LFWD motif at amino acids 90 and 97. [8]

FAM237A's transmembrane domain is generally predicted to reside on amino acids 14-32 within the protein. [8] [9] However, structure prediction tool Phyre2 predicts that the protein's transmembrane domain resides on amino acids 91–106. [10]

Regulation

Three promoters of Homo sapiens FAM237A are predicted: GXP_8991091, GXP_7539237, and GXP_8991092. [11] Of these, GXP_8991091 has the greatest predicted tissue expression levels. [11]

AceView predicts that Homo sapiens FAM237A is localized to membranes. [2] However, this is disputed, with protein localization prediction resource Hum-mPLoc predicting that Homo sapiens FAM237A is expressed within the nucleus and resource PSORT II predicting ER localization, with lesser chances of expression within the mitochondria and Golgi apparatus. [12] [13] [14] [15] [16] [17]

An abundance of predicted phosphorylation sites reside on Homo sapiens FAM237A's sequence. [18] [19] [20] [21] Homo sapiens FAM237A contains two predicted fatty acid addition sites at amino acids 18 and 26; these sites overlap with one of the FAM237A's predicted transmembrane sequences. [22] [23] Homo sapiens FAM237A is additionally predicted to contain two sites of ubiquitination at amino acids 179 and 181 on its sequence. [24] [25] These ubiquitination sites are predicted to perfectly overlap two acetylation sites. [26] [27]

Homology

Homo sapiens FAM237A has one predicted paralog: FAM237B. [4] FAM237B has 21.6% predicted identity with FAM237A [28]

FAM237A has orthologs in a broad range of vertebrate organisms, including other Mammals, Reptilia, Actinopterygii, and Aves. [29] The gene is not found in invertebrates. Based upon BLAST analysis, FAM237A is not found in invertebrates. [29] The only reptiles which FAM237A is found in are predicted to be of the suborder Cryptodira, based upon BLAST searches. [29]

Function

Information regarding FAM237A's function is limited; however, FAM237A is predicted to be a specific activator of GPR83, which is implicated in energy metabolism, dietary patterns, and reward signaling. [3] [30] GPR83 is additionally suspected to be correlated to immune system function [30]

Related Research Articles

<span class="mw-page-title-main">FAM214A</span> Protein-coding gene in the species Homo sapiens

Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.

Transmembrane protein 33 is a protein that in humans, is encoded by the TMEM33 gene, also known as SHINC3. Another name for the TMEM33 protein is DB83.

<span class="mw-page-title-main">TMEM106C</span> Protein-coding gene in the species Homo sapiens

TMEM106C is a gene that encodes the transmembrane protein 106C (TMEM106C) in Homo sapiens It has been found to be overexpressed in cancer cells and also is related to distal arthrogryposis, a condition of stiff joints and irregular muscle development. The TMEM106C gene contains a domain of unknown function, DUF1356, that spans most of the protein. Transmembrane protein 106C also goes by the aliases MGC5576 or MGC111210, LOC79022.

<span class="mw-page-title-main">EVI5L</span> Protein-coding gene in the species Homo sapiens

EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.

<span class="mw-page-title-main">Fam221b</span> Protein-coding gene in the species Mus musculus

FAM221B is a protein that in humans is encoded by the FAM221B gene . FAM221B is also known by the alias C9orf128, is expressed at low level, and is defined by 17 GenBank accessions . It is predicted to function in transcription regulation as a transcription factor.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

LOC100287387 is a protein that in humans is encoded by the gene LOC100287387. The function of the protein is not yet understood in the scientific community. The gene is located on the q arm of chromosome 2.

<span class="mw-page-title-main">C2orf16</span> Protein-coding gene in the species Homo sapiens

C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.

Transmembrane protein 151A, also known as TMEM151A, is a protein that is encoded by the TMEM151A gene.

<span class="mw-page-title-main">TMEM128</span>

TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.

<span class="mw-page-title-main">C16orf90</span> Protein-coding gene in the species Homo sapiens

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

<span class="mw-page-title-main">SMIM15</span> Mammalian protein found in Homo sapiens

SMIM15(small integral membrane protein 15) is a protein in humans that is encoded by the SMIM15 gene. It is a transmembrane protein that interacts with PBX4. Deletions where SMIM15 is located have produced mental defects and physical deformities. The gene has been found to have ubiquitous but variable expression in many tissues throughout the body.

Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.

C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">C17orf75</span> Protein-coding gene in the species Homo sapiens

Chromosome 17 open reading frame 75 is a protein that in humans is encoded by the C17orf75 gene. C17orf75 is also known as SRI2 and is a human protein encoding gene located at 17q11.2 on the complementary strand. The protein this gene encodes is also known as NJMU-R1. The C17orf75 gene is ubiquitously expressed at medium-low levels throughout the body and at slightly higher levels in the brain and testes. This protein is thought to be part of a complex associated with golgin-mediated vesicle capture.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

References

  1. 1 2 3 "FAM237A family with sequence similarity 237 member A [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-12-15.
  2. 1 2 3 4 5 6 7 8 9 10 11 "AceView: Gene:LOC200726, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2020-12-15.
  3. 1 2 Sallee, Nathan A.; Lee, Ernestine; Leffert, Atossa; Ramirez, Silvia; Brace, Arthur D.; Halenbeck, Robert; Kavanaugh, W. Michael; Sullivan, Kathleen M. C. (2020-07-25). "A Pilot Screen of a Novel Peptide Hormone Library Identified Candidate GPR83 Ligands". SLAS Discovery. 25 (9): 1047–1063. doi: 10.1177/2472555220934807 . ISSN   2472-5552. PMID   32713278. S2CID   220798057.
  4. 1 2 "FAM237A Gene - GeneCards | F237A Protein | F237A Antibody". www.genecards.org. Retrieved 2020-12-19.
  5. 1 2 Bjellqvist, Bengt; Hughes, Graham J.; Pasquali, Christian; Paquet, Nicole; Ravier, Florence; Sanchez, Jean-Charles; Frutiger, Séverine; Hochstrasser, Denis (1993). "The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences". Electrophoresis. 14 (1): 1023–1031. doi:10.1002/elps.11501401163. ISSN   0173-0835. PMID   8125050. S2CID   38041111.
  6. 1 2 Bjellqvist, Bengt; Basse, Bodil; Olsen, Eydfinnur; Celis, Julio E. (1994). "Reference points for comparisons of two-dimensional maps of proteins from different human cell types defined in a pH scale where isoelectric points correlate with polypeptide compositions". Electrophoresis. 15 (1): 529–539. doi:10.1002/elps.1150150171. ISSN   0173-0835. PMID   8055880. S2CID   25560231.
  7. 1 2 Wilkins, Marc R.; Gasteiger, Elisabeth; Bairoch, Amos; Sanchez, Jean-Charles; Williams, Keith L.; Appel, Ron D.; Hochstrasser, Denis F. (1998), "Protein Identification and Analysis Tools in the ExPASy Server", 2-D Proteome Analysis Protocols, New Jersey: Humana Press, vol. 112, pp. 531–552, doi:10.1385/1-59259-584-7:531, ISBN   1-59259-584-7, PMID   10027275 , retrieved 2020-12-19
  8. 1 2 3 Madeira, Fábio; Park, Young Mi; Lee, Joon; Buso, Nicola; Gur, Tamer; Madhusoodanan, Nandana; Basutkar, Prasad; Tivey, Adrian R N; Potter, Simon C; Finn, Robert D; Lopez, Rodrigo (30 June 2019). "The EMBL-EBI search and sequence analysis tools APIs in 2019". Nucleic Acids Research. 47 (W1): 636–641. doi:10.1093/nar/gkz268. PMC   6602479 . PMID   30976793.
  9. "protein FAM237A [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-12-19.
  10. Kelley, Lawrence A; Mezulis, Stefans; Yates, Christopher M; Wass, Mark N; Sternberg, Michael J E (2015-05-07). "The Phyre2 web portal for protein modeling, prediction and analysis". Nature Protocols. 10 (6): 845–858. doi:10.1038/nprot.2015.053. ISSN   1754-2189. PMC   5298202 . PMID   25950237.
  11. 1 2 "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Retrieved 2020-12-19.
  12. Chou, Kuo-Chen; Shen, Hong-Bin (2008). "Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms". Nature Protocols. 3 (2): 153–162. doi:10.1038/nprot.2007.494. ISSN   1750-2799. PMID   18274516. S2CID   226104.
  13. Shen, Hong-Bin; Chou, Kuo-Chen (2007-04-20). "Hum-mPLoc: An ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites". Biochemical and Biophysical Research Communications. 355 (4): 1006–1011. doi:10.1016/j.bbrc.2007.02.071. ISSN   0006-291X. PMID   17346678.
  14. Chou, K.-C. (2004-08-12). "Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes". Bioinformatics. 21 (1): 10–19. doi: 10.1093/bioinformatics/bth466 . ISSN   1367-4803. PMID   15308540.
  15. Shen, Hong-Bin; Chou, Kuo-Chen (2006-07-15). "Ensemble classifier for protein fold pattern recognition". Bioinformatics. 22 (14): 1717–1722. doi: 10.1093/bioinformatics/btl170 . ISSN   1367-4811. PMID   16672258.
  16. Nakai, K; Horton, P (January 1999). "PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization". Trends Biochem. Sci. 24 (1): 34–36. doi:10.1016/s0968-0004(98)01336-x. PMID   10087920.
  17. Nakai, Kenta; Kanehisa, Minoru (December 1992). "A knowledge base for predicting protein localization sites in eukaryotic cells". Genomics. 14 (4): 897–911. doi:10.1016/s0888-7543(05)80111-9. ISSN   0888-7543. PMC   7134799 . PMID   1478671.
  18. Xue, Yu; Ren, Jian; Gao, Xinjiao; Jin, Changjiang; Wen, Longping; Yao, Xuebiao (September 2008). "GPS 2.0, a Tool to Predict Kinase-specific Phosphorylation Sites in Hierarchy". Molecular & Cellular Proteomics. 7 (9): 1598–1608. doi:10.1074/mcp.M700574-MCP200. ISSN   1535-9476. PMC   2528073 . PMID   18463090.
  19. Xue, Yu; Liu, Zexian; Cao, Jun; Ma, Qian; Gao, Xinjiao; Wang, Qingqi; Jin, Changjiang; Zhou, Yanhong; Wen, Longping; Ren, Jian (March 2011). "GPS 2.1: enhanced prediction of kinase-specific phosphorylation sites with an algorithm of motif length selection". Protein Engineering, Design & Selection. 24 (3): 255–260. doi: 10.1093/protein/gzq094 . ISSN   1741-0134. PMID   21062758.
  20. Wang, Chenwei; Xu, Haodong; Lin, Shaofeng; Deng, Wankun; Zhou, Jiaqi; Zhang, Ying; Shi, Ying; Peng, Di; Xue, Yu (2020-02-01). "GPS 5.0: An Update on the Prediction of Kinase-specific Phosphorylation Sites in Proteins". Genomics, Proteomics & Bioinformatics. 18 (1): 72–80. doi:10.1016/j.gpb.2020.01.001. ISSN   1672-0229. PMC   7393560 . PMID   32200042.
  21. Xue, Yu; Zhou, Fengfeng; Zhu, Minjie; Ahmed, Kashif; Chen, Guoliang; Yao, Xuebiao (2005-07-01). "GPS: a comprehensive www server for phosphorylation sites prediction". Nucleic Acids Research. 33 (Web Server issue): W184–W187. doi:10.1093/nar/gki393. ISSN   0305-1048. PMC   1160154 . PMID   15980451.
  22. Xie, Yubin; Zheng, Yueyuan; Li, Hongyu; Luo, Xiaotong; He, Zhihao; Cao, Shuo; Shi, Yi; Zhao, Qi; Xue, Yu; Zuo, Zhixiang; Ren, Jian (2016-06-16). "GPS-Lipid: a robust tool for the prediction of multiple lipid modification sites". Scientific Reports. 6: 28249. Bibcode:2016NatSR...628249X. doi:10.1038/srep28249. ISSN   2045-2322. PMC   4910163 . PMID   27306108.
  23. Ren, Jian; Wen, Longping; Gao, Xinjiao; Jin, Changjiang; Xue, Yu; Yao, Xuebiao (November 2008). "CSS-Palm 2.0: an updated software for palmitoylation sites prediction". Protein Engineering, Design & Selection. 21 (11): 639–644. doi:10.1093/protein/gzn039. ISSN   1741-0134. PMC   2569006 . PMID   18753194.
  24. Ren, Jian; Gao, Xinjiao; Jin, Changjiang; Zhu, Mei; Wang, Xiwei; Shaw, Andrew; Wen, Longping; Yao, Xuebiao; Xue, Yu (2009). "Systematic study of protein sumoylation: Development of a site-specific predictor of SUMOsp 2.0". Proteomics. 9 (12): 3409–3412. doi:10.1002/pmic.200800646. ISSN   1615-9861. PMID   29658196. S2CID   4900031.
  25. Zhao, Qi; Xie, Yubin; Zheng, Yueyuan; Jiang, Shuai; Liu, Wenzhong; Mu, Weiping; Liu, Zexian; Zhao, Yong; Xue, Yu; Ren, Jian (2014-07-01). "GPS-SUMO: a tool for the prediction of sumoylation sites and SUMO-interaction motifs". Nucleic Acids Research. 42 (Web Server issue): W325–W330. doi:10.1093/nar/gku383. ISSN   0305-1048. PMC   4086084 . PMID   24880689.
  26. Deng, Wankun; Wang, Chenwei; Zhang, Ying; Xu, Yang; Zhang, Shuang; Liu, Zexian; Xue, Yu (22 December 2016). "GPS-PAIL: prediction of lysine acetyltransferase-specific modification sites from protein sequences". Scientific Reports. 6 (1): 39787. Bibcode:2016NatSR...639787D. doi:10.1038/srep39787. ISSN   2045-2322. PMC   5177928 . PMID   28004786.
  27. Li, Ao; Xue, Yu; Jin, Changjiang; Wang, Minghui; Yao, Xuebiao (1 December 2006). "Prediction of Nε-acetylation on internal lysines implemented in Bayesian Discriminant Method". Biochemical and Biophysical Research Communications. 350 (4): 818–824. doi:10.1016/j.bbrc.2006.08.199. ISSN   0006-291X. PMC   2093955 . PMID   17045240.
  28. "Using sequence similarity searching tools at EMBL-EBI: webinar". doi:10.6019/tol.seqsim-w.2015.00001.1.{{cite journal}}: Cite journal requires |journal= (help)
  29. 1 2 3 Boratyn, Grzegorz M; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Busby, Ben; Madden, Thomas L (2018-08-13). "Magic-BLAST, an accurate DNA and RNA-seq aligner for long and short reads". doi: 10.1101/390013 . S2CID   92268893.{{cite journal}}: Cite journal requires |journal= (help)
  30. 1 2 Müller, Timo D.; Müller, Anne; Yi, Chun-Xia; M Habegger, Kirk; Meyer, Carola W.; Gaylinn, Bruce D.; Finan, Brian; Heppner, Kristy; Trivedi, Chitrang; Bielohuby, Maximilian; Abplanalp, William (2013-06-07). "The orphan receptor Gpr83 regulates systemic energy metabolism via ghrelin-dependent and ghrelin-independent mechanisms". Nature Communications. 4 (1): 1968. Bibcode:2013NatCo...4.1968M. doi:10.1038/ncomms2968. ISSN   2041-1723. PMC   3709495 . PMID   23744028.