FAM237A is a protein coding gene which encodes a protein of the same name. [1] Within Homo sapiens, FAM237A is believed to be primarily expressed within the brain, with moderate heart and lesser testes expression [1] ,. [2] FAM237A is hypothesized to act as a specific activator of receptor GPR83. [3]
FAM237A is alternatively known as HCG1657980 and LOC200726. [2] [4] Homo sapiens FAM237A's sequence resides on chromosome 2’s + strand, and extends from bases 207486904 to 207514174. [2] Homo sapiens FAM237A sequence contains 13 exons unspliced. [2]
Homo sapiens FAM237A is predicted to produce six unique transcripts, of which four are spliced. [2]
Name | Size (Base Pairs) | Exon Usage |
---|---|---|
aAug10 | 1797 | 2,3,8,10 |
bAug10 | 1376 | 4,6,13 |
cAug10 | 1121 | 5,12 |
dAug10 (Unspliced) | 798 | 7 |
eAug10 (Unspliced) | 873 | 9 |
fAug10 | 706 | 1,11 |
Homo sapiens FAM237A is associated with three unnamed protein isoforms. [2] FAM237A's most-researched isoform is 181 amino acids long, and is predicted to contain a transmembrane domain. [2] FAM237A's second protein isoform is predicted to be 417 amino acids long; it contains a transmembrane domain and an upstream open reading frame. [2] The last protein isoform of FAM237A is made up of 158 amino acids and contains a transmembrane domain; this isoform is predicted to localize within the membrane. [2] Several databases, including NCBI, only recognize FAM237A's 181 amino acid isoform. [1] Given the relative abundance of literature surrounding it, the remainder of this page's findings only discuss FAM237A's 181 amino acid isoform.
The theoretical molecular weight of this isoform is 20.56 kDA. [5] [6] [7] Its theoretical isoelectric point is 8.96. [5] [6] [7] Homo sapiens FAM237A amino acid composition is predicted to be relatively standard. [8] It notably contains a repeat LFWD motif at amino acids 90 and 97. [8]
FAM237A's transmembrane domain is generally predicted to reside on amino acids 14-32 within the protein. [8] [9] However, structure prediction tool Phyre2 predicts that the protein's transmembrane domain resides on amino acids 91–106. [10]
Three promoters of Homo sapiens FAM237A are predicted: GXP_8991091, GXP_7539237, and GXP_8991092. [11] Of these, GXP_8991091 has the greatest predicted tissue expression levels. [11]
AceView predicts that Homo sapiens FAM237A is localized to membranes. [2] However, this is disputed, with protein localization prediction resource Hum-mPLoc predicting that Homo sapiens FAM237A is expressed within the nucleus and resource PSORT II predicting ER localization, with lesser chances of expression within the mitochondria and Golgi apparatus. [12] [13] [14] [15] [16] [17]
An abundance of predicted phosphorylation sites reside on Homo sapiens FAM237A's sequence. [18] [19] [20] [21] Homo sapiens FAM237A contains two predicted fatty acid addition sites at amino acids 18 and 26; these sites overlap with one of the FAM237A's predicted transmembrane sequences. [22] [23] Homo sapiens FAM237A is additionally predicted to contain two sites of ubiquitination at amino acids 179 and 181 on its sequence. [24] [25] These ubiquitination sites are predicted to perfectly overlap two acetylation sites. [26] [27]
Homo sapiens FAM237A has one predicted paralog: FAM237B. [4] FAM237B has 21.6% predicted identity with FAM237A [28]
FAM237A has orthologs in a broad range of vertebrate organisms, including other Mammals, Reptilia, Actinopterygii, and Aves. [29] The gene is not found in invertebrates. Based upon BLAST analysis, FAM237A is not found in invertebrates. [29] The only reptiles which FAM237A is found in are predicted to be of the suborder Cryptodira, based upon BLAST searches. [29]
Information regarding FAM237A's function is limited; however, FAM237A is predicted to be a specific activator of GPR83, which is implicated in energy metabolism, dietary patterns, and reward signaling. [3] [30] GPR83 is additionally suspected to be correlated to immune system function [30]
Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.
Transmembrane protein 33 is a protein that in humans, is encoded by the TMEM33 gene, also known as SHINC3. Another name for the TMEM33 protein is DB83.
TMEM106C is a gene that encodes the transmembrane protein 106C (TMEM106C) in Homo sapiens It has been found to be overexpressed in cancer cells and also is related to distal arthrogryposis, a condition of stiff joints and irregular muscle development. The TMEM106C gene contains a domain of unknown function, DUF1356, that spans most of the protein. Transmembrane protein 106C also goes by the aliases MGC5576 or MGC111210, LOC79022.
EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.
FAM221B is a protein that in humans is encoded by the FAM221B gene . FAM221B is also known by the alias C9orf128, is expressed at low level, and is defined by 17 GenBank accessions . It is predicted to function in transcription regulation as a transcription factor.
Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.
LOC100287387 is a protein that in humans is encoded by the gene LOC100287387. The function of the protein is not yet understood in the scientific community. The gene is located on the q arm of chromosome 2.
C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.
Transmembrane protein 151A, also known as TMEM151A, is a protein that is encoded by the TMEM151A gene.
TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
SMIM15(small integral membrane protein 15) is a protein in humans that is encoded by the SMIM15 gene. It is a transmembrane protein that interacts with PBX4. Deletions where SMIM15 is located have produced mental defects and physical deformities. The gene has been found to have ubiquitous but variable expression in many tissues throughout the body.
Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).
Chromosome 17 open reading frame 75 is a protein that in humans is encoded by the C17orf75 gene. C17orf75 is also known as SRI2 and is a human protein encoding gene located at 17q11.2 on the complementary strand. The protein this gene encodes is also known as NJMU-R1. The C17orf75 gene is ubiquitously expressed at medium-low levels throughout the body and at slightly higher levels in the brain and testes. This protein is thought to be part of a complex associated with golgin-mediated vesicle capture.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
{{cite journal}}
: Cite journal requires |journal=
(help){{cite journal}}
: Cite journal requires |journal=
(help)