FAM83A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | FAM83A , BJ-TSA-9, family with sequence similarity 83 member A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2447773 HomoloGene: 13158 GeneCards: FAM83A | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Protein FAM83A (family member with sequence similarity 83) also known as tumor antigen BJ-TSA-9 is a protein that in humans is encoded by the FAM83A gene. [5]
This protein is predicted to contain one domain of unknown function 1669 (DUF1669), which places this protein into the PLDc_SuperFamily. [5] [6] [7] It has been linked to be a potential biomarker in lung, prostate, and bladder cancers. [8] [9] [10]
FAM83A is located on chromosome 8, locus q24.13, [5] and spans 27,566 base pairs. [5] There is a promoter approximately 4,000 base pairs upstream as predicted by the tool ElDorado by Genomatix. [11] [ failed verification ] Deletions in this part of the chromosome, including the FAM83A gene, often result in Langer–Giedion syndrome. [12]
The FAM83A mRNA has 10 different splice forms, [13] with transcript variant 1 being the subject of this article. This mRNA consists of approximately 2,900 base pairs. It contains a domain of unknown function 1669 (DUF1669), and is a member of the PLDc_SF superfamily. [14] It has been placed into this superfamily based on highly similar sequences between FAM83A and other N-terminus phospholipase D-like domains; however, it's missing the functional histidine and therefore only predicted to have a similar structure. [7] Four exons comprise this mRNA. [5] [15] Predictions show several possible RNA stem loop formations in the untranslated regions of the mRNA. [16]
The FAM83A gene encodes for the FAM83A protein isoform A. [5] This protein is 434 amino acids in length, and weighs approximately 45 kilodaltons. [17] Although the structure is not confirmed, it is predicted using GOR4 to have four alpha helices and five beta sheets, [18] with an isoelectric point of 8.964 [16] There appears to be no signal peptide [19] and is not a transmembrane protein. [20] Using CBLAST, it was observed to share a common sequence with the protein 3H0G, [21] however, not in the predicted functional region of FAM83A. This may still provide valuable information, though, on a possible partial structure for this protein.
FAM83A is expressed in many different areas of the human body, and at very different levels. [22] EST data suggests that this gene is expressed primarily in adults. It shows significant levels in the mouth and larynx, with other raised expression in the prostate, lung, and esophagus. [22] These normal levels are even more elevated in head and neck tumors, lung tumors, prostate cancers, and bladder carcinomas. [22] Microarray expression data also show different environmental conditions, especially irritants. Such irritants can be cigarette smoke, where FAM83A has been shown to increase expression after 24 hours exposure, [23] and house dust mite extract on bronchial epithelial cells, where FAM83A expression also increases after exposure. [24] FAM83A shows decreased expression when activating transcription factor 2 (ATF2) is knocked out. [25] Since ATF2 was not predicted to bind in the promoter region, it suggests that there is an indirect relationship between FAM83A and ATF2. With increased expression to irritants and allergens along with an indirect relationship with ATF2, it suggests FAM83A may be in a signaling pathway.
There are multiple studies that link FAM83A overexpression to lung, prostate, and bladder cancers. Researchers believe that this gene might make a good candidate for early detection of these cancers, especially lung cancer. [8] [9] It is unknown why or how FAM83A is upregulated. Studies have shown that arsenic can acetylate the promoter causing upregulation, suggesting this may be a similar mechanism to how this gene becomes unregulated in cancer. [10]
FAM83A has been shown to interact with palate, lung, and nasal epithelium carcinoma associated protein (PLUNC) through the STRING tool. [26] This information gathered came from textmining information dealing with cancer. [9]
FAM83A appears to be a relatively new gene, with BLAST and BLAT only showing orthology back through bony fishes. [15] [27] It is highly conserved through its relatives [27] The DUF1669 domain is the most highly conserved region of the protein, with areas outside of it being more variable. Below is a table of orthologs, noting that this is not a comprehensive list:
Genus Species | Organism Common Name | Divergence from Humans (MYA) [28] | NCBI Protein Accession Number | Sequence Identity | Protein Length |
---|---|---|---|---|---|
Homo sapiens | Humans | -- | NM_032899.4 [17] | 100% | 434 |
Pan troglodytes | Chimpanzee | 6.2 | XP_001147312.2 [29] | 99% | 434 |
Bos taurus | Cattle | 97.4 | XP_599662.2 [30] | 89% | 435 |
Canis lupus | Dog | 97.4 | XP_851601.2 [31] | 89% | 435 |
Mus musculus | Mouse | 91 | NP_776287.2 [32] | 87% | 436 |
Anolis carolinensis | Lizard | 324.5 | XP_003228186.1 [33] | 70% | 438 |
Gallus gallus | Chicken | 324.5 | XP_001233861.2 [34] | 69% | 439 |
Xenopus laevis | Frog | 361.2 | NP_001079963.1 [35] | 64% | 443 |
Danio rerio | Zebrafish | 454.6 | XP_001340276.3 [36] | 59% | 426 |
Uncharacterized protein KIAA1109 is a protein that in humans is encoded by the KIAA1109 gene.
Transmembrane protein 98 is a single-pass membrane protein that in humans is encoded by the TMEM98 gene. The function of this protein is currently unknown. TMEM98 is also known as UNQ536/PRO1079.
Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene. C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers. There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates. C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.
Iroquois-class homeodomain protein IRX-1, also known as Iroquois homeobox protein 1, is a protein that in humans is encoded by the IRX1 gene. All members of the Iroquois (IRO) family of proteins share two highly conserved features, encoding both a homeodomain and a characteristic IRO sequence motif. Members of this family are known to play numerous roles in early embryo patterning. IRX1 has also been shown to act as a tumor suppressor gene in several forms of cancer.
Transmembrane protein 131 (TMEM131) is a protein that is encoded by the TMEM131 gene in humans. The TMEM131 protein contains three domains of unknown function 3651 (DUF3651) and two transmembrane domains. This protein has been implicated as having a role in T cell function and development. TMEM131 also resides in a locus (2q11.1) that is associated with Nievergelt's Syndrome when deleted.
Transmembrane Protein 205 (TMEM205) is a protein encoded on chromosome 19 by the TMEM205 gene.
Solute carrier family 46 member 3 (SLC46A3) is a protein that in humans is encoded by the SLC46A3 gene. Also referred to as FKSG16, the protein belongs to the major facilitator superfamily (MFS) and SLC46A family. Most commonly found in the plasma membrane and endoplasmic reticulum (ER), SLC46A3 is a multi-pass membrane protein with 11 α-helical transmembrane domains. It is mainly involved in the transport of small molecules across the membrane through the substrate translocation pores featured in the MFS domain. The protein is associated with breast and prostate cancer, hepatocellular carcinoma (HCC), papilloma, glioma, obesity, and SARS-CoV. Based on the differential expression of SLC46A3 in antibody-drug conjugate (ADC)-resistant cells and certain cancer cells, current research is focused on the potential of SLC46A3 as a prognostic biomarker and therapeutic target for cancer. While protein abundance is relatively low in humans, high expression has been detected particularly in the liver, small intestine, and kidney.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.
Chromosome 3 open reading frame 67 or C3orf67 is a protein that in humans is encoded by the gene C3orf67. The function of C3orf67 is not yet fully understood.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.
Transmembrane protein 125 is a protein that, in humans, is encoded by the TMEM125 gene. It has 4 transmembrane domains and is expressed in the lungs, thyroid, pancreas, intestines, spinal cord, and brain. Though its function is currently poorly understood by the scientific community, research indicates it may be involved in colorectal and lung cancer networks. Additionally, it was identified as a cell adhesion molecule in oligodendrocytes, suggesting it may play a role in neuron myelination.
Transmembrane protein 155 is a protein that in humans is encoded by the TMEM155 gene. It is located on human chromosome 4, spanning 6,497 bases. It is also referred to as FLJ30834 and LOC132332. This protein is known to be expressed mainly in the brain, placenta, and lymph nodes and is conserved throughout most placental mammals. The function and structure of this protein is still not well understood, but its level of expression has been studied pertaining to various pathologies.
Transmembrane Protein 81 or TMEM81 is a protein that in humans is encoded by the TMEM81 gene. TMEM81 is a poorly-characterized transmembrane protein which contains an extracellular immunoglobulin domain.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of 5 transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
CCDC188 or coiled-coil domain containing protein is a protein that in humans is encoded by the CCDC188 gene.
Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.