FAM89A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | FAM89A , C1orf153, family with sequence similarity 89 member A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1916877 HomoloGene: 18887 GeneCards: FAM89A | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
ProteinFAM89A (family with sequence similarity 89, member A) is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression. [5] [6]
The gene FAM89A is a protein-encoding gene in humans, located on minus strand of chromosome 1, map position 1q42.2. It is also known as chromosome 1 open reading frame 153 (C1orf153). [7] [8] [9] The primary mRNA transcript for the FAM89A gene is 1,503 base pairs in length. [10] There are no other transcript variants for FAM89A. The gene is composed of two exons flanking one large intronic region. [11] FAM89A is neighboring the genes TRIM67 (Tripartite Motif Containing 67), located downstream of FAM89A on the plus strand of chromosome 1, and ARV1 (ARV1 Homolog, Fatty Acid Homeostasis Modulator), located upstream of FAM89A on the plus strand of chromosome 1. [11] [12]
The FAM89A protein is 184 amino acids in length, and it has a predicted molecular mass of 18.6kDa and a predicted isoelectric point of 5.64. [13] Two small repetitive sequences were found twice within the protein sequence; GARAA and ASGG. Composition of FAM89A protein is notable for its abundance of four amino acids; Leucine (14.1%), Glycine (12.0%), Alanine (11.4%) and Serine (11.4%). FAM89A shows five periodic repeats of leucine residues at every seventh amino acid position at positions 81-115, which is characteristic of its predicted leucine zipper structural motif. [14] [15]
FAM89A contains a conserved leucine-rich adapter protein domain (LURAP) called PF14854, located at amino acid positions 84-122. [16] [17] The LURAP superfamily of proteins are activators of the canonical NF-κB pathway, involved in promoting antigen presentation in dendritic cells and the production of pro-inflammatory cytokines. [18]
FAM89A is predicted to be 40% alpha helix, 11% extended strand, and 49% random coils. [19] The conserved LURAP domain is predicted to form an alpha helix. [20] [21] [22] [23]
FAM89A tertiary structure has not yet been determined by X-ray crystallography. I-TASSER software predicts dimerization of alpha helix monomers, indicative of the leucine zipper motif. [21] [22] [23]
The FAM89A promoter region is 1,104 base pairs in length. [25] It contains binding sites for various transcription factors, including TFIIB (RNA polymerase II transcription factor IIB), PLAG1 (pleomorphic adenoma gene 1), MZF1 (myeloid zinc finger 1 factors), and SP1 (GC-Box factors SP1/GC). [11] [25]
FAM89A's highest expression is observed in the placenta and adipose tissue. [26] [27] RNA-sequencing data also reveals moderate FAM89A expression in the adrenal gland, lung, skin, spleen, and breast. [8] [12] Microarray hybridization supports high FAM89A expression in the placenta and moderate expression in the lung, spinal cord, skin, adrenal gland, and retina. [28]
The FAM89A protein is suggested to be localized in the nucleoplasm, Golgi apparatus, and/or vesicles. [29] [30] Reinhardt’s method for cytoplasmic/nuclear discrimination in PSORT II search results predict nuclear localization with a reliability score of 89. Prediction for localization of FAM89A is highest in the nucleus (52.2%) followed by the mitochondria (34.8%), then the cytoskeleton (8.7%), followed by the cytoplasm having the lowest score (4.3%). [14] PredictProtein tool supports the prediction of subcellular localization in the nucleus. [31]
FAM89A has three predicted phosphorylation sites located at amino acid positions 30, 32, and 168 that are conserved in distant orthologs. [32] The predicted phosphorylation site at position 32 is experimentally verified at position 28 in its paralog, FAM89B. [33] There is a possible competitive binding site for phosphorylation and O-linked β-N-acetylglucosamine (O-GlcNAc) at position 158, [34] supporting localization of FAM89A in the nucleoplasm. [29] [30]
NetGlycate 1.0 server predicts two glycation sites at positions 57 and 95. [35] The residues are conserved in distant FAM89A orthologs. Glycation of these lysines is linked to being an important factor in atherosclerosis due to its production of advanced glycation end products (AGEs) which are engulfed by macrophages and taken into the arterial wall. [36]
SUMOplot predicts SUMO (Small Ubiquitin-like Modifier) protein sites at position 83. The residue is conserved in distant FAM89A orthologs.
An important human paralog of FAM89A is FAM89B, located on human chromosome 11 at map position 11q13.1. [37] FAM89B is also known as, Leucine Repeat Adaptor Protein 25 (LRAP25) and Mammary Tumor Virus Receptor Homolog 1 (MTVR1). [37] Orthologs of FAM89A, but not FAM89B, are present in bivalves, crinoids, hemichordates, starfish, and horseshoe crabs. [38] Orthologs of FAM89B, but not FAM89A, are present in brachiopods and priapulids, The paralogs likely split around 736 million years ago. [39]
FAM89A is largely conserved in Eutelostomi (bony vertebrates). Its orthologs can be found in mammals, amphibians, reptiles, birds, fish, and various insects. [40] Distant FAM89A orthologs are present in octopus, scallop, ants, and bees. [41] [42] [43] [44]
The rate of accumulation of amino acid changes relative to the genes Fibrinogen and Cytochrome c indicates that FAM89A is evolving rapidly, using the molecular clock technique.
FAM89A is experimentally determined to interact with the UBXN2B (UBX Domain Protein 2B), an adaptor protein involved in biogenesis in the Golgi apparatus and endoplasmic reticulum (ER) and assembly and maintenance of the ER during the cell cycle [45] [46]
FAM89A is suggested to be involved in modulating the effects of smoking on the risk of atherosclerotic plaque burden. [5] In a study conducted in 2014, a cohort of 264 Caribbean Hispanics with varying smoking frequencies were evaluated for carotid plaque burden and 11 single nucleotide polymorphism (SNP) were identified that had a notable interaction with smoking effects on carotid plaque burden, including SNP rs6700792, located within the FAM89A gene. [5]
FAM89A is also suggested to be involved in discriminating viral and bacterial infection in febrile patients. [6] A 2016 study conducted at the Division of Infectious Disease in the Imperial College of London evaluated blood-based transcriptomic biomarkers and revealed that febrile patients with bacterial infection displayed increased expression of FAM89A. [47] [48]
A 2019 study concerning FAM89A was directed on genes that possess methylation sites that relate to causing gliomas. The researchers found that abnormal expression of FAM89A correlated with glioma gene expression profiling studies. [49]
Microarray hybridization data revealed slight decrease in FAM89A expression in response to airway epithelial cell exposure to interleukin 13 and CD8+ T lymphocyte exposure to interleukin 10. [50] [51]
NBEAL1 is a protein that in humans is encoded by the NBEAL1 gene. It is found on chromosome 2q33.2 of Homo sapiens.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
C11orf42 is an uncharacterized protein in Homo sapiens that is encoded by the C11orf42 gene. It is also known as chromosome 11 open reading frame 42 and uncharacterized protein C11orf42, with no other aliases. The gene is mostly conserved in mammals, but it has also been found in rodents, reptiles, fish and worms.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.
C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.