FAM210B | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | FAM210B , 5A3, C20orf108, dJ1167H4.1, family with sequence similarity 210 member B | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1914267; HomoloGene: 32646; GeneCards: FAM210B; OMA:FAM210B - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.
FAM210B is found on the plus strand of chromosome 20, precisely at 20q13.2. This gene is 9,749 bases in length, corresponding to a 192 amino acid protein. [5]
Alternative names include C20orf108, Hypothetical protein LOC116151, DJ116H4.1, and 5A3. [5]
FAM210B has 2 known splice variants, formally known as FAM210B-001 an FAM210B-002. FAM210B-001 is 3 exons in length, whereas FAM210B-002 is 2 exons. [6]
The longest precursor protein is 192 amino acids in length. The molecular weight is 20.4 kdal, and isoelectric point is 10.8. [7]
There is 1 positive charge run of 4 residues at position 79, EEKK. A high scoring transmembrane sequence is present at positions 99–123, VGVSLHIGISLISLGIFYM VVSSGV. An endoplasmic reticulum retention signal (PAAK) is present at positions 187–191. [7]
The majority of the structure is composed of alpha helices. As predicted my Phyre 2, 61% of the protein forms alpha helices, and 28% forms transmembrane segments. A detailed map of the transmembrane topology is shown below. [8]
I-Tasser predicted the folding of FAM210B to be as shown below.
The primary location of the FAM210B protein is the endoplasmic reticulum. The amino acid sequence contains the endoplasmic retention signal PAAK near the C-terminus. There are 2 transmembrane domains, which have been determined and reaffirmed by multiple methods. They are highlighted in the post-translational modification image below. [9]
The following conceptual translation presents expected post-translational modifications. The modifications were predicted by MotifScan, and then searched in greater detail in Expasy [10] Modifications found include propeptide cleavage sites, C-mannosylation sites, GalNAc O-glycosylation sites, lysine glycation sites, phosphorylation sites, a Coronavirus 3C-like proteinase cleavage site, and a GPI modification site.
An important paralog for FAM210B is FAM210A. Orthologs and distant homologs have been found throughout mammalia, reptiles, birds, invertebrates, fish, amphibian, trichoplax, and fungi. [11]
Gene Name | Common name | Accession number | Sequence length | E-value | Identity | Query cover |
---|---|---|---|---|---|---|
RCE1 protein | RCE1 | AAH52622.1 | 329 | 0.063 | 42% | 28% |
Serine/Threonine-protein kinase | SBK2 | NP_001094871.2 | 348 | 1.3 | 54% | 15% |
FAM210A | FAM210A | NP_689565.2 | 272 | 1.4 | 31% | 53% |
Genus and species | Common name | Date of divergence (MYA) | Accession number | Sequence length | E-value | Identity | Query cover | Notes |
---|---|---|---|---|---|---|---|---|
Homo sapiens | Human | - | NP_543011.2 | 192 | 2.00E-138 | 100% | 100% | Mammal |
Tarsius syrichta | Philippine tarsier | 67.6 | XP_008058424.1 | 150 | 3.00E-88 | 88% | 76% | Mammal |
Fukomys damarensis | Damaraland mole-rat | 90.9 | XP_010627139.1 | 184 | 1.00E-77 | 82% | 76% | Mammal |
Orninus orca | Killer whale | 97.5 | XP_004282340.1 | 192 | 2.00E-98 | 81% | 100% | Mammal |
Camelus ferus | Bactrian camel | 97.5 | EPY80008.1 | 251 | 7.00E-72 | 87% | 69% | Mammal |
Anolis carolinensis | Carolina anole | 320.5 | XP_003223790.1 | 215 | 4.00E-60 | 65% | 78% | Reptile |
Thamnophis sirtalis | Common garter snake | 320.5 | XP_013919769.1 | 286 | 3.00E-59 | 67% | 71% | Reptile |
Ophiophagus hannah | King cobra | 320.5 | ETE65490.1 | 152 | 8.00E-59 | 68% | 67% | Reptile |
Cariama cristata | Reg-legged seriema | 320.5 | KRP59646.1 | 115 | 7.00E-58 | 78% | 59% | Bird |
Egretta garzetta | Little egret | 320.5 | KFP21403.1 | 115 | 1.00E-57 | 77% | 59% | Bird |
Anas platyrhynchos | Mallard | 320.5 | EOB08268.1 | 97 | 2.00E-14 | 44% | 50% | Bird |
Gekko japonicus | Gekko | 320.5 | XP_015277771.1 | 304 | 1.00E-11 | 34% | 57% | Reptile |
Xenopus tropicalis | Western clawed frog | 355.7 | NP_001072818.1 | 196 | 5.00E-57 | 55% | 94% | Amphibian |
Xenopus laevis | African clawed-frog | 355.7 | NP_001088884.1 | 275 | 8.00E-10 | 29% | 53% | Amphibian |
Callorhinchus milii | Australian ghostshark | 482.9 | XP_007910267.1 | 217 | 1.00E-48 | 73% | 55% | Fish |
Drosophila sechellia | Fruit fly | 847 | XP_002037687.1 | 135 | 5.00E-31 | 48% | 71% | Invertebrate |
Cerapachys biroi | Clonal raider ant | 847 | XP_011331931.1 | 344 | 3.00E-15 | 36% | 56% | Invertebrate |
Ogataea parapolymorpha | Microorganism | 1302.5 | XP_013935759.1 | 208 | 7.00E-05 | 29% | 60% | Fungus |
Populus trichocarpa | Black cottonwood | 1513.9 | XP_006379359.1 | 242 | 5.00E-04 | 33% | 49% | Plant |
Trichoplax adhaerens | Trichoplax | Unknown | XP_002116161.1 | 149 | 5.00E-29 | 40% | 63% | Trichoplax |
Mitosporidium daphinae | Parasite | Unknown | XP_013239484.1 | 162 | 1.00E-07 | 35% | 48% | Fungus |
The size of the gene family is two, and includes FAM210A and FAM210B. Gene duplication seems to have occurred when plants diverged from protists. The most distant ortholog found was Populus trichocarpa, also known as the black cottonwood tree. [12] The unrooted phylogenetic tree below demonstrates this divergence. [13] All of the organisms in the tree can also be found in the ortholog table above.
FAM210B is ubiquitously expressed in Homo sapiens . [14] It is expressed in each stage of development, and in many compromised health states including breast, cervical, liver, lung, ovarian, and pancreatic tumors. [15]
These images were taken from Allen Brain Atlas and demonstrate expression of FAM210B in the mouse brain. The images compare the expression of FAM210B (left) to dopamine beta hydroxylase (right) in a sagittal cut of the cerebellum. [16]
The following image contains the gene expression of FAM210B in patients with dengue fever or those who are convalescent. As seen by the blue dots, there is decreased rank of FAM210B in patients experiencing hemorrhagic fever and fever as compared to the healthy controls. [17]
The following image contains the gene expression of FAM210B in patients with severe bacterial pneumonia, severe influenza, and those who received the influenza vaccine. As seen in the image, FAM210B is lower in rank in the patients with severe bacterial pneumonia and are even lower in those with severe influenza, as compared to those who received the influenza vaccine. [18]
The following proteins were determined to interact with FAM210B. Their interaction was determined by affinity capture and mass spectrometric methods. [19] By analyzing the function of these proteins, more insight can be given into the function of FAM210B.
Protein | Function |
---|---|
HTR2C | Encodes transmembrane G-protein-coupled receptor. Responds to signaling through serotonin. |
LPAR1 | Protein encoded by this gene is a LPA receptor. Mediate functions such as proliferation, platelet aggregation, smooth muscle contraction, tumor cell invasion. |
REEP5 | May promote functional cell surface expression of olfactory receptors. |
RTN1 | Reticulon encoding gene, associated with the ER and involved in neuroendocrine secretion. |
RTN4 | Reticulon encoding gene, associated with the ER and involved in neuroendocrine secretion. |
TM4SF20 | Interact with integrins to function cell adhesion, proliferation, and motility. |
TSPAN5 | Mediate signal transduction events that regulate cell development, activation, growth, and motility. |
TSPAN17 | Encodes member of transmembrane 4 superfamily, function is undetermined. |
ATP6V0D1 | Encodes V-ATPase that mediates acidification of eukaryotic intracellular organelles. |
TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long and its cDNA has 1214 base pairs. It was previously designated C16orf42.
Transmembrane protein 151B is a protein that in humans is encoded by the TMEM151B gene.
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.
Leucine rich repeat containing 24 is a protein that, in humans, is encoded by the LRRC24 gene. The protein is represented by the official symbol LRRC24, and is alternatively known as LRRC14OS. The function of LRRC24 is currently unknown. It is a member of the leucine-rich repeat (LRR) superfamily of proteins.
OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.
Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.
Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.
FAM227A is a protein that in humans is encoded by FAM227A gene. Current studies have determined the location of this gene to be in the nuclear region of the cell. FAM227A is most highly expressed in the tissues of the fallopian tube, testis, and pituitary gland. FAM227A is present in species of mammals, birds and reptiles, and gene alignment sequences have shown that FAM227A is a rapidly evolving gene.
Cilia and flagella associated protein 157 (CFAP157) also known as chromosome 9 open reading frame 117 (c9orf117) is a protein that in humans is encoded by the CFAP157 gene.
Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Transmembrane protein 179 is a protein that in humans is encoded by the TMEM179 gene. The function of transmembrane protein 179 is not yet well understood, but it is believed to have a function in the nervous system.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Transmembrane protein 169 (TMEM169) in humans is encoded by TMEM169 gene. The aliases of TMEM169 include FLJ34263, DKFZp781L2456, and LOC92691. TMEM169 has the highest expression in the brain, particularly the fetal brain. TMEM169 has homologs mammals, reptiles, amphibians, birds, fish, chordates and invertebrates. The most distantly related homolog of TMEM169 is Anopheles albimanus.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of five transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
Transmembrane protein 271, or TMEM271 is a protein in Homo sapiens encoded by the TMEM271 gene, located at 4p16.3 on the minus strand. The protein is located on the plasma membrane of cells and highly expressed in several regions of the brain.