This article may be in need of reorganization to comply with Wikipedia's layout guidelines .(March 2020) |
FAM178B | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | FAM178B , family with sequence similarity 178 member B | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 3026913; HomoloGene: 87288; GeneCards: FAM178B; OMA:FAM178B - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
FAM178B is a protein coding gene that is located on the plus strand of chromosome 2. [5] The locus for the gene is 2q11.2. It is also known by the aliases Family with Sequence Similarity 178, Member B, and HSPC234. [6] In total there are 24 exons in the gene. FAM178B spans 110,720 base pairs, and contains 827 amino acids.
There are two isoforms of the gene transcript that exist by alternative splicing, and one gene precursor. [5]
Isoform: | Identifier: | Length in Amino Acids: | Mass (daltons): |
---|---|---|---|
1 | Q8IXR5-1 | 827 | 93,514 |
2 | Q8IXR5-2 | 119 | 13,809 |
3 | Q8IXR5-3 | 679 | 76,515 |
SLF2 (FAM178A) is an important paralog of FAM178B. SLF2 is predicted to play a role in the DNA damage response (DDR) pathway by regulating post-replication repair of UV-damaged DNA, and genomic stability maintenance. [7]
The molecular weight of the protein is 76.5 kilodaltons, [8] and the isoelectric point is 5.47.The gene has 6 transcript splice variants. [9] The protein has been phenotypically associated with bipolar disease due to its locus, [10] as well as body mass index (BMI), and cell adhesion. [11] A proposed structure for the protein can be found in the images for proposed structures. [12] The secondary structure for the FAM178B protein is predicted to be primarily alpha helices. [13] The tertiary structure of the protein may assume a coiled coil structure. [13]
FAM178B is most highly expressed in the skeletal muscle and brain tissues [14] .The structure in which it is highly concentrated is in the corpus callosum of the brain. [14] Additionally, it is of high levels in the trigeminal nerve and spinal cord. Further, there is also high concentrations of the gene found near the heart, testes and olfactory regions.
According to the Allen Brain Atlas, [15] the olfactory regions, and the hippocampus of the mouse brain showed the greatest expressions of the gene when tested experimentally.
The proposed promoter region of FAM178B protein is below. [16] A table of relevant transcription factor binding sites that correspond to the sequence and colors highlighted in the promoter region is also included.
The promoter region of FAM178B is highly conserved across its most related orthologs, and is almost identical across the human, gorilla, chimp, bonobo and gibbon organisms. [17]
Below is a table that shows the protein modifications that could be potentially made to FAM178B, and the potential effects of these modifications on the protein itself:. [18]
An important paralog of the gene is SLF2, which plays a role in the DNA damage response (DDR) pathway by regulating post replication repair of UV-damaged DNA. It also helps maintain genomic stability. [19] There are 74 known orthologs of the protein, and it can be traced back as far back as crustaceans. [5] FAM178B is heavily found in both vertebrates and invertebrates. The ortholog space for the protein FAM178B is quite large as it can be traced back 794 millions of years ago (mya) to mollusks with the apple snail, and Pacific Oyster. There are 134 known orthologs of FAM178B. However, the protein is not found in arthropods, or insects, which is interesting because those organisms also existed in that time period meaning that it was conserved across some taxa, but not others. The protein is most readily found in primates, and other non-primate mammals. The protein is also conserved across reptiles, some bony fish, and cartilaginous fish, and birds.
Below is a table illustrating some of the orthologs for FAM178B using BLAST [20] and BLAT. [21]
Scientific Name: | Common Name: | Accession Number from NCBI: | Sequence Length: (Amino Acids) | Date of Divergence from Human Lineage: (millions of years ago) | Percent Identity: | Percent Similarity: |
---|---|---|---|---|---|---|
Homo Sapiens | Human | Q8IXR5.2 | 827 | -- | -- | -- |
Macaca Mulatta | Macque | XP_014968433 | 709 | 28.1 | 80 | 83 |
Chlorocebus Sabaeus | Green Monkey | XP_008006245 | 724 | 28.1 | 91 | 94 |
Cebus Capucinus | Capuchin | XP_017379417 | 808 | 42.6 | 88 | 91 |
Otolemur Garnettii | Galago | XP_012666502 | 675 | 73.0 | 74 | 81 |
Marmota Marmota | Marmot | XP_015355109 | 692 | 88.0 | 74 | 80 |
Microtus Ochrogaste r | Vole | XP_026641579 | 561 | 88.0 | 68 | 75 |
Felis Catus | Cat | XP_019682636 | 720 | 94.0 | 71 | 77 |
Equus Caballus | Horse | XP_023474395 | 659 | 94.0 | 78 | 84 |
Panthera Tigris | Tiger | XP_007097881 | 674 | 94.0 | 75 | 80 |
Miniopterus Natalensis | Bat | XP_016077151 | 649 | 94.0 | 73 | 81 |
Bison Bison | Bison | XP_010828157 | 664 | 94.0 | 65 | 73 |
Tursiops Truncatus | Dolphin | JH473473 | 827 | 94.0 | 61 | 68 |
Alligator Mississippiensis | Alligator | XP_019343229 | 802 | 320.0 | 34 | 47 |
Pogona Vitticeps | Bearded Dragon | XP_020635880 | 1003 | 320.0 | 41 | 56 |
Apteryx Rowi | Kiwi | XP_025941058 | 512 | 320.0 | 42 | 58 |
Gallus Gallus | Chicken | XP_024998603 | 547 | 320.0 | 37 | 50 |
Rhincodon Typus | Whale Shark | XP_020388490 | 1155 | 465.0 | 34 | 53 |
Callorhinchus Milii | Australian Ghostshark | XP_007910600 | 860 | 465.0 | 33 | 50 |
Crassostrea Gigas | Pacific Oyser | EKC42969 | 1222 | 794.0 | 24 | 38 |
Pomacea Canaliculta | Applesnail | PVD30455 | 1229 | 794.0 | 24 | 45 |
Timetree [22] for FAM178B
There are 2 proteins that have high predicted values of interaction with FAM178B: LRSAM 1 and ZNF598. [23] LRSAM1 [24] is also known as leucine rich repeat and sterile alpha motif protein 1. The value for the protein is .29 and the evidence is physical from hybrid pooling approaches. LRSAM1 was previously known as Tsg-associated ligase and is located on the LRSAM1 gene. Mutations have been associated with periphery neuropathy, and sensory disorders. It is highly expressed in the spinal cord, as is FAM178B. There is currently no known structure for the protein. ZNF598 is a zinc finger protein and the value is .13. It plays a key role in ribosome quality control. The predicted structures are below for both proteins.
MENTHA [23] interacting proteins for FAM178B.
STRING [25] interacting proteins for FAM178B.
KIAA0895 is a protein that in Homo sapiens is encoded by the KIAA0895 gene. The gene encodes a protein commonly known as the KIAA0895 protein. Its aliases include hypothetical protein LOC23366, OTTHUMP00000206979, OTTHUMP00000206980, 9530077C05Rik, and 1110003N12Rik. It is located at 7p14.2.
METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Zinc finger CCHC-type containing 18 (ZCCHC18) is a protein that in humans is encoded by ZCCHC18 gene. It is also known as Smad-interacting zinc finger protein 2 (SIZN2), para-neoplastic Ma antigen family member 7b (PNMA7B), and LOC644353. Other names such as zinc finger, CCHC domain containing 12 pseudogene 1, P0CG32, ZCC18_HUMAN had been used to describe this protein.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Golgin subfamily A member 8H, also known as GOLGA8H, is a protein that in Homo sapiens is encoded by the GOLGA8H gene. Function of the GOLGA8H involves a process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of the Golgi apparatus.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.
Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of five transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).
C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.
Zinc Finger Protein 62, also known as "ZNF62," "ZNF755," or "ZET," is a protein that in humans is encoded by the ZFP62 gene. ZFP62 is part of the C2H2 Zinc Finger family of genes.