FAM110A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | FAM110A , C20orf55, F10, bA371L19.3, family with sequence similarity 110 member A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 611393 MGI: 1921097 HomoloGene: 12862 GeneCards: FAM110A | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Protein FAM110A, also known as protein family with sequence similarity 110, A, C20orf55 [5] or BA371L19.3 [6] is encoded by the FAM110A gene. FAM110A is located on chromosome 20 [6] and is a part of the greater FAM110 gene family, [7] consisting of FAM110A, FAM110B, and FAM110C.
In humans, FAM110A is located on the plus strand at 20p13. [6] The gene transcript is found from base pairs 833,715 to 846,279, with a total transcript length of 12,564 base pairs. [5] The FAM110A mRNA transcript is predicted to contain two exons. [5] An upstream promoter region for FAM110A is predicted to be 1,111 base pairs long. [8] Six different mRNA transcripts of FAM110A are predicted, all differing in their 5' untranslated regions. [5]
219 organisms have been reported to have orthologs with the human FAM110A gene. [5]
Species | Taxonomic group | Date of divergence (MYA) | Accession | Sequence length | Sequence identity | Sequence similarity |
---|---|---|---|---|---|---|
Thirteen-lined ground squirrel (Ictidomys tridecemlineatus) | Mammal | 90 | XP_005320619.1 | 295 | 92% | 94% |
Domesticated dog (Canis lupus familiaris) | Mammal | 96 | XP_005634952.1 | 295 | 91% | 93% |
Armadillo (Dasypus novemcinctus) | Mammal | 105 | XP_023446141.1 | 284 | 85% | 85% |
Garter snake (Thamnophis sirtalis) | Reptile | 312 | XP_013922001.1 | 360 | 46% | 58% |
Painted turtle (Chrysemys picta bellii) | Reptile | 312 | XP_005304966.1 | 362 | 48% | 59% |
Hummingbird (Calypte anna) | Bird | 312 | XP_030319175.1 | 337 | 53% | 61% |
Chicken (Gallus gallus) | Bird | 312 | XP_003642512.1 | 331 | 56% | 62% |
Worm (Microcaecilia unicolor) | Amphibian | 351.8 | XP_030068004.1 | 381 | 39% | 49% |
Sterlet (Acipenser ruthenus) | Fish | 435 | XP_033863480.2 | 350 | 43% | 54% |
Zebrafish (Danio rerio) | Fish | 435 | XP_003201231.1 | 446 | 59% | 71% |
Deer tick (Ixodes scapularis) | Arthropod | 797 | XP_029825208.1 | 337 | 31% | 40% |
Matrix family | Matrix information | Start position | End position | Strand | Matrix similarity |
---|---|---|---|---|---|
TGF-β induced apoptosis proteins | Cysteine-serine-rich nuclear protein 1 (AXUD1, AXIN1 up-regulated 1) | 277 | 283 | (+) | 1.00 |
CAS interacting zinc finger protein | Zinc finger protein 384 (Cas-interacting zinc finger protein - CIZ) | 305 | 315 | (+) | 1.00 |
Lim homeodomain factors | LIM homeobox transcription factor 1, alpha | 342 | 364 | (-) | 1.00 |
SWI/SNF related nucleophosphoproteins with a RING finger DNA binding motif | SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 3 | 354 | 364 | (-) | 1.00 |
SWI/SNF related nucleophosphoproteins with a RING finger DNA binding motif | SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 3 | 357 | 367 | (+) | 1.00 |
Cart-1 (cartilage homeoprotein 1) | Binding site for S8 type homeodomains | 421 | 441 | (-) | 1.00 |
NKX homeodomain factors | Homeo domain factor Nkx-2.5/Csx, tinman homolog low affinity sites | 423 | 441 | (-) | 1.00 |
TCF11 transcription factor | TCF11/LCR-F1/Nrf1 homodimers | 559 | 565 | (+) | 1.00 |
EVI1-myleoid transforming protein | MEL1 (MDS1/EVI1-like gene 1) DNA-binding domain 2 | 612 | 628 | (-) | 1.00 |
C2H2 zinc finger transcription factors 37 | Zinc finger protein 37 alpha (KOX21) | 770 | 778 | (-) | 1.00 |
C2H2 zinc finger transcription factors 2 | Zinc finger with KRAB and SCAN domains 3 | 819 | 841 | (-) | 1.00 |
Myeloid zinc finger 1 factors | Myeloid zinc finger protein MZF1 | 1002 | 1012 | (-) | 1.00 |
C2H2 zinc finger transcription factors 2 | KRAB-containing zinc finger protein 300 | 1014 | 1036 | (+) | 1.00 |
C2H2 zinc finger transcription factors 2 | Zinc finger with KRAB and SCAN domains 3 | 1029 | 1051 | (-) | 1.00 |
All human FAM110A transcript variants encode the same protein, which is 295 amino acids in length. [5] The human FAM110A protein is projected to weigh 31.3 kiladaltons and have an isoelectric point of 10.5. [9]
Species | Accession | Repeating structures | Positions |
---|---|---|---|
Human (Homo sapiens) | NP_001035812.1 | SPARP | 162-166; 177-181 |
Chimpanzee (Pan troglodytes) | XP_003316845.1 | SPARP | 162-166; 177-181 |
Mouse (Mus musculus) | NP_001276079.1 | PATP | 11-14; 139-142 |
Chicken (Gallus gallus) | XP_015151953.1 | AVRR | 88-91; 168-171 |
PRSA | 104-107; 285-288 | ||
SAGR | 106-109; 147-150 | ||
PAAP | 158-161; 199-202 | ||
Zebrafish (Danio rerio) | XP_009302562.1 | LARP | 88-91; 348-351 |
Human FAM110A is predicted to contain one standard deviation less than average frequencies of methionine, asparagine, and isoleucine residues, while containing one standard deviation higher frequencies of serine and proline residues. [10] Human FAM110A is also predicted to contain a frequency of arginine residues two standard deviations higher than average. [10] The presence of a high frequency of arginine residues is also apparent in the FAM110A chimpanzee, mouse, chicken and zebrafish orthologs, [10] indicating that it may play a vital role to the function of the gene due to its high conservation.
FAM110A is predicted to be hydrophilic and soluble. [12]
The tertiary structure of FAM110A is predicted to be 80% disordered. [14]
The N-terminal glycine residue FAM110A is not predicted to be myristolated (confidence: 0.97), [15] indicating that FAM110A is not membrane-associated.
It is predicted that FAM110A contains no sulfation of tyrosine residues, [16] suggesting that FAM110A is not secreted.
Phosphorylation analysis indicates FAM110A to be associated with the AGC and Akt kinase families. [17]
Immunofluorescent analysis of FAM110A reveals the protein to be localized in the nucleoplasm, cytosol, and vesicles. [11]
Protein abbreviation | Protein name | Association type |
---|---|---|
CSNK1E | Casein kinase 1 isoform ε | Two hybrid |
DYNA1I1 | Cytoplasmic dynein 1 intermediate chain | Two hybrid |
KRT15 | Type 1 cytoskeletal keratin | Two hybrid |
TRIM23 | E3 ubiquitin-protein ligase | Two hybrid |
GOLGA2 | Member 2 Golgin (subfamily A) | Two hybrid |
FAM110A has been observed to be abnormally expressed in prostate cancer metastasis, where it co-localizes with E-cadherin and β-catenin at cell-cell adherens junctions, [18] suggesting FAM110A’s involvement in the epithelial-to-mesenchymal transition in cancer pathogenesis. The greater FAM110 gene family is aberrantly methylated in breast cancer cells, [19] and has been shown to be associated with reduced time to distant metastasis in breast cancer patients. [19]
FAM110A has been found to localize to centrosomes and accumulate at the microtubule organizing center in interphase and at spindle poles in mitosis. [7]
C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.
Coiled-coil domain containing 94 (CCDC94) is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.
Vexin is a protein encoded by VXN gene. VXN is found to be highly expressed in regions of the brain and spinal cord.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
Transmembrane Protein 81 or TMEM81 is a protein that in humans is encoded by the TMEM81 gene. TMEM81 is a poorly-characterized transmembrane protein which contains an extracellular immunoglobulin domain.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
Coiled-coil domain containing 121 (CCDC121) is a protein encoded by the CCDC121 gene in humans. CCDC121 is located on the minus strand of chromosome 2 and encodes three protein isoforms. All isoforms of CCDC121 contain a domain of unknown function referred to as DUF4515 or pfam14988.
SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association
OCEL1, also called Occludin//ELL Domain Containing 1, is a protein encoding gene located at chromosome 19p13.11 in the human genome. Other aliases for the gene include FLJ22709, FWP009, and S863-9. The function of OCEL1 has not yet been identified.
C2orf72 is a gene in humans that encodes a protein currently named after its gene, C2orf72. It is also designated LOC257407 and can be found under GenBank accession code NM_001144994.2. The protein can be found under UniProt accession code A6NCS6.
C4orf36 is a protein that in humans is encoded by the c4orf36 gene.
C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.
C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.
Chromosome 20 open reading frame 144 (c20orf144) is a human protein-encoding gene. The human c20orf144 protein consists of 153 amino acids, with the first 150 amino acids being characterized as part of the Bcl-2 like protein of testis (Bclt) family.
Transmembrane protein 271, or TMEM271 is a protein in Homo sapiens encoded by the TMEM271 gene, located at 4p16.3 on the minus strand. The protein is located on the plasma membrane of cells and highly expressed in several regions of the brain.
This article needs additional or more specific categories .(June 2021) |