FAM131A

Last updated
FAM131A
Identifiers
Aliases FAM131A , C3orf40, FLAT715, PRO1378, family with sequence similarity 131 member A
External IDs MGI: 1925658; HomoloGene: 82234; GeneCards: FAM131A; OMA:FAM131A - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001171093
NM_144635
NM_001366133
NM_001366134

NM_133778

RefSeq (protein)

NP_001164564
NP_653236
NP_001353062
NP_001353063

NP_598539

Location (UCSC) Chr 3: 184.34 – 184.35 Mb Chr 16: 20.51 – 20.52 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

FAM131A (Family with Sequence Similarity 131 Member A) is a protein that is encoded by the FAM131A gene in humans. Aliases for FAM131A include C3orf40, FLAT715, and PRO1378. [5]

Contents

Gene

The gene, FAM131A, which is found on the plus strand of chromosome 3 (3q27.1), spans 7,847 base pairs in humans. [6] The FAM131A gene transcribes an mRNA sequence that is 2,437 nucleotides. [7] FAM131A is most highly expressed in the brain, [8] with a low tissue specificity. [9] [10]

Conceptual Translation of Human FAM131A. Annotations indicate start codon, exon boundaries, polyadenylation signals, polyadenylation sites, disordered region, region excluded from isoform 2, and stop codon. Amino acids conserved from distant orthologs are bolded. HSa FAM131A Conceptual Translation.pdf
Conceptual Translation of Human FAM131A. Annotations indicate start codon, exon boundaries, polyadenylation signals, polyadenylation sites, disordered region, region excluded from isoform 2, and stop codon. Amino acids conserved from distant orthologs are bolded.

Protein

The FAM131A protein in humans is 366 amino acids in length, with a theoretical molecular weight of 39.5 kDa and a theoretical isoelectric point of 4.59. [11] There have only been two isoforms found for the protein this gene encodes in humans, and isoform two is shorter at the N-terminus than isoform one due to amino acids 1-85 being absent in isoform two. [12] It was also determined that Asparagine, Threonine, and Isoleucine are represented less in the FAM131A protein in comparison to most human proteins. However, Serine is more highly represented in the FAM131A protein in comparison to most human proteins. [13] The FAM131A protein is predicted to be contained within the nucleus and in the nucleolus, [14] [15] and is predicted to be primarily localized to the nucleoli rim within the cell. [16]

Predicted tertiary structure of human FAM131A protein from AlphaFold. Predicted Structure of FAM131A.png
Predicted tertiary structure of human FAM131A protein from AlphaFold.

Post-translational modifications

Five different post-translational modification sites have been predicted for the FAM131A protein. These include three different theoretical sumoylation sites [18] and two different theoretical lysine acetylation sites. [19]

Interacting proteins

A few proteins have been found to be co-expressed alongside the FAM131 protein, including Von Willebrand Factor A Domain-Containing 5B2 (VWA5B2), [20] Grid 2 Interacting Protein (GRID2IP), [21] and Chordin (CHRD). [22] [23]

Homology

Multiple sequence alignment of FAM131 in distant orthologs. The consensus sequences indicate highly conserved amino acids with an uppercase letter, moderately conserved amino acids with a lowercase letter, and low conservation of an amino acid with a dot. MSA for Wiki-2 of FAM131A.pdf
Multiple sequence alignment of FAM131 in distant orthologs. The consensus sequences indicate highly conserved amino acids with an uppercase letter, moderately conserved amino acids with a lowercase letter, and low conservation of an amino acid with a dot.

Orthologs were found for FAM131A in mammals (sequence identity ranging from 73.6%-92.3%), reptiles (sequence identity ranging from 48.5%-56.4%), birds (sequence identity ranging from 49.6%-54.0%), amphibians (sequence identity ranging from 47.1%-52.1%), and fish (sequence identity ranging from 26.2%-56.5%). [24] The furthest date of divergence was found in fish, specifically Pretromyzon marinus , otherwise known as the Sea lamprey, at 599 million years ago. [25] FAM131A was not found in any invertebrates, which could indicate that FAM131A is restricted to vertebrates.

Table of orthologs

Species NameCommon NameDate of Divergence (mya)Accession NumberSequence Length (AA)Sequence Identity to Human Protein
Homo sapiens Humans0NP_653236366100%
Mus musculus House mouse87NP_59853936192.3%
Phascolarctos cinereus Koala160XP_02086144036273.6%
Sarcophilus harrisii Tasmanian devil160XP_03182396028364.1%
Alligator mississippiensis American alligator319XP_01933970832456.4%
Gallus gallus Chicken319XP_00364184133854.0%
Haliaeetus leucocephalus Bald eagle319XP_01057127927549.6%
Aptenodytes forsteri Emporer penguin319XP_00928634927549.6%
Python bivittatus Burmese python319XP_02502973630248.5%
Rhinatrema bivittatum Two-lined caecilian353XP_02947218529052.1%
Xenopus tropicalis Tropical clawed frog353XP_00491446034450.0%
Rana temporaria Common frog353XP_04020572134847.6%
Bufo bufo Common toad353XP_04028445726147.1%
Protopterus annectens West African lungfish408XP_043926343.136156.5%
Danio rerio Zebrafish431NP_00109362529343.4%
Oryzias latipes Japanese rice fish431XP_00407930833834.4%
Cheilinus undulatus Humphead wrasse431XP_04166011431831.4%
Amblyraja radiata Thorny skate464XP_03288807638051.8%
Petromyzon marinus Sea lamprey599XP_03280277838326.2%

Clinical significance

Studies have found having high expression of FAM131A is prognostically unfavorable for patients with ovarian cancer [26] or endometrial cancer. [27]

Related Research Articles

<span class="mw-page-title-main">Interferon-inducible GTPase 5</span> Protein-coding gene in the species Homo sapiens

Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.

<span class="mw-page-title-main">TMEM176B</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

<span class="mw-page-title-main">C1orf112</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.

<span class="mw-page-title-main">ZCCHC18</span> Protein-coding gene in the species Homo sapiens

Zinc finger CCHC-type containing 18 (ZCCHC18) is a protein that in humans is encoded by ZCCHC18 gene. It is also known as Smad-interacting zinc finger protein 2 (SIZN2), para-neoplastic Ma antigen family member 7b (PNMA7B), and LOC644353. Other names such as zinc finger, CCHC domain containing 12 pseudogene 1, P0CG32, ZCC18_HUMAN had been used to describe this protein.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in humans

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">GPATCH2L</span> It is Wikipedia article of unknown gene called "GPATCH2L".

GPATCH2L is a protein that is encoded by the GPATCH2L human gene located at 14q24.3. In humans, the length of mRNA in GPATCH2L (NM_017926) is 14,021 base pairs and the gene spans bases is 62,422 nt between chr14: 76,151,922 - 76,214,343. GPATCH2L is on the positive strand. IFT43 is the gene directly before GPATCH2L on the positive strand and LOC105370575 is the uncharacterized gene on the negative strand, which is approximately one and a half the size of GPATCH2L. Known aliases for GPATCH2L contain C14orf118, FLJ20689, FLJ10033, and KIAA1152. GPATCH2L produces 28 distinct introns, 17 different mRNAs, 14 alternatively spliced variants, and 3 unspliced forms. It has 5 probable alternative promoters, 7 validated polyadenylation sites, and 6 predicted promoters of varying lengths.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">C2orf80</span> Gene

C2orf80 is a protein that, in humans, is encoded by the c2orf80 gene. The gene c2orf80 also goes by the alias GONDA1. In humans, c2orf80 is exclusively expressed in the brain. While relatively little is known about the function of c2orf80, medical studies have shown a strong association between variations in c2orf80 and IDH-mutant gliomas, 46,XY gonadal dysgenesis, and a possible association with blood pressure.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">ZNF839</span> Protein which in humans is encoded by the ZNF839 gene

ZNF839 or zinc finger protein 839 is a protein which in humans is encoded by the ZNF839 gene. It is located on the long arm of chromosome 14. Zinc finger protein 839 is speculated to play a role in humoral immune response to cancer as a renal carcinoma antigen (NY-REN-50). This is because NY-REN-50 was found to be over expressed in cancer patients, especially those with renal carcinoma. Zinc finger protein 839 also plays a role in transcription regulation by metal-ion binding since it binds to DNA via C2H2-type zinc finger repeats.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000175182 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000050821 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "FAM131A family with sequence similarity 131 member A [ Homo sapiens (human) ]". www.ncbi.nlm.nih.gov. Retrieved 2022-12-14.
  6. "Human Gene FAM131A (ENST00000639617.1) from GENCODE V41". genome.ucsc.edu. Retrieved 2022-12-16.
  7. "Homo sapiens family with sequence similarity 131 member A (FAM131A), transcript variant 1, mRNA". 2021-06-26.
  8. "FAM131A Gene Expression - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16.
  9. "Tissue Cell Type - FAM131A - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-12-16.
  10. Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, et al. (January 2015). "Proteomics. Tissue-based map of the human proteome". Science. 347 (6220): 1260419. doi:10.1126/science.1260419. PMID   25613900. S2CID   802377.
  11. "Expasy - Compute pI/Mw tool". web.expasy.org. Retrieved 2022-12-15.
  12. "protein FAM131A isoform 2 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16.
  13. "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-15.
  14. "PSORT II Prediction". psort.hgc.jp. Retrieved 2022-12-15.
  15. "DeepLoc - 2.0". DTU Health Tech. Retrieved 2022-12-14.
  16. "FAM131A protein expression summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-12-14.
  17. "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2022-12-15.
  18. "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Archived from the original on 2018-05-06. Retrieved 2022-12-16.
  19. "GPS-PAIL 2.0 - Prediction of Acetylation on Internal Lysines". pail.biocuckoo.org. Retrieved 2022-12-16.
  20. Direk K, Lau W, Small KS, Maniatis N, Andrew T (September 2014). "ABCC5 transporter is a novel type 2 diabetes susceptibility gene in European and African American populations". Annals of Human Genetics. 78 (5): 333–344. doi:10.1111/ahg.12072. PMC   4173130 . PMID   25117150.
  21. Lee E, Takita C, Wright JL, Slifer SH, Martin ER, Urbanic JJ, et al. (June 2019). "Genome-wide enriched pathway analysis of acute post-radiotherapy pain in breast cancer patients: a prospective cohort study". Human Genomics. 13 (1): 28. doi: 10.1186/s40246-019-0212-8 . PMC   6567461 . PMID   31196165.
  22. Wang YF, Yan JJ, Tseng YC, Chen RD, Hwang PP (2015-08-15). "Molecular Physiology of an Extra-renal Cl(-) Uptake Mechanism for Body Fluid Cl(-) Homeostasis". International Journal of Biological Sciences. 11 (10): 1190–1203. doi: 10.7150/ijbs.11737 . PMC   4551755 . PMID   26327813.
  23. "FAM131A protein (human) - STRING interaction network". string-db.org. Retrieved 2022-12-16.
  24. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-16.
  25. "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2022-12-16.
  26. Zhao M, Wang T, Liu Q, Cummins S (July 2017). "Copy number alteration of neuropeptides and receptors in multiple cancers". Scientific Reports. 7 (1): 4598. Bibcode:2017NatSR...7.4598Z. doi: 10.1038/s41598-017-04832-0 . PMC   5496884 . PMID   28676692.
  27. Uhlén M, Björling E, Agaton C, Szigyarto CA, Amini B, Andersen E, et al. (December 2005). "A human protein atlas for normal and cancer tissues based on antibody proteomics". Molecular & Cellular Proteomics. 4 (12): 1920–1932. doi: 10.1074/mcp.M500279-MCP200 . PMID   16127175.