FAM63A

Last updated
MINDY1
Available structures
PDB Ortholog search: PDBe RCSB
Identifiers
Aliases MINDY1 , FAM63A, MINDY lysine 48 deubiquitinase 1, MINDY-1
External IDs OMIM: 618407 MGI: 1922257 HomoloGene: 32409 GeneCards: MINDY1
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_133858
NM_199475

RefSeq (protein)

NP_598619
NP_955769

Location (UCSC) Chr 1: 151 – 151.01 Mb Chr 3: 95.19 – 95.2 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3. [5]

Evolutionarily, FAM63A orthologs are found in most vertebrates, and distant homologs of FAM63A are found in invertebrates. [6] FAM63A is ubiquitously expressed throughout human tissues, and it is present during every stage of development. [5]

It has been linked to a biomarker in chronic kidney disease and Alzheimer's disease. [7]

Gene

Locus

FAM63A is located on the minus strand of chromosome 1 at band 1q21.3, spanning 11,829 bp. Other genes surrounding FAM63A include ANXA9 and Prune. [5]

Aliases

FAM63A has two aliases KIAA1390 and PR11-316M1.5. [8]

mRNA

Primary structure

In humans, there are four isoforms of FAM63A, and there are 10 predicted isoforms. Isoform 1 of FAM63A has a molecular weight of 51.8 kilodaltons, and it contains 11 exons. [9] [10] The different isoforms tend to differ at the 5' or 3' end by truncation. Transcription produces 23 introns, 14 spliced variants, and 6 unspliced forms. [10]

Protein

Multiple sequence alignment of DUF demonstrating conservation across orthologs. Dark blue indicates complete conservation, pink identical residues, and light blue chemical similarity Multiple Sequence Alignment.png
Multiple sequence alignment of DUF demonstrating conservation across orthologs. Dark blue indicates complete conservation, pink identical residues, and light blue chemical similarity
This is a 3D depiction of the probable secondary structure of FAM63A. Wikipedia Image 3.jpg
This is a 3D depiction of the probable secondary structure of FAM63A.

Domains and motifs

FAM63A contains a domain of unknown function (DUF 544). DUF544 contains 125 amino acids, running from Met143 to Thr267. [6] Although not completely conserved, this domain is highly conserved across vertebrates, invertebrates, and plants. [11] FAM63A does not contain a transmembrane domain, and it is found primarily in nuclear regions of the cell. [12]

Two repeats of four glutamines are seen from amino acid 400-403 and from amino acid 426-429, leading to an elevated glutamine composition at the C-terminus.

Composition

FAM63A is composed of 469 amino acids. [13] There is an increased presence of glutamine found near the C terminus making FAM63A glutamine rich. FAM63A contains a greater amount of negatively charged (acidic) amino acids than positively charged (basic) amino acids which makes FAM63A a slightly acidic protein. Acidic amino acids such as aspartic acid and glutamic acid are more prevalent than the basic amino acids such as lysine and arginine. This overall acidic composition gives FAM63A an acidic isoelectric point of 4.6. [13]

Post-translational modifications

FAM63A contains 25 phosphorylation sites in humans, including 12 serine, 10 threonine, and 3 tyrosine. Additionally, there are 5 N-myristoylation sites, and there is 1 prenylation site. FAM63A contains no glycosylation sites, transmembrane domains, or signal peptides. [14]

This is a depiction of the posttranslational modifications in FAM63A. Wikipedia Image 4.gif
This is a depiction of the posttranslational modifications in FAM63A.

Secondary structure

The secondary structure for FAM63A has not been explicitly determined. There are, however, predictions for a possible secondary structure. There is a coiled-coil domain at the end of the protein, and in the predicted secondary structure, there is an alpha helix between amino acids 410 and 436. This helix is conserved throughout more distant orthologs of FAM63A. These data support each other, and it gives a confident prediction of the secondary structure. [15]

Interacting proteins

The following genes have interactions with FAM63A: GSPT2, NAA38, RNMT, CSNIK1G2, ACOX1, PSMC1, SLC25A37, MMS19, DIAPH1, ME1, GAPDH, UBC. [5] After performing a yeast two-hybrid screen, it was found that NAA38 and FAM63A interact. [16]

Homology/evolution

In FAM63A, there are several amino acids that are conserved in all vertebrates for which sequences are available. Gly239 is the only amino acid that is conserved in all vertebrates, invertebrates, and plants for which sequences are available. Because there is only one amino acid that is absolutely conserved, a possible function for the conserved Glycine was not deduced. The 25 amino acid sequence ranging from Val313 to Gly338 is the most highly conserved in all vertebrates, invertebrates, and plants for which sequences are available. Although the sequence is not absolutely conserved, it is very highly conserved, even in the most distantly related organisms like fungi and plants.

Orthologs

The protein FAM63A has several strict orthologs. These strict orthologs are found in organisms ranging from Primates to Fish. [17]

Scientific NameCommon NameDivergenceAccession NumberLengthIdentitySimilarityQuery Cover
Homo sapiens HumansN/ANP_060849.2469 aa100%100%100%
Pan paniscus Bonobo6.1 MYAXP_003817322.1469 aa99.1%99.4%100%
Mus musculus House Mouse91.0 MYANP_955769.1468 aa86.0%90.2%100%
Bos taurus Cow97.4 MYANP_001039389.1469 aa85.6%88.5%100%
Trichechus manatus latirostris West Indian Manatee104.7 MYAXP_004389621.1465 aa84.9%89.8%100%
Sarcophilus harrisii Tasmanian Devil176.1 MYAXP_003769968.1464 aa78.3%82.9%100%
Taeniopygia guttata Zebra Finch324.0 MYAXP_002191502.2335 aa43.2%80.1%53%
Gallus gallus Chicken324.5 MYAXP_003642724462 aa66.9%76.1%92%
Chrysemys picta bellii Painted Turtle324.5 MYAXP_005293753.1525 aa62.0%87.2%78%
Pelodiscus sinensis Chinese Softshell Turtle324.5 MYAXP_006119467.1502 aa61.6%77.4%94%
Alligator mississippiensis American Alligator324.5 MYAXP_006274676.1520 aa59.90%77.50%86%
Pseudopodoces humilis Ground Tit324.5 MYAXP_005533539.1502 aa58.0%82.3%78%
Anas platyrhynchos Mallard324.5 MYAXP_005026841.1415 aa57.9%78.6%83%
Xenopus tropicalis Western Clawed Frog361.2 MYAXP_002937311.1506 aa61.3%83.7%76%
Latimeria chalumnae West Indian Ocean Coelacanth430.0 MYAXP_006006147.1513 aa44.7%86.9%55%
Danio rerio Zebrafish454.6 MYAXP_005159508.1520 aa52.2%80.2%76%

FAM63A evolved through time at a relatively moderate rate.

This shows the protein conservation throughout evolution. FAM63A evolved at a medium rate compared with cytochrome c (fast) and fibrinogen (slow). Wikipedia Image 1.jpg
This shows the protein conservation throughout evolution. FAM63A evolved at a medium rate compared with cytochrome c (fast) and fibrinogen (slow).

Paralogs

The protein FAM63A has only one known paralog: FAM63B. FAM63B is predicted as having a molecular function in the cell. [18] All of the vertebrates for which sequences are available have two copies of the FAM63 gene, both A and B. FAM63A and FAM63B likely split apart around 666 million years ago, as the closest relative to Homo sapiens containing only one FAM63 is a tapeworm, which diverged 666 million years ago. [17]

Sequence NumberScientific NameCommon NameDivergenceAccession NumberLengthIdentitySimilarityQuery CoverE-value
protein FAM63B isoform a Homo sapiens HumanN/ANP_001035540.1621 aa41.9%76.7%68%2.00E-129

Expression

Promoter

The promoter region contains a number of transcription factors. [19] Those with high scores include estrogen response elements, TATA boxes, glucocorticoid response elements, and Ccaat/enchancer binding proteins. Experimental data reveals that FAM63A expression decreases when the estrogen receptor is not present, suggesting that the estrogen response elements may serve as an important promoter regulatory mechanism for this protein. [20]

Protein expression

FAM63A is a protein that is ubiquitously expressed across human tissues and throughout development. Although FAM63A is expressed ubiquitously, there are certain tissues that have higher levels of expression including the heart, thyroid, ganglia, and blood. [21]

This is a depiction of the expression levels of FAM63A throughout different human tissues. Wikipedia Image 5.jpg
This is a depiction of the expression levels of FAM63A throughout different human tissues.

Clinical significance

Although there is no specific function determined for FAM63A, there are a few researchers who have discovered possible functions. It has been postulated that FAM63A may be associated with renal function and chronic kidney disease. [7]

Figgins, Minster, and Demirci examined 17,343 functional single nucleotide polymorphisms, demonstrating a strong association between Alzheimer's disease duration and FAM63A. [22] Another gene located on 1q21, CTSS, was also strongly associated with disease duration, the authors believe that there is a strong linkage disequilibrium between the two genes. FAM63A was identified as one of 39 genes exclusively expressed in CML cells, grouped with four other genes believed to function in protein ligation.

Related Research Articles

<span class="mw-page-title-main">TSR3</span> Hypothetical human protein

TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long and its cDNA has 1214 base pairs. It was previously designated C16orf42.

<span class="mw-page-title-main">C2CD4D</span> Mammalian protein found in Homo sapiens

C2CD4D, or C2 calcium-dependent domain-containing protein 4D is a protein product of the human genome. The gene that codes for this protein is found on chromosome 1, from 150,076,963 to 150,079,657. The gene contains 2 exons and encodes 353 amino acids. Synonyms for C2CD4D are "FAM148D" and NP_001129475. C2CD4D contains a conserved metal binding domain that is a known as Protein kinase C conserved region 2, subgroup 1. This motif is known to be a member of the C2 superfamily, which is present in phospholipases, protein kinases C, and synaptotagmins. The amino acid sequence of C2CD4D can be accessed at Prior to any post translational modification, C2CD4D has a molecular weight of 37.6 kdal. Although scientists have not yet determined where C2CD4D functions within the cell, C2CD4D has a predicted isoelectric point of 11.636 which severely limits the places in which it can be effective. In addition, C2CD4D does not contain any predicted transmembrane domains or any predicted signal peptides.

<span class="mw-page-title-main">Transmembrane protein 134</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 134 is a protein encoded by the TMEM134 gene. TMEM134 does not have any other known aliases. There are two transmembrane domains and a domain of unknown function (DUF872). Evolutionary, the majority of the organisms that have this gene are primates and mammals, although there are some organisms dating back to Drosophila and C. elegans. Through current research, there has not been any confirmed function of TMEM134.

<span class="mw-page-title-main">Fam78b</span> Protein-coding gene in the species Homo sapiens

Family with Sequence Similarity 78-Member B (FAM78B) is a protein of unknown function in humans that is encoded by the FAM78B gene (1q24.1). It has orthologous genes and predicted proteins in vertebrates and several invertebrates, but not in arthropods. It has a nuclear localization signal in the protein sequence and a miRNA target region in the mRNA sequence.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">TMEM176B</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">TEX9</span> Protein-coding gene in the species Homo sapiens

Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.

<span class="mw-page-title-main">FAM222A</span> Protein-coding gene in humans

Family with sequence similarity 222 member A or Aggregatin is a protein of unknown function. In humans it is encoded by the gene FAM222A. Aggregatin's cellular function is not well understood, however it has been implicated in Alzheimer's disease.

<span class="mw-page-title-main">TMEM221</span> Protein

Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">CCDC190</span> Protein found in humans

Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.

<span class="mw-page-title-main">C12orf50</span> Protein-coding gene in humans

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

<span class="mw-page-title-main">TMEM269</span> TMEM269 Protein

Transmembrane Protein 269 (TMEM269) is a protein which in humans is encoded by the TMEM269 gene.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">SCRN3</span> Protein-coding gene in the species Homo sapiens

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

<span class="mw-page-title-main">NBPF26</span> Human gene

NBPF26, or Neuroblastoma breakpoint family member 26, is a protein encoded by the NBPF26 gene in Homo sapiens. The alias for NBPF26 is notch 2 N-terminal like R (NOTCH2NLR). NBPF26 encodes 13 Olduvai domains, which are thought to contribute to the rapid expansion of the neocortex in humans.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000143409 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000038712 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 Gene Cards https://www.genecards.org/cgi-bin/carddisp.pl?gene=FAM63A&search=FAM63A
  6. 1 2 National Center for Bioinformation Technology - BLAST http://blast.ncbi.nlm.nih.gov/Blast.cgi
  7. 1 2 Köttgen A, Pattaro C, Böger CA, Fuchsberger C, Olden M, Glazer NL, et al. (May 2010). "New loci associated with kidney function and chronic kidney disease". Nature Genetics. 42 (5): 376–84. doi:10.1038/ng.568. PMC   2997674 . PMID   20383146.
  8. "Ubiquitin carboxyl-terminal hydrolase MINDY-1 isoform 1 [Homo sapiens] - Protein - NCBI".
  9. National Center for Bioinformation Technology - Protein https://www.ncbi.nlm.nih.gov/protein/?term=FAM63A%20AND%20homo%20sapiens
  10. 1 2 National Center for Biotechnology Information - AceView - https://www.ncbi.nlm.nih.gov/ieb/research/acembly/av.cgi?db=human&term=FAM63A&submit=Go
  11. San Diego Super Computer http://seqtool.sdsc.edu/CGI/BW.cgi#%5B%5D!
  12. PSORT II Prediction http://psort.hgc.jp/form2.html
  13. 1 2 San Diego Super Computer - http://seqtool.sdsc.edu/CGI/BW.cgi#%5B%5D!
  14. ExPASy Bioinformatics Resource Portal http://www.expasy.org/proteomics
  15. PHYRE2 -
  16. STRING - Known and Predicted Protein-Protein Interactions http://string-db.org/newstring_cgi/show_network_section.pl
  17. 1 2 National Center for Biotechnology Information - Protein https://www.ncbi.nlm.nih.gov/protein/?term=FAM63A
  18. Gene Cards https://www.genecards.org/cgi-bin/carddisp.pl?gene=FAM63B
  19. Genomatrix - ElDorado - Promoter and transcription factors for FAM63A. http://www.genomatix.de/?s=23e6b2edc9ca33fe998f299bafe56b99
  20. Estrogen receptor alpha-silenced MCF7 breast cancer cells. Profile: GDS4061/ FAM63A. ncbi.nlm.nih.gov/geo/tools/profiles
  21. National Center for Biotechnology Information - GEO Profiles https://www.ncbi.nlm.nih.gov/geo/tools/profileGraph.cgi?ID=GDS596:221856_s_at
  22. Figgins JA, Minster RL, Demirci FY, Dekosky ST, Kamboh MI (June 2009). "Association studies of 22 candidate SNPs with late-onset Alzheimer's disease". American Journal of Medical Genetics. Part B, Neuropsychiatric Genetics. 150B (4): 520–6. doi:10.1002/ajmg.b.30851. PMC   2751631 . PMID   18780302.