FAM222A

Last updated
FAM222A
Identifiers
Aliases FAM222A , C12orf34, family with sequence similarity 222 member A
External IDs MGI: 3605543 HomoloGene: 41895 GeneCards: FAM222A
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_032829

NM_001004180

RefSeq (protein)

NP_116218

NP_001004180

Location (UCSC) Chr 12: 109.71 – 109.77 Mb Chr 5: 114.71 – 114.75 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Family with sequence similarity 222 member A or Aggregatin is a protein of unknown function. In humans it is encoded by the gene FAM222A. Aggregatin's cellular function is not well understood, however it has been implicated in Alzheimer's disease. [5]

Contents

Gene

FAM222A is also called C12orf34. [6] It is located on chromosome 12 at q24.11. [7] It encompasses 56,672 bp. [7] The mRNA is 3,685 bp while the coding region is 1,359 bp. [7]

FAM222A is highly expressed in the brain and spinal cord. [8] It is expressed to a lesser extent in the cerebellum, pituitary gland, adrenal gland and testis. [8]

mRNA

It has 3 different splice variants of mRNA. [6] The most common mRNA is 3,685 bp while the coding region is 1,359 bp. [6] The mRNA consists of three exons and has two different isoforms in humans. The Kozak Sequence is not very well conserved in FAM222A and it has a non-canonical polyadenylation site.

Protein

Aggregatin is a protein made of 452 amino acids. [7] It contains a domain of unknown function called pfam15258 which is 200 amino acids long. [7]

It has been found to be part of protein plaques formed in the brains of patients with Alzheimer's disease. [5]

This graph shows the changes in amino acid sequence that FAM222A has as a function of how related the organism is to humans. It is compared to other common genes. Rate of Evolution.jpg
This graph shows the changes in amino acid sequence that FAM222A has as a function of how related the organism is to humans. It is compared to other common genes.

FAM222A has an unusually high amount of prolines with a 6 segment run from amino acids 392 to 397.

Structurally, FAM222A has 5 domains which are connected by linker regions. [9] [10]

Analysis of the amino acid sequence suggests that FAM222A is localized in the nucleus. [11]

Expression and regulation

FAM222A is highly expressed in the brain and to a lesser extent in the adrenal glands. [8] [12]

Alzheimer’s disease seems to cause an increase in FAM222A in the brain, but other degenerative diseases such as Parkinson’s do not. [13] [14]

Interacting proteins

FAM222A has been found to interact with mainly transcription factors. These include mainly  pre-B-cell leukemia transcription factors and Homeobox Meis proteins. [15]

Homologs

FAM222A has only one paralog in humans, FAM222B which is also not well characterized. [16] These two proteins only share about 20% identity.

It has many orthologs in other organisms but is restricted to jawed vertebrates, as far back as bony and cartilaginous fish. [17] Overall the protein is well conserved with a lowest identity of around 50% but certain regions are very strictly conserved such as the beginning of pfam15258 as well as the last 60-70 amino acids on the C terminus. [6]

The protein appears to be changing very slowly even in distantly related animals. It is changing at a rate just slightly higher than Cytochrome C, a highly conserved protein.]

Related Research Articles

<span class="mw-page-title-main">Fam78b</span> Protein-coding gene in the species Homo sapiens

Family with Sequence Similarity 78-Member B (FAM78B) is a protein of unknown function in humans that is encoded by the FAM78B gene (1q24.1). It has orthologous genes and predicted proteins in vertebrates and several invertebrates, but not in arthropods. It has a nuclear localization signal in the protein sequence and a miRNA target region in the mRNA sequence.

<span class="mw-page-title-main">Transmembrane protein 151b</span> Transmembrane protein

Transmembrane protein 151B is a protein that in humans is encoded by the TMEM151B gene.

<span class="mw-page-title-main">FAM63A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 63, member A is a protein that, in humans, is encoded by the FAM63A gene. It is located on the minus strand of chromosome 1 at locus 1q21.3.

<span class="mw-page-title-main">FAM76A</span> Protein-coding gene in the species Homo sapiens

FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene. Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias. FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria. FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

<span class="mw-page-title-main">C12orf60</span> Protein-coding gene in humans

Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.

The Family with sequence similarity 149 member B1 is an uncharacterized protein encoded by the human FAM149B1 gene, with one alias KIAA0974. The protein resides in the nucleus of the cell. The predicted secondary structure of the gene contains multiple alpha-helices, with a few beta-sheet structures. The gene is conserved in mammals, birds, reptiles, fish, and some invertebrates. The protein encoded by this gene contains a DUF3719 protein domain, which is conserved across its orthologues. The protein is expressed at slightly below average levels in most human tissue types, with high expression in brain, kidney, and testes tissues, while showing relatively low expression levels in pancreas tissues.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">TMEM169</span> Gene

Transmembrane protein 169 (TMEM169) in humans is encoded by TMEM169 gene. The aliases of TMEM169 include FLJ34263, DKFZp781L2456, and LOC92691. TMEM169 has the highest expression in the brain, particularly the fetal brain. TMEM169 has homologs mammals, reptiles, amphibians, birds, fish, chordates and invertebrates. The most distantly related homolog of TMEM169 is Anopheles albimanus.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">FAM120AOS</span> Protein-coding gene in the species Homo sapiens

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">TMEM212</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of 5 transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.

<span class="mw-page-title-main">KIAA2013</span> Protein-coding gene in the species Homo sapiens

KIAA2013, also known as Q8IYS2 or MGC33867, is a single-pass transmembrane protein encoded by the KIAA2013 gene in humans. The complete function of KIAA2013 has not yet been fully elucidated.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000139438 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000041930 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 Yan T, Liang J, Gao J, Wang L, Fujioka H, Zhu X, Wang X (January 2020). "FAM222A encodes a protein which accumulates in plaques in Alzheimer's disease". Nature Communications. 11 (1): 411. doi:10.1038/s41467-019-13962-0. PMC   6972869 . PMID   31964863.
  6. 1 2 3 4 "FAM222A Gene - GeneCards | F222A Protein | F222A Antibody". www.genecards.org. Retrieved 2020-02-03.
  7. 1 2 3 4 5 "FAM222A family with sequence similarity 222 member A [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-02-03.
  8. 1 2 3 "GTEx Portal". gtexportal.org. Retrieved 2020-02-03.
  9. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2020-04-29.
  10. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2020-04-29.
  11. Reinhardt A, Hubbard T (May 1998). "Using neural networks for prediction of the subcellular location of proteins". Nucleic Acids Research. 26 (9): 2230–6. doi: 10.1093/nar/26.9.2230 . PMC   147531 . PMID   9547285.
  12. "Gene Detail :: Allen Brain Atlas: Mouse Brain". mouse.brain-map.org. Retrieved 2020-04-29.
  13. "GDS4758 / 7958577". www.ncbi.nlm.nih.gov. Retrieved 2020-04-29.
  14. "GDS3129 / 226487_at". www.ncbi.nlm.nih.gov. Retrieved 2020-04-29.
  15. "FAM222A protein (human) - STRING interaction network". string-db.org. Retrieved 2020-04-29.
  16. "FAM222B family with sequence similarity 222 member B [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-29.
  17. "FAM222A orthologs". NCBI. Retrieved 2020-04-29.