FAM221A

Last updated
FAM221A
Identifiers
Aliases FAM221A , C7orf46, family with sequence similarity 221 member A
External IDs MGI: 2442161 HomoloGene: 18214 GeneCards: FAM221A
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001127364
NM_001127365
NM_001300932
NM_199136

NM_001172216
NM_172727

RefSeq (protein)

NP_001120836
NP_001120837
NP_001287861
NP_954587

NP_001165687
NP_766315

Location (UCSC) Chr 7: 23.68 – 23.7 Mb Chr 6: 49.34 – 49.37 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Family with sequence similarity 221 member A is a protein in humans that is encoded by the FAM221A gene. FAM221A is a gene that is not yet well understood by the scientific community. However, it appears that this gene may have a role in Parkinson's disease and prostate cancer.

Contents

Gene

Location and Aliases

FAM221A is located on Chromosome 7. Its exact location is 7p15.3. [5] It has one alias, which is C7orf46. [6]

Expression

FAM221A has higher levels of expression in the liver, brain, fetal brain, thyroid and colon, but FAM221A has the highest level of expression in the spinal cord, pancreas and retina. [7]

The promoter region of FAM221A is 1222 base pairs long. This was found using ElDorado at Genomatix. [8]

Protein

Protein Analysis

The molecular weight of FAM221A is 33.1 kDa, [9] and the isoelectric point is 6.01. [10] Relative to other proteins in humans, FAM221A has a lower level of asparagine. [9]

Post-Translational Modifications

Post-translational modifications of FAM221A include phosphorylation sites, glycosylation sites and sulfation sites. These have been conserved in mammals other than Homo sapiens, including the macaque, whale, finch and sometimes alligator. These sites were predicted using NetPhos 3.1, [11] YinOYang 1.2 [12] and The Sulfinator. [13]

Secondary Structure

Key structures predicted in FAM221A are random coils and alpha helices, with 71% of the protein being random coils and 21% being helices. Extended strands were also found with 7% of the protein being these. Secondary structure was predicted using RaptorX, [14] and a diagram of the predicted secondary structure is included below.

Secondary structure prediction of FAM221A using RaptorX. Secondary Structure of FAM221A.png
Secondary structure prediction of FAM221A using RaptorX.

Homology/evolution

Paralogs

There exists one paralog for FAM221A: FAM221B. This diverged from FAM221A approximately 1781 million years ago.

Orthologs

Orthologs have been found in mammals, birds, reptiles and fish. FAM221A has also been conserved in invertebrates, but the similarity levels decrease at a faster rate. Orthologs were discovered using BLAST [15] and BLAT. [16] While these are not the only orthologs that exist for FAM221A, a table of 20 orthologs is provided below. The ortholog with no accession number was created using BLAT.

20 Orthologs of FAM221A
SpeciesCommon NameDivergence (mya)Accession NumberLength (aa) % Identity % Similarity
Homo sapiensHuman0 NP_954587.2 298100100
Macaca nemestrinaSouthern pig-tailed macaque28.1 XP_011729478.1 2989696
Condylura cristataStar-nosed mole94 XP_004677186.2 2849094
Cervus elaphus hippelaphusCentral European red deer94 OWK06795.1 2899093
Delphinapterus leucasBeluga whale94 XP_022440764.1 2989092
Alligator mississippiensisAmerican alligator320 KYO26809.1 3667886
Phalacrocorax carboGreat cormorant320 KFW96932.1 2587787
Lonchura striata domesticaSociety finch320 XP_021393915.1 2987685
Pelodiscus sinensisChinese softshell turtle320 XP_014436679.1 2367685
Gallus GallusRed junglefowl320 XP_418719.1 2967584
Crocodylus porosusSaltwater crocodile320 XP_019390202.1 2367584
Amphiprion ocellarisOcellaris clownfish432 XP_023141881.1 2486375
Salvelinus alpinusArctic char432 XP_023832019.1 3725971
Esox luciusNorthern pike432 XP_010891304.1 3325569
Ciona intestinalisVase tunicate678N/A2127787
Stylophora pistillataStylophora pistillata685 XP_022787363.1 3445873
Schistosoma haematobiumUniary blood fluke692 XP_012794504.1 2414561
Crassostrea virginicaEastern oyster794 XP_022337450.1 3245972
Mizuhopecten yessoensisPatinopecten yessoensis794 XP_021377417.1 3265570
Phytophthora nicotianaeBlack shank1781 KUF80258.1 2973448
Chrysochromulina sp. CCMP291Chrysochromulina tobin1781 KOO33212.1 2802842

Divergence of FAM221A

To understand the times when FAM221A diverged from different species, a graph was created. This compares the evolutionary history of FAM221A to Fibrinogen, which evolves quickly, and Cytochrome C, which evolves slowly. As seen in the graph, FAM221A diverges from other species at a moderate pace.

Evolutionary timeline for FAM221A in orthologs found. Evolution of FAM221A.png
Evolutionary timeline for FAM221A in orthologs found.

Clinical significance

FAM221A has a relatively high amount of expression in the brain [17] and has been seen to have an association with neurodegenerative disorders such as Parkinson's disease [17] and Alzheimer's disease. [18] FAM221A has also been seen to have a higher level of expression in those who have prostate cancer versus healthy individuals. [19] Furthermore, FAM221A has also been expressed in those with colorectal tumors. [20]

Interacting Proteins

Three interacting proteins were found, which are SNX2, SNX5 and SNX6.

SNX2 and SNX6 share the same function, which is being involved in the stages of intracellular trafficking. SNX5 facilitates cargo retrieval from endosomes to the trans-golgi network.

Related Research Articles

<span class="mw-page-title-main">DGLUCY</span> Protein-coding gene in the species Homo sapiens

DGLUCY is a protein that in humans is encoded by the DGLUCY gene.

<span class="mw-page-title-main">YIF1A</span> Protein-coding gene in the species Homo sapiens

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.

<span class="mw-page-title-main">TMEM8B</span> Protein-coding gene in humans

Transmembrane protein 8B is a protein that in humans is encoded by the TMEM8B gene. It encodes for a transmembrane protein that is 338 amino acids long, and is located on human chromosome 9. Aliases associated with this gene include C9orf127, NAG-5, and NGX61.

<span class="mw-page-title-main">C1orf21</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene. C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers. There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates. C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.

<span class="mw-page-title-main">FAM83A</span> Protein-coding gene in the species Homo sapiens

Protein FAM83A also known as tumor antigen BJ-TSA-9 is a protein that in humans is encoded by the FAM83A gene.

<span class="mw-page-title-main">CCDC47</span> Protein-coding gene in humans

Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.

Transmembrane protein 251, also known as C14orf109 or UPF0694, is a protein that in humans is encoded by the TMEM251 gene. One notable feature of this protein is the presence of proline residues on one of its predicted transmembrane domains., which is a determinant of the intramitochondrial sorting of inner membrane proteins.

<span class="mw-page-title-main">FAM71E1</span> Mammalian protein found in Homo sapiens

FAM71E1, also known as Family With Sequence Similarity 71 Member E1, is a protein that in humans is encoded by the FAM71E1 gene. It is thought to be ubiquitously expressed at low levels throughout the body, and it is conserved in vertebrates, particularly mammals and some reptiles. The protein is localized to the nucleus and can be exported to the cytoplasm.

<span class="mw-page-title-main">C3orf67</span> Human gene

Chromosome 3 open reading frame 67 or C3orf67 is a protein that in humans is encoded by the gene C3orf67. The function of C3orf67 is not yet fully understood.

<span class="mw-page-title-main">LOC101059915</span> Protein-coding gene in the species Homo sapiens

LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.

<span class="mw-page-title-main">PROB1</span> Protein-coding gene in the species Homo sapiens

Proline-rich basic protein 1(PROB1) is a protein encoded by the PROB1 gene located on human chromosome 5, open reading frame 65. PROB1 is also known as C5orf65 and weakly similar to basic proline-rich protein.

<span class="mw-page-title-main">CXorf38 Isoform 1</span> Human protein

Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.

<span class="mw-page-title-main">GOLGA8H</span>

Golgin subfamily A member 8H, also known as GOLGA8H, is a protein that in Homo sapiens is encoded by the GOLGA8H gene. Function of the GOLGA8H involves a process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of the Golgi apparatus.

LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.

<span class="mw-page-title-main">C22orf23</span> Protein-coding gene in the species Homo sapiens

C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C16orf90</span> Protein-coding gene in the species Homo sapiens

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

<span class="mw-page-title-main">TMEM247</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 247 is a multi-pass transmembrane protein of unknown function found in Homo sapiens encoded by the TMEM247 gene. Notable in the protein are two transmembrane regions near the c-terminus of the translated polypeptide. Transmembrane protein 247 has been found to be expressed almost entirely in the testes.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C17orf75</span> Protein-coding gene in the species Homo sapiens

Chromosome 17 open reading frame 75 is a protein that in humans is encoded by the C17orf75 gene. C17orf75 is also known as SRI2 and is a human protein encoding gene located at 17q11.2 on the complementary strand. The protein this gene encodes is also known as NJMU-R1. The C17orf75 gene is ubiquitously expressed at medium-low levels throughout the body and at slightly higher levels in the brain and testes. This protein is thought to be part of a complex associated with golgin-mediated vesicle capture.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000188732 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000047115 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "Entrez Gene: Family with sequence similarity 221 member A" . Retrieved 2016-07-20.
  6. Database, GeneCards Human Gene. "FAM221A Gene - GeneCards - F221A Protein - F221A Antibody". www.genecards.org.
  7. "GDS3113 / 125374". www.ncbi.nlm.nih.gov.
  8. "Genomatix". www.genomatix.de.
  9. 1 2 "SAPS < Sequence Statistics < EMBL-EBI".
  10. "Calculation of Protein Isoelectric Point".
  11. "NetPhos 3.1 Server". www.cbs.dtu.dk.
  12. "YinOYang 1.2 Server". www.cbs.dtu.dk.
  13. "ExPASy - Sulfinator tool". web.expasy.org.
  14. http://raptorx.uchicago.edu/.{{cite web}}: Missing or empty |title= (help)
  15. "NCBI BLAST".
  16. "UCSC BLAT".
  17. 1 2 Mariani E, Frabetti F, Tarozzi A, Pelleri MC, Pizzetti F, Casadei R (2016). "Meta-Analysis of Parkinson's Disease Transcriptome Data Using TRAM Software: Whole Substantia Nigra Tissue and Single Dopamine Neuron Differential Gene Expression". PLOS ONE. 11 (9): e0161567. Bibcode:2016PLoSO..1161567M. doi: 10.1371/journal.pone.0161567 . PMC   5017670 . PMID   27611585.
  18. Thonberg H, Chiang HH, Lilius L, Forsell C, Lindström AK, Johansson C, Björkström J, Thordardottir S, Sleegers K, Van Broeckhoven C, Rönnbäck A, Graff C (June 2017). "Identification and description of three families with familial Alzheimer disease that segregate variants in the SORL1 gene". Acta Neuropathologica Communications. 5 (1): 43. doi: 10.1186/s40478-017-0441-9 . PMC   5465543 . PMID   28595629.
  19. Arredouani MS, Lu B, Bhasin M, Eljanne M, Yue W, Mosquera JM, Bubley GJ, Li V, Rubin MA, Libermann TA, Sanda MG (September 2009). "Identification of the transcription factor single-minded homologue 2 as a potential biomarker and immunotherapy target in prostate cancer". Clinical Cancer Research. 15 (18): 5794–802. doi:10.1158/1078-0432.CCR-09-0911. PMC   5573151 . PMID   19737960.
  20. Khamas A, Ishikawa T, Shimokawa K, Mogushi K, Iida S, Ishiguro M, Mizushima H, Tanaka H, Uetake H, Sugihara K (2012). "Screening for epigenetically masked genes in colorectal cancer Using 5-Aza-2'-deoxycytidine, microarray and gene expression profile". Cancer Genomics & Proteomics. 9 (2): 67–75. PMID   22399497.