YIF1A

Last updated
YIF1A
Identifiers
Aliases YIF1A , 54TM, FinGER7, YIF1, YIF1P, Yip1 interacting factor homolog A, membrane trafficking protein
External IDs OMIM: 611484 MGI: 1915340 HomoloGene: 56295 GeneCards: YIF1A
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001300861
NM_020470

NM_026553

RefSeq (protein)

NP_001287790
NP_065203

NP_080829

Location (UCSC) Chr 11: 66.28 – 66.29 Mb Chr 19: 5.14 – 5.14 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene. [5] [6] [7] [8]

Contents

Gene

YIF1A (Yip1 interacting factor homolog A) is also known as YIF1, YIF1P, FinGER7, and 54TM. [9] It has 4,591 base pairs with 8 exons, and it is located on the minus strand of chromosome 11, at 11q13.2, in humans. [10]

Promoters

There are four predicted promoter for YIIF1A. [11] The predicted promoter region with highest confidence is GXP_50494 and has 1252 base pairs long; it extends past the first exon of YIF1A. This promoter is located on the minus strand of chromosome 11.

Transcription factors

The promoter of YIF1A transcript variant 1 contains numerous transcription factor binding sites. [12] Transcription factors predicted to bind to the promoter region include the following.

Expression

The expression of YIF1A is highest in the duodenum and liver. It is also expressed at moderate levels in tissues including the colon, ovary, pancreases, spleen, and esophagus, and expressed at lower levels in a variety of other tissues. [13] [14] [15] NCBI GeoProfile data provide the tissue expression graph for YIF1A in humans; it also indicates that YIF1A is expressed at moderately to moderately low across all other tissues. [16]

mRNA

Schematic illustration of YIF1A, with domains and post-translation modifications. Schematic illustration of YIF1A.png
Schematic illustration of YIF1A, with domains and post-translation modifications.

YIF1A has isoforms 1 and 2, with exons 8 and 7 respectively. [10] The two transcripts undergo alternate splicing and are translated into proteins with 293 and 241 amino acids, respectively. [17] [18]

RNA-binding proteins

The 5' untranslated region has predicted sites for binding by RBXM, EIF4B, and FUS. The 3' untranslated region has predicted sites for binding by ELAVL1, which is AU rich elements and regulate mRNA stability. [19]

Protein

The longest protein isoform of YIF1A is 293 amino acids in length. It has an observed molecular weight of approximately 32.0 kDa with a predicted isoelectric point of approximately 8.98. [18] [20] [21]

Composition

YIF1 is a very normal protein in terms of the amino acid quantities it contains. The composition of each amino acid residue is similar to its average relative composition among human proteins. There are no charge clusters, runs, or patterns. There is a repetitive structure for protein YIF1A at [ 201- 204 and 288- 291 ] TFHL. [20]

Domain and motifs

YIF1A has a conserved domain, pfam03878 (AA 57 →287). [10] Within the domain, there are 5 transmembrane domains, 3 non-cytosolic domains, and 3 cytosolic domains. It has been hypothesized that there is a possible role in transport between the endoplasmic reticulum and Golgi. [9]

Structure

YIF1A protein structure generated by I-Tasser and visualized with iCn3D. Transmembrane domains are red, non-cytosolic domains are yellow, and cytosolic domains are deep pink. YIF1A tertiary structure.png
YIF1A protein structure generated by I-Tasser and visualized with iCn3D. Transmembrane domains are red, non-cytosolic domains are yellow, and cytosolic domains are deep pink.

The structure of YIF1A consist of approximately 59% alpha-helices, with TM helix and disordered regions making up the rest of the structure; no beta- strand was predicted. [24]

Localization

YIF1A's predicted location is in the endoplasmic reticulum, with intracellular N-terminus and an extracellular C-terminus. [25] [26]

Post-translational modifications

YIF1A undergoes methionine cleavage and N-terminal acetylation, which is one of the most common post translation modifications of eukaryotic proteins. [27] It also phosphorylated by unspecified kinases at several sites. [28] Three glycation site is predicted in lysine residue(lys 104,161, and 211). [29] YIF1A undergoes O-ß-GlcNAc modification at 5 sites, 1 of them being Yin-Yang sites. [30]

Interacting protein

Based on fluorescence microscopy, validated two hybrid, and anti tag coimmunoprecipitation, the protein that is most likely to interact with YIF1A are GPR37, SEC23IP, REEP2, and YIPF5. Studies suggest that interaction between VAPB and YIF1A control membrane delivery into dendrites. [31] It also participates in ER unfolded protein response (UPR) by inducing ERN1/IRE1. [32] Additionally, the YIF1A protein interacts with the M protein of SARS-Cov-2. [33]

Conceptual translation of Hsa_YIF1A transcript variant 1, mRNA (NM 020470) Conceptual translation ~YIF1A.png
Conceptual translation of Hsa_YIF1A transcript variant 1, mRNA (NM 020470)

Homology

YIF1A has a single Paralog called YIF1B, which is located on human chromosome 19. [9] YIF1A has 238 identified orthologs. [34] The ortholog contains vertebrates such as mammals, amphibians, and reptiles. It also has invertebrates species such as Insecta, Anthozoa, and Ascidiacea. No ortholog was found in protists, bacteria, or archaea.

The following table provides a sample of the ortholog of YIF1A.

Genus and speciesAccession Number [10] Date of Divergence (MYA) [35] Sequence Length(AA)Sequence Identity [36]
Homo sapiens (Human) NP_065203 0293100
Aotus nancymaae (Ma's night monkey) XP_012318344 4331794
Mus musculus (Mouse) NP_080829 9029393
Sus scrofa (Wild Boar) XP_013849519 9631192
Delphinapterus leucas (White whale) XP_022447094 9630691
Phascolarctos cinereus (Koala) XP_020823757 15929388
Ornithorhynchus anatinus (Platypus) XP_028915982 17729388
Chelonia mydas (Green turtle) XP_007056281 31224078
Chrysemys picta bellii (Painted turtle) XP_005305497 31229373
Microcaecilia unicolor (Amph.) XP_029470520 35230672
Rhinatrema bivittatum (Two-lined caecilian) XP_029470520 35230771
Latimeria chalumnae (Gombessa) XP_014345204 41329671
Salmo trutta (Brown trou) XP_029585843 43530970
Echeneis naucrates (live sharksucker) XP_029368074 43530866
Danio rerio (Zebrafish) NP_956225 43530765
Maylandia zebra (zebra mbuna) XP_004545672 43530863
Saccharomyces cerevisiae S288C (Baker's yeast) NP_014136 101731433
Physcomitrium patens (moss)XP_024362517127528230

Related Research Articles

<span class="mw-page-title-main">C11orf49</span> Protein-coding gene in the species Homo sapiens

C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.

<span class="mw-page-title-main">FAM185A</span> Gene of the species Homo sapiens

The FAM185A is a protein that in humans is encoded by the FAM185A gene. The FAM185A gene is found on the positive strand of Chromosome 7 at 7q22.1. The gene begins 102,389,399bp from the p-terminus of the chromosome and ends at 102,449,672bp from the p-terminus; it covers a total of 73,308 basepairs. The protein encoded by this gene is characterized by the presence of multiple copies of DUF4098 near its C-terminus. It is described as a Long Interspersed Nuclear Element (LINE), a subclass of penaeid repetitive elements (PREs).

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

<span class="mw-page-title-main">TMEM176B</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

<span class="mw-page-title-main">CRACD-like protein</span>

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

<span class="mw-page-title-main">TEX9</span> Protein-coding gene in the species Homo sapiens

Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.

FAM71E2, also known as Family With Sequence Similarity 71 Member E2, is a protein that, in humans, is encoded by the FAM71E2 gene. Aliases include C19orf16, Protein FAM71E2, Chromosome 19 open reading frame 16, and Putative Protein FAM71E2. The gene is primarily conserved in mammals, but it is also conserved in two reptile species.

Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development

<span class="mw-page-title-main">Transmembrane protein 179</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 179 is a protein that in humans is encoded by the TMEM179 gene. The function of transmembrane protein 179 is not yet well understood, but it is believed to have a function in the nervous system.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">SNAP47</span>

Synaptosome-associated protein, 47 kDal (SNAP47) is a human protein encoded by the SNAP47 gene. Other aliases of this gene are SVAP1, HEL170, ESFI5812, and HEL-S-290. SNAP47 is a synaptosome protein which is associated with the protein coding in multiple diseases, including non small cell lung cancer and schizophrenia. SNAP47 is a member of the SNAP protein family. SNAP proteins are t-snare proteins that are a component of SNARE complex. The SNARE complex mediates vesicle fusion by creating tight complex that brings vesicle and membrane together. This protein causes ubiquitous expression in testis, ovary, and many other tissues

Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.

<span class="mw-page-title-main">CCDC190</span> Protein found in humans

Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000174851 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000024875 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Vitale G, Alexandrov K, Ullrich O, Horiuchi H, Giner A, Dobson C, et al. (Jan 1997). "The GDP/GTP cycle of Rab5 in the regulation of endocytotic membrane traffic". Cold Spring Harbor Symposia on Quantitative Biology. 60: 211–20. doi:10.1101/SQB.1995.060.01.024. PMID   8824393.
  6. Matern H, Yang X, Andrulis E, Sternglanz R, Trepte HH, Gallwitz D (September 2000). "A novel Golgi membrane protein is part of a GTPase-binding protein complex involved in vesicle targeting". The EMBO Journal. 19 (17): 4485–92. doi:10.1093/emboj/19.17.4485. PMC   302084 . PMID   10970842.
  7. Yoshida Y, Suzuki K, Yamamoto A, Sakai N, Bando M, Tanimoto K, et al. (November 2008). "YIPF5 and YIF1A recycle between the ER and the Golgi apparatus and are involved in the maintenance of the Golgi structure". Experimental Cell Research. 314 (19): 3427–43. doi:10.1016/j.yexcr.2008.07.023. PMID   18718466.
  8. "Entrez Gene: YIF1A Yip1 interacting factor homolog A (S. cerevisiae)".
  9. 1 2 3 "YIF1A related genes - GeneCards Search Results". www.genecards.org. Retrieved 2020-06-21.
  10. 1 2 3 4 "YIF1A - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-06-21.
  11. "Genomatix: Genome Annotation and Browser: Query Input". www.genomatix.de. Retrieved 2020-07-30.
  12. "Genomatix: MatInspector Input". www.genomatix.de. Retrieved 2020-08-03.
  13. Fagerberg L, Hallström BM, Oksvold P, Kampf C, Djureinovic D, Odeberg J, et al. (February 2014). "Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics". Molecular & Cellular Proteomics. 13 (2): 397–406. doi:10.1074/mcp.M113.035600. PMC   3916642 . PMID   24309898.
  14. Duff MO, Olson S, Wei X, Garrett SC, Osman A, Bolisetty M, et al. (May 2015). "Genome-wide identification of zero nucleotide recursive splicing in Drosophila". Nature. 521 (7552): 376–9. Bibcode:2015Natur.521..376D. doi:10.1038/nature14475. PMC   4529404 . PMID   25970244.
  15. Szabo L, Morey R, Palpant NJ, Wang PL, Afari N, Jiang C, et al. (June 2015). "Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development". Genome Biology. 16 (1): 126. doi: 10.1186/s13059-015-0690-5 . PMC   4506483 . PMID   26076956.
  16. "GDS596 / 202418_at". www.ncbi.nlm.nih.gov. Retrieved 2020-08-02.
  17. "protein YIF1A isoform 2 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-07-28.
  18. 1 2 "protein YIF1A isoform 1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-07-28.
  19. "RBPDB: The database of RNA-binding specificities". rbpdb.ccbr.utoronto.ca. Retrieved 2020-08-01.
  20. 1 2 "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-07-28.
  21. "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2020-07-28.
  22. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2020-08-01.
  23. "iCn3D: Web-based 3D Structure Viewer". www.ncbi.nlm.nih.gov. Retrieved 2020-08-01.
  24. "NPS@ : GOR4 secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2020-07-28.
  25. "PredictProtein - Protein Sequence Analysis, Prediction of Structural and Functional Features". www.predictprotein.org. Retrieved 2020-07-28.
  26. "Phobius". phobius.sbc.su.se. Retrieved 2020-07-28.
  27. "TERMINUS - Welcome to terminus". terminus.unige.ch. Retrieved 2020-07-28.
  28. "NetPhosK 1.0 Server". www.cbs.dtu.dk. Archived from the original on 2021-07-09. Retrieved 2020-07-28.
  29. "NetGlycate 1.0 Server - prediction results". www.cbs.dtu.dk. Retrieved 2020-08-01.
  30. "YinOYang 1.2 Server". www.cbs.dtu.dk. Retrieved 2020-07-28.
  31. Kuijpers M, Yu KL, Teuling E, Akhmanova A, Jaarsma D, Hoogenraad CC (July 2013). "The ALS8 protein VAPB interacts with the ER-Golgi recycling protein YIF1A and regulates membrane delivery into dendrites". The EMBO Journal. 32 (14): 2056–72. doi:10.1038/emboj.2013.131. PMC   3715857 . PMID   23736259.
  32. "YIF1A protein (human) - STRING interaction network". string-db.org. Retrieved 2020-07-29.
  33. Mahen, Robert (2020-04-09). "A SARS-CoV-2-Human Protein-Protein Interaction Map Reveals Drug Targets and Potential Drug-Repurposing". doi:10.1242/prelights.18355. S2CID   243418486 . Retrieved 2020-08-05.{{cite journal}}: Cite journal requires |journal= (help)
  34. "Nucleotide BLAST: Search nucleotide databases using a nucleotide query". blast.ncbi.nlm.nih.gov. Retrieved 2020-08-03.
  35. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2020-07-02.
  36. "Human BLAT Search". genome.ucsc.edu. Retrieved 2020-07-02.

Further reading