JASPAR

Last updated
JASPAR
Database.png
Content
Descriptionan open-access database of transcription factor binding profiles
Data types
captured
Eukaryotic transcription factors, their binding sites and binding profiles
Organisms eukaryotes
Contact
AuthorsSandelin, A
Primary citationSandelin, A. et al. (2004) [1]
Release date2004
Access
Website http://jaspar.genereg.net/

JASPAR is an open access and widely used database of manually curated, non-redundant transcription factor (TF) binding profiles stored as position frequency matrices (PFM) and transcription factor flexible models (TFFM) [2] for TFs from species in six taxonomic groups. From the supplied PFMs, users may generate position-specific weight matrices (PWM). The JASPAR database was introduced in 2004. There were seven major updates and new releases in 2006, 2008, 2010, 2014, 2016, 2018, 2020 and 2022, which is the latest release of JASPAR. [3] [4] [5] [6] [7] [8] [9]

Contents

 [10] 


Availability

The JASPAR database is an open-source and freely available for scientific community at http://jaspar.genereg.net/.

Similar databases

Related Research Articles

<span class="mw-page-title-main">Transcription factor</span> Protein that regulates the rate of DNA transcription

In molecular biology, a transcription factor (TF) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The function of TFs is to regulate—turn on and off—genes in order to make sure that they are expressed in the desired cells at the right time and in the right amount throughout the life of the cell and the organism. Groups of TFs function in a coordinated fashion to direct cell division, cell growth, and cell death throughout life; cell migration and organization during embryonic development; and intermittently in response to signals from outside the cell, such as a hormone. There are 1500-1600 TFs in the human genome. Transcription factors are members of the proteome as well as regulome.

A regulatory sequence is a segment of a nucleic acid molecule which is capable of increasing or decreasing the expression of specific genes within an organism. Regulation of gene expression is an essential feature of all living organisms and viruses.

<span class="mw-page-title-main">DNA-binding protein</span> Proteins that bind with DNA, such as transcription factors, polymerases, nucleases and histones

DNA-binding proteins are proteins that have DNA-binding domains and thus have a specific or general affinity for single- or double-stranded DNA. Sequence-specific DNA-binding proteins generally interact with the major groove of B-DNA, because it exposes more functional groups that identify a base pair.

A DNA-binding domain (DBD) is an independently folded protein domain that contains at least one structural motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence or have a general affinity to DNA. Some DNA-binding domains may also include nucleic acids in their folded structure.

The Open Regulatory Annotation Database is designed to promote community-based curation of regulatory information. Specifically, the database contains information about regulatory regions, transcription factor binding sites, regulatory variants, and haplotypes.

<span class="mw-page-title-main">KLF5</span> Protein-coding gene in the species Homo sapiens

Krueppel-like factor 5 is a protein that in humans is encoded by the KLF5 gene.

<span class="mw-page-title-main">SOX6</span> Protein-coding gene in the species Homo sapiens

Transcription factor SOX-6 is a protein that in humans is encoded by the SOX6 gene.

Anders Krogh is a bioinformatician at the University of Copenhagen, where he leads the university's bioinformatics center. He is known for his pioneering work on the use of hidden Markov models in bioinformatics, and is co-author of a widely used textbook in bioinformatics. In addition, he also co-authored one of the early textbooks on neural networks. His current research interests include promoter analysis, non-coding RNA, gene prediction and protein structure prediction.

BIOBASE is an international bioinformatics company headquartered in Wolfenbüttel, Germany. The company focuses on the generation, maintenance, and licensing of databases in the field of molecular biology, and their related software platforms.

<span class="mw-page-title-main">DNA binding site</span> Regions of DNA capable of binding to biomolecules

DNA binding sites are a type of binding site found in DNA where other molecules may bind. DNA binding sites are distinct from other binding sites in that (1) they are part of a DNA sequence and (2) they are bound by DNA-binding proteins. DNA binding sites are often associated with specialized proteins known as transcription factors, and are thus linked to transcriptional regulation. The sum of DNA binding sites of a specific transcription factor is referred to as its cistrome. DNA binding sites also encompasses the targets of other proteins, like restriction enzymes, site-specific recombinases and methyltransferases.

<span class="mw-page-title-main">Therapeutic Targets Database</span> Database of protein targets in drug design

Therapeutic Target Database (TTD) is a pharmaceutical and medical repository constructed by the Innovative Drug Research and Bioinformatics Group (IDRB) at Zhejiang University, China and the Bioinformatics and Drug Design Group at the National University of Singapore. It provides information about known and explored therapeutic protein and nucleic acid targets, the targeted disease, pathway information and the corresponding drugs directed at each of these targets. Detailed knowledge about target function, sequence, 3D structure, ligand binding properties, enzyme nomenclature and drug structure, therapeutic class, and clinical development status. TTD is freely accessible without any login requirement at https://idrblab.org/ttd/.

YEASTRACT is a curated repository of more than 48000 regulatory associations between transcription factors (TF) and target genes in Saccharomyces cerevisiae, based on more than 1200 bibliographic references. It also includes the description of about 300 specific DNA binding sites for more than a hundred characterized TFs. Further information about each Yeast gene has been extracted from the Saccharomyces Genome Database (SGD). For each gene the associated Gene Ontology (GO) terms and their hierarchy in GO was obtained from the GO consortium. Currently, YEASTRACT maintains more than 7100 terms from GO. The nucleotide sequences of the promoter and coding regions for Yeast genes were obtained from Regulatory Sequence Analysis Tools (RSAT). All the information in YEASTRACT is updated regularly to match the latest data from SGD, GO consortium, RSA Tools and recent literature on yeast regulatory networks.

In molecular biology, the BEN domain is a protein domain which is found in diverse proteins including:

TRANSFAC is a manually curated database of eukaryotic transcription factors, their genomic binding sites and DNA binding profiles. The contents of the database can be used to predict potential transcription factor binding sites.

<span class="mw-page-title-main">WRKY protein domain</span> Protein domain

The WRKY domain is found in the WRKY transcription factor family, a class of transcription factors. The WRKY domain is found almost exclusively in plants although WRKY genes appear present in some diplomonads, social amoebae and other amoebozoa, and fungi incertae sedis. They appear absent in other non-plant species. WRKY transcription factors have been a significant area of plant research for the past 20 years. The WRKY DNA-binding domain recognizes the W-box (T)TGAC(C/T) cis-regulatory element.

Transcription factors are proteins that bind genomic regulatory sites. Identification of genomic regulatory elements is essential for understanding the dynamics of developmental, physiological and pathological processes. Recent advances in chromatin immunoprecipitation followed by sequencing (ChIP-seq) have provided powerful ways to identify genome-wide profiling of DNA-binding proteins and histone modifications. The application of ChIP-seq methods has reliably discovered transcription factor binding sites and histone modification sites.

<span class="mw-page-title-main">Ivan Erill</span> Spanish computational biologist

Ivan Erill is a Spanish computational biologist known for his research in comparative genomics and molecular microbiology. His work focuses primarily on bacterial comparative genomics, through the development of computational methods for analyzing regulatory networks and their evolution.

HOCOMOCO is an open-access database providing curated and benchmarked binding motifs of human and mouse transcription factors. It captures the following data types: Homo sapiens (human) and Mus musculus (mouse) transcription factors, their DNA binding site motifs, and motif subtypes.

References

  1. Sandelin, A; Alkema, W; Engström, P; Wasserman, WW; Lenhard, B (1 January 2004). "JASPAR: an open-access database for eukaryotic transcription factor binding profiles". Nucleic Acids Research. 32 (Database issue): D91-4. doi:10.1093/nar/gkh012. PMC   308747 . PMID   14681366.
  2. Mathelier, A; Wasserman, W.W. (5 September 2013). "The Next Generation of Transcription Factor Binding Site Prediction". PLOS Computational Biology. 9 (9): e1003214. Bibcode:2013PLSCB...9E3214M. doi: 10.1371/journal.pcbi.1003214 . PMC   3764009 . PMID   24039567.
  3. Vlieghe, D; Sandelin, A; De Bleser, PJ; Vleminckx, K; Wasserman, WW; van Roy, F; Lenhard, B (1 January 2006). "A new generation of JASPAR, the open-access repository for transcription factor binding site profiles". Nucleic Acids Research. 34 (Database issue): D95-7. doi:10.1093/nar/gkj115. PMC   1347477 . PMID   16381983.
  4. Bryne, JC; Valen, E; Tang, MH; Marstrand, T; Winther, O; da Piedade, I; Krogh, A; Lenhard, B; Sandelin, A (January 2008). "JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update". Nucleic Acids Research. 36 (Database issue): D102-6. doi:10.1093/nar/gkm955. PMC   2238834 . PMID   18006571.
  5. Portales-Casamar, E; Thongjuea, S; Kwon, AT; Arenillas, D; Zhao, X; Valen, E; Yusuf, D; Lenhard, B; Wasserman, WW; Sandelin, A (January 2010). "JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles". Nucleic Acids Research. 38 (Database issue): D105-10. doi:10.1093/nar/gkp950. PMC   2808906 . PMID   19906716.
  6. Mathelier, A; Zhao, X; Zhang, AW; Parcy, F; Worsley-Hunt, R; Arenillas, DJ; Buchman, S; Chen, CY; Chou, A; Ienasescu, H; Lim, J; Shyr, C; Tan, G; Zhou, M; Lenhard, B; Sandelin, A; Wasserman, WW (January 2014). "JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles". Nucleic Acids Research. 42 (Database issue): D142-7. doi:10.1093/nar/gkt997. PMC   3965086 . PMID   24194598.
  7. Mathelier, A; Fornes, O; Arenillas, DJ; Chen, CY; Denay, G; Lee, J; Shi, W; Shyr, C; Tan, G; Worsley-Hunt, R; Zhang, AW; Parcy, F; Lenhard, B; Sandelin, A; Wasserman, WW (4 January 2016). "JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles". Nucleic Acids Research. 44 (D1): D110-5. doi:10.1093/nar/gkv1176. PMC   4702842 . PMID   26531826.
  8. Khan, Aziz; Fornes, Oriol; Stigliani, Arnaud; Gheorghe, Marius; Castro-Mondragon, Jaime A.; van der Lee, Robin; Bessy, Adrien; Chèneby, Jeanne; Kulkarni, Shubhada R.; Tan, Ge; Baranasic, Damir; Arenillas, David J.; Sandelin, Albin; Vandepoele, Klaas; Lenhard, Boris; Ballester, Benoît; Wasserman, Wyeth W.; Parcy, François; Mathelier, Anthony (13 November 2017). "JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework". Nucleic Acids Research. 46 (D1): D260–D266. doi:10.1093/nar/gkx1126. PMC   5753243 . PMID   29140473.
  9. Fornes, Oriol; Castro-Mondragon, Jaime A.; Khan, Aziz; van der Lee, Robin; Zhang, Xi; Richmond, Phillip A.; Modi, Bhavi P.; Correard, Solenne; Gheorghe, Marius; Baranašić, Damir; Santana-Garcia, Walter; Tan, Ge; Chèneby, Jeanne; Ballester, Benoit; Parcy, François; Sandelin, Albin; Lenhard, Boris; Wasserman, Wyeth W.; Mathelier, Anthony (8 January 2020). "JASPAR 2020: update of the open-access database of transcription factor binding profiles". Nucleic Acids Research. 48 (D1): D87–D92. doi:10.1093/nar/gkz1001. PMC   7145627 . PMID   31701148.
  10. Xuan Lin QX, Sian S, An O, Thieffry D, Jha S, Benoukraf T (January 2019). "MethMotif: an integrative cell specific database of transcription factor binding motifs coupled with DNA methylation profiles". Nucleic Acids Research. 47 (D1): D145–D154. doi:10.1093/nar/gky1005. PMC   6323897 . PMID   30380113.