PHI-base

PHI-base
Content
Description	Pathogen-Host Interactions database
Data types; captured	phenotypes of microbial mutants
Organisms	~290 fungal, bacterial and protist pathogens of agronomic and medical importance tested on ~240 hosts
Contact
Research center	Rothamsted Research
Primary citation	PMID 39588765
Release date	May 2005
Access
Data format	XML, FASTA
Website	phibase.org
Tools
Web	PHI-base Search PHIB-BLAST PHI-Canto (Author curation)
Miscellaneous
License	Creative Commons Attribution-NoDerivatives 4.0 International License
Versioning	Yes
Data release; frequency	6 monthly
Version	4.18 (May 2025)
Curation policy	Manual Curation

Last updated January 19, 2026

The Pathogen-Host Interactions database (PHI-base)^[1] is a biological database that contains manually curated information on genes experimentally proven to affect the outcome of pathogen-host interactions. The database has been maintained by researchers at Rothamsted Research and external collaborators since 2005.^[2]^[3]^[4]^[5] PHI-base has been part of the UK node of ELIXIR, the European life-science infrastructure for biological information, since 2016.^[6]

Background

The Pathogen-Host Interactions database was developed to utilise the growing number of verified genes that mediate an organism's ability to cause disease and/or trigger host responses.^[7]

The web-accessible database catalogues experimentally verified pathogenicity, virulence, and effector genes from bacterial, fungal, and oomycete pathogens which infect animal, plant, and fungal hosts. PHI-base was the first online resource devoted to the identification and presentation of information on fungal and oomycete pathogenicity genes and their host interactions. PHI-base is a resource for the discovery of candidate targets in medically and agronomically important fungal and oomycete pathogens for intervention with synthetic chemistries and natural products (fungicides).^[8]^[9]

Each entry in PHI-base is curated by domain experts and supported by strong experimental evidence (gene disruption experiments) as well as literature references in which the experiments are described. Each gene in PHI-base is presented with its nucleotide and deduced amino acid sequence as well as a detailed structured description of the predicted protein's function during the host infection process. To facilitate data interoperability, genes are annotated using controlled vocabularies (Gene Ontology terms, EC Numbers, etc.), and links to other external data sources such as UniProt, EMBL, and the NCBI taxonomy services.

In 2016 the plant portion of PHI-base was used to establish a Semantic PHI-base search tool. PHI-base has been aligned with Ensembl Genomes since 2011, FungiDB since 2016, Global Biotic Interactions (GloBI) since 2018 and Uniprot since 2020. All new PHI-base releases are integrated by these independent databases.

Community curation

Since 2015, the website has been linked to an online literature curation tool called PHI-Canto, enabling community-driven literature curation for various pathogenic species.^[10] PHI-Canto employs a community curation framework that not only offers a curation tool but also includes a phenotype ontology and controlled vocabularies using unified languages and rules used in biology experiments. The central concept of this framework is the introduction of a 'Metagenotype', which allows the annotation and assignment of phenotypes to specific pathogen mutant-host interactions. PHI-Canto extends the single species curation tool developed for PomBase,^[11] the model organism database for fission yeast.

Current developments

Version 4.18 (May 2025) of PHI-base ^[1] provides information on 10,614 genes from 335 pathogens and 265 hosts and their impact on 23,497 interactions as well on efficacy information on ~20 drugs and the target sequences in the pathogen. PHI-base currently focuses on plant pathogenic and human pathogenic organisms including fungi, oomycetes, and bacteria. The entire contents of the database can be downloaded in a tab delimited format. Since the launch of version 4, the PHI-base is also searchable using the PHIB-BLAST search tool, which uses the BLAST algorithm to compare a user's sequence against the sequences available from PHI-base.^[12] A new gene-centric version of PHI-base (version 5.0) was released in March 2024 and announced in a press release on the Rothamsted Research website.^[13] PHI-base 5 reorganised the data structure to group interactions by pathogen and host genes, expanded the use of ontologies for phenotype annotation, and added support for phenotypes relating to gene-for-gene relationships as well as host and pathogen phenotypes observed in vitro. Both PHI-base version 4 and version 5 are currently available online in parallel during the transition period, giving users time to familiarise themselves with the new interface before PHI-base 4 is phased out in mid 2026. New data mining tools are also being developed to support version 5. All articles that cite PHI-base are listed in the database’s ‘About us’ section, ordered by year.

Applications

PHI-base is a resource for many applications including:

The discovery of conserved genes in medically and agronomically important pathogens, which may be potential targets for chemical intervention
Comparative genome analyses
Annotation of newly sequenced pathogen genomes
Functional interpretation of RNA sequencing and microarray experiments
The rapid cross-checking of phenotypic differences between pathogenic species when writing articles for peer review

PHI-base use has been cited in over 900 peer-reviewed articles.^[1]

Funding

PHI-base is funded by the Biotechnology and Biological Sciences Research Council (BBSRC)^[7] through the Growing Health and Delivering Sustainable Wheat Institute Strategic Programmes.

References

1 2 3 Urban, M.; Cuzick, A.; Seager, J.; Nonavinakere, N.; Sahoo, J.; Sahu, P.; Iyer, V. L.; Khamari, L.; Carbajo Martinez, M.; Hammond-Kosack, K.E. (2025). "PHI-base – the multi-species pathogen–host interaction database in 2025". Nucleic Acids Research. 53 (Database Issue): D826-838. doi:10.1093/nar/gkae1084. PMC 11701570 . PMID 39588765.
↑ Winnenburg, R.; Baldwin, T.K.; Urban, M.; Rawlings, C.; Köhler, J.; Hammond-Kosack, K.E. (2014). "PHI-base: a new database for pathogen host interactions". Nucleic Acids Research. 34 (Database Issue): D459-464. doi:10.1093/nar/gkj047. PMC 1347410 . PMID 16381911.
↑ Baldwin, T.K.; Winnenburg, R.; Urban, M.; Rawlings, C.; Köhler, J.; Hammond-Kosack, K.E. (2006). "The pathogen-host interactions database (PHI-base) provides insights into generic and novel themes of pathogenicity". Molecular Plant-Microbe Interactions. 19 (12): 1451–1462. Bibcode:2006MPMI...19.1451B. doi: 10.1094/mpmi-19-1451 . PMID 17153929.
↑ Winnenburg, R.; Urban, M.; Beacham, A.; Baldwin, T.K.; Holland, S.; Lindeberg, M.; Hansen, H.; Rawlings, C.; Hammond-Kosack, K.E.; Köhler, J. (2008). "PHI-base update: additions to the pathogen host interactions database". Nucleic Acids Research. 36 (Database Issue): D572-576. doi:10.1093/nar/gkm858. PMC 2238852 . PMID 17942425.
↑ Urban, M.; Pant, R.; Raghunath, A.; Irvine, A.G.; Pedro, H.; Hammond-Kosack, K.E. (2015). "The Pathogen-Host Interactions database (PHI-base): additions and future developments". Nucleic Acids Research. 43 (Database Issue): D645–D655. doi:10.1093/nar/gku1165. PMC 4383963 . PMID 25414340.
↑ Urban, Martin; Cuzick, Alayne; Seager, James; Wood, Valerie; Rutherford, Kim; Venkatesh, Shilpa Yagwakote; Sahu, Jashobanta; Iyer, S. Vijaylakshmi; Khamari, Lokanath; De Silva, Nishadi; Martinez, Manuel Carbajo; Pedro, Helder; Yates, Andrew D.; Hammond-Kosack, Kim E. (2022-01-07). "PHI-base in 2022: a multi-species phenotype database for Pathogen-Host Interactions". Nucleic Acids Research. 50 (D1): D837–D847. doi:10.1093/nar/gkab1037. ISSN 1362-4962. PMC 8728202 . PMID 34788826.
1 2 Urban, M; Cuzick, A; Seager, J; Wood, V; Rutherford, K; Venkatesh, SY; De Silva, N; Martinez, MC; Pedro, H; Yates, AD; Hassani-Pak, K; Hammond-Kosack, KE (8 January 2020). "PHI-base: the pathogen-host interactions database". Nucleic Acids Research. 48 (D1): D613–D620. doi:10.1093/nar/gkz904. PMC 7145647 . PMID 31733065.
↑ Brown, N. A.; Urban, M.; Hammond-Kosack, K.E. (2016). "The trans-kingdom identification of negative regulators of pathogen hypervirulence". FEMS Microbiol Rev. 40 (1): 19–40. doi:10.1093/femsre/fuv042. PMC 4703069 . PMID 26468211.
↑ Urban, M.; Irvine, A. G.; Raghunath, A.; Cuzick, A.; Hammond-Kosack, K.E. (2015). "Using the pathogen-host interactions database (PHI-base) to investigate plant pathogen genomes and genes implicated in virulence". Front Plant Sci. 6: 605. Bibcode:2015FrPS....6..605U. doi: 10.3389/fpls.2015.00605 . PMC 4526803 . PMID 26300902.
↑ Cuzick, Alayne; Seager, James; Wood, Valerie; Urban, Martin; Rutherford, Kim; Hammond-Kosack, Kim E (2023-07-04). "A framework for community curation of interspecies interactions literature". eLife. 12. doi: 10.7554/elife.84658 . ISSN 2050-084X. PMC 10319440 . PMID 37401199.
↑ Rutherford KM, Lera-Ramírez M, Wood V (May 2024). "PomBase: a Global Core Biodata Resource-growth, collaboration, and sustainability". Genetics. 227 (1). doi:10.1093/genetics/iyae007. PMC 11075564 . PMID 38376816.
↑ Urban, M.; Cuzick, A.; Rutherford, K.; Irvine, A. G.; Pedro, H.; Pant, R.; Sadanadan, V.; Khamari, L.; Billal, S.; Mohanty, S.; Hammond-Kosack, K. (2017). "PHI-base: a new interface and further additions for the multi-species pathogen-host interactions database". Nucleic Acids Res. 45 (D1): D604–D610. doi:10.1093/nar/gkw1089. PMC 5210566 . PMID 27915230.
↑ "Pathogen-host database refocuses on genes". 7 January 2025. Retrieved 1 August 2025.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Urban2025-1] 1 2 3 Urban, M.; Cuzick, A.; Seager, J.; Nonavinakere, N.; Sahoo, J.; Sahu, P.; Iyer, V. L.; Khamari, L.; Carbajo Martinez, M.; Hammond-Kosack, K.E. (2025). "PHI-base – the multi-species pathogen–host interaction database in 2025". Nucleic Acids Research. 53 (Database Issue): D826-838. doi:10.1093/nar/gkae1084. PMC 11701570 . PMID 39588765.

[pmid16381911-2] Winnenburg, R.; Baldwin, T.K.; Urban, M.; Rawlings, C.; Köhler, J.; Hammond-Kosack, K.E. (2014). "PHI-base: a new database for pathogen host interactions". Nucleic Acids Research. 34 (Database Issue): D459-464. doi:10.1093/nar/gkj047. PMC 1347410 . PMID 16381911.

[pmid17153929-3] Baldwin, T.K.; Winnenburg, R.; Urban, M.; Rawlings, C.; Köhler, J.; Hammond-Kosack, K.E. (2006). "The pathogen-host interactions database (PHI-base) provides insights into generic and novel themes of pathogenicity". Molecular Plant-Microbe Interactions. 19 (12): 1451–1462. Bibcode:2006MPMI...19.1451B. doi: 10.1094/mpmi-19-1451 . PMID 17153929.

[pmid17942425-4] Winnenburg, R.; Urban, M.; Beacham, A.; Baldwin, T.K.; Holland, S.; Lindeberg, M.; Hansen, H.; Rawlings, C.; Hammond-Kosack, K.E.; Köhler, J. (2008). "PHI-base update: additions to the pathogen host interactions database". Nucleic Acids Research. 36 (Database Issue): D572-576. doi:10.1093/nar/gkm858. PMC 2238852 . PMID 17942425.

[pmid25414340-5] Urban, M.; Pant, R.; Raghunath, A.; Irvine, A.G.; Pedro, H.; Hammond-Kosack, K.E. (2015). "The Pathogen-Host Interactions database (PHI-base): additions and future developments". Nucleic Acids Research. 43 (Database Issue): D645–D655. doi:10.1093/nar/gku1165. PMC 4383963 . PMID 25414340.

[Urban2022-6] Urban, Martin; Cuzick, Alayne; Seager, James; Wood, Valerie; Rutherford, Kim; Venkatesh, Shilpa Yagwakote; Sahu, Jashobanta; Iyer, S. Vijaylakshmi; Khamari, Lokanath; De Silva, Nishadi; Martinez, Manuel Carbajo; Pedro, Helder; Yates, Andrew D.; Hammond-Kosack, Kim E. (2022-01-07). "PHI-base in 2022: a multi-species phenotype database for Pathogen-Host Interactions". Nucleic Acids Research. 50 (D1): D837–D847. doi:10.1093/nar/gkab1037. ISSN 1362-4962. PMC 8728202 . PMID 34788826.

[:1-7] 1 2 Urban, M; Cuzick, A; Seager, J; Wood, V; Rutherford, K; Venkatesh, SY; De Silva, N; Martinez, MC; Pedro, H; Yates, AD; Hassani-Pak, K; Hammond-Kosack, KE (8 January 2020). "PHI-base: the pathogen-host interactions database". Nucleic Acids Research. 48 (D1): D613–D620. doi:10.1093/nar/gkz904. PMC 7145647 . PMID 31733065.

[pmid26468211-8] Brown, N. A.; Urban, M.; Hammond-Kosack, K.E. (2016). "The trans-kingdom identification of negative regulators of pathogen hypervirulence". FEMS Microbiol Rev. 40 (1): 19–40. doi:10.1093/femsre/fuv042. PMC 4703069 . PMID 26468211.

[pmid26300902-9] Urban, M.; Irvine, A. G.; Raghunath, A.; Cuzick, A.; Hammond-Kosack, K.E. (2015). "Using the pathogen-host interactions database (PHI-base) to investigate plant pathogen genomes and genes implicated in virulence". Front Plant Sci. 6: 605. Bibcode:2015FrPS....6..605U. doi: 10.3389/fpls.2015.00605 . PMC 4526803 . PMID 26300902.

[10] Cuzick, Alayne; Seager, James; Wood, Valerie; Urban, Martin; Rutherford, Kim; Hammond-Kosack, Kim E (2023-07-04). "A framework for community curation of interspecies interactions literature". eLife. 12. doi: 10.7554/elife.84658 . ISSN 2050-084X. PMC 10319440 . PMID 37401199.

[pmid38376816-11] Rutherford KM, Lera-Ramírez M, Wood V (May 2024). "PomBase: a Global Core Biodata Resource-growth, collaboration, and sustainability". Genetics. 227 (1). doi:10.1093/genetics/iyae007. PMC 11075564 . PMID 38376816.

[pmid27915230-12] Urban, M.; Cuzick, A.; Rutherford, K.; Irvine, A. G.; Pedro, H.; Pant, R.; Sadanadan, V.; Khamari, L.; Billal, S.; Mohanty, S.; Hammond-Kosack, K. (2017). "PHI-base: a new interface and further additions for the multi-species pathogen-host interactions database". Nucleic Acids Res. 45 (D1): D604–D610. doi:10.1093/nar/gkw1089. PMC 5210566 . PMID 27915230.

[13] "Pathogen-host database refocuses on genes". 7 January 2025. Retrieved 1 August 2025.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

v t e Bioinformatics
Databases	Sequence databases: GenBank, European Nucleotide Archive, DNA Data Bank of Japan and China National GeneBank Secondary databases: UniProt, database of protein sequences grouping together Swiss-Prot, TrEMBL and Protein Information Resource Other databases: BioNumbers, Protein Data Bank, Ensembl, InterPro, KEGG, and Gene Ontology Specialised genomic databases: BOLD, Saccharomyces Genome Database, FlyBase, VectorBase, WormBase, Rat Genome Database, PHI-base, Arabidopsis Information Resource, GISAID and Zebrafish Information Network
Software	BLAST Bowtie Clustal EMBOSS HMMER MUSCLE PANGOLIN SAMtools SOAP suite TopHat
Other	Server: ExPASy Rosalind (education platform)
Institutions	Broad Institute Computational Biology Department (CBD) Microsoft Research - University of Trento Centre for Computational and Systems Biology (COSBI) Database Center for Life Science (DBCLS) DNA Data Bank of Japan (DDBJ) European Bioinformatics Institute (EMBL-EBI) European Molecular Biology Laboratory (EMBL) Flatiron Institute J. Craig Venter Institute (JCVI) Joint Genome Institute (JGI) Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG) US National Center for Biotechnology Information (NCBI) Japanese Institute of Genetics Netherlands Bioinformatics Centre (NBIC) Philippine Genome Center (PGC) Scripps Research Swiss Institute of Bioinformatics (SIB) Wellcome Sanger Institute Whitehead Institute
Organizations	African Society for Bioinformatics and Computational Biology (ASBCB) Australia Bioinformatics Resource (EMBL-AR) European Molecular Biology network (EMBnet) International Nucleotide Sequence Database Collaboration (INSDC) International Society for Biocuration (ISB) International Society for Computational Biology (ISCB) Student Council (ISCB-SC) Institute of Genomics and Integrative Biology (CSIR-IGIB) Japanese Society for Bioinformatics (JSBi)
Meetings	Basel Computational Biology Conference‎ ([BC²]) European Conference on Computational Biology (ECCB) Intelligent Systems for Molecular Biology (ISMB) International Conference on Bioinformatics (InCoB) International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB) ISCB Africa ASBCB Conference on Bioinformatics Pacific Symposium on Biocomputing (PSB) Research in Computational Molecular Biology (RECOMB)
File formats	CRAM format FASTA format FASTQ format NeXML format Nexus format Pileup format SAM format Stockholm format VCF format GFF format GTF format
Related topics	Computational biology List of biobanks List of biological databases Molecular phylogenetics Sequencing Sequence database Sequence alignment
Category Commons

Content

Description	Pathogen-Host Interactions database
Data types captured	phenotypes of microbial mutants
Organisms	~290 fungal, bacterial and protist pathogens of agronomic and medical importance tested on ~240 hosts
Contact
Research center	Rothamsted Research
Primary citation	PMID 39588765
Release date	May 2005
Access
Data format	XML, FASTA
Website	phibase.org
Tools
Web	PHI-base Search PHIB-BLAST PHI-Canto (Author curation)
Miscellaneous
License	Creative Commons Attribution-NoDerivatives 4.0 International License
Versioning	Yes
Data release frequency	6 monthly
Version	4.18 (May 2025)
Curation policy	Manual Curation