Europhenome

Last updated

Europhenome [1] [2] is a resource for presenting, searching and analysing mouse phenotypes that were revealed by high throughput mouse phenotyping [3] programmes such as EUMODIC.

Contents

The EuroPhenome project provides access to raw and annotated mouse phenotyping data generated from primary pipelines such as EMPReSSlim and secondary procedures from specialist centres. Mutants of interest can be identified by searching the gene or the predicted phenotype.

Description

EuroPhenome is an open source project to develop a software system for capturing, storing, and analysing raw phenotyping data from standard operating procedures contained in EMPReSS.

EuroPhenome is primarily based in the bioinformatics group at MRC Harwell. The development of EuroPhenome is in collaboration with the Helmholtz Zentrum München, the Wellcome Trust Sanger Institute, and the Clinique de la Souris (France).

Initially, EuroPhenome was developed within the EUMORPHIA (European Union Mouse Research for Public Health and Industrial Applications) programme to capture and store pilot phenotyping data obtained on four background strains (C57BL/6J, C3H/HeBFeJ, BALB/cByJ and 129/SvPas). EUMORPHIA was a large project consisting of 18 research centres in 8 European countries, with the main focus of the project being the development of novel approaches in phenotyping, mutagenesis and informatics to improve the characterisation of mouse models for understanding human molecular physiology and pathology.

The current version of EuroPhenome is capturing data from the EUMODIC project as well as the Wellcome Trust Sanger Institute Mouse Genetics Programme, HMGU German Mouse Clinic pipeline, and the CMHD. EUMODIC is undertaking a primary phenotype assessment of up to 500 mouse mutant lines derived from ES cells developed in the EUCOMM project as well as other lines. Lines showing an interesting phenotype will be subject to a more in depth assessment.

EUMODIC is building upon the database of standardised phenotyping protocols, EMPReSS, developed by the EUMORPHIA project. EUMODIC has developed a selection of these screens, called EMPReSSslim, to enable comprehensive, high throughput, primary phenotyping of large numbers of mice.

EuroPhenome annotation of phenovariants

Phenovariants are annotated using an automated pipeline, which assigns an MP term if the mutant data is statistically different from the baseline data. This data is shown in the Phenomap and the mine for a mutant tool. A statistically significant result and the subsequent MP annotation does not necessarily mean a true phenovariant. There are other factors that could cause this result that have not been accounted for in the analysis. It is the responsibility of the user to download the data and use their expert knowledge or further analysis to decide whether they agree or not.

See also

Related Research Articles

Wellcome Sanger Institute British genomics research institute

The Wellcome Sanger Institute, previously known as The Sanger Centre and Wellcome Trust Sanger Institute, is a non-profit British genomics and genetics research institute, primarily funded by the Wellcome Trust.

The Rat Genome Database (RGD) is a database of rat genomics, genetics, physiology and functional data, as well as data for comparative genomics between rat, human and mouse. RGD is responsible for attaching biological information to the rat genome via structured vocabulary, or ontology, annotations assigned to genes and quantitative trait loci (QTL), and for consolidating rat strain data and making it available to the research community. RGD is working with groups such as the Programs for Genomic Applications at MCW and the National BioResource Project for the Rat (NBPR-Rat) in Japan to collect and make available comprehensive physiologic data for a variety of rat strains. They are also developing a suite of tools for mining and analyzing genomic, physiologic and functional data for the rat, and comparative data for rat, mouse and human.

The Zebrafish Information Network is an online biological database of information about the zebrafish. The zebrafish is a widely used model organism for genetic, genomic, and developmental studies, and ZFIN provides an integrated interface for querying and displaying the large volume of data generated by this research. To facilitate use of the zebrafish as a model of human biology, ZFIN links these data to corresponding information about other model organisms and to human disease databases. Abundant links to external sequence databases and to genome browsers are included. Gene product, gene expression, and phenotype data are annotated with terms from biomedical ontologies. ZFIN is based at the University of Oregon in the United States, with funding provided by the National Institutes of Health (NIH).

RAD18

E3 ubiquitin-protein ligase RAD18 is an enzyme that in humans is encoded by the RAD18 gene.

UBAP1

Ubiquitin-associated protein 1 is a protein that in humans is encoded by the UBAP1 gene.

IFITM3

Interferon-induced transmembrane protein 3 (IFITM3) is a protein that in humans is encoded by the IFITM3 gene. It plays a critical role in the immune system's defense against Swine Flu, where heightened levels of IFITM3 keep viral levels low, and the removal of IFITM3 allows the virus to multiply unchecked. This observation has been further advanced by a recent study from Paul Kellam's lab that shows that a single nucleotide polymorphism in the human IFITM3 gene purported to increase influenza susceptibility is overrepresented in people hospitalised with pandemic H1N1. The prevalence of this mutation is thought to be approximately 1/400 in European populations.

ATPAF2

ATP synthase mitochondrial F1 complex assembly factor 2 is an enzyme that in humans is encoded by the ATPAF2 gene.

RHOBTB3

Rho-related BTB domain-containing protein 3 is a protein that in humans is encoded by the RHOBTB3 gene.

YIPF1

Protein YIPF1 is a protein that in humans is encoded by the YIPF1 gene.

SUPV3L1

ATP-dependent RNA helicase SUPV3L1, mitochondrial is an enzyme that in humans is encoded by the SUPV3L1 gene.

PUS7L

Pseudouridylate synthase 7 homolog-like protein is an enzyme that in humans is encoded by the PUS7L gene.

SLX4

SLX4 is a protein involved in DNA repair, where it has important roles in the final steps of homologous recombination. Mutations in the gene are associated with the disease Fanconi anemia.

HP1BP3

Heterochromatin protein 1, binding protein 3 is a protein that in humans is encoded by the HP1BP3 gene. It has been identified as a novel subtype of the linker histone H1, involved in the structure of heterochromatin

TCF7L1

Transcription factor 7-like 1 , also known as TCF7L1, is a human gene.

NSUN2

NOP2/Sun domain family, member 2 is a protein that in humans is encoded by the NSUN2 gene. Alternatively spliced transcript variants encoding different isoforms have been noted for the gene.

RNASEH2B

Ribonuclease H2, subunit B is a protein that in humans is encoded by the RNASEH2B gene. RNase H2 is composed of a single catalytic subunit (A) and two non-catalytic subunits, and degrades the RNA of RNA:DNA hybrids. The non-catalytic B subunit of RNase H2 is thought to play a role in DNA replication.

Uberon is a comparative anatomy ontology representing a variety of structures found in animals, such as lungs, muscles, bones, feathers and fins. These structures are connected to other structures via relationships such as part-of and develops-from. One of the uses of this ontology is to integrate data from different biological databases, and other species-specific ontologies such as the Foundational Model of Anatomy.

International Mouse Phenotyping Consortium

The International Mouse Phenotyping Consortium (IMPC) is an international scientific endeavour to create and characterize the phenotype of 20,000 knockout mouse strains. Launched in September 2011, the consortium consists of over 15 research institutes across four continents with funding provided by the NIH, European national governments and the partner institutions.

The Mouse Genetics Project (MGP) is a large-scale mutant mouse production and phenotyping programme aimed at identifying new model organisms of disease.

The Monarch Initiative is a large scale bioinformatics web resource focused on leveraging existing biomedical knowledge to connect genotypes with phenotypes in an effort to aid research that combats genetic diseases. Monarch does this by integrating multi-species genotype, phenotype, genetic variant and disease knowledge from various existing biomedical data resources into a centralized and structured database. While this integration process has been traditionally done manually by basic researchers and clinicians on a case by case basis, The Monarch Initiative provides an aggregated and structured collection of data and tools that make biomedical knowledge exploration more efficient and effective.

References

  1. Mallon, A.-M.; Blake, A.; Hancock, J. M. (January 2008). "EuroPhenome and EMPReSS: online mouse phenotyping resource". Nucleic Acids Research. 36 (Database): D715–D718. doi:10.1093/nar/gkm728. PMC   2238991 . PMID   17905814.
  2. Morgan H, Beck T, Blake A, Gates H, Adams N, Debouzy G, Leblanc S, Lengger C, Maier H, Melvin D, Meziane H, Richardson D, Wells S, White J, Wood J, de Angelis MH, Brown SD, Hancock JM, Mallon AM (November 2010). "EuroPhenome: a repository for high-throughput mouse phenotyping data". Nucleic Acids Research. 38 (Database): D577-85. doi:10.1093/nar/gkp1007. PMC   2808931 . PMID   19933761.
  3. Gates H, Mallon AM, Brown SD (December 2010). "High-throughput mouse phenotyping". Methods. 53: 394–404. doi:10.1016/j.ymeth.2010.12.017. PMID   21185382.