Susanna-Assunta Sansone

Susanna-Assunta Sansone
Susanna-Assunta Sansone
	Susanna-Assunta Sansone
Nationality	Italian
Other names	FAIR lady
Alma mater	University of Naples Federico II ; Imperial College London (BSc, PhD)
	Scientific career
Fields	Open science ; Reproducibility ; Data management ; Data publication ; FAIR data
Institutions	University of Oxford ; Oxford e-Research Centre ; European Bioinformatics Institute ; Research Data Alliance ; Microscience Ltd
Thesis	The role of CU-ZN-cofactored superoxide dismutase in salmonella virulence (2001)
Website	eng.ox.ac.uk/people/susanna-assunta-sansone/

Last updated November 20, 2024

Susanna-Assunta Sansone is a British-Italian data scientist who is professor of data readiness at the University of Oxford where she leads the data readiness group and serves as associate director of the Oxford e-Research Centre.^[3] Her research investigates techniques for improving the interoperability, reproducibility and integrity of data.^[2]^[4]

Early life and education

Sansone is from Italy. She was an undergraduate student at the University of Naples Federico II.^[5] She earned her bachelor's degree in molecular biology and a PhD in microbiology at Imperial College London,^[6] where she worked in St Mary's Hospital, London.^[7] Her thesis investigated the role of the cofactored enzyme superoxide dismutase in the virulence of Salmonella .^[6]

Research and career

After earning her doctorate, she moved to Microscience Ltd, where she characterised vaccine strains.^[7] In 2001, Sansone joined the European Bioinformatics Institute (EBI), part of the European Molecular Biology Laboratory (EMBL) where she worked in research data management.^[7] Sansone joined the University of Oxford in 2010.^[8] She became concerned that whilst there were vast amounts of data in the public domain, the majority of it was not reusable. To make data reusable, Sansone encourages researchers to combine their data with metadata: a description of what the data means.^[9] Sansone has described data reproducibility as “the foundation of every scientific field,”.^[10]

Sansone's research investigates strategies to enable the creation of research objects that are Findable, Accessible, Interoperable and Reusable (FAIR).^[7]^[11]^[3] She co-founded the peer-reviewed journal Scientific Data in 2013, and serves as chair of the Research Data Alliance.^[12]^[13] She co-authored the FAIR data principles in 2016,^[14] a set of guidelines for the scientific ecosystem.^[15] FAIR principles have since been adopted by funding bodies, scientific publishers and the private sector.^[15] Sansone works with partners to deliver data stewardship and data governance training and to develop guidelines to make data more accessible.^[16] She is one of the co-creators the FAIR Cookbook, an online resource for life scientists to enable them to keep FAIR data.^[17] Her research has been funded by the Biotechnology and Biological Sciences Research Council (BBSRC) and the European Union.^[18]

Selected publications

Her publications^[2]^[4]^[19] include

The FAIR Guiding Principles for scientific data management and stewardship^[14]
ArrayExpress--a public database of microarray experiments and gene expression profiles^[20]
The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration^[21]
The minimum information about a genome sequence (MIGS) specification^[22]
MetaboLights—an open-access general-purpose repository for metabolomics studies and associated meta-data^[23]
COVID-19 pandemic reveals the peril of ignoring metadata standards^[24]
ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level^[25]
Toward interoperable bioscience data^[26]
Modeling biomedical experimental processes with OBI^[27]

Related Research Articles

Bioinformatics is an interdisciplinary field of science that develops methods and software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, chemistry, physics, computer science, computer programming, information engineering, mathematics and statistics to analyze and interpret biological data. The process of analyzing and interpreting data can some times referred to as computational biology, however this distinction between the two terms is often disputed. To some, the term computational biology refers to building and using models of biological systems.

Eugene Viktorovich Koonin is a Russian-American biologist and Senior Investigator at the National Center for Biotechnology Information (NCBI). He is a recognised expert in the field of evolutionary and computational biology.

Metagenomics is the study of genetic material recovered directly from environmental or clinical samples by a method called sequencing. The broad field may also be referred to as environmental genomics, ecogenomics, community genomics or microbiomics.

The Open Biological and Biomedical Ontologies (OBO) Foundry is a group of people who build and maintain ontologies related to the life sciences. The OBO Foundry establishes a set of principles for ontology development for creating a suite of interoperable reference ontologies in the biomedical domain. Currently, there are more than a hundred ontologies that follow the OBO Foundry principles.

Barend Mons is a molecular biologist and a FAIR data specialist. The first decade of his scientific career he spent on fundamental research on malaria parasites and later on translational research for malaria vaccines. In the year 2000 he switched to advanced data stewardship and (biological) systems analytics. He is most known for innovations in scholarly collaboration, especially nanopublications, and knowledge graph based discovery.

<span class="mw-page-title-main">Metadata</span> Data

Metadata is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:

Open scientific data or open research data is a type of open data focused on publishing observations and results of scientific activities available for anyone to analyze and reuse. A major purpose of the drive for open data is to allow the verification of scientific claims, by allowing others to look at the reproducibility of results, and to allow data from many sources to be integrated to give new knowledge.

The PhenX Toolkit is a web-based catalog of high-priority measures related to complex diseases, phenotypic traits and environmental exposures. These measures were selected by working groups of experts using a consensus process. PhenX Toolkit's mission is to provide investigators with standard measurement protocols for use in genomic, epidemiologic, clinical and translational research. Use of PhenX measures facilitates combining data from a variety of studies, and makes it easy for investigators to expand a study design beyond the primary research focus. The Toolkit is funded by the National Human Genome Research Institute (NHGRI) of the National Institutes of Health (NIH) with co-funding by the Office of the Director (OD), the National Institute of Neurological Disorders and Stroke (NINDS), and the National Heart, Lung, and Blood Institute (NHLBI). Continuously funded since 2007, PhenX has received funding from a variety of NIH institutes, including the National Institute on Drug Abuse (NIDA), the National Institute on Mental Health (NIMH), the National Cancer Institute (NCI) and the National Institute on Minority Health and Health Disparities (NIMHD). The PhenX Toolkit is available to the scientific community at no cost.

<span class="mw-page-title-main">BioMart</span>

BioMart is a community-driven project to provide a single point of access to distributed research data. The BioMart project contributes open source software and data services to the international scientific community. Although the BioMart software is primarily used by the biomedical research community, it is designed in such a way that any type of data can be incorporated into the BioMart framework. The BioMart project originated at the European Bioinformatics Institute as a data management solution for the Human Genome Project. Since then, BioMart has grown to become a multi-institute collaboration involving various database projects on five continents.

Yoshinori Ohsumi is a Japanese cell biologist specializing in autophagy, the process that cells use to destroy and recycle cellular components. Ohsumi is a professor at Tokyo Institute of Technology's Institute of Innovative Research. He received the Kyoto Prize for Basic Sciences in 2012, the 2016 Nobel Prize in Physiology or Medicine, and the 2017 Breakthrough Prize in Life Sciences for his discoveries of mechanisms for autophagy.

<span class="mw-page-title-main">Alfonso Valencia</span>

Alfonso Valencia is a Spanish biologist, ICREA Professor, current director of the Life Sciences department at Barcelona Supercomputing Center, of Spanish National Bioinformatics Institute (INB-ISCIII), and coordinator of the data pillar of the Spanish Personalised Medicine initiative, IMPaCT. From 2015 to 2018, he was President of the International Society for Computational Biology.

<span class="mw-page-title-main">MetaboLights</span> Metabolomics database

MetaboLights is a data repository founded in 2012 for cross-species and cross-platform metabolomic studies that provides primary research data and meta data for metabolomic studies as well as a knowledge base for properties of individual metabolites. The database is maintained by the European Bioinformatics Institute (EMBL-EBI) and the development is funded by Biotechnology and Biological Sciences Research Council (BBSRC). As of July 2018, the MetaboLights browse functionality consists of 383 studies, two analytical platforms, NMR spectroscopy and mass spectrometry.

Machine learning in bioinformatics is the application of machine learning algorithms to bioinformatics, including genomics, proteomics, microarrays, systems biology, evolution, and text mining.

<span class="mw-page-title-main">FAIR data</span> Data compliant with the terms of the FAIR Data Principles

FAIR data is data which meets the FAIR principles of findability, accessibility, interoperability, and reusability (FAIR). The acronym and principles were defined in a March 2016 paper in the journal Scientific Data by a consortium of scientists and organizations.

Nanoinformatics is the application of informatics to nanotechnology. It is an interdisciplinary field that develops methods and software tools for understanding nanomaterials, their properties, and their interactions with biological entities, and using that information more efficiently. It differs from cheminformatics in that nanomaterials usually involve nonuniform collections of particles that have distributions of physical properties that must be specified. The nanoinformatics infrastructure includes ontologies for nanomaterials, file formats, and data repositories.

Elisabeth Margaretha Harbers-Bik is a Dutch microbiologist and scientific integrity consultant. Bik is known for her work detecting photo manipulation in scientific publications, and identifying over 4,000 potential cases of improper research conduct. Bik is the founder of Microbiome Digest, a blog with daily updates on microbiome research, and the Science Integrity Digest blog.

Biocuration is the field of life sciences dedicated to organizing biomedical data, information and knowledge into structured formats, such as spreadsheets, tables and knowledge graphs. The biocuration of biomedical knowledge is made possible by the cooperative work of biocurators, software developers and bioinformaticians and is at the base of the work of biological databases.

Laurie A. Boyer is an American biologist who is a Professor at the Massachusetts Institute of Technology. Her research focuses on the regulation of cell fate decisions and how faulty regulation leads to disease using human stem cells and mice as models.

European Genome-phenome Archive (EGA) is a repository for human biomolecular and phenotypic data in the United Kingdom and Spain. It involves the secure storage of all potentially identifiable genetic data, phenotypic and clinical data generated by biomedical research programs.

Christina Curtis is an American scientist who is a Professor of Medicine, Genetics and Biomedical Data Science and an Endowed Scholar at Stanford University where her research investigates the evolution of tumors. She is director of Artificial Intelligence and Cancer Genomics at Stanford University School of Medicine and is on the board of directors of the American Association for Cancer Research.

References

↑ Susanna-Assunta Sansone on Twitter
1 2 3 Susanna-Assunta Sansone publications indexed by Google Scholar
1 2 "Professor Susanna-Assunta Sansone". ox.ac.uk. University of Oxford. Retrieved 18 February 2022.
1 2 Susanna-Assunta Sansone publications from Europe PubMed Central
↑ "Susanna-Assunta Sansone". elixir-europe.org. ELIXIR . Retrieved 18 February 2022.
1 2 Sansone, Susanna-Assunta (2001). The role of CU-ZN-cofactored superoxide dismutase in salmonella virulence. london.ac.uk (PhD thesis). Imperial College London (University of London). OCLC 498579453. EThOS uk.bl.ethos.246768.
1 2 3 4 "Susanna-Assunta Sansone". eng.ox.ac.uk. Retrieved 18 February 2022.
↑ "Susanna-Assunta Sansone". fairsfair.eu. 11 June 2019. Retrieved 18 February 2022.
↑ Van Noorden, Richard (2013). "Data-sharing: Everything on display". Nature. 500 (7461): 243–245. doi: 10.1038/nj7461-243a . ISSN 1476-4687. PMID 23930278.
↑ "Towards Improved Data Reproducibility". technologynetworks.com. Informatics from Technology Networks. Retrieved 18 February 2022.
↑ "Susanna-Assunta Sansone". nature.com. Retrieved 18 February 2022.
↑ "Editors & Editorial Board". nature.com. Scientific Data. Retrieved 18 February 2022.
↑ "Susanna-Assunta Sansone". rd-alliance.org. Retrieved 18 February 2022.
1 2 Mark D. Wilkinson; Michel Dumontier; IJsbrand Jan Aalbersberg; et al. (15 March 2016). "The FAIR Guiding Principles for scientific data management and stewardship". Scientific Data . 3 (1): 160018. doi:10.1038/SDATA.2016.18. ISSN 2052-4463. PMC 4792175 . PMID 26978244. Wikidata Q27942822.
1 2 "Pharma-backed Toolkit to Speed Up Adoption of FAIR Data Principles". technologynetworks.com. Informatics from Technology Networks. Retrieved 18 February 2022.
↑ "Funded Projects". sansonegroup.eng.ox.ac.uk. Retrieved 18 February 2022.
↑ "FAIR Cookbook". fairplus.github.io. Retrieved 18 February 2022.
↑ "UK government grants awarded to Susanna Sansone". ukri.org. UK Research and Innovation.
↑ Susanna-Assunta Sansone at DBLP Bibliography Server
↑ H Parkinson; M Kapushesky; M Shojatalab; et al. (28 November 2006). "ArrayExpress--a public database of microarray experiments and gene expression profiles". Nucleic Acids Research . 35 (Database issue): D747-50. doi:10.1093/NAR/GKL995. ISSN 0305-1048. PMC 1716725 . PMID 17132828. Wikidata Q33264889.
↑ Barry Smith; Michael Ashburner; Cornelius Rosse; et al. (November 2007). "The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration". Nature Biotechnology . 25 (11): 1251–5. doi:10.1038/NBT1346. ISSN 1087-0156. PMC 2814061 . PMID 17989687. Wikidata Q19671692.
↑ Dawn Field; George Garrity; Tanya Gray; et al. (May 2008). "The minimum information about a genome sequence (MIGS) specification". Nature Biotechnology . 26 (5): 541–7. doi:10.1038/NBT1360. ISSN 1087-0156. PMC 2409278 . PMID 18464787. Wikidata Q28279450.
↑ Kenneth Haug; Reza M. Salek; Pablo Conesa; et al. (January 2013). "MetaboLights--an open-access general-purpose repository for metabolomics studies and associated meta-data". Nucleic Acids Research . 41 (Database issue): D781-6. doi:10.1093/NAR/GKS1004. ISSN 0305-1048. PMC 3531110 . PMID 23109552. Wikidata Q27818909.
↑ Lynn M Schriml; Maria Chuvochina; Neil Davies; et al. (19 June 2020). "COVID-19 pandemic reveals the peril of ignoring metadata standards". Scientific Data . 7 (1): 188. doi:10.1038/S41597-020-0524-5. ISSN 2052-4463. PMID 32561801. Wikidata Q96473059.
↑ Philippe Rocca-Serra; Marco Brandizi; Eamonn Maguire; et al. (15 September 2010). "ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level". Bioinformatics . 26 (18): 2354–6. doi:10.1093/BIOINFORMATICS/BTQ415. ISSN 1367-4803. PMC 2935443 . PMID 20679334. Wikidata Q28749402.
↑ Susanna-Assunta Sansone; Philippe Rocca-Serra; Dawn Field; et al. (27 January 2012). "Toward interoperable bioscience data". Nature Genetics . 44 (2): 121–6. doi:10.1038/NG.1054. ISSN 1061-4036. PMC 3428019 . PMID 22281772. Wikidata Q28090939.
↑ Ryan R Brinkman; Mélanie Courtot; Dirk Derom; et al. (22 June 2010). "Modeling biomedical experimental processes with OBI". Journal of Biomedical Semantics . 1 Suppl 1 (Suppl 1): S7. doi: 10.1186/2041-1480-1-S1-S7 . ISSN 2041-1480. PMC 2903726 . PMID 20626927. Wikidata Q28287823.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[twitter-1] Susanna-Assunta Sansone on Twitter

[gs-2] 1 2 3 Susanna-Assunta Sansone publications indexed by Google Scholar

[expert-3] 1 2 "Professor Susanna-Assunta Sansone". ox.ac.uk. University of Oxford. Retrieved 18 February 2022.

[epmc-4] 1 2 Susanna-Assunta Sansone publications from Europe PubMed Central

[5] "Susanna-Assunta Sansone". elixir-europe.org. ELIXIR . Retrieved 18 February 2022.

[saphd-6] 1 2 Sansone, Susanna-Assunta (2001). The role of CU-ZN-cofactored superoxide dismutase in salmonella virulence. london.ac.uk (PhD thesis). Imperial College London (University of London). OCLC 498579453. EThOS uk.bl.ethos.246768.

[:0-7] 1 2 3 4 "Susanna-Assunta Sansone". eng.ox.ac.uk. Retrieved 18 February 2022.

[8] "Susanna-Assunta Sansone". fairsfair.eu. 11 June 2019. Retrieved 18 February 2022.

[9] Van Noorden, Richard (2013). "Data-sharing: Everything on display". Nature. 500 (7461): 243–245. doi: 10.1038/nj7461-243a . ISSN 1476-4687. PMID 23930278.

[10] "Towards Improved Data Reproducibility". technologynetworks.com. Informatics from Technology Networks. Retrieved 18 February 2022.

[11] "Susanna-Assunta Sansone". nature.com. Retrieved 18 February 2022.

[12] "Editors & Editorial Board". nature.com. Scientific Data. Retrieved 18 February 2022.

[13] "Susanna-Assunta Sansone". rd-alliance.org. Retrieved 18 February 2022.

[fairdata-14] 1 2 Mark D. Wilkinson; Michel Dumontier; IJsbrand Jan Aalbersberg; et al. (15 March 2016). "The FAIR Guiding Principles for scientific data management and stewardship". Scientific Data . 3 (1): 160018. doi:10.1038/SDATA.2016.18. ISSN 2052-4463. PMC 4792175 . PMID 26978244. Wikidata Q27942822.

[:1-15] 1 2 "Pharma-backed Toolkit to Speed Up Adoption of FAIR Data Principles". technologynetworks.com. Informatics from Technology Networks. Retrieved 18 February 2022.

[16] "Funded Projects". sansonegroup.eng.ox.ac.uk. Retrieved 18 February 2022.

[17] "FAIR Cookbook". fairplus.github.io. Retrieved 18 February 2022.

[18] "UK government grants awarded to Susanna Sansone". ukri.org. UK Research and Innovation.

[dblp-19] Susanna-Assunta Sansone at DBLP Bibliography Server

[20] H Parkinson; M Kapushesky; M Shojatalab; et al. (28 November 2006). "ArrayExpress--a public database of microarray experiments and gene expression profiles". Nucleic Acids Research . 35 (Database issue): D747-50. doi:10.1093/NAR/GKL995. ISSN 0305-1048. PMC 1716725 . PMID 17132828. Wikidata Q33264889.

[obo-21] Barry Smith; Michael Ashburner; Cornelius Rosse; et al. (November 2007). "The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration". Nature Biotechnology . 25 (11): 1251–5. doi:10.1038/NBT1346. ISSN 1087-0156. PMC 2814061 . PMID 17989687. Wikidata Q19671692.

[migs-22] Dawn Field; George Garrity; Tanya Gray; et al. (May 2008). "The minimum information about a genome sequence (MIGS) specification". Nature Biotechnology . 26 (5): 541–7. doi:10.1038/NBT1360. ISSN 1087-0156. PMC 2409278 . PMID 18464787. Wikidata Q28279450.

[metabolights-23] Kenneth Haug; Reza M. Salek; Pablo Conesa; et al. (January 2013). "MetaboLights--an open-access general-purpose repository for metabolomics studies and associated meta-data". Nucleic Acids Research . 41 (Database issue): D781-6. doi:10.1093/NAR/GKS1004. ISSN 0305-1048. PMC 3531110 . PMID 23109552. Wikidata Q27818909.

[covid-24] Lynn M Schriml; Maria Chuvochina; Neil Davies; et al. (19 June 2020). "COVID-19 pandemic reveals the peril of ignoring metadata standards". Scientific Data . 7 (1): 188. doi:10.1038/S41597-020-0524-5. ISSN 2052-4463. PMID 32561801. Wikidata Q96473059.

[isa-25] Philippe Rocca-Serra; Marco Brandizi; Eamonn Maguire; et al. (15 September 2010). "ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level". Bioinformatics . 26 (18): 2354–6. doi:10.1093/BIOINFORMATICS/BTQ415. ISSN 1367-4803. PMC 2935443 . PMID 20679334. Wikidata Q28749402.

[towards-26] Susanna-Assunta Sansone; Philippe Rocca-Serra; Dawn Field; et al. (27 January 2012). "Toward interoperable bioscience data". Nature Genetics . 44 (2): 121–6. doi:10.1038/NG.1054. ISSN 1061-4036. PMC 3428019 . PMID 22281772. Wikidata Q28090939.

[obi-27] Ryan R Brinkman; Mélanie Courtot; Dirk Derom; et al. (22 June 2010). "Modeling biomedical experimental processes with OBI". Journal of Biomedical Semantics . 1 Suppl 1 (Suppl 1): S7. doi: 10.1186/2041-1480-1-S1-S7 . ISSN 2041-1480. PMC 2903726 . PMID 20626927. Wikidata Q28287823.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

Authority control databases
International	ISNI
Academics	ORCID Scopus Google Scholar DBLP

Susanna-Assunta Sansone

Contents

Early life and education

Research and career

Selected publications

Related Research Articles

References