Ebbe Nielsen Challenge

Last updated

The Ebbe Nielsen Challenge is an international science competition conducted annually from 2015 onwards by the Global Biodiversity Information Facility (GBIF), with a set of cash prizes that recognize researcher(s)' submissions in creating software or approaches that successfully address a GBIF-issued challenge in the field of biodiversity informatics. It succeeds the Ebbe Nielsen Prize, which was awarded annually by GBIF between 2002 and 2014. The name of the challenge honours the memory of prominent entomologist and biodiversity informatics proponent Ebbe Nielsen, who died of a heart attack in the U.S.A. en route to the 2001 GBIF Governing Board meeting.

Contents

History

In 2001, GBIF created the Ebbe Nielsen Prize to honour the recently deceased Danish-Australian entomologist Ebbe Nielsen, who was a keen proponent of both GBIF and the biodiversity informatics discipline. [1] That prize recognized a global researcher or research team for their retrospective contribution(s) to the field of biodiversity informatics, according to criteria set out by GBIF in the terms of the award. From 2015 onwards, GBIF re-launched the award process as a competition between global individuals or teams of researchers to create new software or approaches to using biodiversity data according to a theme announced annually for each round of the competition, and also to split the prize money among multiple groups instead of a single winner as in the initial era of the Prize. [2] Calls for entries to the competition, now called the "Ebbe Nielsen Challenge", have been issued annually from 2015 to the present, with winners announced through a competitive process in all years except for 2017, when an insufficient number of entries was received. [3]

List of winners, 2015 onwards

2015

Challenge: "Make significant use of GBIF-mediated data in a way that provides new representations or insights. Your submission could involve a range of results—websites, stand-alone or mobile applications, or outputs of analyses—or could seek to improve any number of issues or processes, including (but not limited to) data analysis or visualization, data workflows, uploading, or annotations." [4]

A list of finalists for the 2015 Challenge is available here. [5]

First Prize winner:

Second Prize winner:

Details are available via the GBIF website here. [8]

2016

Challenge: "In 2016, the Challenge will focus on the question of data gaps and completeness, seeking tools, methods and mechanisms to help analyse the fitness-for-use of GBIF-mediated data and/or guide priority setting for biodiversity data mobilization. We expect both data users and data holders to benefit from this year’s emphasis on gaps and completeness." [9] [10]

First Prize winner:

Joint Second Prize winners:

Details are available via the GBIF website here. [14]

2017

Challenge: "The 2017 GBIF Ebbe Nielsen Challenge seeks submissions that repurpose these datasets [in public open-access repositories] and adapting them into the Darwin Core Archive format (DwC-A), the interoperable and reusable standard that powers the publication of almost 800 million species occurrence records from the nearly 1,000 worldwide institutions now active in the GBIF network. The 2017 Ebbe Nielsen Challenge will task developers and data scientists to create web applications, scripts or other tools that automate the discovery and extraction of relevant biodiversity data from open data repositories." [15]

No winners were announced, indicating that the 2017 prize money was not awarded to any entry.

2018

Challenge: "The 2018 Ebbe Nielsen Challenge is deliberately open-ended, so entrants have a broad remit for creating tools and techniques that advance in open science and improve the access, utility or quality of GBIF-mediated data. Challenge submissions may be new applications, visualization methods, workflows or analyses, or they build on and extend existing tools and features." [16] [17]

Joint First Prize winners:

Joint Second Prize winners:

Details are available via the GBIF website here. [24]

2019

Challenge: "The 2019 Ebbe Nielsen Challenge is deliberately open-ended, so entrants have a broad remit for creating tools and techniques that advance in open science and improve the access, utility or quality of GBIF-mediated data. Challenge submissions may be new applications, visualization methods, workflows or analyses, or they build on and extend existing tools and features. It is expected that successful entries provide practical, pragmatic solutions for the GBIF network while advancing biodiversity informatics and biodiversity data management in relation to the GBIF mission and strategic plan." [25]

First Prize winner:

Joint Second Prize winners:

Joint Third Prize winners:

Details are available via the GBIF website here. [35]

Related Research Articles

<span class="mw-page-title-main">Hackathon</span> Event in which groups of software developers work at an accelerated pace

A hackathon is an event where people engage in rapid and collaborative engineering over a relatively short period of time such as 24 or 48 hours. They are often run using agile software development practices, such as sprint-like design wherein computer programmers and others involved in software development, including graphic designers, interface designers, product managers, project managers, domain experts, and others collaborate intensively on engineering projects, such as software engineering.

Ebbe Schmidt Nielsen was a Danish entomologist influential in systematics and Lepidoptera research, and an early proponent of biodiversity informatics. The journal Invertebrate Systematics was established with significant contributions from Nielsen, and he assisted in the founding of the Global Biodiversity Information Facility (GBIF). Nielsen wrote several books, published over eighty scientific papers, and was highly regarded within the scientific community. Following his death, the GBIF organised the Ebbe Nielsen Prize in his memory, awarded annually to promising researchers in the field of biodiversity informatics. The moth Pollanisus nielseni is named after Nielsen.

<span class="mw-page-title-main">Global Biodiversity Information Facility</span> Aggregator of scientific data on biodiversity; data portal

The Global Biodiversity Information Facility (GBIF) is an international organisation that focuses on making scientific data on biodiversity available via the Internet using web services. The data are provided by many institutions from around the world; GBIF's information architecture makes these data accessible and searchable through a single portal. Data available through the GBIF portal are primarily distribution data on plants, animals, fungi, and microbes for the world, and scientific names data.

Biodiversity informatics is the application of informatics techniques to biodiversity information, such as taxonomy, biogeography or ecology. It is defined as the application of Information technology technologies to management, algorithmic exploration, analysis and interpretation of primary data regarding life, particularly at the species level organization. Modern computer techniques can yield new ways to view and analyze existing information, as well as predict future situations. Biodiversity informatics is a term that was only coined around 1992 but with rapidly increasing data sets has become useful in numerous studies and applications, such as the construction of taxonomic databases or geographic information systems. Biodiversity informatics contrasts with "bioinformatics", which is often used synonymously with the computerized handling of data in the specialized area of molecular biology.

C-squares is a system of spatially unique, location-based identifiers (geocodes) for areas on the surface of the earth, represented as cells from a latitude- and longitude-based Discrete Global Grid at a hierarchical set of resolution steps, obtained by progressively subdividing 10×10 degree World Meteorological Organization squares; the term "c-square" is also available for use to designate any component cell of the grid. Individual cell identifiers incorporate literal values of latitude and longitude in an interleaved notation, together with additional digits that support intermediate grid resolutions of 5, 0.5, 0.05 degrees, etc.

<span class="mw-page-title-main">Galaxy (computational biology)</span>

Galaxy is a scientific workflow, data integration, and data and analysis persistence and publishing platform that aims to make computational biology accessible to research scientists that do not have computer programming or systems administration experience. Although it was initially developed for genomics research, it is largely domain agnostic and is now used as a general bioinformatics workflow management system.

A taxonomic database is a database created to hold information on biological taxa – for example groups of organisms organized by species name or other taxonomic identifier – for efficient data management and information retrieval. Taxonomic databases are routinely used for the automated construction of biological checklists such as floras and faunas, both for print publication and online; to underpin the operation of web-based species information systems; as a part of biological collection management ; as well as providing, in some cases, the taxon management component of broader science or biology information systems. They are also a fundamental contribution to the discipline of biodiversity informatics.

Plazi is a Swiss-based international non-profit association supporting and promoting the development of persistent and openly accessible digital bio-taxonomic literature. Plazi is cofounder of the Biodiversity Literature Repository and is maintaining this digital taxonomic literature repository at Zenodo to provide access to FAIR data converted from taxonomic publications using the TreatmentBank service, enhances submitted taxonomic treatments by creating a version in the XML format Taxpub, and educates about the importance of maintaining open access to scientific discourse and data. It is a contributor to the evolving e-taxonomy in the field of Biodiversity Informatics.

<span class="mw-page-title-main">Dryad (repository)</span>

Dryad is an international open-access repository of research data, especially data underlying scientific and medical publications. Dryad is a curated general-purpose repository that makes data discoverable, freely reusable, and citable. The scientific, educational, and charitable mission of Dryad is to provide the infrastructure for and promote the re-use of scholarly research data.

Dotmatics is an R&D scientific software company used by scientists in the R&D process that help them be more efficient in their efforts to innovate. Founded in 2005, the company's primary office is in Boston with 14 offices around the globe. In March 2021, Dotmatics joined forces with Insightful Science through a merger. In April 2022, the two companies consolidated under the Dotmatics brand. Dotmatics' software is used by 2 million scientists and researchers and 10,000 customers.

iNaturalist is a social network of naturalists, citizen scientists, and biologists built on the concept of mapping and sharing observations of biodiversity across the globe. iNaturalist may be accessed via its website or from its mobile applications. As of 21 September 2022, iNaturalist users had contributed approximately 115,651,000 observations of plants, animals, fungi, and other organisms worldwide, and around 245,700 users were active in the previous 30 days.

The Ebbe Nielsen Prize was an international science award made annually between 2002 and 2014 by the Global Biodiversity Information Facility (GBIF), to recognize a researcher who had made substantial contributions to the field of biodiversity informatics. The prize was established in memory of prominent entomologist and biodiversity informatics proponent Ebbe Nielsen, who died of a heart attack in the U.S.A. en route to the 2001 GBIF Governing Board meeting.

<span class="mw-page-title-main">Richard Pyle</span> American ichthyologist and scuba diver

Richard Lawrence Pyle, Ph.D. is a scuba diver and ichthyologist working on Hawaii.

PrecisionFDA is a secure, collaborative, high-performance computing platform that has established a growing community of experts around the analysis of biological datasets in order to advance precision medicine, inform regulatory science, and enable improvements in health outcomes. This cloud-based platform is developed and served by the United States Food and Drug Administration (FDA). PrecisionFDA connects experts, citizen scientists, and scholars from around the world and provides them with a library of computational tools, workflow features, and reference data. The platform allows researchers to upload and compare data against reference genomes, and execute bioinformatic pipelines. The variant call file (VCF) comparator tool also enables users to compare their genetic test results to reference genomes. The platform's code is open source and available on GitHub. The platform also features a crowdsourcing model to sponsor community challenges in order to stimulate the development of innovative analytics that inform precision medicine and regulatory science. Community members from around the world come together to participate in scientific challenges, solving problems that demonstrate the effectiveness of their tools, testing the capabilities of the platform, sharing their results, and engaging the community in discussions. Globally, precisionFDA has more than 5,000 users.

<span class="mw-page-title-main">Interim Register of Marine and Nonmarine Genera</span> Taxonomic database

The Interim Register of Marine and Nonmarine Genera (IRMNG) is a taxonomic database which attempts to cover published genus names for all domains of life from 1758 in zoology up to the present, arranged in a single, internally consistent taxonomic hierarchy, for the benefit of Biodiversity Informatics initiatives plus general users of biodiversity (taxonomic) information. In addition to containing over 490,000 published genus name instances as at March 2020, the database holds over 1.7 million species names, although this component of the data is not maintained in as current or complete state as the genus-level holdings. IRMNG can be queried online for access to the latest version of the dataset and is also made available as periodic snapshots or data dumps for import/upload into other systems as desired.

<i>Nomenclator Zoologicus</i>

Nomenclator Zoologicus is one of the major compendia in the field of zoological nomenclature, compiled by Sheffield Airey Neave and his successors and published in 9 volumes over the period 1939–1994, under the auspices of the Zoological Society of London; a tenth, electronic-only volume was also produced before the project ceased. It contains over 340,000 published name instances with their authorities and details of their original publication, certain nomenclatural notes and cross references, and an indication of the taxonomic group to which each is assigned. An electronic (digitised) version of volumes 1-10 was released online by the uBio project, based at the Marine Biological Laboratory, Woods Hole, in 2004–2005.

Bionomia is a database and database entry tool which permits the name strings of collectors, and of taxonomists who determine specimen data, to be assigned to the unique person who collected or identified the specimen. If the person is living, this is done via their ORCID iD, and if dead, via their Wikidata identifier. It thereby resolves ambiguity where two or more collectors have similar names; or where one collector has worked under two names, or a single name written in two or more ways. The specimen data associated with, and used by, Bionomia are the aggregated GBIF data.

<span class="mw-page-title-main">Tony Rees (scientist)</span>

Anthony J. J. ("Tony") Rees is a British-born software developer, data manager and biologist resident in Australia since 1986, and previously a data manager with CSIRO Marine and Atmospheric Research. He is responsible for developing a number of software systems currently used in science data management, including c-squares, Taxamatch, and IRMNG, the Interim Register of Marine and Nonmarine Genera. He has also been closely involved with the development of other biodiversity informatics initiatives including the Ocean Biogeographic Information System (OBIS), AquaMaps, and the iPlant Taxonomic Name Resolution Service (TNRS).

The Biodiversity Literature Repository (BLR) is a biodiversity dedicated community created in November 11, 2013, in Zenodo, the open science repository at CERN and part of the European project OpenAIRE. The goal of BLR is to provide a long-term, stable, open repository that allows deposition of bio-taxonomic articles enhanced with custom metadata and links to data extracted from therein and deposited in BLR. As of April 25, 2021, this includes 94,443 taxonomic treatments and 293,457 figures from 48,993 articles which are made findable, accessible, interoperable and reusable FAIR data. Most of the data is uploaded on a continuous basis by Plazi using its TreatmentBank service based on their Plazi workflow, and Pensoft Publishers using BLR as repository for data published in their journals. The largest single re-user of data is the Global Biodiversity Information Facility (GBIF), using data from within 33,623 processed articles.

Vince Smith is a British entomologist and biodiversity informatician at the Natural History Museum, London.

References

  1. Kristensen, Niels P. (2001). "Ebbe Schmidt Nielsen 7 June 1950 - 6 March 2001". Nota Lepidopterologica. 24 (3): 3–9.
  2. "GBIF Ebbe Nielsen Challenge". GBIF Ebbe Nielsen Challenge.
  3. "2018 GBIF Ebbe Nielsen Challenge now open". iphylo.blogspot.com.
  4. "Winners named for first GBIF Ebbe Nielsen Challenge". www.gbif.org.
  5. "Finalists for GBIF Ebbe Nielsen Challenge". dna-barcoding.blogspot.com.
  6. "GBIF dataset metrics". chrome.google.com.
  7. "BioGUID.org - Cross-linking data Across the Biodiversity Universe through the use of unique identifiers and related data objects". devpost.com.
  8. "GBIF Ebbe Nielsen Challenge: Bringing biodiversity information to life". www.gbif.org.
  9. "GBIF 2016 Ebbe Nielsen Challenge: Analysing and addressing gaps and biases in primary biodiversity data". www.gbif.org.
  10. "GBIF 2016 Ebbe Nielsen Challenge". www.biodiv.be.
  11. "SLU scientist won a prize for 'Ignorance mapping'". www.slu.se.
  12. "sampbias - Evaluating geographic sampling biases in species distribution datasets". devpost.com.
  13. "GBIF Coverage Assessment Tools - Online tools to identify and track gaps in GBIF-mediated data to characterize species distributions in space and time". devpost.com.
  14. "Data 'ignorance mapping' entry wins 2016 Ebbe Nielsen Challenge". www.gbif.org.
  15. "Liberating species records from open data repositories for scientific discovery and reuse". www.gbif.org.
  16. "2018 GBIF Ebbe Nielsen Challenge seeks open-data innovations for biodiversity". www.gbif.org.
  17. "2018 GBIF Ebbe Nielsen Challenge". www.europeandataportal.eu.
  18. "Checklist Recipe - How we created a template to standardize species data". ropensci.org.
  19. "Prof Roderic Page Wins Ebbe Nielsen Prize". University of Glasgow.
  20. "bdverse: Biodiversity Data Cleaning Framework". www.researchgate.net.
  21. "The Smithsonian Institution's Digitization Program Office wins prize in GBIF 2018 Ebbe Nielsen Challenge". dpo.si.edu.
  22. "VectorBase team members among the six 2018 Ebbe Nielsen Challenge winners". www.vectorbase.org.
  23. "BLOG POST: Visual exploration of taxonomic, phylogenomic and biodiversity data". big4-project.eu.
  24. "Six winners top the 2018 Ebbe Nielsen Challenge". www.gbif.org.
  25. "2019 GBIF Ebbe Nielsen Challenge seeks open-data innovations for biodiversity". www.gbif.org.
  26. "WhereNext: a recommending system to identify sampling priorities based on generalized dissimilarity modeling". rpubs.com.
  27. "occCite: Tools to Enable Comprehensive Biodiversity Data Citation". github.com.
  28. "Organella". netlify.com.
  29. "Rapid Least Concern: Automated assessments of lower risk plants for the IUCN Red List of Threatened Species (Red List)". shinyapps.io.
  30. "Agile GBIF Publishing". earthcape.com/.
  31. "Biodiversity Information Review and Decision Support: the BIRDS package for R". github.com.
  32. "GB Sifter: A GBIF to GenBank Data Sifter". youtube.com.
  33. "naturaList". shinyapps.io.
  34. "GeoNature-atlas: Publish an online biodiversity atlas with GBIF data of your area". ecrins-parcnational.com.
  35. "WhereNext wins 2019 GBIF Ebbe Nielsen Challenge". www.gbif.org.