Morphobank

Last updated
MorphoBank
Developer(s) The MorphoBank Project
Initial release2001
Operating system Web-based application
Type Evolutionary research, systematics
Website morphobank.org

MorphoBank is a web application for collaborative evolutionary research, specifically phylogenetic systematics or cladistics, on the phenotype. Historically, scientists conducting research on phylogenetic systematics have worked individually or in small groups employing traditional single-user software applications such as MacClade, [1] Mesquite [2] and Nexus Data Editor. [3] As the hypotheses under study have grown more complex, large research teams have assembled to tackle the problem of discovering the Tree of Life for the estimated 4-100 million living species( Wilson 2003 , pp. 77–80) and the many thousands more extinct species known from fossils. Because the phenotype is fundamentally visual, and as phenotype-based phylogenetic studies have continued to increase in size, [4] it becomes important that observations be backed up by labeled images. Traditional desktop software applications currently in wide use do not provide robust support for team-based research or for image manipulation and storage. MorphoBank is a particularly important tool for the growing scientific field of phenomics.

Contents

The development of MorphoBank, which began in 2001, has been funded by the National Science Foundation's Directorates for Geosciences, Biological Sciences and Computer and Information Science and Engineering. The significance of the scientific work on MorphoBank has been featured in the New York Times(here and here), among other publications.

Advantages

Teams of scientists studying phylogenetics to build the Tree of Life assemble large spreadsheets of observations about species (referred to as "matrices"). These teams require simultaneous access by each team member to a single and secure copy of the team's data during a scientific research project. This single copy of the data also changes with great frequency during the data collection phase. Images that can be very helpful for documenting homology statements must be displayed, labeled and shared as homology statements develop. This cannot be accomplished elegantly with a desktop software package alone because in a desktop environment each collaborator is working on his own private copy of project data. Changes made by one participant cannot automatically propagate to others, preventing collaborators from seeing each other's data edits until they are manually (and due to the effort involved, often only periodically) merged into a single "true" dataset. In all but the smallest and most disciplined of teams, file version control and the reconciliation of changes made on multiple copies of the data emerge quickly as significant drags on productivity.

MorphoBank is an attempt to address these issues by leveraging the ubiquity of the web and modern web-based application techniques, including Ajax, web service layers, and rich web applications to provide a full-featured, net-accessible collaborative workspace for phylogenetic research. In particular, MorphoBank makes it easy to:

example of morphobank.org layout Morphobank example 5 22.jpg
example of morphobank.org layout

These tasks are difficult or impossible in most existing software applications.

History

In 2001 the National Science Foundation (NSF) sponsored a workshop, [5] at the American Museum of Natural History in New York to develop the outlines of a web-based system for a collaborative, media-rich research tool for morphological phylogenetics. An application prototype presented at the workshop was later refined with feedback from the workshop and became MorphoBank version 1.0. A grant from the US National Oceanic and Atmospheric Administration funded further revisions resulting in version 2.0, released in 2005. Current support from the NSF is funding current feature enhancements to MorphoBank. MorphoBank was hosted by Stony Brook University until late October 2021 and received back up support from the American Museum of Natural History. The current version is 3.0. Rationale for the software was described in the journal Cladistics. [6] MorphoBank has also received support from NESCENT and the San Diego Supercomputer Center. Since 2018, MorphoBank has been supported in part by Phoenix Bioinformatics, a non-profit company founded to sustain databases for the basic sciences. A permanent move of MorphoBank from Stony Brook University to Phoenix Bioinformatics was complete in late October 2021. [7]

The San Diego Supercomputer Center has previously provided technical and hosting resources to the MorphoBank project.

Usage

MorphoBank hosts the products of peer-reviewed scientific research on phenotypes. An increasing volume of systematics data is "born digital" and MorphoBank is well suited to handle this type of material. On August 24, 2007, 62 active research projects were hosted by MorphoBank, as well as 6 completed (and published) projects. By 2017 over 2000 scientists and their students were registered content builders (users are not required to register and are even more numerous) and has more than 500 publicly available projects with approximately 80,000 images that are the products of scientific research. Over 1,500 active research projects [8] are hosted by MorphoBank. The software has been used to assemble phylogenetic research on such groups as mammals, [9] from bats [10] to whales, [11] [12] bivalve molluscs, [13] arachnids, [14] fossil plants [15] and living and extinct amniotes. [16] It has also been used more broadly in evolutionary and paleontological research to host curated images associated with published research on lacewing insects [17] geckos, [18] [19] raptor birds, [20] dinosaurs, [21] frogs [22] and nematodes. [23] MorphoBank is increasingly used in conjunction with the Paleobiology Database. [24]

Example published projects:

MorphoBank has been particularly important to the Assembling the Tree of Life initiative sponsored by the National Science Foundation. MorphoBank is well-suited to such projects because of its tools for merging taxonomic, character and matrix-based data, as well as its collaborative features. [25] Highlights of this research include a collaborative matrix on mammal evolution published in Science that included over 4,000 phenomic characters scored for over 80 species, [26] a matrix on extant baleen whales featuring nearly 600 images, [27] and more.

Related Research Articles

In biology, phenetics, also known as taximetrics, is an attempt to classify organisms based on overall similarity, usually in morphology or other observable traits, regardless of their phylogeny or evolutionary relation. It is closely related to numerical taxonomy which is concerned with the use of numerical methods for taxonomic classification. Many people contributed to the development of phenetics, but the most influential were Peter Sneath and Robert R. Sokal. Their books are still primary references for this sub-discipline, although now out of print.

<span class="mw-page-title-main">Ungulate</span> Group of animals that walk on the tips of their toes or hooves

Ungulates are members of the diverse clade Euungulata which primarily consists of large mammals with hooves. Once part of the clade "Ungulata" along with the clade Paenungulata, "Ungulata" has since been determined to be a polyphyletic and thereby invalid clade based on molecular data. As a result, true ungulates had since been reclassified to the newer clade Euungulata in 2001 within the clade Laurasiatheria while Paenungulata has been reclassified to a distant clade Afrotheria. Living ungulates are divided into two orders: Perissodactyla including equines, rhinoceroses, and tapirs; and Artiodactyla including cattle, antelope, pigs, giraffes, camels, sheep, deer, and hippopotamuses, among others. Cetaceans such as whales, dolphins, and porpoises are also classified as artiodactyls, although they do not have hooves. Most terrestrial ungulates use the hoofed tips of their toes to support their body weight while standing or moving. Two other orders of ungulates, Notoungulata and Litopterna, both native to South America, became extinct at the end of the Pleistocene, around 12,000 years ago.

<span class="mw-page-title-main">Cladogram</span> Diagram used to show relations among groups of organisms with common origins

A cladogram is a diagram used in cladistics to show relations among organisms. A cladogram is not, however, an evolutionary tree because it does not show how ancestors are related to descendants, nor does it show how much they have changed, so many differing evolutionary trees can be consistent with the same cladogram. A cladogram uses lines that branch off in different directions ending at a clade, a group of organisms with a last common ancestor. There are many shapes of cladograms but they all have lines that branch off from other lines. The lines can be traced back to where they branch off. These branching off points represent a hypothetical ancestor which can be inferred to exhibit the traits shared among the terminal taxa above it. This hypothetical ancestor might then provide clues about the order of evolution of various features, adaptation, and other evolutionary narratives about ancestors. Although traditionally such cladograms were generated largely on the basis of morphological characters, DNA and RNA sequencing data and computational phylogenetics are now very commonly used in the generation of cladograms, either on their own or in combination with morphology.

<span class="mw-page-title-main">Phylogenetic tree</span> Branching diagram of evolutionary relationships between organisms

A phylogenetic tree, phylogeny or evolutionary tree is a graphical representation which shows the evolutionary history between a set of species or taxa during a specific time. In other words, it is a branching diagram or a tree showing the evolutionary relationships among various biological species or other entities based upon similarities and differences in their physical or genetic characteristics. In evolutionary biology, all life on Earth is theoretically part of a single phylogenetic tree, indicating common ancestry. Phylogenetics is the study of phylogenetic trees. The main challenge is to find a phylogenetic tree representing optimal evolutionary ancestry between a set of species or taxa. Computational phylogenetics focuses on the algorithms involved in finding optimal phylogenetic tree in the phylogenetic landscape.

<span class="mw-page-title-main">Baleen</span> Keratin structure in whales, used for flexible stiffening

Baleen is a filter-feeding system inside the mouths of baleen whales. To use baleen, the whale first opens its mouth underwater to take in water. The whale then pushes the water out, and animals such as krill are filtered by the baleen and remain as a food source for the whale. Baleen is similar to bristles and consists of keratin, the same substance found in human fingernails, skin and hair. Baleen is a skin derivative. Some whales, such as the bowhead whale, have longer baleen than others. Other whales, such as the gray whale, only use one side of their baleen. These baleen bristles are arranged in plates across the upper jaw of whales.

<span class="mw-page-title-main">Tylopoda</span> Suborder of mammals

Tylopoda is a suborder of terrestrial herbivorous even-toed ungulates belonging to the order Artiodactyla. They are found in the wild in their native ranges of South America and Asia, while Australian feral camels are introduced. The group has a long fossil history in North America and Eurasia. Tylopoda appeared during the Eocene around 50 million years ago.

<span class="mw-page-title-main">Neobalaenidae</span> Family of mammals

Neobalaenidae is a family of baleen whales including the extant pygmy right whale. Although traditionally considered related to balaenids, recent studies by Fordyce and Marx (2013) and Ludovic Dutoit and colleagues (2023) have recovered the living pygmy right whale as a member of Cetotheriidae, making it the only extant cetotheriid. Not all authors agree with this placement.

<span class="mw-page-title-main">Phylogenesis</span>

Phylogenesis is the biological process by which a taxon appears. The science that studies these processes is called phylogenetics.

In phylogenetics and computational phylogenetics, maximum parsimony is an optimality criterion under which the phylogenetic tree that minimizes the total number of character-state changes. Under the maximum-parsimony criterion, the optimal tree will minimize the amount of homoplasy. In other words, under this criterion, the shortest possible tree that explains the data is considered best. Some of the basic ideas behind maximum parsimony were presented by James S. Farris in 1970 and Walter M. Fitch in 1971.

<span class="mw-page-title-main">Substitution model</span> Description of the process by which states in sequences change into each other and back

In biology, a substitution model, also called models of DNA sequence evolution, are Markov models that describe changes over evolutionary time. These models describe evolutionary changes in macromolecules represented as sequence of symbols. Substitution models are used to calculate the likelihood of phylogenetic trees using multiple sequence alignment data. Thus, substitution models are central to maximum likelihood estimation of phylogeny as well as Bayesian inference in phylogeny. Estimates of evolutionary distances are typically calculated using substitution models. Substitution models are also central to phylogenetic invariants because they are necessary to predict site pattern frequencies given a tree topology. Substitution models are also necessary to simulate sequence data for a group of organisms related by a specific tree.

<span class="mw-page-title-main">Laurasiatheria</span> Clade of mammals

Laurasiatheria is a superorder of placental mammals that groups together true insectivores (eulipotyphlans), bats (chiropterans), carnivorans, pangolins (pholidotes), even-toed ungulates (artiodactyls), odd-toed ungulates (perissodactyls), and all their extinct relatives. From systematics and phylogenetic perspectives, it is subdivided into order Eulipotyphla and clade Scrotifera. It is a sister group to Euarchontoglires with which it forms the magnorder Boreoeutheria. Laurasiatheria was discovered on the basis of the similar gene sequences shared by the mammals belonging to it; no anatomical features have yet been found that unite the group, although a few have been suggested such as a small coracoid process, a simplified hindgut and allantoic vessels that are large to moderate in size. The Laurasiatheria clade is based on DNA sequence analyses and retrotransposon presence/absence data. The superorder originated on the northern supercontinent of Laurasia, after it split from Gondwana when Pangaea broke up. Its last common ancestor is supposed to have lived between ca. 76 to 90 million years ago.

Computational phylogenetics, phylogeny inference, or phylogenetic inference focuses on computational and optimization algorithms, heuristics, and approaches involved in phylogenetic analyses. The goal is to find a phylogenetic tree representing optimal evolutionary ancestry between a set of genes, species, or taxa. Maximum likelihood, parsimony, Bayesian, and minimum evolution are typical optimality criteria used to assess how well a phylogenetic tree topology describes the sequence data. Nearest Neighbour Interchange (NNI), Subtree Prune and Regraft (SPR), and Tree Bisection and Reconnection (TBR), known as tree rearrangements, are deterministic algorithms to search for optimal or the best phylogenetic tree. The space and the landscape of searching for the optimal phylogenetic tree is known as phylogeny search space.

Phenomics is the systematic study of traits that make up a phenotype. It was coined by UC Berkeley and LBNL scientist Steven A. Garan. As such, it is a transdisciplinary area of research that involves biology, data sciences, engineering and other fields. Phenomics is concerned with the measurement of the phenotype where a phenome is a set of traits that can be produced by a given organism over the course of development and in response to genetic mutation and environmental influences. It is also important to remember that an organisms phenotype changes with time. The relationship between phenotype and genotype enables researchers to understand and study pleiotropy. Phenomics concepts are used in functional genomics, pharmaceutical research, metabolic engineering, agricultural research, and increasingly in phylogenetics.

<span class="mw-page-title-main">Boreoeutheria</span> Magnorder of mammals containing Laurasiatheria and Euarchontoglires

Boreoeutheria is a magnorder of placental mammals that groups together superorders Euarchontoglires and Laurasiatheria. With a few exceptions male animals in the clade have a scrotum, an ancestral feature of the clade. The sub-clade Scrotifera was named after this feature.

<i>Janjucetus</i> Extinct genus of mammals

Janjucetus is an extinct genus of cetacean, and a basal baleen whale (Mysticeti), from the Late Oligocene around 25 million years ago (mya) off southeast Australia, containing one species J. hunderi. Unlike modern mysticetes, it possessed large teeth for gripping and shredding prey, and lacked baleen, and so was likely to have been a predator that captured large single prey animals rather than filter feeding. However, its teeth may have interlocked, much like those of the modern-day filter-feeding crabeater seal, which would have allowed some filter-feeding behavior. Its hunting behaviour was probably similar to the modern-day leopard seal, probably eating large fish. Like baleen whales, Janjucetus could not echolocate; however, it did have unusually large eyes, and so probably had an acute sense of vision. The only specimen was found on the Jan Juc beach, where the remains of the extinct whales Mammalodon, Prosqualodon and Waipatia have also been discovered.

<span class="mw-page-title-main">Cetruminantia</span> Taxonomic clade

The Cetruminantia are a clade made up of the Cetancodontamorpha and their closest living relatives, the Ruminantia.

<i>Aetiocetus</i> Extinct genus of mammals

Aetiocetus is a genus of extinct basal mysticete, or baleen whale that lived 33.9 to 23.03 million years ago, in the Oligocene in the North Pacific ocean, around Japan, Mexico, and Oregon, U.S. It was first described by Douglas Emlong in 1966 and currently contains known four species, A. cotylalveus, A. polydentatus, A. tomitai, and A. weltoni. These whales are remarkable for their retention of teeth and presence of nutrient foramina, indicating that they possessed baleen. Thus, Aetiocetus represents the transition from teeth to baleen in Oligocene mysticetes. Baleen is a highly derived character, or synapomorphy, of mysticetes, and is a keratinous structure that grows from the palate, or roof of the mouth, of the whale. The presence of baleen is inferred from the fossil record in the skull of Aetiocetus. Aetiocetus is known from both sides of the Pacific Ocean: it was first documented in Oregon, United States, but it is also known from Japan and Mexico. The genus is currently constrained to the Northern hemisphere and has little value in biostratigraphic studies of the Oligocene due to its limited occurrences across the Pacific.

<span class="mw-page-title-main">Artiofabula</span> Clade of mammals comprising pigs, cows, hippos, and whales, among others

Artiofabula is a clade made up of the Suina and the Cetruminantia. The clade was found in molecular phylogenetic analyses and contradicted traditional relationships based on morphological analyses.

Nancy B. Simmons is an American zoologist, mammalogist, professor, and author. Specializing in bats, Simmons has conducted extensive research on the morphology and evolutionary history of numerous bat species. She is also the curator-in-charge of the Department of Mammalogy at the American Museum of Natural History and a professor at the Richard Gilder Graduate School.

References

  1. MacClade, a computer program for phylogenetic analysis, David R. Maddison and Wayne P. Maddison.
  2. Mesquite, A modular system for evolutionary analysis, Wayne P. Maddison
  3. Nexus Data Editor A program to edit NEXUS format data files, Roderick D. M. Page
  4. Laing, Adam M.; Sharon Doyle; Maria Eugenia Leone Gold; Sterling J. Nesbitt; Maureen A. O'Leary; Alan H. Turner; Eric W. Wilberg; Karen E. Poole (2017). "Giant taxon-character matrices: the future of morphological systematics". Cladistics. 34 (3): 333–335. doi: 10.1111/cla.12197 . PMID   34645074. S2CID   91087899.
  5. MorphoBank workshop report Archived 2007-10-06 at the Wayback Machine , November 10–11, 2001
  6. O'Leary, Maureen; Kaufman, Seth (October 2011). "MorphoBank: phylophenomics in the "cloud"". Cladistics. 27 (5): 529–537. doi: 10.1111/j.1096-0031.2011.00355.x . PMID   34875801. S2CID   76652345.
  7. Dunaief, Daniel. "Power of 3: SBU's Maureen O'Leary makes MorphoBank funding plans | TBR News Media" . Retrieved 2021-08-31.
  8. "Metering | MorphoBank". morphobank.org. Retrieved 2021-08-31.
  9. O'Leary, M.A.; J.I. Bloch; J.J. Flynn; T.J. Gaudin; A. Giallombardo; N.P. Giannini; S.L. Goldber; B.P. Kraatz; Z.-X. Luo; J. Meng; X. Ni; M.J. Novacek; F.A. Perini; Z. Randall; G.W. Rougier; E.J. Sargis; M.T. Silcox; N.B. Simmons; M. Spaulding; P.M. Velazco; M. Weksler; J.R. Wible; A.L. Cirranello (2013). "The placental mammal ancestor and the post-K-Pg radiation of placentals". Science. 339 (6120): 662–667. doi:10.1126/science.1229237. hdl: 11336/7302 . PMID   23393258. S2CID   206544776.
  10. "MorphoBank Project 265, Phylogenetic relationships of Icaronycteris, Archaeonycteris, Hassianycteris, and Palaeochiropteryx to extant bat lineages, with comments on the evolution of echolocation and foraging strategies in Microchiroptera".
  11. "MorphoBank Project 470, The Comparative Osteology of the Petrotympanic Complex (Ear Region) of Extant Baleen Whales (Cetacea: Mysticeti) Data Matrix".
  12. "MorphoBank Project 578, The pygmy right whale Caperea marginata - the last of the cetotheres".
  13. "MorphoBank Project 790, Investigating the Bivalve Tree of Life -- BivAToL matrix (100+ taxa)".
  14. "MorphoBank Project 44, New genus of cyphophthalmid from the Iberian Peninsula with a phylogenetic analysis of the Sironidae (Arachnida: Opiliones: Cyphophthalmi) and a SEM database of external morphology".
  15. "MorphoBank Project 277, Matrix of Morphological Characters of Humiriaceae".
  16. "MorphoBank Project 46, Integration of Morphological Data Sets for Phylogenetic Analysis of Amniota: The Importance of Integumentary Characters and Increased Taxonomic Sampling".
  17. "MorphoBank Project 146, A new genus and species of green lacewings from Brazil (Neuroptera: Chrysopidae: Leucochrysini)".
  18. "MorphoBank Project 348, Taxonomic revision of the cape verdean reptiles ii".
  19. "MorphoBank Project 1006, Two newly recognized species of Hemidactylus (Squamata, Gekkonidae) from the Arabian Peninsula and Sinai, Egypt".
  20. "MorphoBank Project 268, Predatory functional morphology in raptors: Interdigital variation in talon size is related to prey restraint and immobilisation technique".
  21. "MorphoBank Project 494, The ontogenetic osteohistology of Tenontosaurus tilletti".
  22. "MorphoBank Project 701, Phylogenetic signal and diversity of visceral pigmentation in eight anuran families".
  23. "MorphoBank Project 774, Cryptic species unveiled: the case of the nematode Spauligodon atlanticus".
  24. Hopkins, Melanie J. (2019-09-19). "Using the Paleobiology Database and MorphoBank to facilitate collaborative research and data archival". PeerJ Preprints. 7:e27971v1.
  25. The supermatrix approach to systematics, Alan de Queiroz and John Gatesy, Trends in Ecology & Evolution, Vol.22 No.1, 2006.
  26. O'Leary, M.A.; J.I. Bloch; J.J. Flynn; T.J. Gaudin; A. Giallombardo; N.P. Giannini; S.L. Goldber; B.P. Kraatz; Z.-X. Luo; J. Meng; X. Ni; M.J. Novacek; F.A. Perini; Z. Randall; G.W. Rougier; E.J. Sargis; M.T. Silcox; N.B. Simmons; M. Spaulding; P.M. Velazco; M. Weksler; J.R. Wible; A.L. Cirranello (2013). "The placental mammal ancestor and the post-K-Pg radiation of placentals". Science. 339 (6120): 662–667. doi:10.1126/science.1229237. hdl: 11336/7302 . PMID   23393258. S2CID   206544776.
  27. Ekdale, E.G.; A. Berta; T.A. Deméré (2011). "The Comparative Osteology of the Petrotympanic Complex (Ear Region) of Extant Baleen Whales (Cetacea: Mysticeti)". PLOS ONE. 6 (6): e21311. doi: 10.1371/journal.pone.0021311 . PMC   3120854 . PMID   21731700.

Citations

Wilson, E. O. (2003), "The encyclopedia of life", TREE, 18: 77–80.