Identification key

Last updated

In biology, an identification key, taxonomic key, or frequently just key, is a printed or computer-aided device that aids in the identification of biological organisms.

Contents

Historically, the most common type of identification key is the dichotomous key, a type of single-access key which offers a fixed sequence of identification steps, each with two alternatives. The earliest examples of identification keys originate in the seventeenth, but their conceptual history can be traced back to antiquity. ModerRichardn multi-access keys allow the user to freely choose the identification steps and any order. They were traditionally performed using punched cards but now almost exclusively take the form of computer programs.

History

Identification key published in Lamarck's Flore francaise, Volume 1. Lamarck's key.png
Identification key published in Lamarck's Flore française, Volume 1.

The conceptual origins of the modern identification key can be traced back to antiquity. Theophrastus categorized organisms into "subdivisions" based on dichotomous characteristics. The seventeenth-century Chinese herbalist, Pao Shan, in his treatise Yeh-ts'ai Po-Iu, included a systematic categorization of plants based on their apparent characteristics specifically for the purposes of identification. [1] :2

Seventeenth-century naturalists, including John Ray, Rivinius, and Nehemiah Grew, published examples of bracketed tables. However, these examples were not strictly keys in the modern sense of an analytical device used to identify a single specimen, since they often did not lead to a single end point, and instead functioned more as synopses of classification schemes. [1] :3–8

The first analytical identification key is credited to Lamarck who included several in his 1778 book, Flore Françoise. Lamarck's key follows more or less the same design as the modern dichotomous, bracketed key. [1] :10

Alphonso Wood was the first American to use identification keys in 1845. Other early instances of keys are found in the works of Asa Gray and W. H. Evans. [1] :12–14

Terminology

Identification keys are known historically and contemporarily by many names, including analytical key, entomological key, artificial key, [1] diagnostic key, [2] determinator, [3] and taxonomic key [4]

Within the biological literature, identification keys are referred to simply as keys. [5] They are also commonly referred to in general as dichotomous keys, [6] though this term strictly refers to a specific type of identification key (see Types of keys).

Use

Identification keys are used in systematic biology and taxonomy to identify the genus or species of a specimen organism from a set of known taxa. They are commonly used in the fields of microbiology, plant taxonomy, and entomology, as groups of related taxa in these fields tend to be very large. [3] However, they have also been used to classify non-organisms, such as birds nests, and in non-biological sciences such as geology. [1] :14–15 Similar methods have also been used in computer science [7]

A user of a key selects from a series of choices, representing mutually exclusive features of the specimen, with the aim to arrive at the sole remaining identity from the group of taxa. [8] Each step in the key employs a character: a distinguishing feature of an organism that is conveniently observable. [3]

Types of keys

Identification keys are sometimes also referred to as artificial keys to differential them from other diagrams that visualize a classification schemes, often in the form of a key or tree structure. These diagrams are called natural keys or synopses and are not used for identifying specimens. In contrast, an artificial identification key is a tool that utilizes characters that are the easiest to observe and most practical for arriving at an identity. [2] :7 [6] :225 Identification keys can be divided into two main types.

Single-access key

User interaction steps in a single-access key. The sequence of steps follow the data structure. Screenshot from 2024-10-20 20-19-32.png
User interaction steps in a single-access key. The sequence of steps follow the data structure.

A single-access key (also called a sequential key or an analytical key), has a fixed structure and sequence. The user must begin at the first step of the key and proceed until the end. A single-access key has steps that consist of two mutually exclusive statements (leads) is called a dichotomous key. Most single-access keys are dichotomous. [3] A single-access key with more than two leads per step is referred to as polytomous. [9]

Presentational variants

Dichotomous keys can be presented in two main styles: linked and nested. In the linked style (also referred to as open,parallel, linked, and juxtaposition [9] :63), each pair of leads (called a couplet) are printed together. In the nested style (also referred to as closed,yoked, and indented [9] :63), the subsequent steps after choosing a lead are printed directly underneath it, in succession. To follow the second lead of the couplet, the user must skip over the nested material that follows logically from the first lead of the couplet. [2] Nested keys are more commonly known as indented, but unfortunately this refers to an accidental (albeit frequent) rather than essential quality. Nested keys may be printed without indentation to preserve space (relying solely on corresponding lead symbols) and linked keys may be indented to enhance the visibility of the couplet structure. [9] :63

Multi-access keys

User interaction steps in a multi-access key. The sequence of steps is determined by the user. Multi-access key.png
User interaction steps in a multi-access key. The sequence of steps is determined by the user.

A multi-access key (free-access key, [9] or polyclave [8] ) allows a user to specify characters in any order. Therefore, a multi-access key can be thought of as "the set of all possible single-access keys that arise by permutating the order of characters." [9] :60 While there are print versions of multi-access keys, they were historically created using punched card systems. [8] Today, multi-access keys are computer-aided tools. [9] :61

Key construction

An early attempt to standardize the construction of keys was offered by E. B. Williamson in the June 1922 volume of Science. [10] More recently, Richard Pankhurst published a guidelines and practical tips for key construction in a section of his 1978 book, Biological Identification. [2] :15–22

Identification errors may have serious consequences in both pure and applied disciplines, including ecology, medical diagnosis, pest control, forensics, etc. [11]

Computer-aided key construction

The first computer programs for constructing identification keys were created in the early 1970s. [12] [13] Since then, several popular programs have been developed, including DELTA, XPER, and LucID. [3] :379–80

Single-access keys, until recently, have been developed only rarely as computer-aided, interactive tools. Noteworthy developments in this area are the commercial LucID Phoenix application, the FRIDA/Dryades software, the KeyToNature Open Key Editor, and the open source WikiKeys and jKey application on biowikifarm. [9] :62

See also

Related Research Articles

<span class="mw-page-title-main">Bioinformatics</span> Computational analysis of large, complex sets of biological data

Bioinformatics is an interdisciplinary field of science that develops methods and software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, chemistry, physics, computer science, computer programming, information engineering, mathematics and statistics to analyze and interpret biological data. The process of analyzing and interpreting data can some times referred to as computational biology, however this distinction between the two terms is often disputed. To some, the term computational biology refers to building and using models of biological systems.

<span class="mw-page-title-main">Clade</span> Group of a common ancestor and all descendants

In biological phylogenetics, a clade, also known as a monophyletic group or natural group, is a grouping of organisms that are monophyletic – that is, composed of a common ancestor and all its lineal descendants – on a phylogenetic tree. In the taxonomical literature, sometimes the Latin form cladus is used rather than the English form. Clades are the fundamental unit of cladistics, a modern approach to taxonomy adopted by most biological fields.

<span class="mw-page-title-main">Systematics</span> Branch of biology

Systematics is the study of the diversification of living forms, both past and present, and the relationships among living things through time. Relationships are visualized as evolutionary trees. Phylogenies have two components: branching order and branch length. Phylogenetic trees of species and higher taxa are used to study the evolution of traits and the distribution of organisms (biogeography). Systematics, in other words, is used to understand the evolutionary history of life on Earth.

In biology, taxonomy is the scientific study of naming, defining (circumscribing) and classifying groups of biological organisms based on shared characteristics. Organisms are grouped into taxa and these groups are given a taxonomic rank; groups of a given rank can be aggregated to form a more inclusive group of higher rank, thus creating a taxonomic hierarchy. The principal ranks in modern use are domain, kingdom, phylum, class, order, family, genus, and species. The Swedish botanist Carl Linnaeus is regarded as the founder of the current system of taxonomy, as he developed a ranked system known as Linnaean taxonomy for categorizing organisms and binomial nomenclature for naming organisms.

<span class="mw-page-title-main">Computational biology</span> Branch of biology

Computational biology refers to the use of data analysis, mathematical modeling and computational simulations to understand biological systems and relationships. An intersection of computer science, biology, and big data, the field also has foundations in applied mathematics, chemistry, and genetics. It differs from biological computing, a subfield of computer science and engineering which uses bioengineering to build computers.

<span class="mw-page-title-main">Biological database</span>

Biological databases are libraries of biological sciences, collected from scientific experiments, published literature, high-throughput experiment technology, and computational analysis. They contain information from research areas including genomics, proteomics, metabolomics, microarray gene expression, and phylogenetics. Information contained in biological databases includes gene function, structure, localization, clinical effects of mutations as well as similarities of biological sequences and structures.

<span class="mw-page-title-main">Type (biology)</span> Specimen(s) to which a scientific name is formally attached

In biology, a type is a particular specimen of an organism to which the scientific name of that organism is formally associated. In other words, a type is an example that serves to anchor or centralizes the defining features of that particular taxon. In older usage, a type was a taxon rather than a specimen.

Evolutionary taxonomy, evolutionary systematics or Darwinian classification is a branch of biological classification that seeks to classify organisms using a combination of phylogenetic relationship, progenitor-descendant relationship, and degree of evolutionary change. This type of taxonomy may consider whole taxa rather than single species, so that groups of species can be inferred as giving rise to new groups. The concept found its most well-known form in the modern evolutionary synthesis of the early 1940s.

Biodiversity informatics is the application of informatics techniques to biodiversity information, such as taxonomy, biogeography or ecology. It is defined as the application of Information technology technologies to management, algorithmic exploration, analysis and interpretation of primary data regarding life, particularly at the species level organization. Modern computer techniques can yield new ways to view and analyze existing information, as well as predict future situations. Biodiversity informatics is a term that was only coined around 1992 but with rapidly increasing data sets has become useful in numerous studies and applications, such as the construction of taxonomic databases or geographic information systems. Biodiversity informatics contrasts with "bioinformatics", which is often used synonymously with the computerized handling of data in the specialized area of molecular biology.

Automated species identification is a method of making the expertise of taxonomists available to ecologists, parataxonomists and others via digital technology and artificial intelligence. Today, most automated identification systems rely on images depicting the species for the identification. Based on precisely identified images of a species, a classifier is trained. Once exposed to a sufficient amount of training data, this classifier can then identify the trained species on previously unseen images.

<i>Encyclopedia of Life</i> Free, online collaborative encyclopedia that documents species

The Encyclopedia of Life (EOL) is a free, online encyclopedia intended to document all of the 1.9 million living species known to science. It aggregates content to form "pages" for every known species. Content is compiled from existing trusted databases which are curated by experts and it calls on the assistance of non-experts throughout the world. It includes video, sound, images, graphics, information on characteristics, as well as text. In addition, the Encyclopedia incorporates species-related content from the Biodiversity Heritage Library, which digitizes millions of pages of printed literature from the world's major natural history libraries. The BHL digital content is indexed with the names of organisms using taxonomic indexing software developed by the Global Names project. The EOL project was initially backed by a US$50 million funding commitment, led by the MacArthur Foundation and the Sloan Foundation, who provided US$20 million and US$5 million, respectively. The additional US$25 million came from five cornerstone institutions—the Field Museum, Harvard University, the Marine Biological Laboratory, the Missouri Botanical Garden, and the Smithsonian Institution. The project was initially led by Jim Edwards and the development team by David Patterson. Today, participating institutions and individual donors continue to support EOL through financial contributions.

<span class="mw-page-title-main">Identification (biology)</span> Process of taking existing name to single organisms

Identification in biology is the process of assigning a pre-existing taxon name to an individual organism. Identification of organisms to individual scientific names may be based on individualistic natural body features, experimentally created individual markers, or natural individualistic molecular markers. Individual identification is used in ecology, wildlife management and conservation biology. The more common form of identification is the identification of organisms to common names or scientific name. By necessity this is based on inherited features ("characters") of the sexual organisms, the inheritance forming the basis of defining a class. The features may, e. g., be morphological, anatomical, physiological, behavioral, or molecular.

In biology, a reticulation of a single-access identification key connects different branches of the identification tree to improve error tolerance and identification success. In a reticulated key, multiple paths lead to the same result; the tree data structure thus changes from a simple tree to a directed acyclic graph.

In biology or medicine, a multi-access key is an identification key which overcomes the problem of the more traditional single-access keys of requiring a fixed sequence of identification steps. A multi-access key enables the user to freely choose the characteristics that are convenient to evaluate for the item to be identified.

In phylogenetics, a single-access key is an identification key where the sequence and structure of identification steps is fixed by the author of the key. At each point in the decision process, multiple alternatives are offered, each leading to a result or a further choice. The alternatives are commonly called "leads", and the set of leads at a given point a "couplet".

In biology, determination is the process of matching a specimen or sample of an organism to a known taxon, for example identifying a plant as belonging to a particular species. Expert taxonomists may perform this task, but structures created by taxonomists are sometimes used by non-specialists. Modern tools include single or multi-access identification keys, which can be printed or computer-assisted.

<span class="mw-page-title-main">Earth Microbiome Project</span> Former initiative aiming to analyze global microbial life

The Earth Microbiome Project (EMP) was an initiative founded by Janet Jansson, Jack Gilbert, and Rob Knight in 2010 to collect natural samples and analyze microbial life around the globe.

<span class="mw-page-title-main">Taxonomy</span> Development of classes and classifications

Taxonomy is a practice and science concerned with classification or categorization. Typically, there are two parts to it: the development of an underlying scheme of classes and the allocation of things to the classes (classification).

<span class="mw-page-title-main">Richard Pankhurst (botanist)</span>

Richard John Pankhurst (1940–2013) was a British computer scientist, botanist and academic. From 1963 to 1966 he worked at CERN, then from 1966 to 1974 on computer-aided design at Cambridge University, and from 1974 to 1991 at the Natural History Museum as curator of the British herbarium. In 1991, he became a Principal Scientific Officer at the Royal Botanic Garden Edinburgh.

Australian Tropical Rainforest Plants, also known as RFK, is an identification key giving details—including images, taxonomy, descriptions, range, habitat, and other information—of almost all species of flowering plants found in tropical rainforests of Australia, with the exception of most orchids which are treated in a separate key called Australian Tropical Rainforest Orchids. A key for ferns is under development. RFK is a project initiated by the Australian botanist Bernie Hyland.

References

  1. 1 2 3 4 5 6 Voss, Edward G. (December 1952). "The history of keys and phylogenetic trees in systematic biology". Journal of the Scientific Laboratories of Dennison University. 43: 1–25.
  2. 1 2 3 4 Pankhurst, R. J. (1978). "Conventional Identification Methods". Biological identification: the principles and practice of identification methods in biology. Baltimore: University Park Press. pp. 11–28. ISBN   978-0-8391-1344-7.
  3. 1 2 3 4 5 Winston, Judith E. (1999). "Keys". Describing species: practical taxonomic procedure for biologists. New York: Columbia University Press. pp. 367–381. ISBN   978-0-231-06824-6.
  4. Bohemier, Kayleigh. "Yale University Library Research Guides: Taxonomic Keys: Home". guides.library.yale.edu. Retrieved 2024-10-20.
  5. "key (identification key)". A dictionary of biology. Oxford paperback reference (6th ed.). Oxford: Oxford University Press. 2008. p. 356. ISBN   978-0-19-920462-5.
  6. 1 2 Lawrence, George H. M. (1951). Taxonomy of Vascular Plants (1st ed.). New York: The Macmillan Company. pp. 225–8.
  7. Payne, R. W. (1983). "Identification Keys". In Kotz, Samuel (ed.). Encyclopedia of Statistical Sciences. Vol. 4. John Wiley & Sons. pp. 6–10. ISBN   0471055514.
  8. 1 2 3 Thain, M.; Hickman, M. (2004). "identification keys". The Penguin dictionary of biology (11th ed.). London ; New York, N.Y: Penguin Books. p. 363. ISBN   978-0-14-101396-1.
  9. 1 2 3 4 5 6 7 8 Hagedorn, Gregor; Rambold, Gerhard; Martellos, Stefano (2010). "Types of identification keys" (PDF). In Nimis, P. L.; Vignes Lebbe, R (eds.). Tools for Identifying Biodiversity: Progress and Problems. Edizioni Università di Trieste. pp. 59–64. ISBN   978-88-8303-295-0 via openstarts.units.it.
  10. Williamson, E. B. (1922). "Keys in Systematic Work". Science. 55 (1435): 703–704. doi:10.1126/science.55.1435.703.a. ISSN   0036-8075. JSTOR   1645312. PMID   17751446.
  11. Marshall, Steve (Fall 2000). "Comments on error rates in insect identifications" (PDF). Newsletter of the Biological Survey of Canada (Terrestrial Arthropods). 19 (2). Biological Survey of Canada: 45–47.
  12. Payne, R. W. (1984). "Computer Construction and Typesetting of Identification Keys". The New Phytologist. 96 (4): 631–634. doi:10.1111/j.1469-8137.1984.tb03597.x. ISSN   0028-646X. JSTOR   2432648.
  13. Pankhurst, R. J. (1970-02-01). "A computer program for generating diagnostic keys". The Computer Journal. 13 (2): 145–151. doi:10.1093/comjnl/13.2.145. ISSN   0010-4620.

Definition of Free Cultural Works logo notext.svg  This article incorporates text from a free content work.Licensed under CC BY-SA( license statement/permission ).Text taken from Types of identification keys ,Gregor Hagedorn, Gerhard Rambold, Stefano Martellos,Edizioni Università di Trieste.

Further reading

Pankhurst, Richard John (1991). Practical taxonomic computing. Cambridge: Cambridge university press. ISBN   978-0-521-41760-0. Chapters 4-6.