Single-access key

Last updated

In phylogenetics, a single-access key (also called dichotomous key, sequential key, analytical key, [1] or pathway key) is an identification key where the sequence and structure of identification steps is fixed by the author of the key. At each point in the decision process, multiple alternatives are offered, each leading to a result or a further choice. The alternatives are commonly called "leads", and the set of leads at a given point a "couplet".

Contents

Single access keys are closely related to decision trees or self-balancing binary search trees. However, to improve the usability and reliability of keys, many single-access keys incorporate reticulation, changing the tree structure into a directed acyclic graph. Single-access keys have been in use for several hundred years. [2] They may be printed in various styles (e. g., linked, nested, indented, graphically branching) or used as interactive, computer-aided keys. In the latter case, either a longer part of the key may be displayed (optionally hyperlinked), or only a single question may be displayed at a time.

If the key has several choices it is described as polychotomous or polytomous. If the entire key consists of exactly two choices at each branching point, the key is called dichotomous. The majority of single-access keys are dichotomous.

Diagnostic ('artificial') versus synoptic ('natural') keys

Any single-access key organizes a large set of items into a structure that breaks them down into smaller, more accessible subsets, with many keys leading to the smallest available classification unit (a species or infraspecific taxon typically in the form of binomial nomenclature). However, a trade-off exists between keys that concentrate on making identification most convenient and reliable (diagnostic keys), and keys which aim to reflect the scientific classification of organisms (synoptic keys). The first type of keys limits the choice of characteristics to those most reliable, convenient, and available under certain conditions. Multiple diagnostic keys may be offered for the same group of organisms: Diagnostic keys may be designed for field (field guides) or laboratory use, for summer or winter use, and they may use geographic distribution or habitat preference of organisms as accessory characteristics. They do so at the expense of creating artificial groups in the key.

An example of a diagnostic key is shown below. It is not based on the taxonomic classification of the included species — compare with the botanical classification of oaks.

In contrast, synoptic keys follow the taxonomic classification as close as possible. Where the classification is already based on phylogenetic studies, the key represents the evolutionary relationships within the group. To achieve this, these keys often have to use more difficult characteristics, which may not always be available in the field, and which may require instruments like a hand lens or microscope. Because of convergent evolution, superficially similar species may be separated early in the key, with superficially different, but genetically closely related species being separated much later in the key. Synoptic keys are typically found in scientific treatments of a taxonomic group ("monographs").

An example of a synoptic key (corresponding to the diagnostic key shown below) is shown further below. In plants, flower and fruit characteristics often are important for primary taxonomic classification:

Example of a diagnostic dichotomous key for some eastern United States oaks based on leaf characteristics

1. Leaves usually without teeth or lobes: 2
1. Leaves usually with teeth or lobes: 5
2. Leaves evergreen: 3
2. Leaves not evergreen: 4
3. Mature plant a large tree — Southern live oak Quercus virginiana
3. Mature plant a small shrub — Dwarf live oak Quercus minima
4. Leaf narrow, about 4-6 times as long as broad — Willow oak Quercus phellos
4. Leaf broad, about 2-3 times as long as broad — Shingle oak Quercus imbricaria
5. Lobes or teeth bristle-tipped: 6
5. Lobes or teeth rounded or blunt-pointed, no bristles: 7
6. Leaves mostly with 3 lobes — Blackjack oak Quercus marilandica
6. Leaves mostly with 7-9 lobes — Northern red oak Quercus rubra
7. Leaves with 5-9 deep lobes — White oak Quercus alba
7. Leaves with 21-27 shallow lobes — Swamp chestnut oak Quercus prinus

This key first differentiates between oaks with entire leaves with normally smooth margins (live oaks, Willow oak, Shingle oak), and other oaks with lobed or toothed leaves. The following steps created smaller and smaller groups (e. g., red oak, white oak), until the species has been keyed out.

Example of a synoptic (taxonomic) dichotomous key for some eastern United States oaks, reflecting taxonomic classification

1. Styles short; acorns mature in 6 months, sweet or slightly bitter, inside of acorn shell hairless (Quercus sect. Quercus, white oaks): 2
1. Styles long, acorns mature in 18 months, very bitter, inside of acorn shell woolly (Quercus sect. Lobatae, red oaks): 5
2. Leaves evergreen: 3
2. Leaves not evergreen: 4
3. Mature plant a large tree — Southern live oak Quercus virginiana
3. Mature plant a small shrub — Dwarf live oak Quercus minima
4. Leaves with 5-9 deep lobes — White oak Quercus alba
4. Leaves with 21-27 shallow lobes — Swamp chestnut oak Quercus prinus
5. Leaves usually without teeth or lobes: 6
5. Leaves usually with teeth or lobes: 7
6. Leaf narrow, about 4-6 times as long as broad — Willow oak Quercus phellos
6. Leaf broad, about 2-3 times as long as broad — Shingle oak Quercus imbricaria
7. Leaves mostly with 3 lobes — Blackjack oak Quercus marilandica
7. Leaves mostly with 7-9 lobes — Northern red oak Quercus rubra

Structural variants of single-access keys

The distinction between dichotomous (bifurcating) and polytomous (multifurcating) keys is a structural one, and identification key software may or may not support polytomous keys. This distinction is less arbitrary than it may appear. Allowing a variable number of choices is disadvantageous in the nested display style, where for each couplet in a polytomous key the entire key must be scanned to the end to determine whether more than a second lead may exist or not. Furthermore, if the alternative lead statements are complex (involving more than one characteristic and possibly "and", "or", or "not"), two alternative statements are significantly easier to understand than couplets with more alternatives. However, the latter consideration can easily be accommodated in a polytomous key where couplets based on a single characteristic may have more than two choices, and complex statements may be limited to two alternative leads.

Another structural distinction is whether only lead statements or question-answer pairs are supported. Most traditional single-access keys use the "lead-style", where each option consists of a statement, only one of which is correct. Especially computer-aided keys occasionally use the "question-answer-style" instead, where a question is presented with a choice of answers. The second style is well known from multiple choice testing and therefore more intuitive for beginners. However, it creates problems when multiple characteristics need to be combined in a single step (as in "Flower red and spines present" versus "Flowers yellow to reddish-orange, spines absent").

Lead style

 1. Flowers red ... 2
  Flowers white ... 3
  Flowers blue ... 4

Question-answer-style

 1. What is the flower color?
  - red ... 2
  - white ... 3
  - blue ... 4

Presentation styles

Single-access keys may be presented in different styles. The two most frequently encountered styles are the

The nested style gives an excellent overview over the structure of the key. With a short key and moderate indentation it can be easy to follow and even backtrace an erroneous identification path. The nested style is problematic with polytomous keys, where each key must be scanned to the end to verify that no further leads exist within a couplet. It also does not easily support reticulation (which requires a link method similar to the one used in the linked style).

Advantages and disadvantages

A large amount of knowledge about reliable and efficient identification procedures may be incorporated in good single-access keys. Characteristics that are reliable and convenient to observe most of the time and for most species (or taxa), and which further provide a well-balanced key (the leads splitting number of species evenly) will be preferred at the start of the key. However, in practice it is difficult to achieve this goal for all taxa in all conditions. If the information for a given identification step is not available, several potential leads must be followed and identification becomes increasingly difficult.

Although software exists that helps in skipping questions in a single-access key, [3] the more general solution to this problem is the construction and use of multi-access keys, allowing a free choice of identification steps and are easily adaptable to different taxa (e.g., very small or very large) as well as different circumstances of identification (e. g., in the field or laboratory).

See also

Related Research Articles

In computer programming, an indentation style is a convention governing the indentation of blocks of code to convey program structure. This article largely addresses the free-form languages, such as C and its descendants, but can be applied to most other programming languages, where whitespace is otherwise insignificant. Indentation style is only one aspect of programming style.

YAML is a human-readable data-serialization language. It is commonly used for configuration files and in applications where data is being stored or transmitted. YAML targets many of the same communications applications as Extensible Markup Language (XML) but has a minimal syntax which intentionally differs from Standard Generalized Markup Language (SGML). It uses both Python-style indentation to indicate nesting, and a more compact format that uses [...] for lists and {...} for maps but forbids tab characters to use as indentation thus only some JSON files are valid YAML 1.2.

<span class="mw-page-title-main">Tab key</span> Key on a keyboard for tabulation

The tab keyTab ↹ on a keyboard is used to advance the cursor to the next tab stop.

In biology, an identification key, taxonomic key, or biological key is a printed or computer-aided device that aids the identification of biological entities, such as plants, animals, fossils, microorganisms, and pollen grains. Identification keys are also used in many other scientific and technical fields to identify various kinds of entities, such as diseases, soil types, minerals, or archaeological and anthropological artifacts.

A computer programming language is said to adhere to the off-side rule of syntax if blocks in that language are expressed by their indentation. The term was coined by Peter Landin, possibly as a pun on the offside rule in association football. This is contrasted with free-form languages, notably curly-bracket programming languages, where indentation has no computational meaning and indent style is only a matter of coding conventions and formatting. Off-side-rule languages are also described as having significant indentation.

Evolutionary taxonomy, evolutionary systematics or Darwinian classification is a branch of biological classification that seeks to classify organisms using a combination of phylogenetic relationship, progenitor-descendant relationship, and degree of evolutionary change. This type of taxonomy may consider whole taxa rather than single species, so that groups of species can be inferred as giving rise to new groups. The concept found its most well-known form in the modern evolutionary synthesis of the early 1940s.

<span class="mw-page-title-main">Flora</span> Plant species in a given region

Flora is all the plant life present in a particular region or time, generally the naturally occurring (indigenous) native plants. Sometimes bacteria and fungi are also referred to as flora, as in the terms gut flora or skin flora.

Automated species identification is a method of making the expertise of taxonomists available to ecologists, parataxonomists and others via digital technology and artificial intelligence. Today, most automated identification systems rely on images depicting the species for the identification. Based on precisely identified images of a species, a classifier is trained. Once exposed to a sufficient amount of training data, this classifier can then identify the trained species on previously unseen images.

Regional floras typically contain complete dichotomous keys for identification of trees and other plants to species. The following guide originates from Our Native Trees and How to Identify Them by Harriet L. Keeler and applies to some flowering trees which are indigenous to the region extending from the Atlantic Ocean to the Rocky Mountains and from Canada to the northern boundaries of the southern states, together with a few well-known and naturalized foreign trees. This guide excludes conifers and is not an exhaustive list of all trees known to occur in the region.

In biology, a reticulation of a single-access identification key connects different branches of the identification tree to improve error tolerance and identification success. In a reticulated key, multiple paths lead to the same result; the tree data structure thus changes from a simple tree to a directed acyclic graph.

A branching identification key within taxonomy, is a presentation form of a single-access key where the structure of the decision tree is displayed graphically as a branching structure, involving lines between items. Depending on the number of branches at a single point, a branching key may be dichotomous or polytomous.

In biology or medicine, a multi-access key is an identification key which overcomes the problem of the more traditional single-access keys of requiring a fixed sequence of identification steps. A multi-access key enables the user to freely choose the characteristics that are convenient to evaluate for the item to be identified.

Polychotomous key refers to the number of alternatives which a decision point may have in a non-temporal hierarchy of independent variables. The number of alternatives are equivalent to the root or nth root of a mathematical or logical variable. Decision points or independent variables with two states have a binary root that is referred to as a dichotomous key whereas, the term polychotomous key refers to roots which are greater than one or unitary and usually greater than two or binary. Polychotomous keys are used in troubleshooting to build troubleshooting charts and in classification/identification schemes with characteristics that have more than one attribute and the order of characteristics is not inherently based on the progression of time.

Psychometric software is software that is used for psychometric analysis of data from tests, questionnaires, or inventories reflecting latent psychoeducational variables. While some psychometric analyses can be performed with standard statistical software like SPSS, most analyses require specialized tools.

In biology, determination is the process of matching a specimen of an organism to a known taxon, for example identifying a plant. The term is also used in cellular biology, where it means the act of the differentiation of stem cells becoming fixed. Various methods are used, for example single or multi-access identification keys.

<span class="mw-page-title-main">Gamopetalae</span> Unranked group of plants

Gamopetalae is an artificial historical group used in the identification of plants based on Bentham and Hooker's classification system.

<span class="mw-page-title-main">Inferae</span> Group of flowering plants

Inferae is an artificial group used in the identification of plants based on Bentham and Hooker's classification. Bentham and Hooker published an excellent classification in three volumes in between 1862 and 1883. As a natural system of classification, it does not show evolutionary relationship between plants but still is a useful and popular system of classification based on a dichotomous key especially for the flowering plant groups (angiosperms). It is the most popular system of classification based on key characteristics enabling taxonomic students to quickly identify plant groups based only on physical characteristics. However, it is not a scientific group and is used for identification purposes only based on similar plant characteristics. Under the system Inferae are a group of plants based on an artificial and non scientific series. The group Inferae are Gamopetalae and dicotyledons. The group comprises;

<span class="mw-page-title-main">Bicarpellatae</span>

Bicarpellatae is an artificial group used in the identification of plants based on Bentham and Hooker's classification system. George Bentham and Joseph Dalton Hooker published an excellent classification in three volumes in between 1862 and 1883. As a natural system of classification, it does not show evolutionary relationship between plants but still is a useful and popular system of classification based on a dichotomous key especially for the flowering plant groups (angiosperms). It is the most popular system of classification based on key characteristics enabling taxonomic students to quickly identify plant groups based only on physical characteristics. However, it is not a scientific group and is used for identification purposes only based on similar plant characteristics. Under the system Bicarpellatae are a group of plants based on an artificial and non scientific series. The group Bicarpellatea are Gamopetalae and dicotyledons. The group comprises;

iSpot Web-based citizen science biodiversity project

iSpot is a website developed and hosted by the Open University with funding from the Open Air Laboratories (OPAL) network with an online community intended to connect nature enthusiasts of all levels.

Indentation plastometry is the idea of using an indentation-based procedure to obtain (bulk) mechanical properties in the form of stress-strain relationships in the plastic regime. Since indentation is a much easier and more convenient procedure than conventional tensile testing, with far greater potential for mapping of spatial variations, this is an attractive concept.

References

  1. Winston, J. 1999. Describing Species. Columbia University Press.
  2. Pankhurst, R. J. 1991. Practical Taxonomic Computing.
  3. Lucid Phoenix