Olog

Last updated February 13, 2023

The theory of ologs is an attempt to provide a rigorous mathematical framework for knowledge representation, construction of scientific models and data storage using category theory, linguistic and graphical tools. Ologs were introduced in 2010 by David Spivak,^[1] a research scientist in the Department of Mathematics, MIT.

Etymology

The term "olog" is short for "ontology log". "Ontology" derives from onto- , from the Greek ὤν, ὄντος "being; that which is", present participle of the verb εἰμί "be", and -λογία, -logia: science, study, theory.

Mathematical formalism

An olog ${\mathcal {C}}$ for a given domain is a category whose objects are boxes labeled with phrases (more specifically, singular indefinite noun phrases) relevant to the domain, and whose morphisms are directed arrows between the boxes, labeled with verb phrases also relevant to the domain. These noun and verb phrases combine to form sentences that express relationships between objects in the domain.

In every olog, the objects exist within a target category. Unless otherwise specified, the target category is taken to be ${\textbf {Set}}$ , the category of sets and functions. The boxes in the above diagram represent objects of ${\textbf {Set}}$ . For example, the box containing the phrase "an amino acid" represents the set of all amino acids, and the box containing the phrase "a side chain" represents the set of all side chains. The arrow labeled "has" that points from "an amino acid" to "a side chain" represents the function that maps each amino acid to its unique side chain.

Another target category that can be used is the Kleisli category ${\mathcal {C}}_{\mathbb {P} }$ of the power set monad. Given an $A\in Ob({\textbf {Set}})$ , $\mathbb {P} (A)$ is then the power set of A. The natural transformation $\eta$ maps $a\in A$ to the singleton $\{a\}$ , and the natural transformation $\mu$ maps a set of sets to its union. The Kleisli category ${\mathcal {C}}_{\mathbb {P} }$ is the category with the objects matching those in $\mathbb {P}$ , and morphisms that establish binary relations. Given a morphism $f:A\to B$ , and given $a\in A$ and $b\in B$ , we define the morphism $R$ by saying that $(a,b)\in R$ whenever $b\in f(a)$ . The verb phrases used with this target category would need to make sense with objects that are subsets: for example, "is related to" or "is greater than".

Another possible target category is the Kleisli category of probability distributions, called the Giry monad.^[2] This provides a generalization of Markov decision processes.

Ologs and databases

An olog ${\mathcal {C}}$ can also be viewed as a database schema. Every box (object of ${\mathcal {C}}$ ) in the olog is a table $T$ and the arrows (morphisms) emanating from the box are columns in ${\mathcal {C}}$ . The assignment of a particular instance to an object of ${\mathcal {C}}$ is done through a functor $I:{\mathcal {C}}\to {\textbf {Set}}$ . In the example above, the box "an amino acid" will be represented as a table whose number of rows is equal to the number of types of amino acids and whose number of columns is three, one column for each arrow emanating from that box.

Relations between ologs

"Communication" between different ologs which in practice can be communication between different models or world-views is done using functors. Spivak coins the notions of a 'meaningful' and 'strongly meaningful' functors.^[1] Let ${\mathcal {C}}$ and ${\mathcal {D}}$ be two ologs, $I:{\mathcal {C}}\to {\textbf {Set}}$ , $J:{\mathcal {D}}\to {\textbf {Set}}$ functors (see the section on ologs and databases) and $F:{\mathcal {C}}\to {\mathcal {D}}$ a functor. $F$ is called a schema mapping. We say that a $F$ is meaningful if there exists a natural transformation $m:I\to F^{*}J$ (the pullback of J by F).

Taking as an example ${\mathcal {C}}$ and ${\mathcal {D}}$ as two different scientific models, the functor $F$ is meaningful if "predictions", which are objects in ${\textbf {Set}}$ , made by the first model ${\mathcal {C}}$ can be translated to the second model ${\mathcal {D}}$ .

We say that $F$ is strongly meaningful if given an object $X\in {\mathcal {C}}$ we have $I(X)=J(F(X))$ . This equality is equivalent to requiring $m$ to be a natural isomorphism.

Sometimes it will be hard to find a meaningful functor $F$ from ${\mathcal {C}}$ to ${\mathcal {D}}$ . In such a case we may try to define a new olog ${\mathcal {B}}$ which represents the common ground of ${\mathcal {C}}$ and ${\mathcal {D}}$ and find meaningful functors $F_{\mathcal {C}}:{\mathcal {B}}\to {\mathcal {C}}$ and $F_{\mathcal {D}}:{\mathcal {B}}\to {\mathcal {D}}$ .

If communication between ologs is limited to a two-way communication as described above then we may think of a collection of ologs as nodes of a graph and of the edges as functors connecting the ologs. If a simultaneous communication between more than two ologs is allowed then the graph becomes a symmetric simplicial complex.

Rules of good practice

Spivak provides some rules of good practice for writing an olog whose morphisms have a functional nature (see the first example in the section Mathematical formalism).^[1] The text in a box should adhere to the following rules:

begin with the word "a" or "an". (Example: "an amino acid").
refer to a distinction made and recognizable by the olog's author.
refer to a distinction for which there is well defined functor whose range is ${\textbf {Set}}$ , i.e. an instance can be documented. (Example: there is a set of all amino acids).
declare all variables in a compound structure. (Example: instead of writing in a box "a man and a woman" write "a man $m$ and a woman $w$ " or "a pair $(m,w)$ where $m$ is a man and $w$ is a woman").

The first three rules ensure that the objects (the boxes) defined by the olog's author are well-defined sets. The fourth rule improves the labeling of arrows in an olog.

Applications

This concept was used in a paper published in the December 2011 issue of BioNanoScience by David Spivak and others to establish a scientific analogy between spider silk and musical composition.^[3]

Related Research Articles

In mathematics, especially in category theory and homotopy theory, a groupoid generalises the notion of group in several equivalent ways. A groupoid can be seen as a:

In mathematics, more specifically in category theory, a universal property is a property that characterizes up to an isomorphism the result of some constructions. Thus, universal properties can be used for defining some objects independently from the method chosen for constructing them. For example, the definitions of the integers from the natural numbers, of the rational numbers from the integers, of the real numbers from the rational numbers, and of polynomial rings from the field of their coefficients can all be done in terms of universal properties. In particular, the concept of universal property allows a simple proof that all constructions of real numbers are equivalent: it suffices to prove that they satisfy the same universal property.

In category theory, a branch of mathematics, a natural transformation provides a way of transforming one functor into another while respecting the internal structure of the categories involved. Hence, a natural transformation can be considered to be a "morphism of functors". Informally, the notion of a natural transformation states that a particular map between functors can be done consistently over an entire category.

In mathematics, specifically category theory, adjunction is a relationship that two functors may exhibit, intuitively corresponding to a weak form of equivalence between two related categories. Two functors that stand in this relationship are known as adjoint functors, one being the left adjoint and the other the right adjoint. Pairs of adjoint functors are ubiquitous in mathematics and often arise from constructions of "optimal solutions" to certain problems, such as the construction of a free group on a set in algebra, or the construction of the Stone–Čech compactification of a topological space in topology.

In category theory, a branch of mathematics, an enriched category generalizes the idea of a category by replacing hom-sets with objects from a general monoidal category. It is motivated by the observation that, in many practical applications, the hom-set often has additional structure that should be respected, e.g., that of being a vector space of morphisms, or a topological space of morphisms. In an enriched category, the set of morphisms associated with every pair of objects is replaced by an object in some fixed monoidal category of "hom-objects". In order to emulate the (associative) composition of morphisms in an ordinary category, the hom-category must have a means of composing hom-objects in an associative manner: that is, there must be a binary operation on objects giving us at least the structure of a monoidal category, though in some contexts the operation may also need to be commutative and perhaps also to have a right adjoint.

In mathematics, a concrete category is a category that is equipped with a faithful functor to the category of sets. This functor makes it possible to think of the objects of the category as sets with additional structure, and of its morphisms as structure-preserving functions. Many important categories have obvious interpretations as concrete categories, for example the category of topological spaces and the category of groups, and trivially also the category of sets itself. On the other hand, the homotopy category of topological spaces is not concretizable, i.e. it does not admit a faithful functor to the category of sets.

In category theory, a branch of mathematics, a monad is a monoid in the category of endofunctors. An endofunctor is a functor mapping a category to itself, and a monad is an endofunctor together with two natural transformations required to fulfill certain coherence conditions. Monads are used in the theory of pairs of adjoint functors, and they generalize closure operators on partially ordered sets to arbitrary categories. Monads are also useful in the theory of datatypes and in functional programming languages, allowing languages with non-mutable states to do things such as simulate for-loops; see Monad.

In category theory, a branch of mathematics, a functor category $is a category where the objects are the functors and the morphisms are natural transformations between the functors. Functor categories are of interest for two main reasons:$

In mathematics, a comma category is a construction in category theory. It provides another way of looking at morphisms: instead of simply relating objects of a category to one another, morphisms become objects in their own right. This notion was introduced in 1963 by F. W. Lawvere, although the technique did not become generally known until many years later. Several mathematical concepts can be treated as comma categories. Comma categories also guarantee the existence of some limits and colimits. The name comes from the notation originally used by Lawvere, which involved the comma punctuation mark. The name persists even though standard notation has changed, since the use of a comma as an operator is potentially confusing, and even Lawvere dislikes the uninformative term "comma category".

In mathematics, the derived categoryD(A) of an abelian category A is a construction of homological algebra introduced to refine and in a certain sense to simplify the theory of derived functors defined on A. The construction proceeds on the basis that the objects of D(A) should be chain complexes in A, with two such chain complexes considered isomorphic when there is a chain map that induces an isomorphism on the level of homology of the chain complexes. Derived functors can then be defined for chain complexes, refining the concept of hypercohomology. The definitions lead to a significant simplification of formulas otherwise described (not completely faithfully) by complicated spectral sequences.

Fibred categories are abstract entities in mathematics used to provide a general framework for descent theory. They formalise the various situations in geometry and algebra in which inverse images of objects such as vector bundles can be defined. As an example, for each topological space there is the category of vector bundles on the space, and for every continuous map from a topological space X to another topological space Y is associated the pullback functor taking bundles on Y to bundles on X. Fibred categories formalise the system consisting of these categories and inverse image functors. Similar setups appear in various guises in mathematics, in particular in algebraic geometry, which is the context in which fibred categories originally appeared. Fibered categories are used to define stacks, which are fibered categories with "descent". Fibrations also play an important role in categorical semantics of type theory, and in particular that of dependent type theories.

This is a glossary of properties and concepts in category theory in mathematics.

In category theory, a Kleisli category is a category naturally associated to any monad T. It is equivalent to the category of free T-algebras. The Kleisli category is one of two extremal solutions to the question Does every monad arise from an adjunction? The other extremal solution is the Eilenberg–Moore category. Kleisli categories are named for the mathematician Heinrich Kleisli.

In category theory, a monoidal monad $is a monad on a monoidal category such that the functor is a lax monoidal functor and the natural transformations and are monoidal natural transformations. In other words, is equipped with coherence maps and satisfying certain properties, and the unit and multiplication are monoidal natural transformations. By monoidality of, the morphisms and are necessarily equal.$

In category theory, a branch of mathematics, the diagonal functor $is given by, which maps objects as well as morphisms. This functor can be employed to give a succinct alternate description of the product of objects within the category : a product is a universal arrow from to . The arrow comprises the projection maps.$

In homological algebra, the hyperhomology or hypercohomology is a generalization of (co)homology functors which takes as input not objects in an abelian category $but instead chain complexes of objects, so objects in . It is a sort of cross between the derived functor cohomology of an object and the homology of a chain complex since hypercohomology corresponds to the derived global sections functor .$

In mathematics, mixed Hodge modules are the culmination of Hodge theory, mixed Hodge structures, intersection cohomology, and the decomposition theorem yielding a coherent framework for discussing variations of degenerating mixed Hodge structures through the six functor formalism. Essentially, these objects are a pair of a filtered D-module $together with a perverse sheaf such that the functor from the Riemann-Hilbert correspondence sends to . This makes it possible to construct a Hodge structure on intersection cohomology, one of the key problems when the subject was discovered. This was solved by Morihiko Saito who found a way to use the filtration on a coherent D-module as an analogue of the Hodge filtration for a Hodge structure. This made it possible to give a Hodge structure on an intersection cohomology sheaf, the simple objects in the Abelian category of perverse sheaves.$

In mathematics a stack or 2-sheaf is, roughly speaking, a sheaf that takes values in categories rather than sets. Stacks are used to formalise some of the main constructions of descent theory, and to construct fine moduli stacks when fine moduli spaces do not exist.

In mathematics, the quotient of an abelian category $by a Serre subcategory is the abelian category which, intuitively, is obtained from by ignoring all objects from . There is a canonical exact functor whose kernel is, and is in a certain sense the most general abelian category with this property.$

In mathematics, an Abelian 2-group is a higher dimensional analogue of an Abelian group, in the sense of higher algebra, which were originally introduced by Alexander Grothendieck while studying abstract structures surrounding Abelian varieties and Picard groups. More concretely, they are given by groupoids $which have a bifunctor which acts formally like the addition an Abelian group. Namely, the bifunctor has a notion of commutativity, associativity, and an identity structure. Although this seems like a rather lofty and abstract structure, there are several examples of Abelian 2-groups. In fact, some of which provide prototypes for more complex examples of higher algebraic structures, such as Abelian n-groups.$

References

1 2 3 Spivak (2011). "Ologs: A categorical framework for knowledge representation". PLOS ONE. 7 (1): e24274. arXiv: 1102.1889v1 . Bibcode:2012PLoSO...724274S. doi: 10.1371/journal.pone.0024274 .
↑ Giry monad at the nLab
↑ Giesa, Tristan; Spivak, David I.; Buehler, Markus J. (2011). "Reoccurring patterns in hierarchical protein materials and music: The power of analogies". BioNanoScience. 1 (4): 153–161. arXiv: 1111.5297 . doi:10.1007/s12668-011-0022-5. S2CID 5178100.

External links

Spivak, David I. "Categorical informatics". math.mit.edu. Retrieved 2 May 2017.
Spivak, David I. (2014). Category theory for the sciences. Cambridge, MA: MIT Press. ISBN 9780262028134. OCLC 876833252.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Spivak-1] 1 2 3 Spivak (2011). "Ologs: A categorical framework for knowledge representation". PLOS ONE. 7 (1): e24274. arXiv: 1102.1889v1 . Bibcode:2012PLoSO...724274S. doi: 10.1371/journal.pone.0024274 .

[2] Giry monad at the nLab

[3] Giesa, Tristan; Spivak, David I.; Buehler, Markus J. (2011). "Reoccurring patterns in hierarchical protein materials and music: The power of analogies". BioNanoScience. 1 (4): 153–161. arXiv: 1111.5297 . doi:10.1007/s12668-011-0022-5. S2CID 5178100.

[1]

[2]

[3]