Dependent type

Last updated

In computer science and logic, a dependent type is a type whose definition depends on a value. It is an overlapping feature of type theory and type systems. In intuitionistic type theory, dependent types are used to encode logic's quantifiers like "for all" and "there exists". In functional programming languages like Agda, ATS, Coq, F*, Epigram, Idris, and Lean, dependent types help reduce bugs by enabling the programmer to assign types that further restrain the set of possible implementations.

Contents

Two common examples of dependent types are dependent functions and dependent pairs. The return type of a dependent function may depend on the value (not just type) of one of its arguments. For instance, a function that takes a positive integer may return an array of length , where the array length is part of the type of the array. (Note that this is different from polymorphism and generic programming, both of which include the type as an argument.) A dependent pair may have a second value the type of which depends on the first value. Sticking with the array example, a dependent pair may be used to pair an array with its length in a type-safe way.

Dependent types add complexity to a type system. Deciding the equality of dependent types in a program may require computations. If arbitrary values are allowed in dependent types, then deciding type equality may involve deciding whether two arbitrary programs produce the same result; hence the decidability of type checking may depend on the given type theory's semantics of equality, that is, whether the type theory is intensional or extensional. [1]

History

In 1934, Haskell Curry noticed that the types used in typed lambda calculus, and in its combinatory logic counterpart, followed the same pattern as axioms in propositional logic. Going further, for every proof in the logic, there was a matching function (term) in the programming language. One of Curry's examples was the correspondence between simply typed lambda calculus and intuitionistic logic. [2]

Predicate logic is an extension of propositional logic, adding quantifiers. Howard and de Bruijn extended lambda calculus to match this more powerful logic by creating types for dependent functions, which correspond to "for all", and dependent pairs, which correspond to "there exists". [3]

(Because of this and other work by Howard, propositions-as-types is known as the Curry–Howard correspondence.)

Formal definition

Loosely speaking, dependent types are similar to the type of an indexed family of sets. More formally, given a type in a universe of types , one may have a family of types, which assigns to each term a type . We say that the type B(a) varies with a.

Π type

A function whose type of return value varies with its argument (i.e. there is no fixed codomain) is a dependent function and the type of this function is called dependent product type, pi-type (Π type) or dependent function type. [4] From a family of types we may construct the type of dependent functions , whose terms are functions that take a term and return a term in . For this example, the dependent function type is typically written as or .

If is a constant function, the corresponding dependent product type is equivalent to an ordinary function type. That is, is judgmentally equal to when B does not depend on x.

The name 'Π-type' comes from the idea that these may be viewed as a Cartesian product of types. Π-types can also be understood as models of universal quantifiers.

For example, if we write for n-tuples of real numbers, then would be the type of a function which, given a natural number n, returns a tuple of real numbers of size n. The usual function space arises as a special case when the range type does not actually depend on the input. E.g. is the type of functions from natural numbers to the real numbers, which is written as in typed lambda calculus.

For a more concrete example, taking A to be the type of unsigned integers from 0 to 255 (the ones that fit into 8 bits or 1 byte) and B(a) = Xa for a : A, then devolves into the product of X0 × X1 × X2 × ... × X253 × X254 × X255.

Σ type

The dual of the dependent product type is the dependent pair type, dependent sum type, sigma-type, or (confusingly) dependent product type. [4] Sigma-types can also be understood as existential quantifiers. Continuing the above example, if, in the universe of types , there is a type and a family of types , then there is a dependent pair type . (The alternative notations are similar to that of Π types.)

The dependent pair type captures the idea of an ordered pair where the type of the second term is dependent on the value of the first. If then and . If B is a constant function, then the dependent pair type becomes (is judgementally equal to) the product type, that is, an ordinary Cartesian product . [4] [ full citation needed ]

For a more concrete example, taking A to again be type of unsigned integers from 0 to 255, and B(a) to again be equal to Xa for 256 more arbitrary Xa, then devolves into the sum X0 + X1 + X2 + ... + X253 + X254 + X255.

Example as existential quantification

Let be some type, and let . By the Curry–Howard correspondence, B can be interpreted as a logical predicate on terms of A. For a given , whether the type B(a) is inhabited indicates whether a satisfies this predicate. The correspondence can be extended to existential quantification and dependent pairs: the proposition is true if and only if the type is inhabited.

For example, is less than or equal to if and only if there exists another natural number such that m + k = n. In logic, this statement is codified by existential quantification:

This proposition corresponds to the dependent pair type:

That is, a proof of the statement that m is less than or equal to n is a pair that contains both a non-negative number k, which is the difference between m and n, and a proof of the equality m + k = n.

Systems of the lambda cube

Henk Barendregt developed the lambda cube as a means of classifying type systems along three axes. The eight corners of the resulting cube-shaped diagram each correspond to a type system, with simply typed lambda calculus in the least expressive corner, and calculus of constructions in the most expressive. The three axes of the cube correspond to three different augmentations of the simply typed lambda calculus: the addition of dependent types, the addition of polymorphism, and the addition of higher kinded type constructors (functions from types to types, for example). The lambda cube is generalized further by pure type systems.

First order dependent type theory

The system of pure first order dependent types, corresponding to the logical framework LF, is obtained by generalising the function space type of the simply typed lambda calculus to the dependent product type.

Second order dependent type theory

The system of second order dependent types is obtained from by allowing quantification over type constructors. In this theory the dependent product operator subsumes both the operator of simply typed lambda calculus and the binder of System F.

Higher order dependently typed polymorphic lambda calculus

The higher order system extends to all four forms of abstraction from the lambda cube: functions from terms to terms, types to types, terms to types and types to terms. The system corresponds to the calculus of constructions whose derivative, the calculus of inductive constructions is the underlying system of the Coq proof assistant.

Simultaneous programming language and logic

The Curry–Howard correspondence implies that types can be constructed that express arbitrarily complex mathematical properties. If the user can supply a constructive proof that a type is inhabited (i.e., that a value of that type exists) then a compiler can check the proof and convert it into executable computer code that computes the value by carrying out the construction. The proof checking feature makes dependently typed languages closely related to proof assistants. The code-generation aspect provides a powerful approach to formal program verification and proof-carrying code, since the code is derived directly from a mechanically verified mathematical proof.

Comparison of languages with dependent types

LanguageActively developedParadigm [lower-alpha 1] Tactics Proof terms Termination checking Types can depend on [lower-alpha 2] Universes Proof irrelevance Program extraction Extraction erases irrelevant terms
Ada 2012 Yes [5] ImperativeYes [6] No ?Any term [lower-alpha 3]  ? ? Ada  ?
Agda Yes [7] Purely functional Few/limited [lower-alpha 4] YesYes (optional)Any termYes (optional) [lower-alpha 5] Proof-irrelevant arguments [9] Proof-irrelevant propositions [10] Haskell, JavaScriptYes [9]
ATS Yes [11] Functional / imperativeNo [12] YesYesStatic terms [13]  ?YesYesYes
Cayenne NoPurely functionalNoYesNoAny termNoNo ? ?
Gallina
(Coq)
Yes [14] Purely functionalYesYesYesAny termYes [lower-alpha 6] Yes [15] Haskell, Scheme and OCaml Yes
Dependent ML No [lower-alpha 7]  ? ?Yes ?Natural numbers ? ? ? ?
F* Yes [16] Functional and imperativeYes [17] YesYes (optional)Any pure termYesYes OCaml, F#, and C Yes
Guru No [18] Purely functional [19] hypjoin [20] Yes [19] YesAny termNoYesCarrawayYes
Idris Yes [21] Purely functional [22] Yes [23] YesYes (optional)Any termYesNoYesYes [23]
Lean YesPurely functionalYesYesYesAny termYesYesYesYes
Matita Yes [24] Purely functionalYesYesYesAny termYesYes OCaml Yes
NuPRL YesPurely functionalYesYesYesAny termYes ?Yes ?
PVS Yes ?Yes ? ? ? ? ? ? ?
Sage Archived 2020-11-09 at the Wayback Machine No [lower-alpha 8] Purely functionalNoNoNo ?No ? ? ?
Twelf Yes Logic programming  ?YesYes (optional)Any (LF) termNoNo ? ?
  1. This refers to the core language, not to any tactic (theorem proving procedure) or code generation sublanguage.
  2. Subject to semantic constraints, such as universe constraints
  3. Static_Predicate for restricted terms, Dynamic_Predicate for Assert-like checking of any term in type cast
  4. Ring solver [8]
  5. Optional universes, optional universe polymorphism, and optional explicitly specified universes
  6. Universes, automatically inferred universe constraints (not the same as Agda's universe polymorphism) and optional explicit printing of universe constraints
  7. Has been superseded by ATS
  8. Last Sage paper and last code snapshot are both dated 2006

See also

Related Research Articles

In mathematics, one can often define a direct product of objects already known, giving a new one. This induces a structure on the Cartesian product of the underlying sets from that of the contributing objects. More abstractly, one talks about the product in category theory, which formalizes these notions.

Lambda calculus is a formal system in mathematical logic for expressing computation based on function abstraction and application using variable binding and substitution. It is a universal model of computation that can be used to simulate any Turing machine. It was introduced by the mathematician Alonzo Church in the 1930s as part of his research into the foundations of mathematics.

In mathematical logic, model theory is the study of the relationship between formal theories, and their models. The aspects investigated include the number and size of models of a theory, the relationship of different models to each other, and their interaction with the formal language itself. In particular, model theorists also investigate the sets that can be defined in a model of a theory, and the relationship of such definable sets to each other. As a separate discipline, model theory goes back to Alfred Tarski, who first used the term "Theory of Models" in publication in 1954. Since the 1970s, the subject has been shaped decisively by Saharon Shelah's stability theory.

In mathematics, a product is the result of multiplication, or an expression that identifies objects to be multiplied, called factors. For example, 21 is the product of 3 and 7, and is the product of and . When one factor is an integer, the product is called a multiple.

In mathematics and theoretical computer science, a type theory is the formal presentation of a specific type system. Type theory is the academic study of type systems.

In logic and proof theory, natural deduction is a kind of proof calculus in which logical reasoning is expressed by inference rules closely related to the "natural" way of reasoning. This contrasts with Hilbert-style systems, which instead use axioms as much as possible to express the logical laws of deductive reasoning.

A mathematical symbol is a figure or a combination of figures that is used to represent a mathematical object, an action on mathematical objects, a relation between mathematical objects, or for structuring the other symbols that occur in a formula. As formulas are entirely constituted with symbols of various types, many symbols are needed for expressing all mathematics.

In programming language theory and proof theory, the Curry–Howard correspondence is the direct relationship between computer programs and mathematical proofs.

The ultraproduct is a mathematical construction that appears mainly in abstract algebra and mathematical logic, in particular in model theory and set theory. An ultraproduct is a quotient of the direct product of a family of structures. All factors need to have the same signature. The ultrapower is the special case of this construction in which all factors are equal.

Intuitionistic type theory is a type theory and an alternative foundation of mathematics. Intuitionistic type theory was created by Per Martin-Löf, a Swedish mathematician and philosopher, who first published it in 1972. There are multiple versions of the type theory: Martin-Löf proposed both intensional and extensional variants of the theory and early impredicative versions, shown to be inconsistent by Girard's paradox, gave way to predicative versions. However, all versions keep the core design of constructive logic using dependent types.

In mathematics, particularly in operator theory and C*-algebra theory, the continuous functional calculus is a functional calculus which allows the application of a continuous function to normal elements of a C*-algebra.

In probability theory and related fields, Malliavin calculus is a set of mathematical techniques and ideas that extend the mathematical field of calculus of variations from deterministic functions to stochastic processes. In particular, it allows the computation of derivatives of random variables. Malliavin calculus is also called the stochastic calculus of variations. P. Malliavin first initiated the calculus on infinite dimensional space. Then, the significant contributors such as S. Kusuoka, D. Stroock, J-M. Bismut, Shinzo Watanabe, I. Shigekawa, and so on finally completed the foundations.

In functional analysis, a branch of mathematics, the Borel functional calculus is a functional calculus, which has particularly broad scope. Thus for instance if T is an operator, applying the squaring function ss2 to T yields the operator T2. Using the functional calculus for larger classes of functions, we can for example define rigorously the "square root" of the (negative) Laplacian operator −Δ or the exponential

<span class="mw-page-title-main">Lambda cube</span>

In mathematical logic and type theory, the λ-cube is a framework introduced by Henk Barendregt to investigate the different dimensions in which the calculus of constructions is a generalization of the simply typed λ-calculus. Each dimension of the cube corresponds to a new kind of dependency between terms and types. Here, "dependency" refers to the capacity of a term or type to bind a term or type. The respective dimensions of the λ-cube correspond to:

In logic, a logical framework provides a means to define a logic as a signature in a higher-order type theory in such a way that provability of a formula in the original logic reduces to a type inhabitation problem in the framework type theory. This approach has been used successfully for (interactive) automated theorem proving. The first logical framework was Automath; however, the name of the idea comes from the more widely known Edinburgh Logical Framework, LF. Several more recent proof tools like Isabelle are based on this idea. Unlike a direct embedding, the logical framework approach allows many logics to be embedded in the same type system.

The simply typed lambda calculus, a form of type theory, is a typed interpretation of the lambda calculus with only one type constructor that builds function types. It is the canonical and simplest example of a typed lambda calculus. The simply typed lambda calculus was originally introduced by Alonzo Church in 1940 as an attempt to avoid paradoxical use of the untyped lambda calculus.

Axiomatic constructive set theory is an approach to mathematical constructivism following the program of axiomatic set theory. The same first-order language with "" and "" of classical set theory is usually used, so this is not to be confused with a constructive types approach. On the other hand, some constructive theories are indeed motivated by their interpretability in type theories.

In the branches of mathematical logic known as proof theory and type theory, a pure type system (PTS), previously known as a generalized type system (GTS), is a form of typed lambda calculus that allows an arbitrary number of sorts and dependencies between any of these. The framework can be seen as a generalisation of Barendregt's lambda cube, in the sense that all corners of the cube can be represented as instances of a PTS with just two sorts. In fact, Barendregt (1991) framed his cube in this setting. Pure type systems may obscure the distinction between types and terms and collapse the type hierarchy, as is the case with the calculus of constructions, but this is not generally the case, e.g. the simply typed lambda calculus allows only terms to depend on terms.

<span class="mw-page-title-main">Cartesian product</span> Mathematical set formed from two given sets

In mathematics, specifically set theory, the Cartesian product of two sets A and B, denoted A × B, is the set of all ordered pairs (a, b) where a is in A and b is in B. In terms of set-builder notation, that is

In intuitionistic type theory (ITT), a discipline within mathematical logic, induction-recursion is a feature for simultaneously declaring a type and function on that type. It allows the creation of larger types, such as universes, than inductive types. The types created still remain predicative inside ITT.

References

  1. Hofmann, Martin (1995), Extensional concepts in intensional type theory (PDF)
  2. Sørensen, Morten Heine B.; Urzyczyn, Pawel (1998), Lectures on the Curry-Howard Isomorphism, CiteSeerX   10.1.1.17.7385
  3. Bove, Ana; Dybjer, Peter (2008), Dependent Types at Work (PDF), Chalmers University of Technology
  4. 1 2 "ΠΣ: Dependent Types without the Sugar" (PDF).
  5. "GNAT Community download page".
  6. "§3.2.4 Subtype Predicates". Ada Reference Manual (2012 ed.).
  7. "Agda download page".
  8. "Agda Ring Solver".
  9. 1 2 "Announce: Agda 2.2.8". Archived from the original on 2011-07-18. Retrieved 2010-09-28.
  10. "Agda 2.6.0 changelog".
  11. "ATS2 downloads".
  12. "email from ATS inventor Hongwei Xi".
  13. Xi, Hongwei (March 2017). "Applied Type System: An Approach to Practical Programming with Theorem-Proving" (PDF). arXiv: 1703.08683 .
  14. "Coq CHANGES in Subversion repository".
  15. "Introduction of SProp in Coq 8.10".
  16. "F* changes on GitHub". GitHub .
  17. "F* v0.9.5.0 release notes on GitHub". GitHub .
  18. "Guru SVN".
  19. 1 2 Aaron Stump (6 April 2009). "Verified Programming in Guru" (PDF). Archived from the original (PDF) on 29 December 2009. Retrieved 28 September 2010.
  20. Petcher, Adam (May 2008). Deciding Joinability Modulo Ground Equations in Operational Type Theory (PDF) (MSc). Washington University. Retrieved 14 October 2010.
  21. "Idris git repository". GitHub . 17 May 2022.
  22. Brady, Edwin. "Idris, a language with dependent types — extended abstract" (PDF). CiteSeerX   10.1.1.150.9442 .
  23. 1 2 Brady, Edwin. "How does Idris compare to other dependently-typed programming languages?".
  24. "Matita SVN". Archived from the original on 2006-05-08. Retrieved 2010-09-29.

Further reading