Descriptive complexity theory

Last updated January 14, 2024

Descriptive complexity is a branch of computational complexity theory and of finite model theory that characterizes complexity classes by the type of logic needed to express the languages in them. For example, PH, the union of all complexity classes in the polynomial hierarchy, is precisely the class of languages expressible by statements of second-order logic. This connection between complexity and the logic of finite structures allows results to be transferred easily from one area to the other, facilitating new proof methods and providing additional evidence that the main complexity classes are somehow "natural" and not tied to the specific abstract machines used to define them.

The first main result of descriptive complexity was Fagin's theorem, shown by Ronald Fagin in 1974. It established that NP is precisely the set of languages expressible by sentences of existential second-order logic; that is, second-order logic excluding universal quantification over relations, functions, and subsets. Many other classes were later characterized in such a manner.

The setting

When we use the logic formalism to describe a computational problem, the input is a finite structure, and the elements of that structure are the domain of discourse. Usually the input is either a string (of bits or over an alphabet) and the elements of the logical structure represent positions of the string, or the input is a graph and the elements of the logical structure represent its vertices. The length of the input will be measured by the size of the respective structure. Whatever the structure is, we can assume that there are relations that can be tested, for example " $E(x,y)$ is true if and only if there is an edge from $x$ to $y$ " (in case of the structure being a graph), or " $P(n)$ is true if and only if the $n$ th letter of the string is 1." These relations are the predicates for the first-order logic system. We also have constants, which are special elements of the respective structure, for example if we want to check reachability in a graph, we will have to choose two constants s (start) and t (terminal).

In descriptive complexity theory we often assume that there is a total order over the elements and that we can check equality between elements. This lets us consider elements as numbers: the element $x$ represents the number $n$ if and only if there are $(n-1)$ elements $y$ with $y<x$ . Thanks to this we also may have the primitive predicate "bit", where $bit(x,k)$ is true if only the $k$ th bit of the binary expansion of $x$ is 1. (We can replace addition and multiplication by ternary relations such that $plus(x,y,z)$ is true if and only if $x+y=z$ and $times(x,y,z)$ is true if and only if $x*y=z$ ).

Overview of characterisations of complexity classes

If we restrict ourselves to ordered structures with a successor relation and basic arithmetical predicates, then we get the following characterisations:

First-order logic defines the class AC⁰, the languages recognized by polynomial-size circuits of bounded depth, which equals the languages recognized by a concurrent random access machine in constant time.^[1]
First-order logic augmented with symmetric or deterministic transitive closure operators yield L, problems solvable in logarithmic space.^[2]
First-order logic with a transitive closure operator yields NL, the problems solvable in nondeterministic logarithmic space.^[3]
First-order logic with a least fixed point operator gives P, the problems solvable in deterministic polynomial time.^[3]
Existential second-order logic yields NP.^[3]
Universal second-order logic (excluding existential second-order quantification) yields co-NP.^[4]
Second-order logic corresponds to the polynomial hierarchy PH.^[3]
Second-order logic with a transitive closure (commutative or not) yields PSPACE, the problems solvable in polynomial space.^[5]
Second-order logic with a least fixed point operator gives EXPTIME, the problems solvable in exponential time.^[6]
HO, the complexity class defined by higher-order logic, is equal to ELEMENTARY ^[7]

Sub-polynomial time

FO without any operators

In circuit complexity, first-order logic with arbitrary predicates can be shown to be equal to AC⁰, the first class in the AC hierarchy. Indeed, there is a natural translation from FO's symbols to nodes of circuits, with $\forall ,\exists$ being $\land$ and $\lor$ of size $n$ . First-order logic in a signature with arithmetical predicates characterises the restriction of the AC⁰ family of circuits to those constructible in alternating logarithmic time.^[1] First-order logic in a signature with only the order relation corresponds to the set of star-free languages.^[8]^[9]

Transitive closure logic

First-order logic gains substantially in expressive power when it is augmented with an operator that computes the transitive closure of a binary relation. The resulting transitive closure logic is known to characterise non-deterministic logarithmic space (NL) on ordered structures. This was used by Immerman to show that NL is closed under complement (i. e. that NL = co-NL).^[10]

When restricting the transitive closure operator to deterministic transitive closure, the resulting logic exactly characterises logarithmic space on ordered structures.

Second-order Krom formulae

On structures that have a successor function, NL can also be characterised by second-order Krom formulae.

SO-Krom is the set of boolean queries definable with second-order formulae in conjunctive normal form such that the first-order quantifiers are universal and the quantifier-free part of the formula is in Krom form, which means that the first-order formula is a conjunction of disjunctions, and in each "disjunction" there are at most two variables. Every second-order Krom formula is equivalent to an existential second-order Krom formula.

SO-Krom characterises NL on structures with a successor function.^[11]

Polynomial time

On ordered structures, first-order least fixed-point logic captures PTIME:

First-order least fixed-point logic

FO[LFP] is the extension of first-order logic by a least fixed-point operator, which expresses the fixed-point of a monotone expression. This augments first-order logic with the ability to express recursion. The Immerman–Vardi theorem, shown independently by Immerman and Vardi, shows that FO[LFP] characterises PTIME on ordered structures.^[12]^[13]

As of 2022, it is still open whether there is a natural logic characterising PTIME on unordered structures.

The Abiteboul–Vianu theorem states that FO[LFP]=FO[PFP] on all structures if and only if FO[LFP]=FO[PFP]; hence if and only if P=PSPACE. This result has been extended to other fixpoints.^[14]

Second-order Horn formulae

In the presence of a successor function, PTIME can also be characterised by second-order Horn formulae.

SO-Horn is the set of boolean queries definable with SO formulae in disjunctive normal form such that the first-order quantifiers are all universal and the quantifier-free part of the formula is in Horn form, which means that it is a big AND of OR, and in each "OR" every variable except possibly one are negated.

This class is equal to P on structures with a successor function.^[15]

Those formulae can be transformed to prenex formulas in existential second-order Horn logic.^[11]

Non-deterministic polynomial time

Fagin's theorem

Ronald Fagin's 1974 proof that the complexity class NP was characterised exactly by those classes of structures axiomatizable in existential second-order logic was the starting point of descriptive complexity theory.^[4]^[16]

Since the complement of an existential formula is a universal formula, it follows immediately that co-NP is characterized by universal second-order logic.^[4]

SO, unrestricted second-order logic, is equal to the Polynomial hierarchy PH. More precisely, we have the following generalisation of Fagin's theorem: The set of formulae in prenex normal form where existential and universal quantifiers of second order alternate k times characterise the kth level of the polynomial hierarchy.^[17]

Unlike most other characterisations of complexity classes, Fagin's theorem and its generalisation do not presuppose a total ordering on the structures. This is because existential second-order logic is itself sufficiently expressive to refer to the possible total orders on a structure using second-order variables.^[18]

Beyond NP

Partial fixed point is PSPACE

The class of all problems computable in polynomial space, PSPACE, can be characterised by augmenting first-order logic with a more expressive partial fixed-point operator.

Partial fixed-point logic, FO[PFP], is the extension of first-order logic with a partial fixed-point operator, which expresses the fixed-point of a formula if there is one and returns 'false' otherwise.

Partial fixed-point logic characterises PSPACE on ordered structures.^[19]

Transitive closure is PSPACE

Second-order logic can be extended by a transitive closure operator in the same way as first-order logic, resulting in SO[TC]. The TC operator can now also take second-order variables as argument. SO[TC] characterises PSPACE. Since ordering can be referenced in second-order logic, this characterisation does not presuppose ordered structures.^[20]

Elementary functions

The time complexity class ELEMENTARY of elementary functions can be characterised by HO, the complexity class of structures that can be recognized by formulas of higher-order logic. Higher-order logic is an extension of first-order logic and second-order logic with higher-order quantifiers. There is a relation between the $i$ th order and non-deterministic algorithms the time of which is bounded by $i-1$ levels of exponentials.^[21]

Definition

We define higher-order variables. A variable of order $i>1$ has an arity $k$ and represents any set of $k$ -tuples of elements of order $i-1$ . They are usually written in upper-case and with a natural number as exponent to indicate the order. Higher-order logic is the set of first-order formulae where we add quantification over higher-order variables; hence we will use the terms defined in the FO article without defining them again.

HO $^{i}$ is the set of formulae with variables of order at most $i$ . HO $_{j}^{i}$ is the subset of formulae of the form $\phi =\exists {\overline {X_{1}^{i}}}\forall {\overline {X_{2}^{i}}}\dots Q{\overline {X_{j}^{i}}}\psi$ , where $Q$ is a quantifier and $Q{\overline {X^{i}}}$ means that ${\overline {X^{i}}}$ is a tuple of variable of order $i$ with the same quantification. So HO $_{j}^{i}$ is the set of formulae with $j$ alternations of quantifiers of order $i$ , beginning with $\exists$ , followed by a formula of order $i-1$ .

Using the standard notation of the tetration, $\exp _{2}^{0}(x)=x$ and $\exp _{2}^{i+1}(x)=2^{\exp _{2}^{i}(x)}$ . $\exp _{2}^{i+1}(x)=2^{2^{2^{2^{\dots ^{2^{x}}}}}}$ with $i$ times $2$

Normal form

Every formula of order $i$ th is equivalent to a formula in prenex normal form, where we first write quantification over variable of $i$ th order and then a formula of order $i-1$ in normal form.

Relation to complexity classes

HO is equal to the class ELEMENTARY of elementary functions. To be more precise, ${\mathsf {HO}}_{0}^{i}={\mathsf {NTIME}}(\exp _{2}^{i-2}(n^{O(1)}))$ , meaning a tower of $(i-2)$ 2s, ending with $n^{c}$ , where $c$ is a constant. A special case of this is that $\exists {\mathsf {SO}}={\mathsf {HO}}_{0}^{2}={\mathsf {NTIME}}(n^{O(1)})={\color {Blue}{\mathsf {NP}}}$ , which is exactly Fagin's theorem. Using oracle machines in the polynomial hierarchy, ${\mathsf {HO}}_{j}^{i}={\color {Blue}{\mathsf {NTIME}}}(\exp _{2}^{i-2}(n^{O(1)})^{\Sigma _{j}^{\mathsf {P}}})$

Notes

1 2 Immerman 1999, p. 86
↑ Grädel, Erich; Schalthöfer, Svenja (2019). Choiceless Logarithmic Space. Leibniz International Proceedings in Informatics (LIPIcs). Vol. 138. pp. 31:1–31:15. doi:10.4230/LIPICS.MFCS.2019.31. ISBN 9783959771177.
1 2 3 4 Immerman 1999, p. 242
1 2 3 Fagin, Ron (1974). "Generalized first-order spectra and polynomial-time recognizable sets". In Karp, Richard (ed.). Complexity of Computation. pp. 43–73.
↑ Immerman 1999, p. 243
↑ Abiteboul, Serge; Vardi, Moshe Y.; Vianu, Victor (1997-01-15). "Fixpoint logics, relational machines, and computational complexity". Journal of the ACM . 44 (1): 30–56. doi: 10.1145/256292.256295 . ISSN 0004-5411. S2CID 11338470.
↑ Hella, Lauri; Turull-Torres, José María (2006). "Computing queries with higher-order logics". Theoretical Computer Science . Essex, UK: Elsevier Science Publishers Ltd. 355 (2): 197–214. doi: 10.1016/j.tcs.2006.01.009 . ISSN 0304-3975.
↑ Robert., McNaughton (1971). Counter-free automata. M.I.T. Press. ISBN 0-262-13076-9. OCLC 651199926.
↑ Immerman 1999, p. 22
↑ Immerman, Neil (1988). "Nondeterministic Space is Closed under Complementation". SIAM Journal on Computing . 17 (5): 935–938. doi:10.1137/0217058. ISSN 0097-5397.
1 2 Immerman 1999, p. 153–4
↑ Immerman, Neil (1986). "Relational queries computable in polynomial time". Information and Control . 68 (1–3): 86–104. doi: 10.1016/s0019-9958(86)80029-8 .
↑ Vardi, Moshe Y. (1982). "The complexity of relational query languages (Extended Abstract)". Proceedings of the fourteenth annual ACM symposium on Theory of computing - STOC '82. STOC '82. New York, NY, USA: ACM. pp. 137–146. CiteSeerX 10.1.1.331.6045 . doi:10.1145/800070.802186. ISBN 978-0897910705. S2CID 7869248.
↑ Serge Abiteboul, Moshe Y. Vardi, Victor Vianu: Fixpoint logics, relational machines, and computational complexity Journal of the ACM archive, Volume 44, Issue 1 (January 1997), Pages: 30-56, ISSN 0004-5411
↑ Grädel, Erich (1992-07-13). "Capturing complexity classes by fragments of second-order logic". Theoretical Computer Science. 101 (1): 35–57. doi: 10.1016/0304-3975(92)90149-A . ISSN 0304-3975.
↑ Immerman 1999, p. 115
↑ Immerman 1999, p. 121
↑ Immerman 1999, p. 181
↑ Abiteboul, S.; Vianu, V. (1989). "Fixpoint extensions of first-order logic and datalog-like languages". [1989] Proceedings. Fourth Annual Symposium on Logic in Computer Science. IEEE Comput. Soc. Press. pp. 71–79. doi:10.1109/lics.1989.39160. ISBN 0-8186-1954-6. S2CID 206437693.
↑ Harel, D.; Peleg, D. (1984-01-01). "On static logics, dynamic logics, and complexity classes". Information and Control. 60 (1): 86–102. doi: 10.1016/S0019-9958(84)80023-6 . ISSN 0019-9958.
↑ Hella, Lauri; Turull-Torres, José María (2006). "Computing queries with higher-order logics". Theoretical Computer Science. Essex, UK: Elsevier Science Publishers Ltd. 355 (2): 197–214. doi: 10.1016/j.tcs.2006.01.009 . ISSN 0304-3975.

Related Research Articles

In computational complexity theory, NP is a complexity class used to classify decision problems. NP is the set of decision problems for which the problem instances, where the answer is "yes", have proofs verifiable in polynomial time by a deterministic Turing machine, or alternatively the set of problems that can be solved in polynomial time by a nondeterministic Turing machine.

In logic and mathematics, second-order logic is an extension of first-order logic, which itself is an extension of propositional logic. Second-order logic is in turn extended by higher-order logic and type theory.

In computational complexity theory, P, also known as PTIME or DTIME(n^O(1)), is a fundamental complexity class. It contains all decision problems that can be solved by a deterministic Turing machine using a polynomial amount of computation time, or polynomial time.

In computational complexity theory, the polynomial hierarchy is a hierarchy of complexity classes that generalize the classes NP and co-NP. Each class in the hierarchy is contained within PSPACE. The hierarchy can be defined using oracle machines or alternating Turing machines. It is a resource-bounded counterpart to the arithmetical hierarchy and analytical hierarchy from mathematical logic. The union of the classes in the hierarchy is denoted PH.

In computational complexity theory, the complexity class ELEMENTARY of elementary recursive functions is the union of the classes

In computational complexity theory, an alternating Turing machine (ATM) is a non-deterministic Turing machine (NTM) with a rule for accepting computations that generalizes the rules used in the definition of the complexity classes NP and co-NP. The concept of an ATM was set forth by Chandra and Stockmeyer and independently by Kozen in 1976, with a joint journal publication in 1981.

In computational complexity theory, NL is the complexity class containing decision problems that can be solved by a nondeterministic Turing machine using a logarithmic amount of memory space.

Finite model theory is a subarea of model theory. Model theory is the branch of logic which deals with the relation between a formal language (syntax) and its interpretations (semantics). Finite model theory is a restriction of model theory to interpretations on finite structures, which have a finite universe.

In complexity theory, the Karp–Lipton theorem states that if the Boolean satisfiability problem (SAT) can be solved by Boolean circuits with a polynomial number of logic gates, then

Fagin's theorem is the oldest result of descriptive complexity theory, a branch of computational complexity theory that characterizes complexity classes in terms of logic-based descriptions of their problems rather than by the behavior of algorithms for solving those problems. The theorem states that the set of all properties expressible in existential second-order logic is precisely the complexity class NP.

In mathematics and computer science, the BIT predicate, sometimes written $,$ is a predicate that tests whether the $th$ bit of the number is 1, when $is written as a binary number. Its mathematical applications include modeling the membership relation of hereditarily finite sets, and defining the adjacency relation of the Rado graph. In computer science, it is used for efficient representations of set data structures using bit vectors, in defining the private information retrieval problem from communication complexity, and in descriptive complexity theory to formulate logical descriptions of complexity classes.$

In database theory, a conjunctive query is a restricted form of first-order queries using the logical conjunction operator. Many first-order queries can be written as conjunctive queries. In particular, a large part of queries issued on relational databases can be expressed in this way. Conjunctive queries also have a number of desirable theoretical properties that larger classes of queries do not share.

In mathematical logic, monadic second-order logic (MSO) is the fragment of second-order logic where the second-order quantification is limited to quantification over sets. It is particularly important in the logic of graphs, because of Courcelle's theorem, which provides algorithms for evaluating monadic second-order formulas over graphs of bounded treewidth. It is also of fundamental importance in automata theory, where the Büchi–Elgot–Trakhtenbrot theorem gives a logical characterization of the regular languages.

In computational complexity theory, the language TQBF is a formal language consisting of the true quantified Boolean formulas. A (fully) quantified Boolean formula is a formula in quantified propositional logic where every variable is quantified, using either existential or universal quantifiers, at the beginning of the sentence. Such a formula is equivalent to either true or false. If such a formula evaluates to true, then that formula is in the language TQBF. It is also known as QSAT.

In proof theory, a branch of mathematical logic, elementary function arithmetic (EFA), also called elementary arithmetic and exponential function arithmetic, is the system of arithmetic with the usual elementary properties of 0, 1, +, ×, $, together with induction for formulas with bounded quantifiers.$

In mathematical logic, the spectrum of a sentence is the set of natural numbers occurring as the size of a finite model in which a given sentence is true. By a result in descriptive complexity, a set of natural numbers is a spectrum if and only if it can be recognized in non-deterministic exponential time.

In the mathematical fields of graph theory and finite model theory, the logic of graphs deals with formal specifications of graph properties using sentences of mathematical logic. There are several variations in the types of logical operation that can be used in these sentences. The first-order logic of graphs concerns sentences in which the variables and predicates concern individual vertices and edges of a graph, while monadic second-order graph logic allows quantification over sets of vertices or edges. Logics based on least fixed point operators allow more general predicates over tuples of vertices, but these predicates can only be constructed through fixed-point operators, restricting their power.

In mathematical logic, fixed-point logics are extensions of classical predicate logic that have been introduced to express recursion. Their development has been motivated by descriptive complexity theory and their relationship to database query languages, in particular to Datalog.

Bounded arithmetic is a collective name for a family of weak subtheories of Peano arithmetic. Such theories are typically obtained by requiring that quantifiers be bounded in the induction axiom or equivalent postulates. The main purpose is to characterize one or another class of computational complexity in the sense that a function is provably total if and only if it belongs to a given complexity class. Further, theories of bounded arithmetic present uniform counterparts to standard propositional proof systems such as Frege system and are, in particular, useful for constructing polynomial-size proofs in these systems. The characterization of standard complexity classes and correspondence to propositional proof systems allows to interpret theories of bounded arithmetic as formal systems capturing various levels of feasible reasoning.

Descriptive Complexity is a book in mathematical logic and computational complexity theory by Neil Immerman. It concerns descriptive complexity theory, an area in which the expressibility of mathematical properties using different types of logic is shown to be equivalent to their computability in different types of resource-bounded models of computation. It was published in 1999 by Springer-Verlag in their book series Graduate Texts in Computer Science.

References

Immerman, Neil (1999). Descriptive complexity. Springer. ISBN 0-387-98600-6. OCLC 901297152.

External links

Neil Immerman's descriptive complexity page, including a diagram

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Immerman_1999,_p._86-1] 1 2 Immerman 1999, p. 86

[2] Grädel, Erich; Schalthöfer, Svenja (2019). Choiceless Logarithmic Space. Leibniz International Proceedings in Informatics (LIPIcs). Vol. 138. pp. 31:1–31:15. doi:10.4230/LIPICS.MFCS.2019.31. ISBN 9783959771177.

[:0-3] 1 2 3 4 Immerman 1999, p. 242

[:1-4] 1 2 3 Fagin, Ron (1974). "Generalized first-order spectra and polynomial-time recognizable sets". In Karp, Richard (ed.). Complexity of Computation. pp. 43–73.

[5] Immerman 1999, p. 243

[6] Abiteboul, Serge; Vardi, Moshe Y.; Vianu, Victor (1997-01-15). "Fixpoint logics, relational machines, and computational complexity". Journal of the ACM . 44 (1): 30–56. doi: 10.1145/256292.256295 . ISSN 0004-5411. S2CID 11338470.

[7] Hella, Lauri; Turull-Torres, José María (2006). "Computing queries with higher-order logics". Theoretical Computer Science . Essex, UK: Elsevier Science Publishers Ltd. 355 (2): 197–214. doi: 10.1016/j.tcs.2006.01.009 . ISSN 0304-3975.

[8] Robert., McNaughton (1971). Counter-free automata. M.I.T. Press. ISBN 0-262-13076-9. OCLC 651199926.

[9] Immerman 1999, p. 22

[10] Immerman, Neil (1988). "Nondeterministic Space is Closed under Complementation". SIAM Journal on Computing . 17 (5): 935–938. doi:10.1137/0217058. ISSN 0097-5397.

[:2-11] 1 2 Immerman 1999, p. 153–4

[12] Immerman, Neil (1986). "Relational queries computable in polynomial time". Information and Control . 68 (1–3): 86–104. doi: 10.1016/s0019-9958(86)80029-8 .

[13] Vardi, Moshe Y. (1982). "The complexity of relational query languages (Extended Abstract)". Proceedings of the fourteenth annual ACM symposium on Theory of computing - STOC '82. STOC '82. New York, NY, USA: ACM. pp. 137–146. CiteSeerX 10.1.1.331.6045 . doi:10.1145/800070.802186. ISBN 978-0897910705. S2CID 7869248.

[avv-14] Serge Abiteboul, Moshe Y. Vardi, Victor Vianu: Fixpoint logics, relational machines, and computational complexity Journal of the ACM archive, Volume 44, Issue 1 (January 1997), Pages: 30-56, ISSN 0004-5411

[15] Grädel, Erich (1992-07-13). "Capturing complexity classes by fragments of second-order logic". Theoretical Computer Science. 101 (1): 35–57. doi: 10.1016/0304-3975(92)90149-A . ISSN 0304-3975.

[16] Immerman 1999, p. 115

[17] Immerman 1999, p. 121

[18] Immerman 1999, p. 181

[19] Abiteboul, S.; Vianu, V. (1989). "Fixpoint extensions of first-order logic and datalog-like languages". [1989] Proceedings. Fourth Annual Symposium on Logic in Computer Science. IEEE Comput. Soc. Press. pp. 71–79. doi:10.1109/lics.1989.39160. ISBN 0-8186-1954-6. S2CID 206437693.

[20] Harel, D.; Peleg, D. (1984-01-01). "On static logics, dynamic logics, and complexity classes". Information and Control. 60 (1): 86–102. doi: 10.1016/S0019-9958(84)80023-6 . ISSN 0019-9958.

[21] Hella, Lauri; Turull-Torres, José María (2006). "Computing queries with higher-order logics". Theoretical Computer Science. Essex, UK: Elsevier Science Publishers Ltd. 355 (2): 197–214. doi: 10.1016/j.tcs.2006.01.009 . ISSN 0304-3975.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]