Word problem (mathematics)

Last updated

In computational mathematics, a word problem is the problem of deciding whether two given expressions are equivalent with respect to a set of rewriting identities. A prototypical example is the word problem for groups, but there are many other instances as well. A deep result of computational theory is that answering this question is in many important cases undecidable. [1]

Contents

Background and motivation

In computer algebra one often wishes to encode mathematical expressions using an expression tree. But there are often multiple equivalent expression trees. The question naturally arises of whether there is an algorithm which, given as input two expressions, decides whether they represent the same element. Such an algorithm is called a solution to the word problem. For example, imagine that are symbols representing real numbers - then a relevant solution to the word problem would, given the input , produce the output EQUAL, and similarly produce NOT_EQUAL from .

The most direct solution to a word problem takes the form of a normal form theorem and algorithm which maps every element in an equivalence class of expressions to a single encoding known as the normal form - the word problem is then solved by comparing these normal forms via syntactic equality. [1] For example one might decide that is the normal form of , , and , and devise a transformation system to rewrite those expressions to that form, in the process proving that all equivalent expressions will be rewritten to the same normal form. [2] But not all solutions to the word problem use a normal form theorem - there are algebraic properties which indirectly imply the existence of an algorithm. [1]

While the word problem asks whether two terms containing constants are equal, a proper extension of the word problem known as the unification problem asks whether two terms containing variables have instances that are equal, or in other words whether the equation has any solutions. As a common example, is a word problem in the integer group ℤ, while is a unification problem in the same group; since the former terms happen to be equal in ℤ, the latter problem has the substitution as a solution.

History

One of the most deeply studied cases of the word problem is in the theory of semigroups and groups. A timeline of papers relevant to the Novikov-Boone theorem is as follows: [3] [4]

The word problem for semi-Thue systems

The accessibility problem for string rewriting systems (semi-Thue systems or semigroups) can be stated as follows: Given a semi-Thue system and two words (strings) , can be transformed into by applying rules from ? Note that the rewriting here is one-way. The word problem is the accessibility problem for symmetric rewrite relations, i.e. Thue systems. [27]

The accessibility and word problems are undecidable, i.e. there is no general algorithm for solving this problem. [28] This even holds if we limit the systems to have finite presentations, i.e. a finite set of symbols and a finite set of relations on those symbols. [27] Even the word problem restricted to ground terms is not decidable for certain finitely presented semigroups. [29] [30]

The word problem for groups

Given a presentation for a group G, the word problem is the algorithmic problem of deciding, given as input two words in S, whether they represent the same element of G. The word problem is one of three algorithmic problems for groups proposed by Max Dehn in 1911. It was shown by Pyotr Novikov in 1955 that there exists a finitely presented group G such that the word problem for G is undecidable. [31]

The word problem in combinatorial calculus and lambda calculus

One of the earliest proofs that a word problem is undecidable was for combinatory logic: when are two strings of combinators equivalent? Because combinators encode all possible Turing machines, and the equivalence of two Turing machines is undecidable, it follows that the equivalence of two strings of combinators is undecidable. Alonzo Church observed this in 1936. [32]

Likewise, one has essentially the same problem in (untyped) lambda calculus: given two distinct lambda expressions, there is no algorithm which can discern whether they are equivalent or not; equivalence is undecidable. For several typed variants of the lambda calculus, equivalence is decidable by comparison of normal forms.

The word problem for abstract rewriting systems

Solving the word problem: deciding if
x
-
*
y
{\displaystyle x{\stackrel {*}{\leftrightarrow }}y}
usually requires heuristic search (red, green), while deciding
x
|=
y
|
{\displaystyle x\downarrow =y\downarrow }
is straightforward (grey). Solving the word problem without and with completion svg.svg
Solving the word problem: deciding if usually requires heuristic search (red, green), while deciding is straightforward (grey).

The word problem for an abstract rewriting system (ARS) is quite succinct: given objects x and y are they equivalent under ? [29] The word problem for an ARS is undecidable in general. However, there is a computable solution for the word problem in the specific case where every object reduces to a unique normal form in a finite number of steps (i.e. the system is convergent): two objects are equivalent under if and only if they reduce to the same normal form. [33] The Knuth-Bendix completion algorithm can be used to transform a set of equations into a convergent term rewriting system.

The word problem in universal algebra

In universal algebra one studies algebraic structures consisting of a generating set A, a collection of operations on A of finite arity, and a finite set of identities that these operations must satisfy. The word problem for an algebra is then to determine, given two expressions (words) involving the generators and operations, whether they represent the same element of the algebra modulo the identities. The word problems for groups and semigroups can be phrased as word problems for algebras. [1]

The word problem on free Heyting algebras is difficult. [34] The only known results are that the free Heyting algebra on one generator is infinite, and that the free complete Heyting algebra on one generator exists (and has one more element than the free Heyting algebra).

The word problem for free lattices

Example computation of xz ~ xz∧(xy)
xz∧(xy)~xz
by 5.sincexz~xz
by 1.sincexz=xz
 
 
xz~xz∧(xy)
by 7.sincexz~xzandxz~xy
by 1.sincexz=xzby 6.sincexz~x
by 5.sincex~x
by 1.sincex=x

The word problem on free lattices and more generally free bounded lattices has a decidable solution. Bounded lattices are algebraic structures with the two binary operations ∨ and ∧ and the two constants (nullary operations) 0 and 1. The set of all well-formed expressions that can be formulated using these operations on elements from a given set of generators X will be called W(X). This set of words contains many expressions that turn out to denote equal values in every lattice. For example, if a is some element of X, then a1 = 1 and a1 = a. The word problem for free bounded lattices is the problem of determining which of these elements of W(X) denote the same element in the free bounded lattice FX, and hence in every bounded lattice.

The word problem may be resolved as follows. A relation ≤~ on W(X) may be defined inductively by setting w~v if and only if one of the following holds:

  1.  w = v (this can be restricted to the case where w and v are elements of X),
  2.  w = 0,
  3.  v = 1,
  4.  w = w1w2 and both w1~v and w2~v hold,
  5.  w = w1w2 and either w1~v or w2~v holds,
  6.  v = v1v2 and either w~v1 or w~v2 holds,
  7.  v = v1v2 and both w~v1 and w~v2 hold.

This defines a preorder~ on W(X), so an equivalence relation can be defined by w ~ v when w~v and v~w. One may then show that the partially ordered quotient set W(X)/~ is the free bounded lattice FX. [35] [36] The equivalence classes of W(X)/~ are the sets of all words w and v with w~v and v~w. Two well-formed words v and w in W(X) denote the same value in every bounded lattice if and only if w~v and v~w; the latter conditions can be effectively decided using the above inductive definition. The table shows an example computation to show that the words xz and xz∧(xy) denote the same value in every bounded lattice. The case of lattices that are not bounded is treated similarly, omitting rules 2 and 3 in the above construction of ≤~.

Example: A term rewriting system to decide the word problem in the free group

Bläsius and Bürckert [37] demonstrate the Knuth–Bendix algorithm on an axiom set for groups. The algorithm yields a confluent and noetherian term rewrite system that transforms every term into a unique normal form. [38] The rewrite rules are numbered incontiguous since some rules became redundant and were deleted during the algorithm run. The equality of two terms follows from the axioms if and only if both terms are transformed into literally the same normal form term. For example, the terms

, and

share the same normal form, viz. ; therefore both terms are equal in every group. As another example, the term and has the normal form and , respectively. Since the normal forms are literally different, the original terms cannot be equal in every group. In fact, they are usually different in non-abelian groups.

Group axioms used in Knuth–Bendix completion
A1
A2
A3  
Term rewrite system obtained from Knuth–Bendix completion
R1
R2
R3
R4
R8
R11
R12
R13
R14
R17  

See also

Related Research Articles

<span class="mw-page-title-main">Context-free grammar</span> Type of formal grammar

In formal language theory, a context-free grammar (CFG) is a formal grammar whose production rules can be applied to a nonterminal symbol regardless of its context. In particular, in a context-free grammar, each production rule is of the form

<span class="mw-page-title-main">Formal language</span> Sequence of words formed by specific rules

In logic, mathematics, computer science, and linguistics, a formal language consists of words whose letters are taken from an alphabet and are well-formed according to a specific set of rules.

<span class="mw-page-title-main">Semigroup</span> Algebraic structure consisting of a set with an associative binary operation

In mathematics, a semigroup is an algebraic structure consisting of a set together with an associative internal binary operation on it.

In mathematics, especially in the area of abstract algebra known as combinatorial group theory, the word problem for a finitely generated group G is the algorithmic problem of deciding whether two words in the generators represent the same element. More precisely, if A is a finite set of generators for G then the word problem is the membership problem for the formal language of all words in A and a formal set of inverses that map to the identity under the natural map from the free monoid with involution on A to the group G. If B is another finite generating set for G, then the word problem over the generating set B is equivalent to the word problem over the generating set A. Thus one can speak unambiguously of the decidability of the word problem for the finitely generated group G.

In logic and computer science, unification is an algorithmic process of solving equations between symbolic expressions.

The Post correspondence problem is an undecidable decision problem that was introduced by Emil Post in 1946. Because it is simpler than the halting problem and the Entscheidungsproblem it is often used in proofs of undecidability.

<span class="mw-page-title-main">Semiring</span> Algebraic ring that need not have additive negative elements

In abstract algebra, a semiring is an algebraic structure similar to a ring, but without the requirement that each element must have an additive inverse.

In mathematics, computer science, and logic, rewriting covers a wide range of methods of replacing subterms of a formula with other terms. Such methods may be achieved by rewriting systems. In their most basic form, they consist of a set of objects, plus relations on how to transform those objects.

In computer science and mathematical logic the Turing degree or degree of unsolvability of a set of natural numbers measures the level of algorithmic unsolvability of the set.

In mathematics, a Thue equation is a Diophantine equation of the form

In computability theory, a Turing reduction from a decision problem to a decision problem is an oracle machine which decides problem given an oracle for . It can be understood as an algorithm that could be used to solve if it had available to it a subroutine for solving B. The concept can be analogously applied to function problems.

In theoretical computer science and mathematical logic a string rewriting system (SRS), historically called a semi-Thue system, is a rewriting system over strings from a alphabet. Given a binary relation between fixed strings over the alphabet, called rewrite rules, denoted by , an SRS extends the rewriting relation to all strings in which the left- and right-hand side of the rules appear as substrings, that is , where , , , and are strings.

In mathematics, a proof of impossibility is a proof that demonstrates that a particular problem cannot be solved as described in the claim, or that a particular set of problems cannot be solved in general. Such a case is also known as a negative proof, proof of an impossibility theorem, or negative result. Proofs of impossibility often are the resolutions to decades or centuries of work attempting to find a solution, eventually proving that there is no solution. Proving that something is impossible is usually much harder than the opposite task, as it is often necessary to develop a proof that works in general, rather than to just show a particular example. Impossibility theorems are usually expressible as negative existential propositions or universal propositions in logic.

In mathematics, an automatic semigroup is a finitely generated semigroup equipped with several regular languages over an alphabet representing a generating set. One of these languages determines "canonical forms" for the elements of the semigroup, the other languages determine if two canonical forms represent elements that differ by multiplication by a generator.

In the mathematical subject of group theory, small cancellation theory studies groups given by group presentations satisfying small cancellation conditions, that is where defining relations have "small overlaps" with each other. Small cancellation conditions imply algebraic, geometric and algorithmic properties of the group. Finitely presented groups satisfying sufficiently strong small cancellation conditions are word hyperbolic and have word problem solvable by Dehn's algorithm. Small cancellation methods are also used for constructing Tarski monsters, and for solutions of Burnside's problem.

In the mathematical subject of geometric group theory, a Dehn function, named after Max Dehn, is an optimal function associated to a finite group presentation which bounds the area of a relation in that group in terms of the length of that relation. The growth type of the Dehn function is a quasi-isometry invariant of a finitely presented group. The Dehn function of a finitely presented group is also closely connected with non-deterministic algorithmic complexity of the word problem in groups. In particular, a finitely presented group has solvable word problem if and only if the Dehn function for a finite presentation of this group is recursive. The notion of a Dehn function is motivated by isoperimetric problems in geometry, such as the classic isoperimetric inequality for the Euclidean plane and, more generally, the notion of a filling area function that estimates the area of a minimal surface in a Riemannian manifold in terms of the length of the boundary curve of that surface.

Combinatorics on words is a fairly new field of mathematics, branching from combinatorics, which focuses on the study of words and formal languages. The subject looks at letters or symbols, and the sequences they form. Combinatorics on words affects various areas of mathematical study, including algebra and computer science. There have been a wide range of contributions to the field. Some of the first work was on square-free words by Axel Thue in the early 1900s. He and colleagues observed patterns within words and tried to explain them. As time went on, combinatorics on words became useful in the study of algorithms and coding. It led to developments in abstract algebra and answering open questions.

In computability theory, the halting problem is the problem of determining, from a description of an arbitrary computer program and an input, whether the program will finish running, or continue to run forever. Alan Turing proved in 1936 that a general algorithm to solve the halting problem for all possible program–input pairs cannot exist.

In the mathematical subject of group theory, the Adian–Rabin theorem is a result that states that most "reasonable" properties of finitely presentable groups are algorithmically undecidable. The theorem is due to Sergei Adian (1955) and, independently, Michael O. Rabin (1958).

References

  1. 1 2 3 4 Evans, Trevor (1978). "Word problems". Bulletin of the American Mathematical Society. 84 (5): 790. doi: 10.1090/S0002-9904-1978-14516-9 .
  2. Cohen, Joel S. (2002). Computer algebra and symbolic computation: elementary algorithms. Natick, Mass.: A K Peters. pp. 90–92. ISBN   1568811586.
  3. 1 2 3 4 5 6 7 Miller, Charles F. (2014). Downey, Rod (ed.). "Turing machines to word problems" (PDF). Turing's Legacy: 330. doi:10.1017/CBO9781107338579.010. hdl:11343/51723. ISBN   9781107338579 . Retrieved 6 December 2021.
  4. Stillwell, John (1982). "The word problem and the isomorphism problem for groups". Bulletin of the American Mathematical Society. 6 (1): 33–56. doi: 10.1090/S0273-0979-1982-14963-1 .
  5. Müller-Stach, Stefan (12 September 2021). "Max Dehn, Axel Thue, and the Undecidable". p. 13. arXiv: 1703.09750 [math.HO].
  6. Steinby, Magnus; Thomas, Wolfgang (2000). "Trees and term rewriting in 1910: on a paper by Axel Thue". Bulletin of the European Association for Theoretical Computer Science. 72: 256–269. CiteSeerX   10.1.1.32.8993 . MR   1798015.
  7. Dehn, Max (1911). "Über unendliche diskontinuierliche Gruppen". Mathematische Annalen . 71 (1): 116–144. doi:10.1007/BF01456932. ISSN   0025-5831. MR   1511645. S2CID   123478582.
  8. Dehn, Max (1912). "Transformation der Kurven auf zweiseitigen Flächen". Mathematische Annalen . 72 (3): 413–421. doi:10.1007/BF01456725. ISSN   0025-5831. MR   1511705. S2CID   122988176.
  9. Greendlinger, Martin (June 1959). "Dehn's algorithm for the word problem". Communications on Pure and Applied Mathematics. 13 (1): 67–83. doi:10.1002/cpa.3160130108.
  10. Lyndon, Roger C. (September 1966). "On Dehn's algorithm". Mathematische Annalen. 166 (3): 208–228. doi:10.1007/BF01361168. hdl: 2027.42/46211 . S2CID   36469569.
  11. Schupp, Paul E. (June 1968). "On Dehn's algorithm and the conjugacy problem". Mathematische Annalen. 178 (2): 119–130. doi:10.1007/BF01350654. S2CID   120429853.
  12. Power, James F. (27 August 2013). "Thue's 1914 paper: a translation". arXiv: 1308.5858 [cs.FL].
  13. See History of the Church–Turing thesis. The dates are based on On Formally Undecidable Propositions of Principia Mathematica and Related Systems and Systems of Logic Based on Ordinals.
  14. Post, Emil L. (March 1947). "Recursive Unsolvability of a problem of Thue" (PDF). Journal of Symbolic Logic. 12 (1): 1–11. doi:10.2307/2267170. JSTOR   2267170. S2CID   30320278 . Retrieved 6 December 2021.
  15. Mostowski, Andrzej (September 1951). "A. Markov. Névožmoinost' nékotoryh algoritmov v téorii associativnyh sistém (Impossibility of certain algorithms in the theory of associative systems). Doklady Akadémii Nauk SSSR, vol. 77 (1951), pp. 19–20". Journal of Symbolic Logic. 16 (3): 215. doi:10.2307/2266407. JSTOR   2266407.
  16. Turing, A. M. (September 1950). "The Word Problem in Semi-Groups With Cancellation". The Annals of Mathematics. 52 (2): 491–505. doi:10.2307/1969481. JSTOR   1969481.
  17. Novikov, P. S. (1955). "On the algorithmic unsolvability of the word problem in group theory". Proceedings of the Steklov Institute of Mathematics (in Russian). 44: 1–143. Zbl   0068.01301.
  18. Boone, William W. (1954). "Certain Simple, Unsolvable Problems of Group Theory. I". Indagationes Mathematicae (Proceedings). 57: 231–237. doi:10.1016/S1385-7258(54)50033-8.
  19. Boone, William W. (1957). "Certain Simple, Unsolvable Problems of Group Theory. VI". Indagationes Mathematicae (Proceedings). 60: 227–232. doi: 10.1016/S1385-7258(57)50030-9 .
  20. Britton, J. L. (October 1958). "The Word Problem for Groups". Proceedings of the London Mathematical Society. s3-8 (4): 493–506. doi:10.1112/plms/s3-8.4.493.
  21. Boone, William W. (1958). "The word problem" (PDF). Proceedings of the National Academy of Sciences. 44 (10): 1061–1065. Bibcode:1958PNAS...44.1061B. doi: 10.1073/pnas.44.10.1061 . PMC   528693 . PMID   16590307. Zbl   0086.24701.
  22. Boone, William W. (September 1959). "The Word Problem". The Annals of Mathematics. 70 (2): 207–265. doi:10.2307/1970103. JSTOR   1970103.
  23. Higman, G. (8 August 1961). "Subgroups of finitely presented groups". Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences. 262 (1311): 455–475. Bibcode:1961RSPSA.262..455H. doi:10.1098/rspa.1961.0132. S2CID   120100270.
  24. Britton, John L. (January 1963). "The Word Problem". The Annals of Mathematics. 77 (1): 16–32. doi:10.2307/1970200. JSTOR   1970200.
  25. Simpson, Stephen G. (18 May 2005). "A Slick Proof of the Unsolvability of the Word Problem for Finitely Presented Groups" (PDF). Retrieved 6 December 2021.
  26. "Subgroups of finitely presented groups". Mathematics of the USSR-Sbornik. 103 (145): 147–236. 13 February 1977. doi:10.1070/SM1977v032n02ABEH002376.
  27. 1 2 Matiyasevich, Yuri; Sénizergues, Géraud (January 2005). "Decision problems for semi-Thue systems with a few rules". Theoretical Computer Science. 330 (1): 145–169. doi: 10.1016/j.tcs.2004.09.016 .
  28. Davis, Martin (1978). "What is a Computation?" (PDF). Mathematics Today Twelve Informal Essays: 257–259. doi:10.1007/978-1-4613-9435-8_10. ISBN   978-1-4613-9437-2 . Retrieved 5 December 2021.
  29. 1 2 Baader, Franz; Nipkow, Tobias (5 August 1999). Term Rewriting and All That. Cambridge University Press. pp. 59–60. ISBN   978-0-521-77920-3.
  30. Novikov, P. S. (1955). "On the algorithmic unsolvability of the word problem in group theory". Trudy Mat. Inst. Steklov (in Russian). 44: 1–143.
  31. Statman, Rick (2000). "On the Word Problem for Combinators". Rewriting Techniques and Applications. Lecture Notes in Computer Science. 1833: 203–213. doi:10.1007/10721975_14. ISBN   978-3-540-67778-9.
  32. Beke, Tibor (May 2011). "Categorification, term rewriting and the Knuth–Bendix procedure". Journal of Pure and Applied Algebra. 215 (5): 730. doi: 10.1016/j.jpaa.2010.06.019 .
  33. Peter T. Johnstone, Stone Spaces, (1982) Cambridge University Press, Cambridge, ISBN   0-521-23893-5. (See chapter 1, paragraph 4.11)
  34. Whitman, Philip M. (January 1941). "Free Lattices". The Annals of Mathematics. 42 (1): 325–329. doi:10.2307/1969001. JSTOR   1969001.
  35. Whitman, Philip M. (1942). "Free Lattices II". Annals of Mathematics. 43 (1): 104–115. doi:10.2307/1968883. JSTOR   1968883.
  36. K. H. Bläsius and H.-J. Bürckert, ed. (1992). Deduktionsssysteme. Oldenbourg. p. 291.; here: p.126, 134
  37. Apply rules in any order to a term, as long as possible; the result doesn't depend on the order; it is the term's normal form.