Deterministic finite automaton

Last updated January 29, 2025

In the theory of computation, a branch of theoretical computer science, a deterministic finite automaton (DFA)—also known as deterministic finite acceptor (DFA), deterministic finite-state machine (DFSM), or deterministic finite-state automaton (DFSA)—is a finite-state machine that accepts or rejects a given string of symbols, by running through a state sequence uniquely determined by the string.^[1]Deterministic refers to the uniqueness of the computation run. In search of the simplest models to capture finite-state machines, Warren McCulloch and Walter Pitts were among the first researchers to introduce a concept similar to finite automata in 1943.^[2]^[3]

The figure illustrates a deterministic finite automaton using a state diagram. In this example automaton, there are three states: S₀, S₁, and S₂ (denoted graphically by circles). The automaton takes a finite sequence of 0s and 1s as input. For each state, there is a transition arrow leading out to a next state for both 0 and 1. Upon reading a symbol, a DFA jumps deterministically from one state to another by following the transition arrow. For example, if the automaton is currently in state S₀ and the current input symbol is 1, then it deterministically jumps to state S₁. A DFA has a start state (denoted graphically by an arrow coming in from nowhere) where computations begin, and a set of accept states (denoted graphically by a double circle) which help define when a computation is successful.

A DFA is defined as an abstract mathematical concept, but is often implemented in hardware and software for solving various specific problems such as lexical analysis and pattern matching. For example, a DFA can model software that decides whether or not online user input such as email addresses are syntactically valid.^[4]

DFAs have been generalized to nondeterministic finite automata (NFA) which may have several arrows of the same label starting from a state. Using the powerset construction method, every NFA can be translated to a DFA that recognizes the same language. DFAs, and NFAs as well, recognize exactly the set of regular languages.^[1]

Formal definition

A deterministic finite automaton $M$ is a 5-tuple, $(Q, Σ, δ, q 0, F)$ , consisting of

a finite set of states $Q$
a finite set of input symbols called the alphabet $Σ$
a transition function $δ : Q \times Σ \to Q$
an initial (or start) state $q_{0}\in Q$
a set of accepting (or final) states $F\subseteq Q$

Let $w = a 1 a 2 ... a n$ be a string over the alphabet $Σ$ . The automaton $M$ accepts the string $w$ if a sequence of states, $r 0, r 1, ..., r n$ , exists in $Q$ with the following conditions:

$r 0 = q 0$
$r i +1 = δ (r i, a i +1)$ , for $i = 0, ..., n - 1$
$r_{n}\in F$ .

In words, the first condition says that the machine starts in the start state $q 0$ . The second condition says that given each character of string $w$ , the machine will transition from state to state according to the transition function $δ$ . The last condition says that the machine accepts $w$ if the last input of $w$ causes the machine to halt in one of the accepting states. Otherwise, it is said that the automaton rejects the string. The set of strings that $M$ accepts is the language recognized by $M$ and this language is denoted by $L (M)$ .

A deterministic finite automaton without accept states and without a starting state is known as a transition system or semiautomaton.

For more comprehensive introduction of the formal definition see automata theory.

Example

The following example is of a DFA $M$ , with a binary alphabet, which requires that the input contains an even number of 0s.

$M = (Q, Σ, δ, q 0, F)$ where

$Q = {S 1, S 2}$
$Σ = {0, 1}$
$q 0 = S 1$
$F = {S 1}$ and
$δ$ is defined by the following state transition table:

	0	1
S₁	S₂	S₁
S₂	S₁	S₂

The state $S 1$ represents that there has been an even number of 0s in the input so far, while $S 2$ signifies an odd number. A 1 in the input does not change the state of the automaton. When the input ends, the state will show whether the input contained an even number of 0s or not. If the input did contain an even number of 0s, $M$ will finish in state $S 1$ , an accepting state, so the input string will be accepted.

The language recognized by $M$ is the regular language given by the regular expression (1*) (0 (1*) 0 (1*))*, where * is the Kleene star, e.g., 1* denotes any number (possibly zero) of consecutive ones.

Variations

Complete and incomplete

According to the above definition, deterministic finite automata are always complete: they define from each state a transition for each input symbol.

While this is the most common definition, some authors use the term deterministic finite automaton for a slightly different notion: an automaton that defines at most one transition for each state and each input symbol; the transition function is allowed to be partial.^[5] When no transition is defined, such an automaton halts.

Local automata

A local automaton is a DFA, not necessarily complete, for which all edges with the same label lead to a single vertex. Local automata accept the class of local languages, those for which membership of a word in the language is determined by a "sliding window" of length two on the word.^[6]^[7]

A Myhill graph over an alphabet A is a directed graph with vertex set A and subsets of vertices labelled "start" and "finish". The language accepted by a Myhill graph is the set of directed paths from a start vertex to a finish vertex: the graph thus acts as an automaton.^[6] The class of languages accepted by Myhill graphs is the class of local languages.^[8]

Randomness

When the start state and accept states are ignored, a DFA of $n$ states and an alphabet of size $k$ can be seen as a digraph of $n$ vertices in which all vertices have $k$ out-arcs labeled $1, ..., k$ (a $k$ -out digraph). It is known that when $k \geq 2$ is a fixed integer, with high probability, the largest strongly connected component (SCC) in such a $k$ -out digraph chosen uniformly at random is of linear size and it can be reached by all vertices.^[9] It has also been proven that if $k$ is allowed to increase as $n$ increases, then the whole digraph has a phase transition for strong connectivity similar to Erdős–Rényi model for connectivity.^[10]

In a random DFA, the maximum number of vertices reachable from one vertex is very close to the number of vertices in the largest SCC with high probability.^[9]^[11] This is also true for the largest induced sub-digraph of minimum in-degree one, which can be seen as a directed version of $1$ -core.^[10]

Closure properties

The upper left automaton recognizes the language of all binary strings containing at least one occurrence of "00". The lower right automaton recognizes all binary strings with an even number of "1". The lower left automaton is obtained as product of the former two, it recognizes the intersection of both languages. Intersection1.png — The upper left automaton recognizes the language of all binary strings containing at least one occurrence of "00". The lower right automaton recognizes all binary strings with an even number of "1". The lower left automaton is obtained as product of the former two, it recognizes the intersection of both languages.

If DFAs recognize the languages that are obtained by applying an operation on the DFA recognizable languages then DFAs are said to be closed under the operation. The DFAs are closed under the following operations.

Union
Intersection^[12] (see picture)
Concatenation
Complement
Kleene closure
Reversal^[13]
Quotient^[13]
Substitution^[14]
Homomorphism^[13]^[14]

For each operation, an optimal construction with respect to the number of states has been determined in state complexity research. Since DFAs are equivalent to nondeterministic finite automata (NFA), these closures may also be proved using closure properties of NFA.

As a transition monoid

A run of a given DFA can be seen as a sequence of compositions of a very general formulation of the transition function with itself. Here we construct that function.

For a given input symbol $a\in \Sigma$ , one may construct a transition function $\delta _{a}:Q\rightarrow Q$ by defining $\delta _{a}(q)=\delta (q,a)$ for all $q\in Q$ . (This trick is called currying.) From this perspective, $\delta _{a}$ "acts" on a state in Q to yield another state. One may then consider the result of function composition repeatedly applied to the various functions $\delta _{a}$ , $\delta _{b}$ , and so on. Given a pair of letters $a,b\in \Sigma$ , one may define a new function ${\widehat {\delta }}_{ab}=\delta _{a}\circ \delta _{b}$ , where $\circ$ denotes function composition.

Clearly, this process may be recursively continued, giving the following recursive definition of ${\widehat {\delta }}:Q\times \Sigma ^{\star }\rightarrow Q$ :

{\widehat {\delta }}(q,\epsilon )=q

, where

\epsilon

is the empty string and

{\widehat {\delta }}(q,wa)=\delta _{a}({\widehat {\delta }}(q,w))

, where

w\in \Sigma ^{*},a\in \Sigma

and

q\in Q

.

${\widehat {\delta }}$ is defined for all words $w\in \Sigma ^{*}$ . A run of the DFA is a sequence of compositions of ${\widehat {\delta }}$ with itself.

Repeated function composition forms a monoid. For the transition functions, this monoid is known as the transition monoid, or sometimes the transformation semigroup. The construction can also be reversed: given a ${\widehat {\delta }}$ , one can reconstruct a $\delta$ , and so the two descriptions are equivalent.

Advantages and disadvantages

DFAs are one of the most practical models of computation, since there is a trivial linear time, constant-space, online algorithm to simulate a DFA on a stream of input. Also, there are efficient algorithms to find a DFA recognizing:

the complement of the language recognized by a given DFA.
the union/intersection of the languages recognized by two given DFAs.

Because DFAs can be reduced to a canonical form (minimal DFAs), there are also efficient algorithms to determine:

whether a DFA accepts any strings (Emptiness Problem)
whether a DFA accepts all strings (Universality Problem)
whether two DFAs recognize the same language (Equality Problem)
whether the language recognized by a DFA is included in the language recognized by a second DFA (Inclusion Problem)
the DFA with a minimum number of states for a particular regular language (Minimization Problem)

DFAs are equivalent in computing power to nondeterministic finite automata (NFAs). This is because, firstly any DFA is also an NFA, so an NFA can do what a DFA can do. Also, given an NFA, using the powerset construction one can build a DFA that recognizes the same language as the NFA, although the DFA could have exponentially larger number of states than the NFA.^[15]^[16] However, even though NFAs are computationally equivalent to DFAs, the above-mentioned problems are not necessarily solved efficiently also for NFAs. The non-universality problem for NFAs is PSPACE complete since there are small NFAs with shortest rejecting word in exponential size. A DFA is universal if and only if all states are final states, but this does not hold for NFAs. The Equality, Inclusion and Minimization Problems are also PSPACE complete since they require forming the complement of an NFA which results in an exponential blow up of size.^[17]

On the other hand, finite-state automata are of strictly limited power in the languages they can recognize; many simple languages, including any problem that requires more than constant space to solve, cannot be recognized by a DFA. The classic example of a simply described language that no DFA can recognize is bracket or Dyck language, i.e., the language that consists of properly paired brackets such as word "(()())". Intuitively, no DFA can recognize the Dyck language because DFAs are not capable of counting: a DFA-like automaton needs to have a state to represent any possible number of "currently open" parentheses, meaning it would need an unbounded number of states. Another simpler example is the language consisting of strings of the form aⁿbⁿ for some finite but arbitrary number of a's, followed by an equal number of b's.^[18]

DFA identification from labeled words

Given a set of positive words $S^{+}\subset \Sigma ^{*}$ and a set of negative words $S^{-}\subset \Sigma ^{*}$ one can construct a DFA that accepts all words from $S^{+}$ and rejects all words from $S^{-}$ : this problem is called DFA identification (synthesis, learning). While some DFA can be constructed in linear time, the problem of identifying a DFA with the minimal number of states is NP-complete.^[19] The first algorithm for minimal DFA identification has been proposed by Trakhtenbrot and Barzdin^[20] and is called the TB-algorithm. However, the TB-algorithm assumes that all words from $\Sigma$ up to a given length are contained in either $S^{+}\cup S^{-}$ .

Later, K. Lang proposed an extension of the TB-algorithm that does not use any assumptions about $S^{+}$ and $S^{-}$ , the Traxbar algorithm.^[21] However, Traxbar does not guarantee the minimality of the constructed DFA. In his work^[19] E.M. Gold also proposed a heuristic algorithm for minimal DFA identification. Gold's algorithm assumes that $S^{+}$ and $S^{-}$ contain a characteristic set of the regular language; otherwise, the constructed DFA will be inconsistent either with $S^{+}$ or $S^{-}$ . Other notable DFA identification algorithms include the RPNI algorithm,^[22] the Blue-Fringe evidence-driven state-merging algorithm,^[23] and Windowed-EDSM.^[24] Another research direction is the application of evolutionary algorithms: the smart state labeling evolutionary algorithm^[25] allowed to solve a modified DFA identification problem in which the training data (sets $S^{+}$ and $S^{-}$ ) is noisy in the sense that some words are attributed to wrong classes.

Yet another step forward is due to application of SAT solvers by Marjin J. H. Heule and S. Verwer: the minimal DFA identification problem is reduced to deciding the satisfiability of a Boolean formula.^[26] The main idea is to build an augmented prefix-tree acceptor (a trie containing all input words with corresponding labels) based on the input sets and reduce the problem of finding a DFA with $C$ states to coloring the tree vertices with $C$ states in such a way that when vertices with one color are merged to one state, the generated automaton is deterministic and complies with $S^{+}$ and $S^{-}$ . Though this approach allows finding the minimal DFA, it suffers from exponential blow-up of execution time when the size of input data increases. Therefore, Heule and Verwer's initial algorithm has later been augmented with making several steps of the EDSM algorithm prior to SAT solver execution: the DFASAT algorithm.^[27] This allows reducing the search space of the problem, but leads to loss of the minimality guarantee. Another way of reducing the search space has been proposed by Ulyantsev et al.^[28] by means of new symmetry breaking predicates based on the breadth-first search algorithm: the sought DFA's states are constrained to be numbered according to the BFS algorithm launched from the initial state. This approach reduces the search space by $C!$ by eliminating isomorphic automata.

Equivalent models

Read-only right-moving Turing machines

Read-only right-moving Turing machines are a particular type of Turing machine that only moves right; these are almost exactly equivalent to DFAs.^[29] The definition based on a singly infinite tape is a 7-tuple

M=\langle Q,\Gamma ,b,\Sigma ,\delta ,q_{0},F\rangle ,

where

Q

is a finite set of states;

\Gamma

is a finite set of the tape alphabet/symbols;

b\in \Gamma

is the blank symbol (the only symbol allowed to occur on the tape infinitely often at any step during the computation);

\Sigma

, a subset of

\Gamma

not including b, is the set of input symbols;

\delta :Q\times \Gamma \to Q\times \Gamma \times \{R\}

is a function called the transition function , R is a right movement (a right shift);

q_{0}\in Q

is the initial state;

F\subseteq Q

is the set of final or accepting states.

The machine always accepts a regular language. There must exist at least one element of the set $F$ (a HALT state) for the language to be nonempty.

Example of a 3-state, 2-symbol read-only Turing machine

tape symbol	Write symbol	Move tape	Next state	Write symbol	Move tape	Next state	Write symbol	Move tape	Next state
	Current state A			Current state B			Current state C
0	1	R	B	1	R	A	1	R	B
1	1	R	C	1	R	B	1	N	HALT

Q=\{A,B,C,{\text{HALT}}\};

\Gamma =\{0,1\};

b=0

, "blank";

\Sigma =\varnothing

, empty set;

\delta =

see state-table above;

q_{0}=A

, initial state;

F=

the one element set of final states:

\{{\text{HALT}}\}

.

Notes

1 2 Hopcroft, Motwani & Ullman 2006.
↑ McCulloch & Pitts 1943.
↑ Rabin & Scott 1959.
↑ Bai, Gina R.; Clee, Brian; Shrestha, Nischal; Chapman, Carl; Wright, Cimone; Stolee, Kathryn T. (2019). "Exploring tools and strategies used during regular expression composition tasks". In Guéhéneuc, Yann-Gaël; Khomh, Foutse; Sarro, Federica (eds.). Proceedings of the 27th International Conference on Program Comprehension, ICPC 2019, Montreal, QC, Canada, May 25-31, 2019. IEEE / ACM. pp. 197–208. doi:10.1109/ICPC.2019.00039. ISBN 978-1-7281-1519-1.
↑ Mogensen, Torben Ægidius (2011). "Lexical Analysis". Introduction to Compiler Design. Undergraduate Topics in Computer Science. London: Springer. p. 12. doi:10.1007/978-0-85729-829-4_1. ISBN 978-0-85729-828-7.
1 2 Lawson 2004, p. 129.
↑ Sakarovitch 2009, p. 228.
↑ Lawson 2004, p. 128.
1 2 Grusho, A. A. (1973). "Limit distributions of certain characteristics of random automaton graphs". Mathematical Notes of the Academy of Sciences of the USSR. 4: 633–637. doi:10.1007/BF01095785. S2CID 121723743.
1 2 Cai, Xing Shi; Devroye, Luc (October 2017). "The graph structure of a deterministic automaton chosen at random". Random Structures & Algorithms. 51 (3): 428–458. arXiv: 1504.06238 . doi:10.1002/rsa.20707. S2CID 13013344.
↑ Carayol, Arnaud; Nicaud, Cyril (February 2012). Distribution of the number of accessible states in a random deterministic automaton. STACS'12 (29th Symposium on Theoretical Aspects of Computer Science). Vol. 14. Paris, France. pp. 194–205.
↑ Hopcroft & Ullman 1979, pp. 59–60.
1 2 3 Rose, Gene F. (1968). "Closures which Preserve Finiteness in Families of Languages". Journal of Computer and System Sciences . 2 (2): 148–168. doi:10.1016/S0022-0000(68)80029-7.
1 2 Spanier, E. (1969). "Grammars and languages". American Mathematical Monthly. 76 (4): 335–342. doi:10.1080/00029890.1969.12000214. JSTOR 2316423. MR 0241205.
↑ Sakarovitch 2009, p. 105.
↑ Lawson 2004, p. 63.
↑ Esparza Estaun, Francisco Javier; Sickert, Salomon; Blondin, Michael (16 November 2016). "Operations and tests on sets: Implementation on DFAs" (PDF). Automata and Formal Languages 2017/18. Archived from the original (PDF) on 8 August 2018.
↑ Lawson 2004, p. 46.
1 2 Gold, E. M. (1978). "Complexity of Automaton Identification from Given Data". Information and Control. 37 (3): 302–320. doi:10.1016/S0019-9958(78)90562-4.
↑ De Vries, A. (28 June 2014). Finite Automata: Behavior and Synthesis. Elsevier. ISBN 9781483297293.
↑ Lang, Kevin J. (1992). "Random DFA's can be approximately learned from sparse uniform examples". Proceedings of the fifth annual workshop on Computational learning theory - COLT '92. pp. 45–52. doi:10.1145/130385.130390. ISBN 089791497X. S2CID 7480497.
↑ Oncina, J.; García, P. (1992). "Inferring Regular Languages in Polynomial Updated Time". Pattern Recognition and Image Analysis. Series in Machine Perception and Artificial Intelligence. Vol. 1. pp. 49–61. doi:10.1142/9789812797902_0004. ISBN 978-981-02-0881-3.
↑ Lang, Kevin J.; Pearlmutter, Barak A.; Price, Rodney A. (1998). "Results of the Abbadingo one DFA learning competition and a new evidence-driven state merging algorithm". Grammatical Inference (PDF). Lecture Notes in Computer Science. Vol. 1433. pp. 1–12. doi:10.1007/BFb0054059. ISBN 978-3-540-64776-8.
↑ Adriaans, Pieter; Fernau, Henning; Zaanen, Menno van (23 September 2002). Beyond EDSM | Proceedings of the 6th International Colloquium on Grammatical Inference: Algorithms and Applications. Springer. pp. 37–48. ISBN 9783540442394.
↑ Lucas, S.M.; Reynolds, T.J. (2005). "Learning deterministic finite automata with a smart state labeling evolutionary algorithm". IEEE Transactions on Pattern Analysis and Machine Intelligence. 27 (7): 1063–1074. doi:10.1109/TPAMI.2005.143. PMID 16013754. S2CID 14062047.
↑ Heule, M. J. H. (2010). "Exact DFA Identification Using SAT Solvers". Grammatical Inference: Theoretical Results and Applications. Grammatical Inference: Theoretical Results and Applications. ICGI 2010. Lecture Notes in Computer Science. Lecture Notes in Computer Science. Vol. 6339. pp. 66–79. doi:10.1007/978-3-642-15488-1_7. ISBN 978-3-642-15487-4.
↑ Heule, Marijn J. H.; Verwer, Sicco (2013). "Software model synthesis using satisfiability solvers". Empirical Software Engineering. 18 (4): 825–856. doi:10.1007/s10664-012-9222-z. hdl: 2066/103766 . S2CID 17865020.
↑ Ulyantsev, Vladimir; Zakirzyanov, Ilya; Shalyto, Anatoly (2015). "BFS-Based Symmetry Breaking Predicates for DFA Identification". Language and Automata Theory and Applications. Lecture Notes in Computer Science. Vol. 8977. pp. 611–622. doi:10.1007/978-3-319-15579-1_48. ISBN 978-3-319-15578-4.
↑ Davis, Martin; Ron Sigal; Elaine J. Weyuker (1994). Second Edition: Computability, Complexity, and Languages and Logic: Fundamentals of Theoretical Computer Science (2nd ed.). San Diego: Academic Press, Harcourt, Brace & Company. ISBN 0-12-206382-1.

Related Research Articles

A finite-state machine (FSM) or finite-state automaton, finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number of states at any given time. The FSM can change from one state to another in response to some inputs; the change from one state to another is called a transition. An FSM is defined by a list of its states, its initial state, and the inputs that trigger each transition. Finite-state machines are of two types—deterministic finite-state machines and non-deterministic finite-state machines. For any non-deterministic finite-state machine, an equivalent deterministic one can be constructed.

In the theory of computation, a branch of theoretical computer science, a pushdown automaton (PDA) is a type of automaton that employs a stack.

Automata theory is the study of abstract machines and automata, as well as the computational problems that can be solved using them. It is a theory in theoretical computer science with close connections to mathematical logic. The word automata comes from the Greek word αὐτόματος, which means "self-acting, self-willed, self-moving". An automaton is an abstract self-propelled computing device which follows a predetermined sequence of operations automatically. An automaton with a finite number of states is called a finite automaton (FA) or finite-state machine (FSM). The figure on the right illustrates a finite-state machine, which is a well-known type of automaton. This automaton consists of states and transitions. As the automaton sees a symbol of input, it makes a transition to another state, according to its transition function, which takes the previous state and current input symbol as its arguments.

In computer science and automata theory, a deterministic Büchi automaton is a theoretical machine which either accepts or rejects infinite inputs. Such a machine has a set of states and a transition function, which determines which state the machine should move to from its current state when it reads the next input character. Some states are accepting states and one state is the start state. The machine accepts an input if and only if it will pass through an accepting state infinitely many times as it reads the input.

In automata theory, a finite-state machine is called a deterministic finite automaton (DFA), if

In automata theory, an alternating finite automaton (AFA) is a nondeterministic finite automaton whose transitions are divided into existential and universal transitions. For example, let A be an alternating automaton.

A finite-state transducer (FST) is a finite-state machine with two memory tapes, following the terminology for Turing machines: an input tape and an output tape. This contrasts with an ordinary finite-state automaton, which has a single tape. An FST is a type of finite-state automaton (FSA) that maps between two sets of symbols. An FST is more general than an FSA. An FSA defines a formal language by defining a set of accepted strings, while an FST defines a relation between sets of strings.

In the theory of computation and automata theory, the powerset construction or subset construction is a standard method for converting a nondeterministic finite automaton (NFA) into a deterministic finite automaton (DFA) which recognizes the same formal language. It is important in theory because it establishes that NFAs, despite their additional flexibility, are unable to recognize any language that cannot be recognized by some DFA. It is also important in practice for converting easier-to-construct NFAs into more efficiently executable DFAs. However, if the NFA has n states, the resulting DFA may have up to 2ⁿ states, an exponentially larger number, which sometimes makes the construction impractical for large NFAs.

In automata theory, a deterministic pushdown automaton is a variation of the pushdown automaton. The class of deterministic pushdown automata accepts the deterministic context-free languages, a proper subset of context-free languages.

In computer science, in particular in automata theory, a two-way finite automaton is a finite automaton that is allowed to re-read its input.

In quantum computing, quantum finite automata (QFA) or quantum state machines are a quantum analog of probabilistic automata or a Markov decision process. They provide a mathematical abstraction of real-world quantum computers. Several types of automata may be defined, including measure-once and measure-many automata. Quantum finite automata can also be understood as the quantization of subshifts of finite type, or as a quantization of Markov chains. QFAs are, in turn, special cases of geometric finite automata or topological finite automata.

In mathematics and computer science, the probabilistic automaton (PA) is a generalization of the nondeterministic finite automaton; it includes the probability of a given transition into the transition function, turning it into a transition matrix. Thus, the probabilistic automaton also generalizes the concepts of a Markov chain and of a subshift of finite type. The languages recognized by probabilistic automata are called stochastic languages; these include the regular languages as a subset. The number of stochastic languages is uncountable.

A queue machine, queue automaton, or pullup automaton (PUA) is a finite-state machine with the ability to store and retrieve data from an infinite-memory queue. Its design is similar to a pushdown automaton but differs by replacing the stack with this queue. A queue machine is a model of computation equivalent to a Turing machine, and therefore it can process the same class of formal languages.

A read-only Turing machine or two-way deterministic finite-state automaton (2DFA) is class of models of computability that behave like a standard Turing machine and can move in both directions across input, except cannot write to its input tape. The machine in its bare form is equivalent to a deterministic finite automaton in computational power, and therefore can only parse a regular language.

In automata theory, DFA minimization is the task of transforming a given deterministic finite automaton (DFA) into an equivalent DFA that has a minimum number of states. Here, two DFAs are called equivalent if they recognize the same regular language. Several different algorithms accomplishing this task are known and described in standard textbooks on automata theory.

In computer science, more specifically in automata and formal language theory, nested words are a concept proposed by Alur and Madhusudan as a joint generalization of words, as traditionally used for modelling linearly ordered structures, and of ordered unranked trees, as traditionally used for modelling hierarchical structures. Finite-state acceptors for nested words, so-called nested word automata, then give a more expressive generalization of finite automata on words. The linear encodings of languages accepted by finite nested word automata gives the class of visibly pushdown languages. The latter language class lies properly between the regular languages and the deterministic context-free languages. Since their introduction in 2004, these concepts have triggered much research in that area.

In computer science and mathematical logic, an infinite-tree automaton is a state machine that deals with infinite tree structures. It can be seen as an extension of top-down finite-tree automata to infinite trees or as an extension of infinite-word automata to infinite trees.

In computational learning theory, induction of regular languages refers to the task of learning a formal description of a regular language from a given set of example strings. Although E. Mark Gold has shown that not every regular language can be learned this way, approaches have been investigated for a variety of subclasses. They are sketched in this article. For learning of more general grammars, see Grammar induction.

In theoretical computer science and formal language theory, a weighted automaton or weighted finite-state machine is a generalization of a finite-state machine in which the edges have weights, for example real numbers or integers. Finite-state machines are only capable of answering decision problems; they take as input a string and produce a Boolean output, i.e. either "accept" or "reject". In contrast, weighted automata produce a quantitative output, for example a count of how many answers are possible on a given input string, or a probability of how likely the input string is according to a probability distribution. They are one of the simplest studied models of quantitative automata.

In automata theory, an unambiguous finite automaton (UFA) is a nondeterministic finite automaton (NFA) such that each word has at most one accepting path. Each deterministic finite automaton (DFA) is an UFA, but not vice versa. DFA, UFA, and NFA recognize exactly the same class of formal languages. On the one hand, an NFA can be exponentially smaller than an equivalent DFA. On the other hand, some problems are easily solved on DFAs and not on UFAs. For example, given an automaton A, an automaton A′ which accepts the complement of A can be computed in linear time when A is a DFA, whereas it is known that this cannot be done in polynomial time for UFAs. Hence UFAs are a mix of the worlds of DFA and of NFA; in some cases, they lead to smaller automata than DFA and quicker algorithms than NFA.

References

Hopcroft, John E.; Ullman, Jeffrey D. (1979). Introduction to Automata Theory, Languages, and Computation (1st ed.). Addison-Wesley. ISBN 0-201-02988-X.(accessible to patrons with print disabilities)
Hopcroft, John E.; Motwani, Rajeev; Ullman, Jeffrey D. (2006) [1979]. Introduction to Automata Theory, Languages, and Computation (3rd ed.). Addison-Wesley. ISBN 0-321-45536-3.
Lawson, Mark V. (2004). Finite automata. Chapman and Hall/CRC. ISBN 1-58488-255-7. Zbl 1086.68074.
McCulloch, W. S.; Pitts, W. (1943). "A Logical Calculus of the Ideas Immanent in Nervous Activity". Bulletin of Mathematical Biophysics. 5 (4): 115–133. doi:10.1007/BF02478259. PMID 2185863.
Rabin, M. O.; Scott, D. (1959). "Finite automata and their decision problems". IBM J. Res. Dev. 3 (2): 114–125. doi:10.1147/rd.32.0114.
Sakarovitch, Jacques (2009). Elements of automata theory. Translated from the French by Reuben Thomas. Cambridge: Cambridge University Press. ISBN 978-0-521-84425-3. Zbl 1188.68177.