State complexity

Last updated July 30, 2024

State complexity is an area of theoretical computer science dealing with the size of abstract automata, such as different kinds of finite automata. The classical result in the area is that simulating an $n$ -state nondeterministic finite automaton by a deterministic finite automaton requires exactly $2^{n}$ states in the worst case.

Transformation between variants of finite automata
The 2DFA vs. 2NFA problem and logarithmic space
State complexity of operations for finite automata
Union
Intersection
Complementation
Concatenation
Kleene star
Reversal
Finite automata over a unary alphabet
Transformation between models
Union 2
Intersection 2
Complementation 2
Concatenation 2
Kleene star 2
Further reading
References

Transformation between variants of finite automata

Finite automata can be deterministic and nondeterministic, one-way (DFA, NFA) and two-way (2DFA, 2NFA). Other related classes are unambiguous (UFA), self-verifying (SVFA) and alternating (AFA) finite automata. These automata can also be two-way (2UFA, 2SVFA, 2AFA).

All these machines can accept exactly the regular languages. However, the size of different types of automata necessary to accept the same language (measured in the number of their states) may be different. For any two types of finite automata, the state complexity tradeoff between them is an integer function $f$ where $f(n)$ is the least number of states in automata of the second type sufficient to recognize every language recognized by an $n$ -state automaton of the first type. The following results are known.

NFA to DFA: $2^{n}$ states. This is the subset construction by Rabin and Scott,^[1] proved optimal by Lupanov.^[2]

UFA to DFA: $2^{n}$ states, see Leung,^[3] An earlier lower bound by Schmidt^[4] was smaller.
NFA to UFA: $2^{n}-1$ states, see Leung.^[3] There was an earlier smaller lower bound by Schmidt.^[4]
SVFA to DFA: $\Theta (3^{n/3})$ states, see Jirásková and Pighizzini ^[5]
2DFA to DFA: $n(n^{n}-(n-1)^{n})$ states, see Kapoutsis.^[6] Earlier construction by Shepherdson ^[7] used more states, and an earlier lower bound by Moore^[8] was smaller.
2DFA to NFA: ${\binom {2n}{n+1}}=O({\frac {4^{n}}{\sqrt {n}}})$ , see Kapoutsis.^[6] Earlier construction by Birget ^[9] used more states.
2NFA to NFA: ${\binom {2n}{n+1}}$ $State complexity$ , see Kapoutsis.^[6]
- 2NFA to NFA accepting the complement: $O(4^{n})$ states, see Vardi.^[10]
AFA to DFA: $2^{2^{n}}$ states, see Chandra, Kozen and Stockmeyer.^[11]
AFA to NFA: $2^{n}$ states, see Fellah, Jürgensen and Yu.^[12]
2AFA to DFA: $2^{n2^{n}}$ , see Ladner, Lipton and Stockmeyer.^[13]
2AFA to NFA: $2^{\Theta (n\log n)}$ , see Geffert and Okhotin.^[14]

The 2DFA vs. 2NFA problem and logarithmic space

Unsolved problem in computer science:

Does every

n

-state 2NFA have an equivalent

\operatorname {poly} (n)

-state 2DFA?

(more unsolved problems in computer science)

It is an open problem whether all 2NFAs can be converted to 2DFAs with polynomially many states, i.e. whether there is a polynomial $p(n)$ such that for every $n$ -state 2NFA there exists a $p(n)$ -state 2DFA. The problem was raised by Sakoda and Sipser,^[15] who compared it to the P vs. NP problem in the computational complexity theory. Berman and Lingas ^[16] discovered a formal relation between this problem and the L vs. NL open problem. This relation was further elaborated by Kapoutsis.^[17]

State complexity of operations for finite automata

Given a binary regularity-preserving operation on languages $\circ$ and a family of automata X (DFA, NFA, etc.), the state complexity of $\circ$ is an integer function $f(m,n)$ such that

for each m-state X-automaton A and n-state X-automaton B there is an $f(m,n)$ -state X-automaton for $L(A)\circ L(B)$ , and
for all integers m, n there is an m-state X-automaton A and an n-state X-automaton B such that every X-automaton for $L(A)\circ L(B)$ must have at least $f(m,n)$ states.

Analogous definition applies for operations with any number of arguments.

The first results on state complexity of operations for DFAs were published by Maslov ^[18] and by Yu, Zhuang and Salomaa. ^[19] Holzer and Kutrib ^[20] pioneered the state complexity of operations on NFA. The known results for basic operations are listed below.

Union

If language $L_{1}$ requires m states and language $L_{2}$ requires n states, how many states does $L_{1}\cup L_{2}$ require?

DFA: $mn$ states, see Maslov^[18] and Yu, Zhuang and Salomaa.^[19]
NFA: $m+n+1$ states, see Holzer and Kutrib.^[20]
UFA: at least $\min(n,m)^{\Omega (\log(\min(n,m)))}$ ;^[21] between $mn+m+n$ and $m+nm2^{0.79m}$ states, see Jirásek, Jirásková and Šebej.^[22]
SVFA: $mn$ states, see Jirásek, Jirásková and Szabari.^[23]
2DFA: between $m+n$ and $4m+n+4$ states, see Kunc and Okhotin.^[24]
2NFA: $m+n$ states, see Kunc and Okhotin.^[25]

Intersection

How many states does $L_{1}\cap L_{2}$ require?

DFA: $mn$ states, see Maslov^[18] and Yu, Zhuang and Salomaa.^[19]
NFA: $mn$ states, see Holzer and Kutrib.^[20]
UFA: $mn$ states, see Jirásek, Jirásková and Šebej.^[22]
SVFA: $mn$ states, see Jirásek, Jirásková and Szabari.^[23]
2DFA: between $m+n$ and $m+n+1$ states, see Kunc and Okhotin.^[24]
2NFA: between $m+n$ and $m+n+1$ states, see Kunc and Okhotin.^[25]

Complementation

If language L requires n states then how many states does its complement require?

DFA: $n$ states, by exchanging accepting and rejecting states.
NFA: $2^{n}$ states, see Birget.^[26] or Jirásková^[27]
UFA: at least $n^{{\tilde {\Omega }}(\log n)}$ states, see Göös, Kiefer and Yuan,^[21] (this follows an earlier bound by Raskin^[28]); and at most ${\sqrt {n+1}}\cdot 2^{0.5n}$ states, see Indzhev and Kiefer.^[29]
SVFA: $n$ states, by exchanging accepting and rejecting states.
2DFA: at least $n$ and at most $4n$ states, see Geffert, Mereghetti and Pighizzini.^[30]

Concatenation

How many states does $L_{1}L_{2}=\{w_{1}w_{2}\mid w_{1}\in L_{1},w_{2}\in L_{2}\}$ require?

DFA: $m\cdot 2^{n}-2^{n-1}$ states, see Maslov ^[18] and Yu, Zhuang and Salomaa.^[19]
NFA: $m+n$ states, see Holzer and Kutrib.^[20]
UFA: ${\frac {3}{4}}2^{m+n}-1$ states, see Jirásek, Jirásková and Šebej.^[22]
SVFA: $\Theta (3^{n/3}2^{m})$ states, see Jirásek, Jirásková and Szabari.^[23]
2DFA: at least ${\frac {2^{\Omega (n)}}{\log m}}$ and at most $2m^{m+1}\cdot 2^{n^{n+1}}$ states, see Jirásková and Okhotin.^[31]

Kleene star

DFA: ${\frac {3}{4}}2^{n}$ states, see Maslov^[18] and Yu, Zhuang and Salomaa.^[19]
NFA: $n+1$ states, see Holzer and Kutrib.^[20]
UFA: ${\frac {3}{4}}2^{n}$ states, see Jirásek, Jirásková and Šebej.^[22]
SVFA: ${\frac {3}{4}}2^{n}$ states, see Jirásek, Jirásková and Szabari.^[23]
2DFA: at least ${\frac {1}{n}}2^{{\frac {n}{2}}-1}$ and at most $2^{O(n^{n+1})}$ states, see Jirásková and Okhotin.^[31]

Reversal

DFA: $2^{n}$ states, see Mirkin,^[32] Leiss,^[33] and Yu, Zhuang and Salomaa.^[19]
NFA: $n+1$ states, see Holzer and Kutrib.^[20]
UFA: $n$ states.
SVFA: $2n+1$ states, see Jirásek, Jirásková and Szabari.^[23]
2DFA: between $n+1$ and $n+2$ states, see Jirásková and Okhotin.^[31]

Finite automata over a unary alphabet

State complexity of finite automata with a one-letter (unary) alphabet, pioneered by Chrobak,^[34] is different from the multi-letter case.

Let $g(n)=e^{\Theta ({\sqrt {n\ln n}})}$ be Landau's function.

Transformation between models

For a one-letter alphabet, transformations between different types of finite automata are sometimes more efficient than in the general case.

NFA to DFA: $g(n)+O(n^{2})$ states, see Chrobak.^[34]
2DFA to DFA: $g(n)+O(n)$ states, see Chrobak^[34] and Kunc and Okhotin.^[35]
2NFA to DFA: $O(g(n))$ states, see Mereghetti and Pighizzini.^[36] and Geffert, Mereghetti and Pighizzini.^[37]
NFA to 2DFA: at most $O(n^{2})$ states, see Chrobak.^[34]
2NFA to 2DFA: at most $n^{O(\log n)}$ states, proved by implementing the method of Savitch's theorem, see Geffert, Mereghetti and Pighizzini.^[37]
UFA to DFA: $e^{\Theta ({\sqrt[{3}]{n(\ln n)^{2}}})}$ , see Okhotin.^[38]
NFA to UFA: $g(n)+O(n^{2})$ , see Okhotin.^[38]