Subshift of finite type

Last updated

In mathematics, subshifts of finite type are used to model dynamical systems, and in particular are the objects of study in symbolic dynamics and ergodic theory. They also describe the set of all possible sequences executed by a finite state machine. The most widely studied shift spaces are the subshifts of finite type.

Contents

Motivating examples

directed graph
A
-
B
-
C
-
A
{\displaystyle A\to B\to C\to A} Directed.svg
directed graph

Consider a directed graph .

The (one-sided) shifts of finite type are all sequences, infinite on one end only, that can be made up of the letters , like . The (two-sided) shifts of finite type are similar, but are infinite on both ends.

Now when we drop to the subset of subshifts, defined by the graph , we restrict transitions to only those allowed by the graph. This gives us only three possible one-sided subshifts: . Similarly, there are only three possible two-sided subshifts.

Other directed graphs on the letters give us other subshifts. For example, if we add another arrow , then instead of 3, we have uncountably infinitely many possible subshifts.

Markov and non-Markov measures

The hidden part of a hidden Markov model, whose observable states is non-Markovian. Blackwell HMM example.png
The hidden part of a hidden Markov model, whose observable states is non-Markovian.

Given a Markov transition matrix and an invariant distribution on the states, we can impose a probability measure on the set of subshifts. For example, consider the Markov chain given on the left on the states , with invariant distribution . If we "forget" the distinction between , we project this space of subshifts on into another space of subshifts on , and this projection also projects the probability measure down to a probability measure on the subshifts on .

The curious thing is that the probability measure on the subshifts on is not created by a Markov chain on , not even multiple orders. Intuitively, this is because if one observes a long sequence of , then one would become increasingly sure that the , meaning that the observable part of the system can be affected by something infinitely in the past. [1] [2]

Conversely, there exists a space of subshifts on 6 symbols, projected to subshifts on 2 symbols, such that any Markov measure on the smaller subshift has a preimage measure that is not Markov of any order (Example 2.6 [2] ).

Definition

Let V be a finite set of n symbols (alphabet). Let X denote the set of all bi-infinite sequences of elements of V together with the shift operator T. We endow V with the discrete topology and X with the product topology. A symbolic flow or subshift is a closed T-invariant subset Y of X [3] and the associated language LY is the set of finite subsequences of Y. [4]

Now let A be an n × n adjacency matrix with entries in {0, 1}. Using these elements we construct a directed graph G = (V, E) with V the set of vertices and E the set of edges containing the directed edge xy in E if and only if Ax, y = 1. Let Y be the set of all infinite admissible sequences of edges, where by admissible it is meant that the sequence is a walk of the graph, and the sequence can be either one-sided or two-sided infinite. Let T be the left shift operator on such sequences; it plays the role of the time-evolution operator of the dynamical system. A subshift of finite type is then defined as a pair (Y, T) obtained in this way. If the sequence extends to infinity in only one direction, it is called a one-sided subshift of finite type, and if it is bilateral, it is called a two-sided subshift of finite type.

Formally, one may define the sequences of edges as

This is the space of all sequences of symbols such that the symbol p can be followed by the symbol q only if the (p, q)-th entry of the matrix A is 1. The space of all bi-infinite sequences is defined analogously:

The shift operator T maps a sequence in the one- or two-sided shift to another by shifting all symbols to the left, i.e.

Clearly this map is only invertible in the case of the two-sided shift.

A subshift of finite type is called transitive if G is strongly connected: there is a sequence of edges from any one vertex to any other vertex. It is precisely transitive subshifts of finite type which correspond to dynamical systems with orbits that are dense.

An important special case is the full n-shift: it has a graph with an edge that connects every vertex to every other vertex; that is, all of the entries of the adjacency matrix are 1. The full n-shift corresponds to the Bernoulli scheme without the measure.

Terminology

By convention, the term shift is understood to refer to the full n-shift. A subshift is then any subspace of the full shift that is shift-invariant (that is, a subspace that is invariant under the action of the shift operator), non-empty, and closed for the product topology defined below. Some subshifts can be characterized by a transition matrix, as above; such subshifts are then called subshifts of finite type. Often, subshifts of finite type are called simply shifts of finite type. Subshifts of finite type are also sometimes called topological Markov shifts.

Examples

Many chaotic dynamical systems are isomorphic to subshifts of finite type; examples include systems with transverse homoclinic connections, diffeomorphisms of closed manifolds with a positive metric entropy, the Prouhet–Thue–Morse system, the Chacon system (this is the first system shown to be weakly mixing but not strongly mixing), Sturmian systems and Toeplitz systems. [5]

Generalizations

A sofic system is an image of a subshift of finite type where different edges of the transition graph may be mapped to the same symbol. For example, if one only watches the output from a hidden Markov chain, then the output appears to be a sofic system. [1] It may be regarded as the set of labellings of paths through an automaton: a subshift of finite type then corresponds to an automaton which is deterministic. [6] Such systems correspond to regular languages.

Context-free systems are defined analogously, and are generated by phrase structure grammars.

A renewal system is defined to be the set of all infinite concatenations of some fixed finite collection of finite words.

Subshifts of finite type are identical to free (non-interacting) one-dimensional Potts models (n-letter generalizations of Ising models), with certain nearest-neighbor configurations excluded. Interacting Ising models are defined as subshifts together with a continuous function of the configuration space[ when defined as? ] (continuous with respect to the product topology, defined below); the partition function and Hamiltonian are explicitly expressible in terms of this function.[ clarification needed ]

Subshifts may be quantized in a certain way, leading to the idea of the quantum finite automata.

Topology

A subshift has a natural topology, derived from the product topology on where

and V is given the discrete topology. A basis for the topology of which induces the topology of the subshift, is the family of cylinder sets

The cylinder sets are clopen sets in Every open set in is a countable union of cylinder sets. Every open set in the subshift is the intersection of an open set of with the subshift. With respect to this topology, the shift T is a homeomorphism; that is, with respect to this topology, it is continuous with continuous inverse.

The space is homeomorphic to a Cantor set.

Metric

A variety of different metrics can be defined on a shift space. One can define a metric on a shift space by considering two points to be "close" if they have many initial symbols in common; this is the p-adic metric. In fact, both the one- and two-sided shift spaces are compact metric spaces.

Measure

A subshift of finite type may be endowed with any one of several different measures, thus leading to a measure-preserving dynamical system. A common object of study is the Markov measure, which is an extension of a Markov chain to the topology of the shift.

A Markov chain is a pair (P, π) consisting of the transition matrix, an n × n matrix P = (pij) for which all pij 0 and

for all i. The stationary probability vectorπ = (πi) has all πi 0, and has

A Markov chain, as defined above, is said to be compatible with the shift of finite type if pij = 0 whenever Aij = 0. The Markov measure of a cylinder set may then be defined by

The Kolmogorov–Sinai entropy with relation to the Markov measure is

Zeta function

The Artin–Mazur zeta function is defined as the formal power series

where Fix(T n) is the set of fixed points of the n-fold shift. [7] It has a product formula

where γ runs over the closed orbits. [7] For subshifts of finite type, the zeta function is a rational function of z: [8]

See also

Notes

  1. 1 2 Sofic Measures: Characterizations of Hidden Markov Chains by Linear Algebra, Formal Languages, and Symbolic Dynamics - Karl Petersen, Mathematics 210, Spring 2006, University of North Carolina at Chapel Hill
  2. 1 2 Boyle, Mike; Petersen, Karl (2010-01-13), Hidden Markov processes in the context of symbolic dynamics, arXiv: 0907.1858
  3. Xie (1996) p.21
  4. Xie (1996) p.22
  5. Matthew Nicol and Karl Petersen, (2009) "Ergodic Theory: Basic Examples and Constructions", Encyclopedia of Complexity and Systems Science, Springer https://doi.org/10.1007/978-0-387-30440-3_177
  6. Pytheas Fogg (2002) p.205
  7. 1 2 Brin & Stuck (2002) p.60
  8. Brin & Stuck (2002) p.61

Related Research Articles

In mathematics, more specifically in functional analysis, a Banach space is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and is complete in the sense that a Cauchy sequence of vectors always converges to a well-defined limit that is within the space.

In mathematics, one can often define a direct product of objects already known, giving a new one. This induces a structure on the Cartesian product of the underlying sets from that of the contributing objects. More abstractly, one talks about the product in category theory, which formalizes these notions.

<span class="mw-page-title-main">Sequence</span> Finite or infinite ordered list of elements

In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members. The number of elements is called the length of the sequence. Unlike a set, the same elements can appear multiple times at different positions in a sequence, and unlike a set, the order does matter. Formally, a sequence can be defined as a function from natural numbers to the elements at each position. The notion of a sequence can be generalized to an indexed family, defined as a function from an arbitrary index set.

In mathematical analysis and in probability theory, a σ-algebra on a set X is a nonempty collection Σ of subsets of X closed under complement, countable unions, and countable intersections. The ordered pair is called a measurable space.

In mathematics, a topological space is, roughly speaking, a geometrical space in which closeness is defined but cannot necessarily be measured by a numeric distance. More specifically, a topological space is a set whose elements are called points, along with an additional structure called a topology, which can be defined as a set of neighbourhoods for each point that satisfy some axioms formalizing the concept of closeness. There are several equivalent definitions of a topology, the most commonly used of which is the definition through open sets, which is easier than the others to manipulate.

In mathematics, the Lp spaces are function spaces defined using a natural generalization of the p-norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue, although according to the Bourbaki group they were first introduced by Frigyes Riesz.

In mathematics, a measure-preserving dynamical system is an object of study in the abstract formulation of dynamical systems, and ergodic theory in particular. Measure-preserving systems obey the Poincaré recurrence theorem, and are a special case of conservative systems. They provide the formal, mathematical basis for a broad range of physical systems, and, in particular, many systems from classical mechanics as well as systems in thermodynamic equilibrium.

In mathematics, specifically order theory, a well-quasi-ordering or wqo on a set is a quasi-ordering of for which every infinite sequence of elements from contains an increasing pair with

In statistical mechanics, the Potts model, a generalization of the Ising model, is a model of interacting spins on a crystalline lattice. By studying the Potts model, one may gain insight into the behaviour of ferromagnets and certain other phenomena of solid-state physics. The strength of the Potts model is not so much that it models these physical systems well; it is rather that the one-dimensional case is exactly solvable, and that it has a rich mathematical formulation that has been studied extensively.

<span class="mw-page-title-main">Dyadic transformation</span> Doubling map on the unit interval

The dyadic transformation is the mapping

A Dynkin system, named after Eugene Dynkin, is a collection of subsets of another universal set satisfying a set of axioms weaker than those of 𝜎-algebra. Dynkin systems are sometimes referred to as 𝜆-systems or d-system. These set families have applications in measure theory and probability.

In mathematics, the Bernoulli scheme or Bernoulli shift is a generalization of the Bernoulli process to more than two possible outcomes. Bernoulli schemes appear naturally in symbolic dynamics, and are thus important in the study of dynamical systems. Many important dynamical systems exhibit a repellor that is the product of the Cantor set and a smooth manifold, and the dynamics on the Cantor set are isomorphic to that of the Bernoulli shift. This is essentially the Markov partition. The term shift is in reference to the shift operator, which may be used to study Bernoulli schemes. The Ornstein isomorphism theorem shows that Bernoulli shifts are isomorphic when their entropy is equal.

In mathematics, the cylinder sets form a basis of the product topology on a product of sets; they are also a generating family of the cylinder σ-algebra.

In mathematics, nuclear spaces are topological vector spaces that can be viewed as a generalization of finite dimensional Euclidean spaces and share many of their desirable properties. Nuclear spaces are however quite different from Hilbert spaces, another generalization of finite dimensional Euclidean spaces. They were introduced by Alexander Grothendieck.

In mathematics, a π-system on a set is a collection of certain subsets of such that

In mathematics, ergodicity expresses the idea that a point of a moving system, either a dynamical system or a stochastic process, will eventually visit all parts of the space that the system moves in, in a uniform and random sense. This implies that the average behavior of the system can be deduced from the trajectory of a "typical" point. Equivalently, a sufficiently large collection of random samples from a process can represent the average statistical properties of the entire process. Ergodicity is a property of the system; it is a statement that the system cannot be reduced or factored into smaller components. Ergodic theory is the study of systems possessing ergodicity.

In mathematics, a topological space is said to be limit point compact or weakly countably compact if every infinite subset of has a limit point in This property generalizes a property of compact spaces. In a metric space, limit point compactness, compactness, and sequential compactness are all equivalent. For general topological spaces, however, these three notions of compactness are not equivalent.

In symbolic dynamics and related branches of mathematics, a shift space or subshift is a set of infinite words that represent the evolution of a discrete system. In fact, shift spaces and symbolic dynamical systems are often considered synonyms. The most widely studied shift spaces are the subshifts of finite type and the sofic shifts.

In mathematics, the concept of graph dynamical systems can be used to capture a wide range of processes taking place on graphs or networks. A major theme in the mathematical and computational analysis of GDSs is to relate their structural properties and the global dynamics that result.

In mathematics, a Markov odometer is a certain type of topological dynamical system. It plays a fundamental role in ergodic theory and especially in orbit theory of dynamical systems, since a theorem of H. Dye asserts that every ergodic nonsingular transformation is orbit-equivalent to a Markov odometer.

References

Further reading