Potts model

Last updated

In statistical mechanics, the Potts model, a generalization of the Ising model, is a model of interacting spins on a crystalline lattice. [1] By studying the Potts model, one may gain insight into the behaviour of ferromagnets and certain other phenomena of solid-state physics. The strength of the Potts model is not so much that it models these physical systems well; it is rather that the one-dimensional case is exactly solvable, and that it has a rich mathematical formulation that has been studied extensively.

Contents

The model is named after Renfrey Potts, who described the model near the end of his 1951 Ph.D. thesis. [2] The model was related to the "planar Potts" or "clock model", which was suggested to him by his advisor, Cyril Domb. The four-state Potts model is sometimes known as the Ashkin–Teller model, [3] after Julius Ashkin and Edward Teller, who considered an equivalent model in 1943.

The Potts model is related to, and generalized by, several other models, including the XY model, the Heisenberg model and the N-vector model. The infinite-range Potts model is known as the Kac model. When the spins are taken to interact in a non-Abelian manner, the model is related to the flux tube model, which is used to discuss confinement in quantum chromodynamics. Generalizations of the Potts model have also been used to model grain growth in metals, coarsening in foams, and statistical properties of proteins. [4] A further generalization of these methods by James Glazier and Francois Graner, known as the cellular Potts model, [5] has been used to simulate static and kinetic phenomena in foam and biological morphogenesis.

Definition

Vector Potts model

The Potts model consists of spins that are placed on a lattice; the lattice is usually taken to be a two-dimensional rectangular Euclidean lattice, but is often generalized to other dimensions and lattice structures.

Originally, Domb suggested that the spin takes one of possible values [ citation needed ], distributed uniformly about the circle, at angles

where and that the interaction Hamiltonian is given by

with the sum running over the nearest neighbor pairs over all lattice sites, and is a coupling constant, determining the interaction strength. This model is now known as the vector Potts model or the clock model. Potts provided the location in two dimensions of the phase transition for . In the limit , this becomes the XY model.

Standard Potts model

What is now known as the standard Potts model was suggested by Potts in the course of his study of the model above and is defined by a simpler Hamiltonian:

where is the Kronecker delta, which equals one whenever and zero otherwise.

The standard Potts model is equivalent to the Ising model and the 2-state vector Potts model, with . The standard Potts model is equivalent to the three-state vector Potts model, with .

Generalized Potts model

A generalization of the Potts model is often used in statistical inference and biophysics, particularly for modelling proteins through direct coupling analysis. [4] [6] This generalized Potts model consists of 'spins' which each may take on states: (with no particular ordering). The Hamiltonian is,

where is the energetic cost of spin being in state while spin is in state , and is the energetic cost of spin being in state . Note: . This model resembles the Sherrington-Kirkpatrick model in that couplings can be heterogeneous and non-local. There is no explicit lattice structure in this model.

Physical properties

Phase transitions

Despite its simplicity as a model of a physical system, the Potts model is useful as a model system for the study of phase transitions. For example, for the standard ferromagnetic Potts model in , a phase transition exists for all real values , [7] with the critical point at . The phase transition is continuous (second order) for [8] and discontinuous (first order) for . [9]

For the clock model, there is evidence that the corresponding phase transitions are infinite order BKT transitions, [10] and a continuous phase transition is observed when . [10] Further use is found through the model's relation to percolation problems and the Tutte and chromatic polynomials found in combinatorics. For integer values of , the model displays the phenomenon of 'interfacial adsorption' [11] with intriguing critical wetting properties when fixing opposite boundaries in two different states [ clarification needed ].

Relation with the random cluster model

The Potts model has a close relation to the Fortuin-Kasteleyn random cluster model, another model in statistical mechanics. Understanding this relationship has helped develop efficient Markov chain Monte Carlo methods for numerical exploration of the model at small , and led to the rigorous proof of the critical temperature of the model. [7]

At the level of the partition function , the relation amounts to transforming the sum over spin configurations into a sum over edge configurations i.e. sets of nearest neighbor pairs of the same color. The transformation is done using the identity with . [12] This leads to rewriting the partition function as

where the clusters are the connected components of the union of closed segments . This is proportional to the partition function of the random cluster model with the open edge probability . An advantage of the random cluster formulation is that can be an arbitrary complex number, rather than a natural integer.

Measure-theoretic description

The one dimensional Potts model may be expressed in terms of a subshift of finite type, and thus gains access to all of the mathematical techniques associated with this formalism. In particular, it can be solved exactly using the techniques of transfer operators. (However, Ernst Ising used combinatorial methods to solve the Ising model, which is the "ancestor" of the Potts model, in his 1924 PhD thesis). This section develops the mathematical formalism, based on measure theory, behind this solution.

While the example below is developed for the one-dimensional case, many of the arguments, and almost all of the notation, generalizes easily to any number of dimensions. Some of the formalism is also broad enough to handle related models, such as the XY model, the Heisenberg model and the N-vector model.

Topology of the space of states

Let Q = {1, ..., q} be a finite set of symbols, and let

be the set of all bi-infinite strings of values from the set Q. This set is called a full shift. For defining the Potts model, either this whole space, or a certain subset of it, a subshift of finite type, may be used. Shifts get this name because there exists a natural operator on this space, the shift operator τ : QZQZ, acting as

This set has a natural product topology; the base for this topology are the cylinder sets

that is, the set of all possible strings where k+1 spins match up exactly to a given, specific set of values ξ0, ..., ξk. Explicit representations for the cylinder sets can be gotten by noting that the string of values corresponds to a q-adic number, however the natural topology of the q-adic numbers is finer than the above product topology.

Interaction energy

The interaction between the spins is then given by a continuous function V : QZR on this topology. Any continuous function will do; for example

will be seen to describe the interaction between nearest neighbors. Of course, different functions give different interactions; so a function of s0, s1 and s2 will describe a next-nearest neighbor interaction. A function V gives interaction energy between a set of spins; it is not the Hamiltonian, but is used to build it. The argument to the function V is an element sQZ, that is, an infinite string of spins. In the above example, the function V just picked out two spins out of the infinite string: the values s0 and s1. In general, the function V may depend on some or all of the spins; currently, only those that depend on a finite number are exactly solvable.

Define the function Hn : QZR as

This function can be seen to consist of two parts: the self-energy of a configuration [s0, s1, ..., sn] of spins, plus the interaction energy of this set and all the other spins in the lattice. The n → ∞ limit of this function is the Hamiltonian of the system; for finite n, these are sometimes called the finite state Hamiltonians.

Partition function and measure

The corresponding finite-state partition function is given by

with C0 being the cylinder sets defined above. Here, β = 1/kT, where k is Boltzmann's constant, and T is the temperature. It is very common in mathematical treatments to set β = 1, as it is easily regained by rescaling the interaction energy. This partition function is written as a function of the interaction V to emphasize that it is only a function of the interaction, and not of any specific configuration of spins. The partition function, together with the Hamiltonian, are used to define a measure on the Borel σ-algebra in the following way: The measure of a cylinder set, i.e. an element of the base, is given by

One can then extend by countable additivity to the full σ-algebra. This measure is a probability measure; it gives the likelihood of a given configuration occurring in the configuration space QZ. By endowing the configuration space with a probability measure built from a Hamiltonian in this way, the configuration space turns into a canonical ensemble.

Most thermodynamic properties can be expressed directly in terms of the partition function. Thus, for example, the Helmholtz free energy is given by

Another important related quantity is the topological pressure, defined as

which will show up as the logarithm of the leading eigenvalue of the transfer operator of the solution.

Free field solution

The simplest model is the model where there is no interaction at all, and so V = c and Hn = c (with c constant and independent of any spin configuration). The partition function becomes

If all states are allowed, that is, the underlying set of states is given by a full shift, then the sum may be trivially evaluated as

If neighboring spins are only allowed in certain specific configurations, then the state space is given by a subshift of finite type. The partition function may then be written as

where card is the cardinality or count of a set, and Fix is the set of fixed points of the iterated shift function:

The q × q matrix A is the adjacency matrix specifying which neighboring spin values are allowed.

Interacting model

The simplest case of the interacting model is the Ising model, where the spin can only take on one of two values, sn ∈ {−1, 1} and only nearest neighbor spins interact. The interaction potential is given by

This potential can be captured in a 2 × 2 matrix with matrix elements

with the index σ, σ′ ∈ {−1, 1}. The partition function is then given by

The general solution for an arbitrary number of spins, and an arbitrary finite-range interaction, is given by the same general form. In this case, the precise expression for the matrix M is a bit more complex.

The goal of solving a model such as the Potts model is to give an exact closed-form expression for the partition function and an expression for the Gibbs states or equilibrium states in the limit of n → ∞, the thermodynamic limit.

Applications

Signal and image processing

The Potts model has applications in signal reconstruction. Assume that we are given noisy observation of a piecewise constant signal g in Rn. To recover g from the noisy observation vector f in Rn, one seeks a minimizer of the corresponding inverse problem, the Lp-Potts functional Pγ(u) which is defined by

The jump penalty forces piecewise constant solutions and the data term couples the minimizing candidate u to the data f. The parameter γ > 0 controls the tradeoff between regularity and data fidelity. There are fast algorithms for the exact minimization of the L1 and the L2-Potts functional. [13]

In image processing, the Potts functional is related to the segmentation problem. [14] However, in two dimensions the problem is NP-hard. [15]

See also

Related Research Articles

<span class="mw-page-title-main">Helmholtz free energy</span> Thermodynamic potential

In thermodynamics, the Helmholtz free energy is a thermodynamic potential that measures the useful work obtainable from a closed thermodynamic system at a constant temperature (isothermal). The change in the Helmholtz energy during a process is equal to the maximum amount of work that the system can perform in a thermodynamic process in which temperature is held constant. At constant temperature, the Helmholtz free energy is minimized at equilibrium.

<span class="mw-page-title-main">Lattice model (physics)</span>

In mathematical physics, a lattice model is a mathematical model of a physical system that is defined on a lattice, as opposed to a continuum, such as the continuum of space or spacetime. Lattice models originally occurred in the context of condensed matter physics, where the atoms of a crystal automatically form a lattice. Currently, lattice models are quite popular in theoretical physics, for many reasons. Some models are exactly solvable, and thus offer insight into physics beyond what can be learned from perturbation theory. Lattice models are also ideal for study by the methods of computational physics, as the discretization of any continuum model automatically turns it into a lattice model. The exact solution to many of these models includes the presence of solitons. Techniques for solving these include the inverse scattering transform and the method of Lax pairs, the Yang–Baxter equation and quantum groups. The solution of these models has given insights into the nature of phase transitions, magnetization and scaling behaviour, as well as insights into the nature of quantum field theory. Physical lattice models frequently occur as an approximation to a continuum theory, either to give an ultraviolet cutoff to the theory to prevent divergences or to perform numerical computations. An example of a continuum theory that is widely studied by lattice models is the QCD lattice model, a discretization of quantum chromodynamics. However, digital physics considers nature fundamentally discrete at the Planck scale, which imposes upper limit to the density of information, aka Holographic principle. More generally, lattice gauge theory and lattice field theory are areas of study. Lattice models are also used to simulate the structure and dynamics of polymers.

The Ising model, named after the physicists Ernst Ising and Wilhelm Lenz, is a mathematical model of ferromagnetism in statistical mechanics. The model consists of discrete variables that represent magnetic dipole moments of atomic "spins" that can be in one of two states. The spins are arranged in a graph, usually a lattice, allowing each spin to interact with its neighbors. Neighboring spins that agree have a lower energy than those that disagree; the system tends to the lowest energy but heat disturbs this tendency, thus creating the possibility of different structural phases. The model allows the identification of phase transitions as a simplified model of reality. The two-dimensional square-lattice Ising model is one of the simplest statistical models to show a phase transition.

In physics and probability theory, Mean-field theory (MFT) or Self-consistent field theory studies the behavior of high-dimensional random (stochastic) models by studying a simpler model that approximates the original by averaging over degrees of freedom. Such models consider many individual components that interact with each other.

In physics, the Hamilton–Jacobi equation, named after William Rowan Hamilton and Carl Gustav Jacob Jacobi, is an alternative formulation of classical mechanics, equivalent to other formulations such as Newton's laws of motion, Lagrangian mechanics and Hamiltonian mechanics.

In mathematics, the Gibbs measure, named after Josiah Willard Gibbs, is a probability measure frequently seen in many problems of probability theory and statistical mechanics. It is a generalization of the canonical ensemble to infinite systems. The canonical ensemble gives the probability of the system X being in state x as

In solid-state physics, the tight-binding model is an approach to the calculation of electronic band structure using an approximate set of wave functions based upon superposition of wave functions for isolated atoms located at each atomic site. The method is closely related to the LCAO method used in chemistry. Tight-binding models are applied to a wide variety of solids. The model gives good qualitative results in many cases and can be combined with other models that give better results where the tight-binding model fails. Though the tight-binding model is a one-electron model, the model also provides a basis for more advanced calculations like the calculation of surface states and application to various kinds of many-body problem and quasiparticle calculations.

The quantum Heisenberg model, developed by Werner Heisenberg, is a statistical mechanical model used in the study of critical points and phase transitions of magnetic systems, in which the spins of the magnetic systems are treated quantum mechanically. It is related to the prototypical Ising model, where at each site of a lattice, a spin represents a microscopic magnetic dipole to which the magnetic moment is either up or down. Except the coupling between magnetic dipole moments, there is also a multipolar version of Heisenberg model called the multipolar exchange interaction.

In statistical mechanics, the two-dimensional square lattice Ising model is a simple lattice model of interacting magnetic spins. The model is notable for having nontrivial interactions, yet having an analytical solution. The model was solved by Lars Onsager for the special case that the external magnetic field H = 0. An analytical solution for the general case for has yet to be found.

In statistical mechanics, the corner transfer matrix describes the effect of adding a quadrant to a lattice. Introduced by Rodney Baxter in 1968 as an extension of the Kramers-Wannier row-to-row transfer matrix, it provides a powerful method of studying lattice models. Calculations with corner transfer matrices led Baxter to the exact solution of the hard hexagon model in 1980.

The Swendsen–Wang algorithm is the first non-local or cluster algorithm for Monte Carlo simulation for large systems near criticality. It has been introduced by Robert Swendsen and Jian-Sheng Wang in 1987 at Carnegie Mellon.

A flavor of the k·p perturbation theory used for calculating the structure of multiple, degenerate electronic bands in bulk and quantum well semiconductors. The method is a generalization of the single band k·p theory.

Dynamical mean-field theory (DMFT) is a method to determine the electronic structure of strongly correlated materials. In such materials, the approximation of independent electrons, which is used in density functional theory and usual band structure calculations, breaks down. Dynamical mean-field theory, a non-perturbative treatment of local interactions between electrons, bridges the gap between the nearly free electron gas limit and the atomic limit of condensed-matter physics.

The model is a simplified statistical mechanical spin model. It is a generalization of the Ising model. Although it can be defined on an arbitrary graph, it is integrable only on one and two-dimensional lattices, in several special cases.

Heat transfer physics describes the kinetics of energy storage, transport, and energy transformation by principal energy carriers: phonons, electrons, fluid particles, and photons. Heat is energy stored in temperature-dependent motion of particles including electrons, atomic nuclei, individual atoms, and molecules. Heat is transferred to and from matter by the principal energy carriers. The state of energy stored within matter, or transported by the carriers, is described by a combination of classical and quantum statistical mechanics. The energy is different made (converted) among various carriers. The heat transfer processes are governed by the rates at which various related physical phenomena occur, such as the rate of particle collisions in classical mechanics. These various states and kinetics determine the heat transfer, i.e., the net rate of energy storage or transport. Governing these process from the atomic level to macroscale are the laws of thermodynamics, including conservation of energy.

In computational solid state physics, Continuous-time quantum Monte Carlo (CT-QMC) is a family of stochastic algorithms for solving the Anderson impurity model at finite temperature. These methods first expand the full partition function as a series of Feynman diagrams, employ Wick's theorem to group diagrams into determinants, and finally use Markov chain Monte Carlo to stochastically sum up the resulting series.

In statistical mechanics, probability theory, graph theory, etc. the random cluster model is a random graph that generalizes and unifies the Ising model, Potts model, and percolation model. It is used to study random combinatorial structures, electrical networks, etc. It is also referred to as the RC model or sometimes the FK representation after its founders Cees Fortuin and Piet Kasteleyn.

The transverse field Ising model is a quantum version of the classical Ising model. It features a lattice with nearest neighbour interactions determined by the alignment or anti-alignment of spin projections along the axis, as well as an external magnetic field perpendicular to the axis which creates an energetic bias for one x-axis spin direction over the other.

In statistical mechanics, Lee–Yang theory, sometimes also known as Yang–Lee theory, is a scientific theory which seeks to describe phase transitions in large physical systems in the thermodynamic limit based on the properties of small, finite-size systems. The theory revolves around the complex zeros of partition functions of finite-size systems and how these may reveal the existence of phase transitions in the thermodynamic limit.

The three-state Potts CFT, also known as the parafermion CFT, is a conformal field theory in two dimensions. It is a minimal model with central charge . It is considered to be the simplest minimal model with a non-diagonal partition function in Virasoro characters, as well as the simplest non-trivial CFT with the W-algebra as a symmetry.

References

  1. Wu, F. Y. (1982-01-01). "The Potts model". Reviews of Modern Physics. 54 (1): 235–268. Bibcode:1982RvMP...54..235W. doi:10.1103/RevModPhys.54.235.
  2. Potts, R. B. (January 1952). "Some generalized order-disorder transformations". Mathematical Proceedings of the Cambridge Philosophical Society. 48 (1): 106–109. Bibcode:1952PCPS...48..106P. doi:10.1017/S0305004100027419. ISSN   1469-8064. S2CID   122689941.
  3. Ashkin, J.; Teller, E. (1943-09-01). "Statistics of Two-Dimensional Lattices with Four Components". Physical Review. 64 (5–6): 178–184. Bibcode:1943PhRv...64..178A. doi:10.1103/PhysRev.64.178.
  4. 1 2 Shimagaki, Kai; Weigt, Martin (2019-09-19). "Selection of sequence motifs and generative Hopfield-Potts models for protein families". Physical Review E. 100 (3): 032128. arXiv: 1905.11848 . Bibcode:2019PhRvE.100c2128S. doi:10.1103/PhysRevE.100.032128. PMID   31639992. S2CID   167217593.
  5. Graner, François; Glazier, James A. (1992-09-28). "Simulation of biological cell sorting using a two-dimensional extended Potts model". Physical Review Letters. 69 (13): 2013–2016. Bibcode:1992PhRvL..69.2013G. doi:10.1103/PhysRevLett.69.2013. PMID   10046374.
  6. Mehta, Pankaj; Bukov, Marin; Wang, Ching-Hao; Day, Alexandre G. R.; Richardson, Clint; Fisher, Charles K.; Schwab, David J. (2019-05-30). "A high-bias, low-variance introduction to Machine Learning for physicists". Physics Reports. 810: 1–124. arXiv: 1803.08823 . Bibcode:2019PhR...810....1M. doi:10.1016/j.physrep.2019.03.001. ISSN   0370-1573. PMC   6688775 . PMID   31404441.
  7. 1 2 Beffara, Vincent; Duminil-Copin, Hugo (2012-08-01). "The self-dual point of the two-dimensional random-cluster model is critical for q ≥ 1". Probability Theory and Related Fields. 153 (3): 511–542. doi: 10.1007/s00440-011-0353-8 . ISSN   1432-2064. S2CID   55391558.
  8. Duminil-Copin, Hugo; Sidoravicius, Vladas; Tassion, Vincent (2017-01-01). "Continuity of the Phase Transition for Planar Random-Cluster and Potts Models with $${1 \le q \le 4}$$". Communications in Mathematical Physics. 349 (1): 47–107. arXiv: 1505.04159 . doi:10.1007/s00220-016-2759-8. ISSN   1432-0916. S2CID   119153736.
  9. Duminil-Copin, Hugo; Gagnebin, Maxime; Harel, Matan; Manolescu, Ioan; Tassion, Vincent (2017-09-05). "Discontinuity of the phase transition for the planar random-cluster and Potts models with $q>4$". arXiv: 1611.09877 [math.PR].
  10. 1 2 Li, Zi-Qian; Yang, Li-Ping; Xie, Z. Y.; Tu, Hong-Hao; Liao, Hai-Jun; Xiang, T. (2020). "Critical properties of the two-dimensional $q$-state clock model". Physical Review E. 101 (6): 060105. arXiv: 1912.11416v3 . Bibcode:2020PhRvE.101f0105L. doi:10.1103/PhysRevE.101.060105. PMID   32688489. S2CID   209460838.
  11. Selke, Walter; Huse, David A. (1983-06-01). "Interfacial adsorption in planar potts models". Zeitschrift für Physik B: Condensed Matter. 50 (2): 113–116. Bibcode:1983ZPhyB..50..113S. doi:10.1007/BF01304093. ISSN   1431-584X. S2CID   121502987.
  12. Sokal, Alan D. (2005). "The multivariate Tutte polynomial (alias Potts model) for graphs and matroids". Surveys in Combinatorics 2005. pp. 173–226. arXiv: math/0503607 . doi:10.1017/CBO9780511734885.009. ISBN   9780521615235. S2CID   17904893.
  13. Friedrich, F.; Kempe, A.; Liebscher, V.; Winkler, G. (2008). "Complexity Penalized M-Estimation: Fast Computation". Journal of Computational and Graphical Statistics. 17 (1): 201–224. doi:10.1198/106186008X285591. ISSN   1061-8600. JSTOR   27594299. S2CID   117951377.
  14. Krähenbühl, Philipp; Koltun, Vladlen (2011). "Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials". Advances in Neural Information Processing Systems. Curran Associates, Inc. 24. arXiv: 1210.5644 .
  15. Boykov, Y.; Veksler, O.; Zabih, R. (November 2001). "Fast approximate energy minimization via graph cuts". IEEE Transactions on Pattern Analysis and Machine Intelligence. 23 (11): 1222–1239. doi:10.1109/34.969114. ISSN   1939-3539.