# Quantum statistical mechanics

Last updated

Quantum statistical mechanics is statistical mechanics applied to quantum mechanical systems. In quantum mechanics a statistical ensemble (probability distribution over possible quantum states) is described by a density operator S, which is a non-negative, self-adjoint, trace-class operator of trace 1 on the Hilbert space H describing the quantum system. This can be shown under various mathematical formalisms for quantum mechanics. One such formalism is provided by quantum logic.

## Expectation

From classical probability theory, we know that the expectation of a random variable X is defined by its distribution DX by

${\displaystyle \mathbb {E} (X)=\int _{\mathbb {R} }\lambda \,d\,\operatorname {D} _{X}(\lambda )}$

assuming, of course, that the random variable is integrable or that the random variable is non-negative. Similarly, let A be an observable of a quantum mechanical system. A is given by a densely defined self-adjoint operator on H. The spectral measure of A defined by

${\displaystyle \operatorname {E} _{A}(U)=\int _{U}\lambda d\operatorname {E} (\lambda ),}$

uniquely determines A and conversely, is uniquely determined by A. EA is a boolean homomorphism from the Borel subsets of R into the lattice Q of self-adjoint projections of H. In analogy with probability theory, given a state S, we introduce the distribution of A under S which is the probability measure defined on the Borel subsets of R by

${\displaystyle \operatorname {D} _{A}(U)=\operatorname {Tr} (\operatorname {E} _{A}(U)S).}$

Similarly, the expected value of A is defined in terms of the probability distribution DA by

${\displaystyle \mathbb {E} (A)=\int _{\mathbb {R} }\lambda \,d\,\operatorname {D} _{A}(\lambda ).}$

Note that this expectation is relative to the mixed state S which is used in the definition of DA.

Remark. For technical reasons, one needs to consider separately the positive and negative parts of A defined by the Borel functional calculus for unbounded operators.

One can easily show:

${\displaystyle \mathbb {E} (A)=\operatorname {Tr} (AS)=\operatorname {Tr} (SA).}$

Note that if S is a pure state corresponding to the vector ψ, then:

${\displaystyle \mathbb {E} (A)=\langle \psi |A|\psi \rangle .}$

The trace of an operator A is written as follows:

${\displaystyle \operatorname {Tr} (A)=\sum _{m}\langle m|A|m\rangle .}$

## Von Neumann entropy

Of particular significance for describing randomness of a state is the von Neumann entropy of Sformally defined by

${\displaystyle \operatorname {H} (S)=-\operatorname {Tr} (S\log _{2}S)}$.

Actually, the operator S log2S is not necessarily trace-class. However, if S is a non-negative self-adjoint operator not of trace class we define Tr(S) = +. Also note that any density operator S can be diagonalized, that it can be represented in some orthonormal basis by a (possibly infinite) matrix of the form

${\displaystyle {\begin{bmatrix}\lambda _{1}&0&\cdots &0&\cdots \\0&\lambda _{2}&\cdots &0&\cdots \\\vdots &\vdots &\ddots &\\0&0&&\lambda _{n}&\\\vdots &\vdots &&&\ddots \end{bmatrix}}}$

and we define

${\displaystyle \operatorname {H} (S)=-\sum _{i}\lambda _{i}\log _{2}\lambda _{i}.}$

The convention is that ${\displaystyle \;0\log _{2}0=0}$, since an event with probability zero should not contribute to the entropy. This value is an extended real number (that is in [0, ]) and this is clearly a unitary invariant of S.

Remark. It is indeed possible that H(S) = + for some density operator S. In fact T be the diagonal matrix

${\displaystyle T={\begin{bmatrix}{\frac {1}{2(\log _{2}2)^{2}}}&0&\cdots &0&\cdots \\0&{\frac {1}{3(\log _{2}3)^{2}}}&\cdots &0&\cdots \\\vdots &\vdots &\ddots &\\0&0&&{\frac {1}{n(\log _{2}n)^{2}}}&\\\vdots &\vdots &&&\ddots \end{bmatrix}}}$

T is non-negative trace class and one can show T log2T is not trace-class.

Theorem. Entropy is a unitary invariant.

In analogy with classical entropy (notice the similarity in the definitions), H(S) measures the amount of randomness in the state S. The more dispersed the eigenvalues are, the larger the system entropy. For a system in which the space H is finite-dimensional, entropy is maximized for the states S which in diagonal form have the representation

${\displaystyle {\begin{bmatrix}{\frac {1}{n}}&0&\cdots &0\\0&{\frac {1}{n}}&\dots &0\\\vdots &\vdots &\ddots &\vdots \\0&0&\cdots &{\frac {1}{n}}\end{bmatrix}}}$

For such an S, H(S) = log2n. The state S is called the maximally mixed state.

Recall that a pure state is one of the form

${\displaystyle S=|\psi \rangle \langle \psi |,}$

for ψ a vector of norm 1.

Theorem. H(S) = 0 if and only if S is a pure state.

For S is a pure state if and only if its diagonal form has exactly one non-zero entry which is a 1.

Entropy can be used as a measure of quantum entanglement.

## Gibbs canonical ensemble

Consider an ensemble of systems described by a Hamiltonian H with average energy E. If H has pure-point spectrum and the eigenvalues ${\displaystyle E_{n}}$ of H go to + sufficiently fast, er H will be a non-negative trace-class operator for every positive r.

The Gibbs canonical ensemble is described by the state

${\displaystyle S={\frac {\mathrm {e} ^{-\beta H}}{\operatorname {Tr} (\mathrm {e} ^{-\beta H})}}.}$

Where β is such that the ensemble average of energy satisfies

${\displaystyle \operatorname {Tr} (SH)=E}$

and

${\displaystyle \operatorname {Tr} (\mathrm {e} ^{-\beta H})=\sum _{n}\mathrm {e} ^{-\beta E_{n}}=Z(\beta )}$

This is called the partition function; it is the quantum mechanical version of the canonical partition function of classical statistical mechanics. The probability that a system chosen at random from the ensemble will be in a state corresponding to energy eigenvalue ${\displaystyle E_{m}}$ is

${\displaystyle {\mathcal {P}}(E_{m})={\frac {\mathrm {e} ^{-\beta E_{m}}}{\sum _{n}\mathrm {e} ^{-\beta E_{n}}}}.}$

Under certain conditions, the Gibbs canonical ensemble maximizes the von Neumann entropy of the state subject to the energy conservation requirement.[ clarification needed ]

## Grand canonical ensemble

For open systems where the energy and numbers of particles may fluctuate, the system is described by the grand canonical ensemble, described by the density matrix

${\displaystyle \rho ={\frac {\mathrm {e} ^{\beta (\sum _{i}\mu _{i}N_{i}-H)}}{\operatorname {Tr} \left(\mathrm {e} ^{\beta (\sum _{i}\mu _{i}N_{i}-H)}\right)}}.}$

where the N1, N2, ... are the particle number operators for the different species of particles that are exchanged with the reservoir. Note that this is a density matrix including many more states (of varying N) compared to the canonical ensemble.

The grand partition function is

${\displaystyle {\mathcal {Z}}(\beta ,\mu _{1},\mu _{2},\cdots )=\operatorname {Tr} (\mathrm {e} ^{\beta (\sum _{i}\mu _{i}N_{i}-H)})}$

## Related Research Articles

In probability theory and statistics, the exponential distribution is the probability distribution of the time between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average rate. It is a particular case of the gamma distribution. It is the continuous analogue of the geometric distribution, and it has the key property of being memoryless. In addition to being used for the analysis of Poisson point processes it is found in various other contexts.

In linear algebra, an n-by-n square matrix A is called invertible if there exists an n-by-n square matrix B such that

In mathematics, the adjoint representation of a Lie group G is a way of representing the elements of the group as linear transformations of the group's Lie algebra, considered as a vector space. For example, if G is , the Lie group of real n-by-n invertible matrices, then the adjoint representation is the group homomorphism that sends an invertible n-by-n matrix to an endomorphism of the vector space of all linear transformations of defined by: .

In physics, a partition function describes the statistical properties of a system in thermodynamic equilibrium. Partition functions are functions of the thermodynamic state variables, such as the temperature and volume. Most of the aggregate thermodynamic variables of the system, such as the total energy, free energy, entropy, and pressure, can be expressed in terms of the partition function or its derivatives. The partition function is dimensionless, it is a pure number.

In statistics, the Wishart distribution is a generalization to multiple dimensions of the gamma distribution. It is named in honor of John Wishart, who first formulated the distribution in 1928.

In mathematics, the spectral radius of a square matrix or a bounded linear operator is the largest absolute value of its eigenvalues. It is sometimes denoted by ρ(·).

In mathematics, the determinant of a skew-symmetric matrix can always be written as the square of a polynomial in the matrix entries, a polynomial with integer coefficients that only depend on the size of the matrix. The value of this polynomial, when applied to the coefficients of a skew-symmetric matrix, is called the Pfaffian of that matrix. The term Pfaffian was introduced by Cayley (1852) who indirectly named them after Johann Friedrich Pfaff. The Pfaffian is nonvanishing only for 2n × 2n skew-symmetric matrices, in which case it is a polynomial of degree n.

The Gell-Mann matrices, developed by Murray Gell-Mann, are a set of eight linearly independent 3×3 traceless Hermitian matrices used in the study of the strong interaction in particle physics. They span the Lie algebra of the SU(3) group in the defining representation.

In quantum mechanics, a quantum operation is a mathematical formalism used to describe a broad class of transformations that a quantum mechanical system can undergo. This was first discussed as a general stochastic transformation for a density matrix by George Sudarshan. The quantum operation formalism describes not only unitary time evolution or symmetry transformations of isolated systems, but also the effects of measurement and transient interactions with an environment. In the context of quantum computation, a quantum operation is called a quantum channel.

In probability and statistics, the Dirichlet distribution, often denoted , is a family of continuous multivariate probability distributions parameterized by a vector of positive reals. It is a multivariate generalization of the beta distribution, hence its alternative name of multivariate beta distribution (MBD). Dirichlet distributions are commonly used as prior distributions in Bayesian statistics, and in fact the Dirichlet distribution is the conjugate prior of the categorical distribution and multinomial distribution.

In quantum mechanics, the Hellmann–Feynman theorem relates the derivative of the total energy with respect to a parameter, to the expectation value of the derivative of the Hamiltonian with respect to that same parameter. According to the theorem, once the spatial distribution of the electrons has been determined by solving the Schrödinger equation, all the forces in the system can be calculated using classical electrostatics.

In mathematics, a logarithm of a matrix is another matrix such that the matrix exponential of the latter matrix equals the original matrix. It is thus a generalization of the scalar logarithm and in some sense an inverse function of the matrix exponential. Not all matrices have a logarithm and those matrices that do have a logarithm may have more than one logarithm. The study of logarithms of matrices leads to Lie theory since when a matrix has a logarithm then it is in a Lie group and the logarithm is the corresponding element of the vector space of the Lie algebra.

The time-evolving block decimation (TEBD) algorithm is a numerical scheme used to simulate one-dimensional quantum many-body systems, characterized by at most nearest-neighbour interactions. It is dubbed Time-evolving Block Decimation because it dynamically identifies the relevant low-dimensional Hilbert subspaces of an exponentially larger original Hilbert space. The algorithm, based on the Matrix Product States formalism, is highly efficient when the amount of entanglement in the system is limited, a requirement fulfilled by a large class of quantum many-body systems in one dimension.

In quantum information theory, quantum relative entropy is a measure of distinguishability between two quantum states. It is the quantum mechanical analog of relative entropy.

In statistics, Bayesian multivariate linear regression is a Bayesian approach to multivariate linear regression, i.e. linear regression where the predicted outcome is a vector of correlated random variables rather than a single scalar random variable. A more general treatment of this approach can be found in the article MMSE estimator.

In mathematics, a matrix function is a function which maps a matrix to another matrix.

In quantum mechanics, and especially quantum information theory, the purity of a normalized quantum state is a scalar defined as

Given a Hilbert space with a tensor product structure a product numerical range is defined as a numerical range with respect to the subset of product vectors. In some situations, especially in the context of quantum mechanics product numerical range is known as local numerical range

In mathematics, a Bunce–Deddens algebra, named after John W. Bunce and James A. Deddens, is a certain type of direct limit of matrix algebras over the continuous functions on the circle. They are therefore examples of simple unital AT algebras. In the inductive system defining these algebras, the connecting maps between each stage are given by embeddings between families of shift operators with periodic weights.

In mathematics, there are many kinds of inequalities involving matrices and linear operators on Hilbert spaces. This article covers some important operator inequalities connected with traces of matrices.

## References

• J. von Neumann, Mathematical Foundations of Quantum Mechanics, Princeton University Press, 1955.
• F. Reif, Statistical and Thermal Physics, McGraw-Hill, 1965.