Bennett acceptance ratio

Last updated September 23, 2022

The Bennett acceptance ratio method (BAR) is an algorithm for estimating the difference in free energy between two systems (usually the systems will be simulated on the computer). It was suggested by Charles H. Bennett in 1976.^[1]

Preliminaries

Take a system in a certain super (i.e. Gibbs) state. By performing a Metropolis Monte Carlo walk it is possible to sample the landscape of states that the system moves between, using the equation

p({\text{State}}_{x}\rightarrow {\text{State}}_{y})=\min \left(e^{-\beta \,\Delta U},1\right)=M(\beta \,\Delta U)

where ΔU = U(State_y) − U(State_x) is the difference in potential energy, β = 1/kT (T is the temperature in kelvins, while k is the Boltzmann constant), and $M(x)\equiv \min(e^{-x},1)$ is the Metropolis function. The resulting states are then sampled according to the Boltzmann distribution of the super state at temperature T. Alternatively, if the system is dynamically simulated in the canonical ensemble (also called the NVT ensemble), the resulting states along the simulated trajectory are likewise distributed. Averaging along the trajectory (in either formulation) is denoted by angle brackets $\left\langle \cdots \right\rangle$ .

Suppose that two super states of interest, A and B, are given. We assume that they have a common configuration space, i.e., they share all of their micro states, but the energies associated to these (and hence the probabilities) differ because of a change in some parameter (such as the strength of a certain interaction). The basic question to be addressed is, then, how can the Helmholtz free energy change (ΔF = F_B − F_A) on moving between the two super states be calculated from sampling in both ensembles? The kinetic energy part in the free energy is equal between states so can be ignored. Also the Gibbs free energy corresponds to the NpT ensemble.

The general case

Bennett shows that for every function f satisfying the condition $f(x)/f(-x)\equiv e^{-x}$ (which is essentially the detailed balance condition), and for every energy offset C, one has the exact relationship

e^{-\beta (\Delta F-C)}={\frac {\left\langle f\left(\beta (U_{\text{B}}-U_{\text{A}}-C)\right)\right\rangle _{\text{A}}}{\left\langle f\left(\beta (U_{\text{A}}-U_{\text{B}}+C)\right)\right\rangle _{\text{B}}}}

where U_A and U_B are the potential energies of the same configurations, calculated using potential function A (when the system is in superstate A) and potential function B (when the system is in the superstate B) respectively.

The basic case

Substituting for f the Metropolis function defined above (which satisfies the detailed balance condition), and setting C to zero, gives

e^{-\beta \Delta F}={\frac {\left\langle M\left(\beta (U_{\text{B}}-U_{\text{A}})\right)\right\rangle _{\text{A}}}{\left\langle M\left(\beta (U_{\text{A}}-U_{\text{B}})\right)\right\rangle _{\text{B}}}}

The advantage of this formulation (apart from its simplicity) is that it can be computed without performing two simulations, one in each specific ensemble. Indeed, it is possible to define an extra kind of "potential switching" Metropolis trial move (taken every fixed number of steps), such that the single sampling from the "mixed" ensemble suffices for the computation.

The most efficient case

Bennett explores which specific expression for ΔF is the most efficient, in the sense of yielding the smallest standard error for a given simulation time. He shows that the optimal choice is to take

$f(x)\equiv {\frac {1}{1+e^{x}}}$ , which is essentially the Fermi–Dirac distribution (satisfying indeed the detailed balance condition).
$C\approx \Delta F$ . This value, of course, is not known (it is exactly what one is trying to compute), but it can be approximately chosen in a self-consistent manner.

Some assumptions needed for the efficiency are the following:

The densities of the two super states (in their common configuration space) should have a large overlap. Otherwise, a chain of super states between A and B may be needed, such that the overlap of each two consecutive super states is adequate.
The sample size should be large. In particular, as successive states are correlated, the simulation time should be much larger than the correlation time.
The cost of simulating both ensembles should be approximately equal - and then, in fact, the system is sampled roughly equally in both super states. Otherwise, the optimal expression for C is modified, and the sampling should devote equal times (rather than equal number of time steps) to the two ensembles.

Multistate Bennett acceptance ratio

The multistate Bennett acceptance ratio (MBAR) is a generalization of the Bennett acceptance ratio that calculates the (relative) free energies of several multi states. It essentially reduces to the BAR method when only two super states are involved.

Relation to other methods

The perturbation theory method

This method, also called Free energy perturbation (or FEP), involves sampling from state A only. It requires that all the high probability configurations of super state B are contained in high probability configurations of super state A, which is a much more stringent requirement than the overlap condition stated above.

The exact (infinite order) result

e^{-\beta \Delta F}=\left\langle e^{-\beta (U_{\text{B}}-U_{\text{A}})}\right\rangle _{\text{A}}

or

\Delta F=-kT\cdot \log \left\langle e^{\beta (U_{\text{A}}-U_{\text{B}})}\right\rangle _{\text{A}}

This exact result can be obtained from the general BAR method, using (for example) the Metropolis function, in the limit $C\rightarrow -\infty$ . Indeed, in that case, the denominator of the general case expression above tends to 1, while the numerator tends to $e^{\beta C}\left\langle e^{-\beta (U_{\text{B}}-U_{\text{A}})}\right\rangle _{\text{A}}$ . A direct derivation from the definitions is more straightforward, though.

The second order (approximate) result

Assuming that $U_{\text{B}}-U_{\text{A}}\ll kT$ and Taylor expanding the second exact perturbation theory expression to the second order, one gets the approximation

\Delta F\approx \left\langle U_{\text{B}}-U_{\text{A}}\right\rangle _{\text{A}}-{\frac {\beta }{2}}\left(\left\langle (U_{\text{B}}-U_{\text{A}})^{2}\right\rangle _{\text{A}}-\left(\left\langle (U_{\text{B}}-U_{\text{A}})\right\rangle _{\text{A}}\right)^{2}\right)

Note that the first term is the expected value of the energy difference, while the second is essentially its variance.

The first order inequalities

Using the convexity of the log function appearing in the exact perturbation analysis result, together with Jensen's inequality, gives an inequality in the linear level; combined with the analogous result for the B ensemble one gets the following version of the Gibbs-Bogoliubov inequality:

\langle U_{\text{B}}-U_{\text{A}}\rangle _{\text{B}}\leq \Delta F\leq \langle U_{\text{B}}-U_{\text{A}}\rangle _{\text{A}}

Note that the inequality agrees with the negative sign of the coefficient of the (positive) variance term in the second order result.

The thermodynamic integration method

writing the potential energy as depending on a continuous parameter, $U_{\text{A}}=U(\lambda =0),U_{\text{B}}=U(\lambda =1),$

one has the exact result ${\frac {\partial F(\lambda )}{\partial \lambda }}=\left\langle {\frac {\partial U(\lambda )}{\partial \lambda }}\right\rangle _{\lambda }$ This can either be directly verified from definitions or seen from the limit of the above Gibbs-Bogoliubov inequalities when ${\text{A}}=\lambda ^{+},{\text{B}}=\lambda ^{-}$ . we can therefore write

\Delta F=\int _{0}^{1}\left\langle {\frac {\partial U(\lambda )}{\partial \lambda }}\right\rangle \,d\lambda

which is the thermodynamic integration (or TI) result. It can be approximated by dividing the range between states A and B into many values of λ at which the expectation value is estimated, and performing numerical integration.

Implementation

The Bennett acceptance ratio method is implemented in modern molecular dynamics systems, such as Gromacs. Python-based code for MBAR and BAR is available for download at .

Related Research Articles

Magnetic circular dichroism (MCD) is the differential absorption of left and right circularly polarized light, induced in a sample by a strong magnetic field oriented parallel to the direction of light propagation. MCD measurements can detect transitions which are too weak to be seen in conventional optical absorption spectra, and it can be used to distinguish between overlapping transitions. Paramagnetic systems are common analytes, as their near-degenerate magnetic sublevels provide strong MCD intensity that varies with both field strength and sample temperature. The MCD signal also provides insight into the symmetry of the electronic levels of the studied systems, such as metal ion sites.

In physics, a Langevin equation is a stochastic differential equation describing how a system evolves when subjected to a combination of deterministic and fluctuating ("random") forces. The dependent variables in a Langevin equation typically are collective (macroscopic) variables changing only slowly in comparison to the other (microscopic) variables of the system. The fast (microscopic) variables are responsible for the stochastic nature of the Langevin equation. One application is to Brownian motion, which models the fluctuating motion of a small particle in a fluid.

In mathematics, a self-adjoint operator on an infinite-dimensional complex vector space V with inner product $is a linear map A that is its own adjoint. If V is finite-dimensional with a given orthonormal basis, this is equivalent to the condition that the matrix of A is a Hermitian matrix, i.e., equal to its conjugate transpose A * . By the finite-dimensional spectral theorem, V has an orthonormal basis such that the matrix of A relative to this basis is a diagonal matrix with entries in the real numbers. In this article, we consider generalizations of this concept to operators on Hilbert spaces of arbitrary dimension.$

In thermodynamics, the Helmholtz free energy is a thermodynamic potential that measures the useful work obtainable from a closed thermodynamic system at a constant temperature (isothermal). The change in the Helmholtz energy during a process is equal to the maximum amount of work that the system can perform in a thermodynamic process in which temperature is held constant. At constant temperature, the Helmholtz free energy is minimized at equilibrium.

The Ising model, named after the physicists Ernst Ising and Wilhelm Lenz, is a mathematical model of ferromagnetism in statistical mechanics. The model consists of discrete variables that represent magnetic dipole moments of atomic "spins" that can be in one of two states. The spins are arranged in a graph, usually a lattice, allowing each spin to interact with its neighbors. Neighboring spins that agree have a lower energy than those that disagree; the system tends to the lowest energy but heat disturbs this tendency, thus creating the possibility of different structural phases. The model allows the identification of phase transitions as a simplified model of reality. The two-dimensional square-lattice Ising model is one of the simplest statistical models to show a phase transition.

In mathematics, the Hodge star operator or Hodge star is a linear map defined on the exterior algebra of a finite-dimensional oriented vector space endowed with a nondegenerate symmetric bilinear form. Applying the operator to an element of the algebra produces the Hodge dual of the element. This map was introduced by W. V. D. Hodge.

In physics, a partition function describes the statistical properties of a system in thermodynamic equilibrium. Partition functions are functions of the thermodynamic state variables, such as the temperature and volume. Most of the aggregate thermodynamic variables of the system, such as the total energy, free energy, entropy, and pressure, can be expressed in terms of the partition function or its derivatives. The partition function is dimensionless.

<span class="mw-page-title-main">Equipartition theorem</span> Theorem in classical statistical mechanics

In classical statistical mechanics, the equipartition theorem relates the temperature of a system to its average energies. The equipartition theorem is also known as the law of equipartition, equipartition of energy, or simply equipartition. The original idea of equipartition was that, in thermal equilibrium, energy is shared equally among all of its various forms; for example, the average kinetic energy per degree of freedom in translational motion of a molecule should equal that in rotational motion.

In physics, the S-matrix or scattering matrix relates the initial state and the final state of a physical system undergoing a scattering process. It is used in quantum mechanics, scattering theory and quantum field theory (QFT).

A granular material is a conglomeration of discrete solid, macroscopic particles characterized by a loss of energy whenever the particles interact. The constituents that compose granular material are large enough such that they are not subject to thermal motion fluctuations. Thus, the lower size limit for grains in granular material is about 1 μm. On the upper size limit, the physics of granular materials may be applied to ice floes where the individual grains are icebergs and to asteroid belts of the Solar System with individual grains being asteroids.

The classical XY model is a lattice model of statistical mechanics. In general, the XY model can be seen as a specialization of Stanley's n-vector model for $n = 2$ .

The Curie–Weiss law describes the magnetic susceptibility $χ$ of a ferromagnet in the paramagnetic region above the Curie point:

In probability theory and mathematical physics, a random matrix is a matrix-valued random variable—that is, a matrix in which some or all elements are random variables. Many important properties of physical systems can be represented mathematically as matrix problems. For example, the thermal conductivity of a lattice can be computed from the dynamical matrix of the particle-particle interactions within the lattice.

In particle physics, neutral particle oscillation is the transmutation of a particle with zero electric charge into another neutral particle due to a change of a non-zero internal quantum number, via an interaction that does not conserve that quantum number. Neutral particle oscillations were first investigated in 1954 by Murray Gell-mann and Abraham Pais.

The Jaynes–Cummings model is a theoretical model in quantum optics. It describes the system of a two-level atom interacting with a quantized mode of an optical cavity, with or without the presence of light. It was originally developed to study the interaction of atoms with the quantized electromagnetic field in order to investigate the phenomena of spontaneous emission and absorption of photons in a cavity.

Rubber elasticity refers to a property of crosslinked rubber: it can be stretched by up to a factor of 10 from its original length and, when released, returns very nearly to its original length. This can be repeated many times with no apparent degradation to the rubber. Rubber is a member of a larger class of materials called elastomers and it is difficult to overestimate their economic and technological importance. Elastomers have played a key role in the development of new technologies in the 20th century and make a substantial contribution to the global economy. Rubber elasticity is produced by several complex molecular processes and its explanation requires a knowledge of advanced mathematics, chemistry and statistical physics, particularly the concept of entropy. Entropy may be thought of as a measure of the thermal energy that is stored in a molecule. Common rubbers, such as polybutadiene and polyisoprene, are produced by a process called polymerization. Very long molecules (polymers) are built up sequentially by adding short molecular backbone units through chemical reactions. A rubber polymer follows a random, zigzag path in three dimensions, intermingling with many other rubber molecules. An elastomer is created by the addition of a few percent of a cross linking molecule such as sulfur. When heated, the crosslinking molecule causes a reaction that chemically joins (bonds) two of the rubber molecules together at some point. Because the rubber molecules are so long, each one participates in many crosslinks with many other rubber molecules forming a continuous molecular network. As a rubber band is stretched, some of the network chains are forced to become straight and this causes a decrease in their entropy. It is this decrease in entropy that gives rise to the elastic force in the network chains.

Thermodynamic integration is a method used to compare the difference in free energy between two given states whose potential energies $and have different dependences on the spatial coordinates. Because the free energy of a system is not simply a function of the phase space coordinates of the system, but is instead a function of the Boltzmann-weighted integral over phase space, the free energy difference between two states cannot be calculated directly from the potential energy of just two coordinate sets. In thermodynamic integration, the free energy difference is calculated by defining a thermodynamic path between the states and integrating over ensemble-averaged enthalpy changes along the path. Such paths can either be real chemical processes or alchemical processes. An example alchemical process is the Kirkwood's coupling parameter method.$

In thermodynamics, the excess chemical potential is defined as the difference between the chemical potential of a given species and that of an ideal gas under the same conditions . The chemical potential of a particle species $is therefore given by an ideal part and an excess part.$

<span class="mw-page-title-main">Thermal fluctuations</span> Random temperature-influenced deviations of particles from their average state

In statistical mechanics, thermal fluctuations are random deviations of a system from its average state, that occur in a system at equilibrium. All thermal fluctuations become larger and more frequent as the temperature increases, and likewise they decrease as temperature approaches absolute zero.

In computational fluid dynamics, the Stochastic Eulerian Lagrangian Method (SELM) is an approach to capture essential features of fluid-structure interactions subject to thermal fluctuations while introducing approximations which facilitate analysis and the development of tractable numerical methods. SELM is a hybrid approach utilizing an Eulerian description for the continuum hydrodynamic fields and a Lagrangian description for elastic structures. Thermal fluctuations are introduced through stochastic driving fields.

References

↑ Charles H. Bennett (1976) Efficient estimation of free energy differences from Monte Carlo data. Journal of Computational Physics 22 : 245–268

External links

Bennett Acceptance Ratio from AlchemistryWiki.
Multistate Bennett Acceptance Ratio from AlchemistryWiki.
Weighted Histogram Analysis Method (MBAR being the unbinned case) from AlchemistryWiki.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Bennett1976-1] Charles H. Bennett (1976) Efficient estimation of free energy differences from Monte Carlo data. Journal of Computational Physics 22 : 245–268

[1]