The Gaussian network model (GNM) is a representation of a biological macromolecule as an elastic mass-and-spring network to study, understand, and characterize the mechanical aspects of its long-time large-scale dynamics. The model has a wide range of applications from small proteins such as enzymes composed of a single domain, to large macromolecular assemblies such as a ribosome or a viral capsid. Protein domain dynamics plays key roles in a multitude of molecular recognition and cell signalling processes. Protein domains, connected by intrinsically disordered flexible linker domains, induce long-range allostery via protein domain dynamics. The resultant dynamic modes cannot be generally predicted from static structures of either the entire protein or individual domains.
The Gaussian network model is a minimalist, coarse-grained approach to study biological molecules. In the model, proteins are represented by nodes corresponding to α-carbons of the amino acid residues. Similarly, DNA and RNA structures are represented with one to three nodes for each nucleotide. The model uses the harmonic approximation to model interactions. This coarse-grained representation makes the calculations computationally inexpensive.
At the molecular level, many biological phenomena, such as catalytic activity of an enzyme, occur within the range of nano- to millisecond timescales. All atom simulation techniques, such as molecular dynamics simulations, rarely reach microsecond trajectory length, depending on the size of the system and accessible computational resources. Normal mode analysis in the context of GNM, or elastic network (EN) models in general, provides insights on the longer-scale functional dynamic behaviors of macromolecules. Here, the model captures native state functional motions of a biomolecule at the cost of atomic detail. The inference obtained from this model is complementary to atomic detail simulation techniques.
Another model for protein dynamics based on elastic mass-and-spring networks is the Anisotropic Network Model.
The Gaussian network model was proposed by Bahar, Atilgan, Haliloglu and Erman in 1997. [1] [2] The GNM is often analyzed using normal mode analysis, which offers an analytical formulation and unique solution for each structure. The GNM normal mode analysis differs from other normal mode analyses in that it is exclusively based on inter-residue contact topology, influenced by the theory of elasticity of Flory [3] and the Rouse model [4] and does not take the three-dimensional directionality of motions into account.
Figure 2 shows a schematic view of elastic network studied in GNM. Metal beads represent the nodes in this Gaussian network (residues of a protein) and springs represent the connections between the nodes (covalent and non-covalent interactions between residues). For nodes i and j, equilibrium position vectors, R0i and R0j, equilibrium distance vector, R0ij, instantaneous fluctuation vectors, ΔRi and ΔRj, and instantaneous distance vector, Rij, are shown in Figure 2. Instantaneous position vectors of these nodes are defined by Ri and Rj. The difference between equilibrium position vector and instantaneous position vector of residue i gives the instantaneous fluctuation vector, ΔRi = Ri - R0i. Hence, the instantaneous fluctuation vector between nodes i and j is expressed as ΔRij = ΔRj - ΔRi = Rij - R0ij.
The potential energy of the network in terms of ΔRi is
where γ is a force constant uniform for all springs and Γij is the ijth element of the Kirchhoff (or connectivity) matrix of inter-residue contacts, Γ, defined by
rc is a cutoff distance for spatial interactions and taken to be 7 Å for amino acid pairs (represented by their α-carbons).
Expressing the X, Y and Z components of the fluctuation vectors ΔRi as ΔXT = [ΔX1 ΔX2 ..... ΔXN], ΔYT = [ΔY1 ΔY2 ..... ΔYN], and ΔZT = [ΔZ1 ΔZ2 ..... ΔZN], above equation simplifies to
In the GNM, the probability distribution of all fluctuations, P(ΔR) is isotropic
and Gaussian
where kB is the Boltzmann constant and T is the absolute temperature. p(ΔY) and p(ΔZ) are expressed similarly. N-dimensional Gaussian probability density function with random variable vector x, mean vector μ and covariance matrix Σ is
normalizes the distribution and |Σ| is the determinant of the covariance matrix.
Similar to Gaussian distribution, normalized distribution for ΔXT = [ΔX1 ΔX2 ..... ΔXN] around the equilibrium positions can be expressed as
The normalization constant, also the partition function ZX, is given by
where is the covariance matrix in this case. ZY and ZZ are expressed similarly. This formulation requires inversion of the Kirchhoff matrix. In the GNM, the determinant of the Kirchhoff matrix is zero, hence calculation of its inverse requires eigenvalue decomposition. Γ−1 is constructed using the N-1 non-zero eigenvalues and associated eigenvectors. Expressions for p(ΔY) and p(ΔZ) are similar to that of p(ΔX). The probability distribution of all fluctuations in GNM becomes
For this mass and spring system, the normalization constant in the preceding expression is the overall GNM partition function, ZGNM,
The expectation values of residue fluctuations, <ΔRi2> (also called mean-square fluctuations, MSFs), and their cross-correlations, <ΔRi · ΔRj> can be organized as the diagonal and off-diagonal terms, respectively, of a covariance matrix. Based on statistical mechanics, the covariance matrix for ΔX is given by
The last equality is obtained by inserting the above p(ΔX) and taking the (generalized Gaussian) integral. Since,
<ΔRi2> and <ΔRi · ΔRj> follows
The GNM normal modes are found by diagonalization of the Kirchhoff matrix, Γ = UΛUT. Here, U is a unitary matrix, UT = U−1, of the eigenvectors ui of Γ and Λ is the diagonal matrix of eigenvalues λi. The frequency and shape of a mode is represented by its eigenvalue and eigenvector, respectively. Since the Kirchhoff matrix is positive semi-definite, the first eigenvalue, λ1, is zero and the corresponding eigenvector have all its elements equal to 1/√N. This shows that the network model translationally invariant.
Cross-correlations between residue fluctuations can be written as a sum over the N-1 nonzero modes as
It follows that, [ΔRi · ΔRj], the contribution of an individual mode is expressed as
where [uk]i is the ith element of uk.
By definition, a diagonal element of the Kirchhoff matrix, Γii, is equal to the degree of a node in GNM that represents the corresponding residue's coordination number. This number is a measure of the local packing density around a given residue. The influence of local packing density can be assessed by series expansion of Γ−1 matrix. Γ can be written as a sum of two matrices, Γ = D + O, containing diagonal elements and off-diagonal elements of Γ.
This expression shows that local packing density makes a significant contribution to expected fluctuations of residues. [5] The terms that follow inverse of the diagonal matrix, are contributions of positional correlations to expected fluctuations.
Equilibrium fluctuations of biological molecules can be experimentally measured. In X-ray crystallography the B-factor (also called Debye-Waller or temperature factor) of each atom is a measure of its mean-square fluctuation near its equilibrium position in the native structure. In NMR experiments, this measure can be obtained by calculating root-mean-square differences between different models. In many applications and publications, including the original articles, it has been shown that expected residue fluctuations obtained by the GNM are in good agreement with the experimentally measured native state fluctuations. [6] [7] The relation between B-factors, for example, and expected residue fluctuations obtained from GNM is as follows
Figure 3 shows an example of GNM calculation for the catalytic domain of the protein Cdc25B, a cell division cycle dual-specificity phosphatase.
Diagonalization of the Kirchhoff matrix decomposes the conformational motions into a spectrum of collective modes. The expected values of fluctuations and cross-correlations are obtained from linear combinations of fluctuations along these normal modes. The contribution of each mode is scaled with the inverse of that modes frequency. Hence, slow (low frequency) modes contribute most to the expected fluctuations. Along the few slowest modes, motions are shown to be collective and global and potentially relevant to functionality of the biomolecules. Fast (high frequency) modes, on the other hand, describe uncorrelated motions not inducing notable changes in the structure. GNM-based methods do not provide real dynamics but only an approximation based on the combination and interpolation of normal modes. [8] Their applicability strongly depends on how collective the motion is. [8] [9]
There are several major areas in which the Gaussian network model and other elastic network models have proved to be useful. [10] These include:
In practice, two kinds of calculations can be performed. The first kind (the GNM per se) makes use of the Kirchhoff matrix. [1] [2] The second kind (more specifically called either the Elastic Network Model or the Anisotropic Network Model) makes use of the Hessian matrix associated to the corresponding set of harmonic springs. [38] Both kinds of models can be used online, using the following servers.
In statistical mechanics and information theory, the Fokker–Planck equation is a partial differential equation that describes the time evolution of the probability density function of the velocity of a particle under the influence of drag forces and random forces, as in Brownian motion. The equation can be generalized to other observables as well. The Fokker-Planck equation has multiple applications in information theory, graph theory, data science, finance, economics etc.
Polymer physics is the field of physics that studies polymers, their fluctuations, mechanical properties, as well as the kinetics of reactions involving degradation and polymerisation of polymers and monomers respectively.
The Lotka–Volterra equations, also known as the Lotka–Volterra predator–prey model, are a pair of first-order nonlinear differential equations, frequently used to describe the dynamics of biological systems in which two species interact, one as a predator and the other as prey. The populations change through time according to the pair of equations:
Hemorheology, also spelled haemorheology, or blood rheology, is the study of flow properties of blood and its elements of plasma and cells. Proper tissue perfusion can occur only when blood's rheological properties are within certain levels. Alterations of these properties play significant roles in disease processes. Blood viscosity is determined by plasma viscosity, hematocrit and mechanical properties of red blood cells. Red blood cells have unique mechanical behavior, which can be discussed under the terms erythrocyte deformability and erythrocyte aggregation. Because of that, blood behaves as a non-Newtonian fluid. As such, the viscosity of blood varies with shear rate. Blood becomes less viscous at high shear rates like those experienced with increased flow such as during exercise or in peak-systole. Therefore, blood is a shear-thinning fluid. Contrarily, blood viscosity increases when shear rate goes down with increased vessel diameters or with low flow, such as downstream from an obstruction or in diastole. Blood viscosity also increases with increases in red cell aggregability.
In classical statistical mechanics, the equipartition theorem relates the temperature of a system to its average energies. The equipartition theorem is also known as the law of equipartition, equipartition of energy, or simply equipartition. The original idea of equipartition was that, in thermal equilibrium, energy is shared equally among all of its various forms; for example, the average kinetic energy per degree of freedom in translational motion of a molecule should equal that in rotational motion.
In theoretical physics and mathematics, a Wess–Zumino–Witten (WZW) model, also called a Wess–Zumino–Novikov–Witten model, is a type of two-dimensional conformal field theory named after Julius Wess, Bruno Zumino, Sergei Novikov and Edward Witten. A WZW model is associated to a Lie group, and its symmetry algebra is the affine Lie algebra built from the corresponding Lie algebra. By extension, the name WZW model is sometimes used for any conformal field theory whose symmetry algebra is an affine Lie algebra.
In probability theory and mathematical physics, a random matrix is a matrix-valued random variable—that is, a matrix in which some or all elements are random variables. Many important properties of physical systems can be represented mathematically as matrix problems. For example, the thermal conductivity of a lattice can be computed from the dynamical matrix of the particle-particle interactions within the lattice.
In spatial statistics the theoretical variogram, denoted , is a function describing the degree of spatial dependence of a spatial random field or stochastic process . The semivariogram is half the variogram.
In physics, the Thomas precession, named after Llewellyn Thomas, is a relativistic correction that applies to the spin of an elementary particle or the rotation of a macroscopic gyroscope and relates the angular velocity of the spin of a particle following a curvilinear orbit to the angular velocity of the orbital motion.
The Debye–Waller factor (DWF), named after Peter Debye and Ivar Waller, is used in condensed matter physics to describe the attenuation of x-ray scattering or coherent neutron scattering caused by thermal motion. It is also called the B factor, atomic B factor, or temperature factor. Often, "Debye–Waller factor" is used as a generic term that comprises the Lamb–Mössbauer factor of incoherent neutron scattering and Mössbauer spectroscopy.
In differential geometry, the notion of torsion is a manner of characterizing a twist or screw of a moving frame around a curve. The torsion of a curve, as it appears in the Frenet–Serret formulas, for instance, quantifies the twist of a curve about its tangent vector as the curve evolves. In the geometry of surfaces, the geodesic torsion describes how a surface twists about a curve on the surface. The companion notion of curvature measures how moving frames "roll" along a curve "without twisting".
The normal-inverse Gaussian distribution is a continuous probability distribution that is defined as the normal variance-mean mixture where the mixing density is the inverse Gaussian distribution. The NIG distribution was noted by Blaesild in 1977 as a subclass of the generalised hyperbolic distribution discovered by Ole Barndorff-Nielsen. In the next year Barndorff-Nielsen published the NIG in another paper. It was introduced in the mathematical finance literature in 1997.
Contact mechanics is the study of the deformation of solids that touch each other at one or more points. A central distinction in contact mechanics is between stresses acting perpendicular to the contacting bodies' surfaces and frictional stresses acting tangentially between the surfaces. Normal contact mechanics or frictionless contact mechanics focuses on normal stresses caused by applied normal forces and by the adhesion present on surfaces in close contact, even if they are clean and dry. Frictional contact mechanics emphasizes the effect of friction forces.
The Anisotropic Network Model (ANM) is a simple yet powerful tool made for normal mode analysis of proteins, which has been successfully applied for exploring the relation between function and dynamics for many proteins. It is essentially an Elastic Network Model for the Cα atoms with a step function for the dependence of the force constants on the inter-particle distance.
In statistical mechanics, thermal fluctuations are random deviations of an atomic system from its average state, that occur in a system at equilibrium. All thermal fluctuations become larger and more frequent as the temperature increases, and likewise they decrease as temperature approaches absolute zero.
In computational fluid dynamics, the Stochastic Eulerian Lagrangian Method (SELM) is an approach to capture essential features of fluid-structure interactions subject to thermal fluctuations while introducing approximations which facilitate analysis and the development of tractable numerical methods. SELM is a hybrid approach utilizing an Eulerian description for the continuum hydrodynamic fields and a Lagrangian description for elastic structures. Thermal fluctuations are introduced through stochastic driving fields. Approaches also are introduced for the stochastic fields of the SPDEs to obtain numerical methods taking into account the numerical discretization artifacts to maintain statistical principles, such as fluctuation-dissipation balance and other properties in statistical mechanics.
The Pomeranchuk instability is an instability in the shape of the Fermi surface of a material with interacting fermions, causing Landau’s Fermi liquid theory to break down. It occurs when a Landau parameter in Fermi liquid theory has a sufficiently negative value, causing deformations of the Fermi surface to be energetically favourable. It is named after the Soviet physicist Isaak Pomeranchuk.
In solid mechanics, the linear stability analysis of an elastic solution is studied using the method of incremental deformations superposed on finite deformations. The method of incremental deformation can be used to solve static, quasi-static and time-dependent problems. The governing equations of the motion are ones of the classical mechanics, such as the conservation of mass and the balance of linear and angular momentum, which provide the equilibrium configuration of the material. The main corresponding mathematical framework is described in the main Raymond Ogden's book Non-linear elastic deformations and in Biot's book Mechanics of incremental deformations, which is a collection of his main papers.
The hyperbolastic functions, also known as hyperbolastic growth models, are mathematical functions that are used in medical statistical modeling. These models were originally developed to capture the growth dynamics of multicellular tumor spheres, and were introduced in 2005 by Mohammad Tabatabai, David Williams, and Zoran Bursac. The precision of hyperbolastic functions in modeling real world problems is somewhat due to their flexibility in their point of inflection. These functions can be used in a wide variety of modeling problems such as tumor growth, stem cell proliferation, pharma kinetics, cancer growth, sigmoid activation function in neural networks, and epidemiological disease progression or regression.
Single-particle trajectories (SPTs) consist of a collection of successive discrete points causal in time. These trajectories are acquired from images in experimental data. In the context of cell biology, the trajectories are obtained by the transient activation by a laser of small dyes attached to a moving molecule.