Solvent model

Last updated

In computational chemistry, a solvent model is a computational method that accounts for the behavior of solvated condensed phases. [1] [2] [3] Solvent models enable simulations and thermodynamic calculations applicable to reactions and processes which take place in solution. These include biological, chemical and environmental processes. [1] Such calculations can lead to new predictions about the physical processes occurring by improved understanding.

Contents

Solvent models have been extensively tested and reviewed in the scientific literature. The various models can generally be divided into two classes, explicit and implicit models, all of which have their own advantages and disadvantages. Implicit models are generally computationally efficient and can provide a reasonable description of the solvent behavior, but fail to account for the local fluctuations in solvent density around a solute molecule. The density fluctuation behavior is due to solvent ordering around a solute and is particularly prevalent when one is considering water as the solvent. Explicit models are often less computationally economical, but can provide a physical spatially resolved description of the solvent. However, many of these explicit models are computationally demanding and can fail to reproduce some experimental results, often due to certain fitting methods and parametrization. Hybrid methodologies are another option. These methods incorporate aspects of implicit and explicit aiming to minimize computational cost while retaining at least some spatial resolution of the solvent. These methods can require more experience to use them correctly and often contain post-calculation correction terms. [4]

Implicit models

Implicit solvents or continuum solvents, are models in which one accepts the assumption that implicit solvent molecules can be replaced by a homogeneously polarizable medium as long as this medium, to a good approximation, gives equivalent properties. [1] No explicit solvent molecules are present and so explicit solvent coordinates are not given. Continuum models consider thermally averaged and usually isotropic solvents, [3] which is why only a small number of parameters can be used to represent the solvent with reasonable accuracy in many situations. The main parameter is the dielectric constant (ε), this is often supplemented with further parameters, for example solvent surface tension. The dielectric constant is the value responsible for defining the degree of polarizability of the solvent. Generally speaking, for implicit solvents, a calculation proceeds by encapsulating a solute in a tiled cavity (See the figure below). The cavity containing the solute is embedded in homogeneously polarizable continuum describing the solvent. The solute's charge distribution meets the continuous dielectric field at the surface of the cavity and polarizes the surrounding medium, which causes a change in the polarization on the solute. This defines the reaction potential, a response to the change in polarization. This recursive reaction potential is then iterated to self-consistency. Continuum models have widespread use, including use in force field methods and quantum chemical situations. In quantum chemistry, where charge distributions come from ab initio methods (Hartree-Fock (HF), Post-HF and density functional theory (DFT)) the implicit solvent models represent the solvent as a perturbation to the solute Hamiltonian. In general, mathematically, these approaches can be thought of in the following way: [3] [5] [6] [7]

Polarize continuum model cavity image - created using Geomview and Gaussian PCM - cavity.svg
Polarize continuum model cavity image - created using Geomview and Gaussian

Note here that the implicit nature of the solvent is shown mathematically in the equation above, as the equation is only dependent on solute molecule coordinates . The second right hand term is composed of interaction operators. These interaction operators calculate the systems responses as a result of going from a gaseous infinitely separated system to one in a continuum solution. If one is therefore modelling a reaction this process is akin to modelling the reaction in the gas phase and providing a perturbation to the Hamiltonian in this reaction. [4]

Top: Four interaction operators generally considered in the continuum solvation models. Bottom: Five contributing Gibbs energy terms from continuum solvation models. [5]

The interaction operators have a clear meaning and are physically well defined. 1st - cavity creation; a term accounting for the energy spent to build a cavity in the solvent of suitable size and shape as to house the solute. Physically, this is energy cost of compressing the solvents structure when creating a void in the solvent. 2nd term - electrostatic energy; This term deals with the polarization of the solute and solvent. 3rd term - quantum mechanical dispersion energy; can be approximated using an averaging procedure for the solvent charge distribution. [5] 4th term - an approximation for the quantum mechanical exchange repulsion; given the implicit solvent this term can only be approximated against high level theoretical calculations.

These models can make useful contributions when the solvent being modelled can be modelled by a single function i.e. it is not varying significantly from the bulk. They can also be a useful way to include approximate solvent effects where the solvent is not an active constituent in the reaction or process. Additionally, if computer resources are limited, considerable computational resources can be saved by evoking the implicit solvent approximation instead of explicit solvent molecules. Implicit solvent models have been applied to model the solvent in computational investigations of reactions and to predict hydration Gibbs energy (ΔhydG). [8] Several standard models exist and have all been used successfully in a number of situations. The Polarizable continuum model (PCM) is a commonly used implicit model and has seeded the birth of several variants. [5] The model is based on the Poisson-Boltzmann equation, which is an expansion of the original Poisson's equation. Solvation Models (SMx) and the Solvation Model based on Density (SMD) have also seen wide spread use. SMx models (where x is an alphanumeric label to show the version) are based on the generalized Born equation. This is an approximation of Poisson's equation suitable for arbitrary cavity shapes. The SMD model solves the Poisson-Boltzmann equation analogously to PCM, but does so using a set of specifically parametrised radii which construct the cavity. [9] The COSMO solvation model is another popular implicit solvation model. [10] This model uses the scaled conductor boundary condition, which is a fast and robust approximation to the exact dielectric equations and reduces the outlying charge errors as compared to PCM. [11] The approximations lead to a root mean square deviation in the order of 0.07 kcal/mol to the exact solutions. [12]

Explicit models

Explicit solvent models treat explicitly (i.e. the coordinates and usually at least some of the molecular degrees of freedom are included) the solvent molecules. This is a more intuitively realistic picture in which there are direct, specific solvent interactions with a solute, in contrast to continuum models. These models generally occur in the application of molecular mechanics (MM) and dynamics (MD) or Monte Carlo (MC) simulations, although some quantum chemical calculations do use solvent clusters. Molecular dynamics simulations allow scientists to study the time evolution of a chemical system in discrete time intervals. These simulations often utilize molecular mechanics force fields which are generally empirical, parametrized functions which can efficiently calculate the properties and motions of large systems. [6] [7] Parametrization is often to a higher level theory or experimental data. MC simulations allow one to explore the potential energy surface of a system by perturbing the system and calculating the energy after the perturbation. Prior criteria are defined to aid the algorithm in deciding whether to accept the newly perturbed system or not.

Explicit solvent snap shot Explicit-solvent.svg
Explicit solvent snap shot

In general, force field methods are based on similar energy evaluation functionals which usually contain terms representing the bond stretching, angle bending, torsions and terms for repulsion and dispersion, such as the Buckingham potential or Lennard-Jones potential. Commonly used solvents, such as water, often have idealized models generated. These idealized models allow one to reduce the degrees of freedom which are to be evaluated in the energy calculation without a significant loss in the overall accuracy; although this can lead certain models becoming useful only in specific circumstances. Models such as TIPXP (where X is an integer suggesting the number of sites used for energy evaluation) [13] and the simple point charge model (SPC) of water have been used extensively. A typical model of this kind uses a fixed number of sites (often three for water), on each site is placed a parametrized point charge and repulsion and dispersion parameter. These models are commonly geometrically constrained with aspects of the geometry fixed such as the bond length or angles. [14]

Advancements around 2010 onwards in explicit solvent modelling see the use of a new generation of polarizable force fields, which are currently being created. These force fields are able to account for changes in the molecular charge distribution. A number of these force fields are being developed to utilise multipole moments, as opposed to point charges, given that multipole moments can reflect the charge anisotropy of the molecules. One such method is the Atomic Multipole Optimised Energetics for Biomolecular Applications (AMOEBA) force field. [15] This method has been used to study the solvation dynamics of ions. [1] Other emerging polarizable forcefields which have been applied to condensed phase systems are; the Sum of Interactions between Fragments ab initio computed (SIBFA) [16] and the Quantum Chemical Topology Force Field (QCTFF). [17] Polarizable water models are also being produced. The so-called charge on spring (COS) model gives water models with the ability to polarize due to one of the interaction sites being flexible (on spring). [18]

Hybrid models

Hybrid models, as then name suggests, are in the middle between explicit and implicit models. Hybrid models can usually be considered closer to one or other implicit or explicit. Mixed quantum mechanics and molecular mechanics models,(QM/MM) schemes, can be thought of in this context. QM/MM methods here are closer to explicit models. One can imagine having a QM core treatment containing the solute and may be a small number of explicit solvent molecules. The second layer could then comprise MM water molecules, with a final third layer of implicit solvent representing the bulk. The Reference Interaction Site Model (RISM) can be thought of being closer to implicit solvent representations. RISM allows the solvent density to fluctuate in a local environment, achieving a description of the solvent shell behaviour. [1] [2] [5]

QM/MM methods enable a section of the system to be calculated using quantum mechanics, for example the active site in a biological molecule, whilst the rest of the system is modeled using MM force fields. By continuing to a third layer with an implicit solvent the bulk water effect can be modeled more cheaply than using all explicit solvent molecules. There are many different combinations that can be used with the QM/MM technique. Alternatively, a few explicit solvent molecules can be added to a QM region and the rest of the solvent treated implicitly. Previous work has shown mixed results upon the addition of explicit solvent molecules to an implicit solvent. One example added up to three explicit water molecules to a QM calculation with an implicit COSMO water model. The results suggest that using either implicit or explicit solvent alone provide a good approximation to experiment, however, the mixed models had mixed results and possibly some dependence on the number of added explicit solvent molecules. [19]

a RISM solvent field RISM matlab.svg
a RISM solvent field

RISM, a classical statistical mechanics methodology, has it roots in the integral equation theory of liquids (IET). By statistically modeling of the solvent, an appreciation of the dynamics of the system can be acquired. This is more useful than a static model as the dynamics of the solvent can be important in some processes. The statistical modeling is done using radial distribution function (RDF). RDF are probabilistic functions which can represent the probability of locating solvent atoms/molecules in a specific area or at a specific distance from the reference point; generally taken as the solute molecule. As the probability of locating solvent atoms and molecules from the reference point can be determined in RISM theory, solvent shell structure can be directly derived. [20]

The molecular Ornstein-Zernike equation (MOZ) is the starting point for RISM calculations. [5] Within the MOZ equations a solvated system can be defined in 3D space by three spatial coordinates (r) and three angles (Θ). Using relative RDF's the MOZ equations for the solvated system can define the total correlation function h(r - r';ʘ - ʘ'). The equations have a high dimensionality (6D).

is the total correlation function, is the radial distribution function accounting for the direct effects of one molecule on another separated by r. [5]

It is a common approximation to assume spherical symmetry, allowing one to remove the orientational (angular) degrees of freedom. The MOZ equation splits the total correlation function in two. First the direct correlation function c(r), concerned with the effect of one particle on one other over the distance r. The second, the indirect correlation function, accounts for the effects of a third particle in a system. The indirect correlation function is given as the direct correlation function between the first and the third particles in addition to the total correlation function between the second and third particles . [21]

Ornstein-Zernike equation with the assumption of spherical symmetry. ρ is the liquid density, r is the separating distance, h(r) is the total correlation function, c(r) is the direct correlation function.

h(r) and c(r) are the solutions to the MOZ equations. In order to solve for h(r) and c(r), another equation must be introduced. This new equation is called a closure relation. The exact closure relation is unknown, due to the so-called bridge functions exact form being unclear, we, therefore, must introduce approximations. There are several valid approximations, the first was the HyperNetted Chain (HNC), which sets the unknown terms in the closure relation to zero. Although appearing crude the HNC has been generally quite successfully applied, although it shows slow convergence and divergent behaviour in some cases. [22] A modern alternative closure relation has been suggested the Partially Linearised HyperNetted Chain (PLHNC) or Kovalenko Hirata closure. [23] The PLHNC partially linearises the exponential function if it exceeds its cutoff value. This causes a much more reliable convergence of the equations. [4]

The PLHNC closure, where and is the interaction potential, a typical interaction potential is shown below. T(r) is the indirect correlation function, as it is the difference of the total and the direct correlation functions.

There are various approximations of the RISM equations. Two popular approximations are 3D RISM and 1D RISM. [1] There are known deficiencies in these approximate RISM models. 3D RISM makes a poor estimation of the cavity creation term. 1D RISM has been found to not be properly accounting for the spatial correlations of the solvent density around the solute. However, both methods are quick to calculate, 1D RISM can be calculated in a matter of seconds on a modern computer, making it an attractive model for high through put computation. Both 3D RISM and 1D RISM have had correction schemes proposed which make the predictions reach a comparable level of accuracy to traditional implicit and explicit models. [22] [24] [25]

The COSMO-RS model is another hybrid model using the surface polarization charge density derived from continuum COSMO calculations to estimate the interaction energies with neighbored molecules. COSMO-RS is able to account for a major part of reorientation and strong directional interactions like hydrogen bonding within the first solvation shell. It yields thermodynamically consistent mixture thermodynamics and is often used in addition to UNIFAC in chemical engineering applications.

Applications to QSAR and QSPR

Quantitative Structure–Activity Relationships (QSAR)/Quantitative Structure–Property Relationships (QSPR), whilst unable to directly model the physical process occurring in a condensed solvent phase, can provide useful predictions of solvent and solvation properties and activities; such as the solubility of a solute. [26] [27] [28] [4] These methods come in a varied way from simple regression models to sophisticated machine learning methods. Generally, QSAR/QSPR methods require descriptors; these come in many different forms and are used to represent physical features and properties of a system of interest. Descriptors are generally single numerical values which hold some information about a physical property. [29] A regression model or statistical learning model is then applied to find a correlation between the descriptor(s) and the property of interest. Once trained on some known data these model can be applied to similar unknown data to make predictions. Typically the known data comes from experimental measurement, although there is no reason why similar methods can not be used to correlate descriptor(s) with theoretical or predicted values. It is currently debated whether if more accurate experimental data was used to train these models whether the prediction from such models would be more accurate. [30]

More recently the rise of deep learning has provided many methods to generate embedded representations of molecules. [31] [27] Some of these methods have also been applied to solvation properties such as solubility prediction [32]

[27]

Related Research Articles

<span class="mw-page-title-main">Computational chemistry</span> Branch of chemistry

Computational chemistry is a branch of chemistry that uses computer simulations to assist in solving chemical problems. It uses methods of theoretical chemistry incorporated into computer programs to calculate the structures and properties of molecules, groups of molecules, and solids. The importance of this subject stems from the fact that, with the exception of some relatively recent findings related to the hydrogen molecular ion, achieving an accurate quantum mechanical depiction of chemical systems analytically, or in a closed form, is not feasible. The complexity inherent in the many-body problem exacerbates the challenge of providing detailed descriptions of quantum mechanical systems. While computational results normally complement information obtained by chemical experiments, it can occasionally predict unobserved chemical phenomena.

<span class="mw-page-title-main">Solvation</span> Association of molecules of a solvent with molecules or ions of a solute

Solvation describes the interaction of a solvent with dissolved molecules. Both ionized and uncharged molecules interact strongly with a solvent, and the strength and nature of this interaction influence many properties of the solute, including solubility, reactivity, and color, as well as influencing the properties of the solvent such as its viscosity and density. If the attractive forces between the solvent and solute particles are greater than the attractive forces holding the solute particles together, the solvent particles pull the solute particles apart and surround them. The surrounded solute particles then move away from the solid solute and out into the solution. Ions are surrounded by a concentric shell of solvent. Solvation is the process of reorganizing solvent and solute molecules into solvation complexes and involves bond formation, hydrogen bonding, and van der Waals forces. Solvation of a solute by water is called hydration.

<span class="mw-page-title-main">Solubility</span> Capacity of a substance to dissolve in a solvent in a homogeneous way

In chemistry, solubility is the ability of a substance, the solute, to form a solution with another substance, the solvent. Insolubility is the opposite property, the inability of the solute to form such a solution.

Circular dichroism (CD) is dichroism involving circularly polarized light, i.e., the differential absorption of left- and right-handed light. Left-hand circular (LHC) and right-hand circular (RHC) polarized light represent two possible spin angular momentum states for a photon, and so circular dichroism is also referred to as dichroism for spin angular momentum. This phenomenon was discovered by Jean-Baptiste Biot, Augustin Fresnel, and Aimé Cotton in the first half of the 19th century. Circular dichroism and circular birefringence are manifestations of optical activity. It is exhibited in the absorption bands of optically active chiral molecules. CD spectroscopy has a wide range of applications in many different fields. Most notably, UV CD is used to investigate the secondary structure of proteins. UV/Vis CD is used to investigate charge-transfer transitions. Near-infrared CD is used to investigate geometric and electronic structure by probing metal d→d transitions. Vibrational circular dichroism, which uses light from the infrared energy region, is used for structural studies of small organic molecules, and most recently proteins and DNA.

<span class="mw-page-title-main">Molecular dynamics</span> Computer simulations to discover and understand chemical properties

Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of the system. In the most common version, the trajectories of atoms and molecules are determined by numerically solving Newton's equations of motion for a system of interacting particles, where forces between the particles and their potential energies are often calculated using interatomic potentials or molecular mechanical force fields. The method is applied mostly in chemical physics, materials science, and biophysics.

Density-functional theory (DFT) is a computational quantum mechanical modelling method used in physics, chemistry and materials science to investigate the electronic structure of many-body systems, in particular atoms, molecules, and the condensed phases. Using this theory, the properties of a many-electron system can be determined by using functionals, i.e. functions of another function. In the case of DFT, these are functionals of the spatially dependent electron density. DFT is among the most popular and versatile methods available in condensed-matter physics, computational physics, and computational chemistry.

<span class="mw-page-title-main">Solvation shell</span> Solvent interface of a solute

A solvation shell or solvation sheath is the solvent interface of any chemical compound or biomolecule that constitutes the solute in a solution. When the solvent is water it is called a hydration shell or hydration sphere. The number of solvent molecules surrounding each unit of solute is called the hydration number of the solute.

<span class="mw-page-title-main">Molecular mechanics</span> Use of classical mechanics to model molecular systems

Molecular mechanics uses classical mechanics to model molecular systems. The Born–Oppenheimer approximation is assumed valid and the potential energy of all systems is calculated as a function of the nuclear coordinates using force fields. Molecular mechanics can be used to study molecule systems ranging in size and complexity from small to large biological systems or material assemblies with many thousands to millions of atoms.

In the physical sciences, a partition coefficient (P) or distribution coefficient (D) is the ratio of concentrations of a compound in a mixture of two immiscible solvents at equilibrium. This ratio is therefore a comparison of the solubilities of the solute in these two liquids. The partition coefficient generally refers to the concentration ratio of un-ionized species of compound, whereas the distribution coefficient refers to the concentration ratio of all species of the compound.

In chemical thermodynamics, the fugacity of a real gas is an effective partial pressure which replaces the mechanical partial pressure in an accurate computation of chemical equilibrium. It is equal to the pressure of an ideal gas which has the same temperature and molar Gibbs free energy as the real gas.

<span class="mw-page-title-main">Force field (chemistry)</span> Concept on molecular modeling

In the context of chemistry, molecular physics and physical chemistry and molecular modelling, a force field is a computational model that is used to describe the forces between atoms within molecules or between molecules as well as in crystals. Force fields are a variety of interatomic potentials. More precisely, the force field refers to the functional form and parameter sets used to calculate the potential energy of a system of the atomistic level. Force fields are usually used in molecular dynamics or Monte Carlo simulations. The parameters for a chosen energy function may be derived from classical laboratory experiment data, calculations in quantum mechanics, or both. Force fields utilize the same concept as force fields in classical physics, with the main difference that the force field parameters in chemistry describe the energy landscape on the atomistic level. From a force field, the acting forces on every particle are derived as a gradient of the potential energy with respect to the particle coordinates.

In probability theory, the Gillespie algorithm generates a statistically correct trajectory of a stochastic equation system for which the reaction rates are known. It was created by Joseph L. Doob and others, presented by Dan Gillespie in 1976, and popularized in 1977 in a paper where he uses it to simulate chemical or biochemical systems of reactions efficiently and accurately using limited computational power. As computers have become faster, the algorithm has been used to simulate increasingly complex systems. The algorithm is particularly useful for simulating reactions within cells, where the number of reagents is low and keeping track of every single reaction is computationally feasible. Mathematically, it is a variant of a dynamic Monte Carlo method and similar to the kinetic Monte Carlo methods. It is used heavily in computational systems biology.

Implicit solvation is a method to represent solvent as a continuous medium instead of individual “explicit” solvent molecules, most often used in molecular dynamics simulations and in other applications of molecular mechanics. The method is often applied to estimate free energy of solute-solvent interactions in structural and chemical processes, such as folding or conformational transitions of proteins, DNA, RNA, and polysaccharides, association of biological macromolecules with ligands, or transport of drugs across biological membranes.

<span class="mw-page-title-main">Water model</span> Aspect of computational chemistry

In computational chemistry, a water model is used to simulate and thermodynamically calculate water clusters, liquid water, and aqueous solutions with explicit solvent. The models are determined from quantum mechanics, molecular mechanics, experimental results, and these combinations. To imitate a specific nature of molecules, many types of models have been developed. In general, these can be classified by the following three points; (i) the number of interaction points called site, (ii) whether the model is rigid or flexible, (iii) whether the model includes polarization effects.

<span class="mw-page-title-main">COSMO solvation model</span>

COSMO is a calculation method for determining the electrostatic interaction of a molecule with a solvent. COSMO is a dielectric continuum model. These models can be used in computational chemistry to model solvation effects. COSMO has become a popular method of these solvation models in recent years. The COSMO formalism is similar to the method proposed earlier by Hoshi et al. The COSMO approach is based - as many other dielectric continuum models - on the surface segmentation of a molecule surface.

Stokesian dynamics is a solution technique for the Langevin equation, which is the relevant form of Newton's 2nd law for a Brownian particle. The method treats the suspended particles in a discrete sense while the continuum approximation remains valid for the surrounding fluid, i.e., the suspended particles are generally assumed to be significantly larger than the molecules of the solvent. The particles then interact through hydrodynamic forces transmitted via the continuum fluid, and when the particle Reynolds number is small, these forces are determined through the linear Stokes equations. In addition, the method can also resolve non-hydrodynamic forces, such as Brownian forces, arising from the fluctuating motion of the fluid, and interparticle or external forces. Stokesian Dynamics can thus be applied to a variety of problems, including sedimentation, diffusion and rheology, and it aims to provide the same level of understanding for multiphase particulate systems as molecular dynamics does for statistical properties of matter. For rigid particles of radius suspended in an incompressible Newtonian fluid of viscosity and density , the motion of the fluid is governed by the Navier–Stokes equations, while the motion of the particles is described by the coupled equation of motion:

In fluid dynamics, the Darcy friction factor formulae are equations that allow the calculation of the Darcy friction factor, a dimensionless quantity used in the Darcy–Weisbach equation, for the description of friction losses in pipe flow as well as open-channel flow.

The polarizable continuum model (PCM) is a commonly used method in computational chemistry to model solvation effects. If it is necessary to consider each solvent molecule as a separate molecule, the computational cost of modeling a solvent-mediated chemical reaction would grow prohibitively high. Modeling the solvent as a polarizable continuum, rather than individual molecules, makes ab initio computation feasible. Two types of PCMs have been popularly used: the dielectric PCM (D-PCM) in which the continuum is polarizable and the conductor-like PCM (C-PCM) in which the continuum is conductor-like similar to COSMO Solvation Model.

A depletion force is an effective attractive force that arises between large colloidal particles that are suspended in a dilute solution of depletants, which are smaller solutes that are preferentially excluded from the vicinity of the large particles. One of the earliest reports of depletion forces that lead to particle coagulation is that of Bondy, who observed the separation or "creaming" of rubber latex upon addition of polymer depletant molecules to solution. More generally, depletants can include polymers, micelles, osmolytes, ink, mud, or paint dispersed in a continuous phase.

COSMO-RS is a quantum chemistry based equilibrium thermodynamics method with the purpose of predicting chemical potentials µ in liquids. It processes the screening charge density σ on the surface of molecules to calculate the chemical potential µ of each species in solution. Perhaps in dilute solution a constant potential must be considered. As an initial step a quantum chemical COSMO calculation for all molecules is performed and the results are stored in a database. In a separate step COSMO-RS uses the stored COSMO results to calculate the chemical potential of the molecules in a liquid solvent or mixture. The resulting chemical potentials are the basis for other thermodynamic equilibrium properties such as activity coefficients, solubility, partition coefficients, vapor pressure and free energy of solvation. The method was developed to provide a general prediction method with no need for system specific adjustment.

References

  1. 1 2 3 4 5 6 Skyner, R.; McDonagh, J. L., Groom, C. R., van Mourik, T., Mitchell, J. B. O.; Groom, C. R.; Van Mourik, T.; Mitchell, J. B. O. (2015). "A Review of Methods for the Calculation of Solution Free Energies and the Modelling of Systems in Solution". Phys. Chem. Chem. Phys. 17 (9): 6174–91. Bibcode:2015PCCP...17.6174S. doi: 10.1039/C5CP00288E . hdl: 10023/6096 . PMID   25660403.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  2. 1 2 Tomasi, J.; Mennucci, B., Cammi, R.; Cammi, Roberto (2005). "Quantum Mechanical Continuum Solvation Models". Chemical Reviews. 105 (8): 2999–3093. doi:10.1021/cr9904009. PMID   16092826.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  3. 1 2 3 Cramer, C. J.; Truhlar, D. G. (1999). "Implicit Solvation Models: Equilibria, Structure, Spectra, and Dynamics". Chemical Reviews. 99 (8): 2161–2200. doi:10.1021/cr960149m. PMID   11849023.
  4. 1 2 3 4 McDonagh, J. L. (2015). Computing the Aqueous solubility of Organic Drug-Like Molecules and Understanding Hydrophobicity (Thesis). University of St. Andrews. hdl:10023/6534.
  5. 1 2 3 4 5 6 7 Mennucci, B.; Cammi, R. Continuum solvation models in chemical physics: from theory to applications. Wiley Online Library. ISBN   9780470515235.
  6. 1 2 Cramer, C. J. (2013). Essentials of Computational Chemistry: Theories and Models. John Wiley & Sons.
  7. 1 2 Jensen, F. (2007). Introduction to computational chemistry. John Wiley and Sons.
  8. Palmer, D. S.; McDonagh, J. L., Mitchell, J. B. O., van Mourik, T., Fedorov, M. V.; Mitchell, John B. O.; Van Mourik, Tanja; Fedorov, Maxim V. (2012). "First-Principles Calculation of the Intrinsic Aqueous Solubility of Crystalline Druglike Molecules". Journal of Chemical Theory and Computation. 8 (9): 3322–3337. doi:10.1021/ct300345m. hdl: 10023/25470 . PMID   26605739.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  9. Marenich, A. V.; Truhlar, Donald G. (2009). "Universal solvation model based on solute electron density and on a continuum model of the solvent defined by the bulk dielectric constant and atomic surface tensions". The Journal of Physical Chemistry B. 113 (18): 6378–6396. doi: 10.1021/jp810292n . PMID   19366259.
  10. Klamt, A.; et al. (1993). "COSMO: a new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient". Journal of the Chemical Society, Perkin Transactions 2 (5): 799–805. doi:10.1039/P29930000799.
  11. Klamt, A.; et al. (1996). "Treatment of outlying charge in continuum solvation models". The Journal of Chemical Physics. 105 (22): 9972–9980. Bibcode:1996JChPh.105.9972K. doi:10.1063/1.472829.
  12. Klamt, A.; et al. (2015). "A Comprehensive Comparison of the IEFPCM and SS(V)PE Continuum Solvation Methods with the COSMO Approach". Journal of Chemical Theory and Computation. 11 (9): 4220–4225. doi:10.1021/acs.jctc.5b00601. PMID   26575917.
  13. Price, Daniel J., D. J.; Brooks, C. L. (2004). "A modified TIP3P water potential for simulation with Ewald summation". The Journal of Chemical Physics. 121 (20): 10096–10103. Bibcode:2004JChPh.12110096P. doi:10.1063/1.1808117. PMID   15549884.
  14. Berendsen, H. J. C.; Grigera, J. R.; Straatsma, T. P. (1987). "The missing term in effective pair potentials". The Journal of Physical Chemistry. 91 (24): 6269–6271. doi:10.1021/j100308a038.
  15. Ponder, J. W.; et al. (2010). "Current Status of the AMOEBA Polarizable Force Field". The Journal of Physical Chemistry B. 114 (8): 2549–2564. doi:10.1021/jp910674d. PMC   2918242 . PMID   20136072.
  16. Goldwaser, E.; et al. (2014). "Conformational analysis of a polyconjugated protein-binding ligand by joint quantum chemistry and polarizable molecular mechanics. Addressing the issues of anisotropy, conjugation, polarization, and multipole transferability". Journal of Molecular Modeling. 20 (11): 1–24. doi:10.1007/s00894-014-2472-5. PMID   25367040. S2CID   14085710.
  17. Liem, S. Y.; et al. (2014). "The hydration of serine: multipole moments versus point charges". Physical Chemistry Chemical Physics. 16 (9): 4122–4134. Bibcode:2014PCCP...16.4122L. doi:10.1039/C3CP54723J. PMID   24448691.
  18. Haibo, Y.; van Gunsteren, W. F. (2004). "Charge-on-spring polarizable water models revisited: From water clusters to liquid water to ice". Journal of Chemical Physics. 121 (19): 9549–64. Bibcode:2004JChPh.121.9549Y. doi:10.1063/1.1805516. PMID   15538877.
  19. Kamerlin, S. C. L.; Warshel, Arieh (2009). "Are Mixed Explicit/Implicit Solvation Models Reliable for Studying Phosphate Hydrolysis? A Comparative Study of Continuum, Explicit and Mixed Solvation Models". ChemPhysChem. 10 (7): 1125–1134. doi:10.1002/cphc.200800753. PMID   19301306. S2CID   25817085.
  20. Ratkova, Ekaterina L. (2015). "Solvation Thermodynamics of Organic Molecules by the Molecular Integral Equation Theory: Approaching Chemical Accuracy" (PDF). Chemical Reviews. 115 (13): 6312–6356. doi:10.1021/cr5000283. PMID   26073187.
  21. Pratt, L. R.; Chandler, D. (1977). "Theory of the hydrophobic effect". The Journal of Chemical Physics. 67 (8): 3683–3704. Bibcode:1977JChPh..67.3683P. doi:10.1063/1.435308.
  22. 1 2 Ratkova, E. L.; Fedorov, M. V. (2011). "Combination of RISM and Cheminformatics for Efficient Predictions of Hydration Free Energy of Polyfragment Molecules: Application to a Set of Organic Pollutants". Journal of Chemical Theory and Computation. 7 (5): 1450–1457. doi:10.1021/ct100654h. PMID   26610135.
  23. Kovalenko, A.; Hirata, F. (1999). "Self-consistent description of a metal-water interface by the Kohn-Sham density functional theory and the three-dimensional reference interaction site model". The Journal of Chemical Physics. 110 (20): 10095–10112. Bibcode:1999JChPh.11010095K. doi:10.1063/1.478883.
  24. Palmer, D. S.; Ratkova, Ekaterina L; Fedorov, Maxim V (2010). "Towards a universal method for calculating hydration free energies: a 3D reference interaction site model with partial molar volume correction". Journal of Physics: Condensed Matter. 22 (49): 492101. Bibcode:2010JPCM...22W2101P. doi:10.1088/0953-8984/22/49/492101. PMID   21406779. S2CID   818982.
  25. Misin, M.; Maxim V. Fedorov, David S. Palmer; Palmer, David S. (2015). "Communication: Accurate hydration free energies at a wide range of temperatures from 3D-RISM" (PDF). Journal of Chemical Physics. 142 (9): 091105. Bibcode:2015JChPh.142i1105M. doi:10.1063/1.4914315. PMID   25747054.
  26. McDonagh, J. L.; Nath; De Ferrari, Luna; Van Mourik, Tanja; Mitchell, John B. O. (2014). "Uniting Cheminformatics and Chemical Theory To Predict the Intrinsic Aqueous Solubility of Crystalline Druglike Molecules". Journal of Chemical Information and Modeling. 54 (3): 844–856. doi:10.1021/ci4005805. PMC   3965570 . PMID   24564264.
  27. 1 2 3 Lusci, A.; Pollastri, G.; Baldi, P. (2013). "Deep Architectures and Deep Learning in Chemoinformatics: The Prediction of Aqueous Solubility for Drug-Like Molecules". Journal of Chemical Information and Modeling. 53 (7): 1563–1575. doi:10.1021/ci400187y. PMC   3739985 . PMID   23795551.
  28. Palmer, D. S.; et al. (2008). "Predicting intrinsic aqueous solubility by a thermodynamic cycle". Molecular Pharmaceutics. 5 (2): 266–279. doi:10.1021/mp7000878. PMID   18290628.
  29. Leach, A. R.; Gillet, V. J. (2007). An introduction to chemoinformatics. Springer. ISBN   978-1-4020-6291-9.
  30. Palmer, D. S.; Mitchell, J. B. O. (2014). "Is Experimental Data Quality the Limiting Factor in Predicting the Aqueous Solubility of Druglike Molecules?". Molecular Pharmaceutics. 11 (8): 2962–2972. doi: 10.1021/mp500103r . hdl: 10023/19095 . PMID   24919008.
  31. jaeger, S. (2018). "Mol2vec: unsupervised machine learning approach with chemical intuition". Journal of Chemical Information and Modeling. 58 (1): 27–35. doi:10.1021/acs.jcim.7b00616. PMID   29268609. S2CID   34512664.
  32. Conn, J. G. M. (2023). "Blinded Predictions and Post Hoc Analysis of the Second Solubility Challenge Data: Exploring Training Data and Feature Set Selection for Machine and Deep Learning Models". Journal of Chemical Information and Modeling. 63 (4): 1099–1113. doi:10.1021/acs.jcim.2c01189. PMC   9976279 . PMID   36758178.