CHARMM

Last updated
CHARMM
Developer(s) Martin Karplus, Accelrys
Initial release1983;40 years ago (1983)
Stable release
c47b1 / 2022;1 year ago (2022) [1]
Preview release
c48a1 / 2022;1 year ago (2022) [1]
Written in FORTRAN 77-95, CUDA
Operating system Unix-like: Linux, macOS, AIX, iOS [2]
Platform x86, ARM, Nvidia GPU; Cray XT4, XT5 [2]
Available inEnglish
Type Molecular dynamics
License Proprietary
Website www.academiccharmm.org

Chemistry at Harvard Macromolecular Mechanics (CHARMM) is the name of a widely used set of force fields for molecular dynamics, and the name for the molecular dynamics simulation and analysis computer software package associated with them. [3] [4] [5] The CHARMM Development Project involves a worldwide network of developers working with Martin Karplus and his group at Harvard to develop and maintain the CHARMM program. Licenses for this software are available, for a fee, to people and groups working in academia.

Contents

Force fields

The CHARMM force fields for proteins include: united-atom (sometimes termed extended atom) CHARMM19, [6] all-atom CHARMM22 [7] and its dihedral potential corrected variant CHARMM22/CMAP, as well as later versions CHARMM27 and CHARMM36 and various modifications such as CHARMM36m and CHARMM36IDPSFF. [8] In the CHARMM22 protein force field, the atomic partial charges were derived from quantum chemical calculations of the interactions between model compounds and water. Furthermore, CHARMM22 is parametrized for the TIP3P explicit water model. Nevertheless, it is often used with implicit solvents. In 2006, a special version of CHARMM22/CMAP was reparametrized for consistent use with implicit solvent GBSW. [9]

The CHARMM22 force field has the following potential energy function: [7] [10]

The bond, angle, dihedral, and nonbonded terms are similar to those found in other force fields such as AMBER. The CHARMM force field also includes an improper term accounting for out-of-plane bending (which applies to any set of four atoms that are not successively bonded), where is the force constant and is the out-of-plane angle. The Urey-Bradley term is a cross-term that accounts for 1,3 nonbonded interactions not accounted for by the bond and angle terms; is the force constant and is the distance between the 1,3 atoms.

For DNA, RNA, and lipids, CHARMM27 [11] is used. Some force fields may be combined, for example CHARMM22 and CHARMM27 for the simulation of protein-DNA binding. Also, parameters for NAD+, sugars, fluorinated compounds, etc., may be downloaded. These force field version numbers refer to the CHARMM version where they first appeared, but may of course be used with subsequent versions of the CHARMM executable program. Likewise, these force fields may be used within other molecular dynamics programs that support them.

In 2009, a general force field for drug-like molecules (CGenFF) was introduced. It "covers a wide range of chemical groups present in biomolecules and drug-like molecules, including a large number of heterocyclic scaffolds". [12] The general force field is designed to cover any combination of chemical groups. This inevitably comes with a decrease in accuracy for representing any particular subclass of molecules. Users are repeatedly warned in Mackerell's website not to use the CGenFF parameters for molecules for which specialized force fields already exist (as mentioned above for proteins, nucleic acids, etc.).

CHARMM also includes polarizable force fields using two approaches. One is based on the fluctuating charge (FQ) model, also termed Charge Equilibration (CHEQ). [13] [14] The other is based on the Drude shell or dispersion oscillator model. [15] [16]

Parameters for all of these force fields may be downloaded from the Mackerell website for free. [17]

Molecular dynamics program

The CHARMM program allows for generating and analysing a wide range of molecular simulations. The most basic kinds of simulation are minimizing a given structure and production runs of a molecular dynamics trajectory. More advanced features include free energy perturbation (FEP), quasi-harmonic entropy estimation, correlation analysis and combined quantum, and quantum mechanics - molecular mechanics (QM/MM) methods.

CHARMM is one of the oldest programs for molecular dynamics. It has accumulated many features, some of which are duplicated under several keywords with slight variants. This is an inevitable result of the many outlooks and groups working on CHARMM worldwide. The changelog file, and CHARMM's source code, are good places to look for the names and affiliations of the main developers. The involvement and coordination by Charles L. Brooks III's group at the University of Michigan is salient.

Software history

Around 1969, there was considerable interest in developing potential energy functions for small molecules. CHARMM originated at Martin Karplus's group at Harvard. Karplus and his then graduate student Bruce Gelin decided the time was ripe to develop a program that would make it possible to take a given amino acid sequence and a set of coordinates (e.g., from the X-ray structure) and to use this information to calculate the energy of the system as a function of the atomic positions. Karplus has acknowledged the importance of major inputs in the development of the (at the time nameless) program, including:

In the 1980s, finally a paper appeared and CHARMM made its public début. Gelin's program had by then been considerably restructured. For the publication, Bob Bruccoleri came up with the name HARMM (HARvard Macromolecular Mechanics), but it seemed inappropriate. So they added a C for Chemistry. Karplus said: "I sometimes wonder if Bruccoleri's original suggestion would have served as a useful warning to inexperienced scientists working with the program." [18] CHARMM has continued to grow and the latest release of the executable program was made in 2015 as CHARMM40b2.

Running CHARMM under Unix-Linux

The general syntax for using the program is:

charmm -i filename.inp -o filename.out

Volunteer computing

Docking@Home, hosted by University of Delaware, one of the projects which use an open-source platform for the distributed computing, BOINC, used CHARMM to analyze the atomic details of protein-ligand interactions in terms of molecular dynamics (MD) simulations and minimizations.

World Community Grid, sponsored by IBM, ran a project named The Clean Energy Project [19] which also used CHARMM in its first phase which has completed.

See also

Related Research Articles

<span class="mw-page-title-main">Molecular dynamics</span> Computer simulations to discover and understand chemical properties

Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of the system. In the most common version, the trajectories of atoms and molecules are determined by numerically solving Newton's equations of motion for a system of interacting particles, where forces between the particles and their potential energies are often calculated using interatomic potentials or molecular mechanical force fields. The method is applied mostly in chemical physics, materials science, and biophysics.

GROMACS is a molecular dynamics package mainly designed for simulations of proteins, lipids, and nucleic acids. It was originally developed in the Biophysical Chemistry department of University of Groningen, and is now maintained by contributors in universities and research centers worldwide. GROMACS is one of the fastest and most popular software packages available, and can run on central processing units (CPUs) and graphics processing units (GPUs). It is free, open-source software released under the GNU General Public License (GPL), and starting with version 4.6, the GNU Lesser General Public License (LGPL).

<span class="mw-page-title-main">AMBER</span>

Assisted Model Building with Energy Refinement (AMBER) is a family of force fields for molecular dynamics of biomolecules originally developed by Peter Kollman's group at the University of California, San Francisco.

<span class="mw-page-title-main">Molecular mechanics</span> Use of classical mechanics to model molecular systems

Molecular mechanics uses classical mechanics to model molecular systems. The Born–Oppenheimer approximation is assumed valid and the potential energy of all systems is calculated as a function of the nuclear coordinates using force fields. Molecular mechanics can be used to study molecule systems ranging in size and complexity from small to large biological systems or material assemblies with many thousands to millions of atoms.

<span class="mw-page-title-main">Molecular modelling</span> Discovering chemical properties by physical simulations

Molecular modelling encompasses all methods, theoretical and computational, used to model or mimic the behaviour of molecules. The methods are used in the fields of computational chemistry, drug design, computational biology and materials science to study molecular systems ranging from small chemical systems to large biological molecules and material assemblies. The simplest calculations can be performed by hand, but inevitably computers are required to perform molecular modelling of any reasonably sized system. The common feature of molecular modelling methods is the atomistic level description of the molecular systems. This may include treating atoms as the smallest individual unit, or explicitly modelling protons and neutrons with its quarks, anti-quarks and gluons and electrons with its photons.

A chemical file format is a type of data file which is used specifically for depicting molecular data. One of the most widely used is the chemical table file format, which is similar to Structure Data Format (SDF) files. They are text files that represent multiple chemical structure records and associated data fields. The XYZ file format is a simple format that usually gives the number of atoms in the first line, a comment on the second, followed by a number of lines with atomic symbols and cartesian coordinates. The Protein Data Bank Format is commonly used for proteins but is also used for other types of molecules. There are many other types which are detailed below. Various software systems are available to convert from one format to another.

<span class="mw-page-title-main">Force field (chemistry)</span> Concept on molecular modeling

In the context of chemistry and molecular modelling, a force field is a computational method that is used to estimate the forces between atoms within molecules and also between molecules. More precisely, the force field refers to the functional form and parameter sets used to calculate the potential energy of a system of atoms or coarse-grained particles in molecular mechanics, molecular dynamics, or Monte Carlo simulations. The parameters for a chosen energy function may be derived from experiments in physics and chemistry, calculations in quantum mechanics, or both. Force fields are interatomic potentials and utilize the same concept as force fields in classical physics, with the difference that the force field parameters in chemistry describe the energy landscape, from which the acting forces on every particle are derived as a gradient of the potential energy with respect to the particle coordinates.

<span class="mw-page-title-main">Martin Karplus</span>

Martin Karplus is an Austrian and American theoretical chemist. He is the Director of the Biophysical Chemistry Laboratory, a joint laboratory between the French National Center for Scientific Research and the University of Strasbourg, France. He is also the Theodore William Richards Professor of Chemistry, emeritus at Harvard University. Karplus received the 2013 Nobel Prize in Chemistry, together with Michael Levitt and Arieh Warshel, for "the development of multiscale models for complex chemical systems".

In computational biology, protein pKa calculations are used to estimate the pKa values of amino acids as they exist within proteins. These calculations complement the pKa values reported for amino acids in their free state, and are used frequently within the fields of molecular modeling, structural bioinformatics, and computational biology.

Implicit solvation is a method to represent solvent as a continuous medium instead of individual “explicit” solvent molecules, most often used in molecular dynamics simulations and in other applications of molecular mechanics. The method is often applied to estimate free energy of solute-solvent interactions in structural and chemical processes, such as folding or conformational transitions of proteins, DNA, RNA, and polysaccharides, association of biological macromolecules with ligands, or transport of drugs across biological membranes.

<span class="mw-page-title-main">Water model</span> Aspect of computational chemistry

In computational chemistry, a water model is used to simulate and thermodynamically calculate water clusters, liquid water, and aqueous solutions with explicit solvent. The models are determined from quantum mechanics, molecular mechanics, experimental results, and these combinations. To imitate a specific nature of molecules, many types of models have been developed. In general, these can be classified by the following three points; (i) the number of interaction points called site, (ii) whether the model is rigid or flexible, (iii) whether the model includes polarization effects.

Drude particles are model oscillators used to simulate the effects of electronic polarizability in the context of a classical molecular mechanics force field. They are inspired by the Drude model of mobile electrons and are used in the computational study of proteins, nucleic acids, and other biomolecules.

Free energy perturbation (FEP) is a method based on statistical mechanics that is used in computational chemistry for computing free energy differences from molecular dynamics or Metropolis Monte Carlo simulations.

This is a list of computer programs that are predominantly used for molecular mechanics calculations.

<span class="mw-page-title-main">MDynaMix</span>

Molecular Dynamics of Mixtures (MDynaMix) is a computer software package for general purpose molecular dynamics to simulate mixtures of molecules, interacting by AMBER- and CHARMM-like force fields in periodic boundary conditions. Algorithms are included for NVE, NVT, NPT, anisotropic NPT ensembles, and Ewald summation to treat electrostatic interactions. The code was written in a mix of Fortran 77 and 90. The package runs on Unix and Unix-like (Linux) workstations, clusters of workstations, and on Windows in sequential mode.

<span class="mw-page-title-main">Ascalaph Designer</span>

Ascalaph Designer is a computer program for general purpose molecular modelling for molecular design and simulations. It provides a graphical environment for the common programs of quantum and classical molecular modelling ORCA, NWChem, Firefly, CP2K and MDynaMix . The molecular mechanics calculations cover model building, energy optimizations and molecular dynamics. Firefly covers a wide range of quantum chemistry methods. Ascalaph Designer is free and open-source software, released under the GNU General Public License, version 2 (GPLv2).

MacroModel is a computer program for molecular modelling of organic compounds and biopolymers. It features various chemistry force fields, plus energy minimizing algorithms, to predict geometry and relative conformational energies of molecules. MacroModel is maintained by Schrödinger, LLC.

Local elevation is a technique used in computational chemistry or physics, mainly in the field of molecular simulation. It was developed in 1994 by Huber, Torda and van Gunsteren to enhance the searching of conformational space in molecular dynamics simulations and is available in the GROMOS software for molecular dynamics simulation. The method was, together with the conformational flooding method, the first to introduce memory dependence into molecular simulations. Many recent methods build on the principles of the local elevation technique, including the Engkvist-Karlström, adaptive biasing force, Wang–Landau, metadynamics, adaptively biased molecular dynamics, adaptive reaction coordinate forces, and local elevation umbrella sampling methods. The basic principle of the method is to add a memory-dependent potential energy term in the simulation so as to prevent the simulation to revisit already sampled configurations, which leads to the increased probability of discovering new configurations. The method can be seen as a continuous variant of the Tabu search method.

Discovery Studio is a suite of software for simulating small molecule and macromolecule systems. It is developed and distributed by Dassault Systemes BIOVIA.

In the context of chemistry and molecular modelling, the Interface force field (IFF) is a force field for classical molecular simulations of atoms, molecules, and assemblies up to the large nanometer scale, covering compounds from across the periodic table. It employs a consistent classical Hamiltonian energy function for metals, oxides, and organic compounds, linking biomolecular and materials simulation platforms into a single platform. The reliability is often higher than that of density functional theory calculations at more than a million times lower computational cost. IFF includes a physical-chemical interpretation for all parameters as well as a surface model database that covers different cleavage planes and surface chemistry of included compounds. The Interface Force Field is compatible with force fields for the simulation of primarily organic compounds and can be used with common molecular dynamics and Monte Carlo codes. Structures and energies of included chemical elements and compounds are rigorously validated and property predictions are up to a factor of 100 more accurate relative to earlier models.

References

  1. 1 2 "Versions - CHARMM". CHARMM (Chemistry at HARvard Macromolecular Mechanics). Harvard University. Retrieved 2021-03-29.
  2. 1 2 "Installation". CHARMM (Chemistry at HARvard Macromolecular Mechanics). Harvard University. 2016. Retrieved 2021-03-29.
  3. Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M (1983). "CHARMM: A program for macromolecular energy, minimization, and dynamics calculations". J. Comput. Chem. 4 (2): 187–217. doi:10.1002/jcc.540040211. S2CID   91559650.
  4. MacKerell, A.D. Jr.; Brooks, B.; Brooks, C. L., III; Nilsson, L.; Roux, B.; Won, Y.; Karplus, M. (1998). "CHARMM: The Energy Function and Its Parameterization with an Overview of the Program". In Schleyer, P.v.R.; et al. (eds.). The Encyclopedia of Computational Chemistry. Vol. 1. Chichester: John Wiley & Sons. pp. 271–277.{{cite encyclopedia}}: CS1 maint: multiple names: authors list (link)
  5. Brooks BR, Brooks CL 3rd, Mackerell AD Jr, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner AR, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor RW, Post CB, Pu JZ, Schaefer M, Tidor B, Venable RM, Woodcock HL, Wu X, Yang W, York DM, Karplus M (29 July 2009). "CHARMM: The biomolecular simulation program". Journal of Computational Chemistry. 30 (10): 1545–1614. doi:10.1002/jcc.21287. PMC   2810661 . PMID   19444816.
  6. Reiher, III WH (1985). Theoretical studies of hydrogen bonding (Thesis). Harvard University.
  7. 1 2 MacKerell AD Jr; et al. (1998). "All-atom empirical potential for molecular modeling and dynamics studies of proteins". J Phys Chem B. 102 (18): 3586–3616. doi:10.1021/jp973084f. PMID   24889800.
  8. MacKerell AD Jr, Feig M, Brooks III CL (2004). "Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations". J Comput Chem. 25 (11): 1400–1415. doi:10.1002/jcc.20065. PMID   15185334. S2CID   11076418.
  9. Brooks CL, Chen J, Im W (2006). "Balancing solvation and intramolecular interactions: toward a consistent generalized born force field (CMAP opt. for GBSW)". J Am Chem Soc. 128 (11): 3728–3736. doi:10.1021/ja057216r. PMC   2596729 . PMID   16536547.
  10. Vanommeslaeghe, K.; MacKerell, A. D. (May 2015). "CHARMM additive and polarizable force fields for biophysics and computer-aided drug design". Biochimica et Biophysica Acta (BBA) - General Subjects. 1850 (5): 861–871. doi:10.1016/j.bbagen.2014.08.004. ISSN   0006-3002. PMC   4334745 . PMID   25149274.
  11. MacKerell AD Jr, Banavali N, Foloppe N (2001). "Development and current status of the CHARMM force field for nucleic acids". Biopolymers. 56 (4): 257–265. doi:10.1002/1097-0282(2000)56:4<257::AID-BIP10029>3.0.CO;2-W. PMID   11754339. S2CID   19502363.
  12. Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, Mackerell AD Jr (2009). "CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields". J Comput Chem. 31 (4): 671–90. doi:10.1002/jcc.21367. PMC   2888302 . PMID   19575467.
  13. Patel S, Brooks CL 3rd (2004). "CHARMM fluctuating charge force field for proteins: I parameterization and application to bulk organic liquid simulations". J Comput Chem. 25 (1): 1–15. doi:10.1002/jcc.10355. PMID   14634989. S2CID   39320318.
  14. Patel S, Mackerell AD Jr, Brooks CL 3rd (2004). "CHARMM fluctuating charge force field for proteins: II protein/solvent properties from molecular dynamics simulations using a nonadditive electrostatic model". J Comput Chem. 25 (12): 1504–1514. doi: 10.1002/jcc.20077 . PMID   15224394. S2CID   16741310.
  15. Lamoureux G, Roux B (2003). "Modeling induced polarization with classical Drude oscillators: Theory and molecular dynamics simulation algorithm". J Chem Phys. 119 (6): 3025–3039. Bibcode:2003JChPh.119.3025L. doi: 10.1063/1.1589749 .
  16. Lamoureux G, Harder E, Vorobyov IV, Roux B, MacKerell AD (2006). "A polarizable model of water for molecular dynamics simulations of biomolecules". Chem Phys Lett. 418 (1–3): 245–249. Bibcode:2006CPL...418..245L. doi:10.1016/j.cplett.2005.10.135.
  17. Mackerell website
  18. Karplus M (2006). "Spinach on the ceiling: a theoretical chemist's return to biology". Annu Rev Biophys Biomol Struct. 35 (1): 1–47. doi: 10.1146/annurev.biophys.33.110502.133350 . PMID   16689626.
  19. The Clean Energy Project