Molecular replacement

Last updated

Molecular replacement (MR) [1] is a method of solving the phase problem in X-ray crystallography. MR relies upon the existence of a previously solved protein structure which is similar to our unknown structure from which the diffraction data is derived. This could come from a homologous protein, or from the lower-resolution protein NMR structure of the same protein. [2]

Contents

The first goal of the crystallographer is to obtain an electron density map, density being related with diffracted wave as follows:

With usual detectors the intensity is being measured, and all the information about phase () is lost. Then, in the absence of phases (Φ), we are unable to complete the shown Fourier transform relating the experimental data from X-ray crystallography (in reciprocal space) to real-space electron density, into which the atomic model is built. MR tries to find the model which fits best experimental intensities among known structures.

Principles of Patterson-based molecular replacement

We can derive a Patterson map for the intensities, which is an interatomic vector map created by squaring the structure factor amplitudes and setting all phases to zero. This vector map contains a peak for each atom related to every other atom, with a large peak at 0,0,0, where vectors relating atoms to themselves "pile up". Such a map is far too noisy to derive any high resolution structural informationhowever if we generate Patterson maps for the data derived from our unknown structure, and from the structure of a previously solved homologue, in the correct orientation and position within the unit cell, the two Patterson maps should be closely correlated. This principle lies at the heart of MR, and can allow us to infer information about the orientation and location of an unknown molecule with its unit cell.

Due to historic limitations in computing power, an MR search is typically divided into two steps: rotation and translation.

Rotation function

In the rotation function, our unknown Patterson map is compared to Patterson maps derived from our known homologue structure in different orientations. Historically r-factors and/or correlation coefficients were used to score the rotation function, however, modern programs use maximum likelihood-based algorithms. The highest correlation (and therefore scores) are obtained when the two structures (known and unknown) are in similar orientation(s)these can then be output in Euler angles or spherical polar angles.

Translation function

In the translation function, the now correctly oriented known model can be correctly positioned by translating it to the correct co-ordinates within the asymmetric unit. This is accomplished by moving the model, calculating a new Patterson map, and comparing it to the unknown-derived Patterson map. This brute-force search is computationally expensive and fast translation functions are now more commonly used. Positions with high correlations are output in Cartesian coordinates.

Using de novo predicted structures in molecular replacement

With the improvement of de novo protein structure prediction, many protocols including MR-Rosetta, QUARK, AWSEM-Suite and I-TASSER-MR can generate a lot of native-like decoy structures that are useful to solve the phase problem by molecular replacement. [3]

The next step

Following this, we should have correctly oriented and translated phasing models, from which we can derive phases which are (hopefully) accurate enough to derive electron density maps. These can be used to build and refine an atomic model of our unknown structure.

Related Research Articles

<span class="mw-page-title-main">Atomic orbital</span> Function describing an electron in an atom

In atomic theory and quantum mechanics, an atomic orbital is a function describing the location and wave-like behavior of an electron in an atom. This function can be used to calculate the probability of finding any electron of an atom in any specific region around the atom's nucleus. The term atomic orbital may also refer to the physical region or space where the electron can be calculated to be present, as predicted by the particular mathematical form of the orbital.

<span class="mw-page-title-main">Crystallography</span> Scientific study of crystal structures

Crystallography is the experimental science of determining the arrangement of atoms in crystalline solids. Crystallography is a fundamental subject in the fields of materials science and solid-state physics. The word crystallography is derived from the Ancient Greek word κρύσταλλος, with its meaning extending to all solids with some degree of transparency, and γράφειν. In July 2012, the United Nations recognised the importance of the science of crystallography by proclaiming that 2014 would be the International Year of Crystallography.

<span class="mw-page-title-main">X-ray crystallography</span> Technique used for determining crystal structures and identifying mineral compounds

X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles and intensities of these diffracted beams, a crystallographer can produce a three-dimensional picture of the density of electrons within the crystal. From this electron density, the mean positions of the atoms in the crystal can be determined, as well as their chemical bonds, their crystallographic disorder, and various other information.

<span class="mw-page-title-main">Chemical structure</span> Organized way in which molecules are ordered and sorted

A chemical structure of a molecule is a spatial arrangement of its atoms and their chemical bonds. Its determination includes a chemist's specifying the molecular geometry and, when feasible and necessary, the electronic structure of the target molecule or other solid. Molecular geometry refers to the spatial arrangement of atoms in a molecule and the chemical bonds that hold the atoms together and can be represented using structural formulae and by molecular models; complete electronic structure descriptions include specifying the occupation of a molecule's molecular orbitals. Structure determination can be applied to a range of targets from very simple molecules to very complex ones.

<span class="mw-page-title-main">Structural bioinformatics</span> Bioinformatics subfield

Structural bioinformatics is the branch of bioinformatics that is related to the analysis and prediction of the three-dimensional structure of biological macromolecules such as proteins, RNA, and DNA. It deals with generalizations about macromolecular 3D structures such as comparisons of overall folds and local motifs, principles of molecular folding, evolution, binding interactions, and structure/function relationships, working both from experimentally solved structures and from computational models. The term structural has the same meaning as in structural biology, and structural bioinformatics can be seen as a part of computational structural biology. The main objective of structural bioinformatics is the creation of new methods of analysing and manipulating biological macromolecular data in order to solve problems in biology and generate new knowledge.

In computational physics and chemistry, the Hartree–Fock (HF) method is a method of approximation for the determination of the wave function and the energy of a quantum many-body system in a stationary state.

<span class="mw-page-title-main">Molecular geometry</span> Study of the 3D shapes of molecules

Molecular geometry is the three-dimensional arrangement of the atoms that constitute a molecule. It includes the general shape of the molecule as well as bond lengths, bond angles, torsional angles and any other geometrical parameters that determine the position of each atom.

In physics, the phase problem is the problem of loss of information concerning the phase that can occur when making a physical measurement. The name comes from the field of X-ray crystallography, where the phase problem has to be solved for the determination of a structure from diffraction data. The phase problem is also met in the fields of imaging and signal processing. Various approaches of phase retrieval have been developed over the years.

Electron crystallography is a method to determine the arrangement of atoms in solids using a transmission electron microscope (TEM). It can involve the use of high-resolution transmission electron microscopy images, electron diffraction patterns including convergent-beam electron diffraction or combinations of these. It has been successful in determining some bulk structures, and also surface structures. Two related methods are low-energy electron diffraction which has solved the structure of many surfaces, and reflection high-energy electron diffraction which is used to monitor surfaces often during growth.

The Patterson function is used to solve the phase problem in X-ray crystallography. It was introduced in 1935 by Arthur Lindo Patterson while he was a visiting researcher in the laboratory of Bertram Eugene Warren at MIT.

In X-ray crystallography, a difference density map shows the spatial distribution of the difference between the measured electron density of the crystal and the electron density explained by the current model.

Multiple isomorphous replacement (MIR) is historically the most common approach to solving the phase problem in X-ray crystallography studies of proteins. For protein crystals this method is conducted by soaking the crystal of a sample to be analyzed with a heavy atom solution or co-crystallization with the heavy atom. The addition of the heavy atom (or ion) to the structure should not affect the crystal formation or unit cell dimensions in comparison to its native form, hence, they should be isomorphic.

Resolution in terms of electron density is a measure of the resolvability in the electron density map of a molecule. In X-ray crystallography, resolution is the highest resolvable peak in the diffraction pattern, while resolution in cryo-electron microscopy is a frequency space comparison of two halves of the data, which strives to correlate with the X-ray definition.

A crystallographic database is a database specifically designed to store information about the structure of molecules and crystals. Crystals are solids having, in all three dimensions of space, a regularly repeating arrangement of atoms, ions, or molecules. They are characterized by symmetry, morphology, and directionally dependent physical properties. A crystal structure describes the arrangement of atoms, ions, or molecules in a crystal.

<span class="mw-page-title-main">Coot (software)</span>

The program Coot is used to display and manipulate atomic models of macromolecules, typically of proteins or nucleic acids, using 3D computer graphics. It is primarily focused on building and validation of atomic models into three-dimensional electron density maps obtained by X-ray crystallography methods, although it has also been applied to data from electron microscopy.

<span class="mw-page-title-main">Crystallographic image processing</span>

Crystallographic image processing (CIP) is traditionally understood as being a set of key steps in the determination of the atomic structure of crystalline matter from high-resolution electron microscopy (HREM) images obtained in a transmission electron microscope (TEM) that is run in the parallel illumination mode. The term was created in the research group of Sven Hovmöller at Stockholm University during the early 1980s and became rapidly a label for the "3D crystal structure from 2D transmission/projection images" approach. Since the late 1990s, analogous and complementary image processing techniques that are directed towards the achieving of goals with are either complementary or entirely beyond the scope of the original inception of CIP have been developed independently by members of the computational symmetry/geometry, scanning transmission electron microscopy, scanning probe microscopy communities, and applied crystallography communities.

Resolution by Proxy (ResProx) is a method for assessing the equivalent X-ray resolution of NMR-derived protein structures. ResProx calculates resolution from coordinate data rather than from electron density or other experimental inputs. This makes it possible to calculate the resolution of a structure regardless of how it was solved. ResProx was originally designed to serve as a simple, single-number evaluation that allows straightforward comparison between the quality/resolution of X-ray structures and the quality of a given NMR structure. However, it can also be used to assess the reliability of an experimentally reported X-ray structure resolution, to evaluate protein structures solved by unconventional or hybrid means and to identify fraudulent structures deposited in the PDB. ResProx incorporates more than 25 different structural features to determine a single resolution-like value. ResProx values are reported in Angstroms. Tests on thousands of X-ray structures show that ResProx values match very closely to resolution values reported by X-ray crystallographers. Resolution-by-proxy values can be calculated for newly determined protein structures using a freely accessible ResProx web server. This server accepts protein coordinate data and generates a resolution estimate for that input structure.

<span class="mw-page-title-main">Multipole density formalism</span>

The Multipole Density Formalism is an X-ray crystallography method of electron density modelling proposed by Niels K. Hansen and Philip Coppens in 1978. Unlike the commonly used Independent Atom Model, the Hansen-Coppens Formalism presents an aspherical approach, allowing one to model the electron distribution around a nucleus separately in different directions and therefore describe numerous chemical features of a molecule inside the unit cell of an examined crystal in detail.

In crystallography, direct methods is a set of techniques used for structure determination using diffraction data and a priori information. It is a solution to the crystallographic phase problem, where phase information is lost during a diffraction measurement. Direct methods provides a method of estimating the phase information by establishing statistical relationships between the recorded amplitude information and phases of strong reflections.

This is a timeline of crystallography.

References

  1. Ch 10 in "Principles of Protein X-ray Crystallography", by Jan Drenth (2nd Edn.) Springer, 1999
  2. Ramelot, TA; Raman, S; Kuzin, AP; Xiao, R; Ma, LC; Acton, TB; Hunt, JF; Montelione, GT; Baker, D; Kennedy, MA (April 2009). "Improving NMR protein structure quality by Rosetta refinement: a molecular replacement study". Proteins. 75 (1): 147–67. doi:10.1002/prot.22229. PMC   3612016 . PMID   18816799.
  3. Jin, Shikai; Miller, Mitchell D.; Chen, Mingchen; Schafer, Nicholas P.; Lin, Xingcheng; Chen, Xun; Phillips, George N.; Wolynes, Peter G. (1 November 2020). "Molecular-replacement phasing using predicted protein structures from AWSEM-Suite". IUCrJ. 7 (6): 1168–1178. doi: 10.1107/S2052252520013494 . PMC   7642774 . PMID   33209327.