Rietveld refinement is a technique described by Hugo Rietveld for use in the characterisation of crystalline materials. The neutron and X-ray diffraction of powder samples results in a pattern characterised by reflections (peaks in intensity) at certain positions. The height, width and position of these reflections can be used to determine many aspects of the material's structure.
The Rietveld method uses a least squares approach to refine a theoretical line profile until it matches the measured profile. The introduction of this technique was a significant step forward in the diffraction analysis of powder samples as, unlike other techniques at that time, it was able to deal reliably with strongly overlapping reflections.
The method was first implemented in 1967, [1] and reported in 1969 [2] for the diffraction of monochromatic neutrons where the reflection-position is reported in terms of the Bragg angle, 2θ. This terminology will be used here although the technique is equally applicable to alternative scales such as x-ray energy or neutron time-of-flight. The only wavelength and technique independent scale is in reciprocal space units or momentum transfer Q, which is historically rarely used in powder diffraction but very common in all other diffraction and optics techniques. The relation is
The most common powder X-ray diffraction (XRD) refinement technique used today is based on the method proposed in the 1960s by Hugo Rietveld. [2] The Rietveld method fits a calculated profile (including all structural and instrumental parameters) to experimental data. It employs the non-linear least squares method, and requires the reasonable initial approximation of many free parameters, including peak shape, unit cell dimensions and coordinates of all atoms in the crystal structure. Other parameters can be guessed while still being reasonably refined. In this way one can refine the crystal structure of a powder material from PXRD data. The successful outcome of the refinement is directly related to the quality of the data, the quality of the model (including initial approximations), and the experience of the user.
The Rietveld method is an incredibly powerful technique which began a remarkable era for powder XRD and materials science in general. Powder XRD is at heart a very basic experimental technique with diverse applications and experimental options. Despite being slightly limited by the one-dimensionality of PXRD data and limited resolution, powder XRD's power is astonishing. It is possible to determine the accuracy of a crystal structure model by fitting a profile to a 1D plot of observed intensity vs angle. Rietveld refinement requires a crystal structure model, and offers no way to come up with such a model on its own. However, it can be used to find structural details missing from a partial or complete ab initio structure solution, such as unit cell dimensions, phase quantities, crystallite sizes/shapes, atomic coordinates/bond lengths, micro strain in crystal lattice, texture, and vacancies. [3] [4]
Before exploring Rietveld refinement, it is necessary to establish a greater understanding of powder diffraction data and what information is encoded therein in order to establish a notion of how to create a model of a diffraction pattern, which is of course necessary in Rietveld refinement. A typical diffraction pattern can be described by the positions, shapes, and intensities of multiple Bragg reflections. Each of the three mentioned properties encodes some information relating to the crystal structure, the properties of the sample, and the properties of the instrumentation. Some of these contributions are shown in Table 1, below.
Pattern component | Crystal structure | Specimen property | Instrumental parameter |
---|---|---|---|
Peak position | Unit cell parameters (a, b, c, α, β, γ) |
|
|
Peak intensity | Atomic parameters (x, y, z, B, etc.) |
|
|
Peak shape |
|
|
|
The structure of a powder pattern is essentially defined by instrumental parameters and two crystallographic parameters: unit cell dimensions, and atomic content and coordination. So, a powder pattern model can be constructed as follows:
It is easy to model a powder pattern given the crystal structure of a material. The opposite, determining the crystal structure from a powder pattern, is much more complicated. A brief explanation of the process follows, though it is not the focus of this article.
To determine structure from a powder diffraction pattern the following steps should be taken. First, Bragg peak positions and intensities should be found by fitting to a peak shape function including background. Next, peak positions should be indexed and used to determine unit cell parameters, symmetry, and content. Third, peak intensities determine space group symmetry and atomic coordination. Finally, the model is used to refine all crystallographic and peak shape function parameters. To do this successfully, there is a requirement for excellent data which means good resolution, low background, and a large angular range.
For general application of the Rietveld method, irrespective of the software used, the observed Bragg peaks in a powder diffraction pattern are best described by the so-called peak shape function (PSF). The PSF is a convolution of three functions: the instrumental broadening , wavelength dispersion , and the specimen function , with the addition of a background function, . It is represented as follows:
where denotes convolution, which is defined for two functions and as an integral:
The instrumental function depends on the location and geometry of the source, monochromator, and sample. Wavelength function accounts for the distribution of the wavelengths in the source, and varies with the nature of the source and monochromatizing technique. The specimen function depends on several things. First is dynamic scattering, and secondly the physical properties of the sample such as crystallite size, and microstrain.
A short aside: unlike the other contributions, those from the specimen function can be interesting in materials characterization. As such, average crystallite size, , and microstrain, , effects on Bragg peak broadening, (in radians), can be described as follows, where is a constant:
Returning to the peak shape function, the goal is to correctly model the Bragg peaks which exist in the observed powder diffraction data. In the most general form, the intensity, , of the point (, where is the number of measured points) is the sum of the contributions from the overlapped Bragg peaks (), and the background, , and can be described as follows:
where is the intensity of the Bragg peak, and . Since is a multiplier, it is possible to analyze the behaviour of different normalized peak functions independently of peak intensity, under the condition that the integral over infinity of the PSF is unity. There are various functions that can be chosen to do this with varying degrees of complexity. The most basic functions used in this way to represent Bragg reflections are the Gauss, and Lorentzian functions. Most commonly though, is the pseudo-Voigt function, a weighted sum of the former two (the full Voigt profile is a convolution of the two, but is computationally more demanding). The pseudo-Voigt profile is the most common and is the basis for most other PSF's. The pseudo-Voigt function can be represented as:
where
and
are the Gaussian and Lorentzian contributions, respectively.
Thus,
where:
The pseudo-Voigt function, like the Gaussian and Lorentz functions, is a centrosymmetric function, and as such does not model asymmetry. This can be problematic for non-ideal powder XRD data, such as those collected at synchrotron radiation sources, which generally exhibit asymmetry due to the use of multiple focusing optics.
The Finger–Cox–Jephcoat function is similar to the pseudo-Voigt, but has better handling of asymmetry, which is treated in terms of axial divergence. The function is a convolution of pseudo-Voigt with the intersection of the diffraction cone and a finite receiving slit length using two geometrical parameters, , and , where and are the sample and the detector slit dimensions in the direction parallel to the goniometer axis, and is the goniometer radius. [7]
The shape of a powder diffraction reflection is influenced by the characteristics of the beam, the experimental arrangement, and the sample size and shape. In the case of monochromatic neutron sources the convolution of the various effects has been found to result in a reflex almost exactly Gaussian in shape. If this distribution is assumed then the contribution of a given reflection to the profile at position is:
where is the full width at half peak height (full-width half-maximum), is the center of the reflex, and is the calculated intensity of the reflex (determined from the structure factor, the Lorentz factor, and multiplicity of the reflection).
At very low diffraction angles the reflections may acquire an asymmetry due to the vertical divergence of the beam. Rietveld used a semi-empirical correction factor, to account for this asymmetry:
where is the asymmetry factor and is , , or depending on the difference being positive, zero, or negative respectively.
At a given position more than one diffraction peak may contribute to the profile. The intensity is simply the sum of all reflections contributing at the point .
For a Bragg peak , the observed integrated intensity, , as determined from numerical integration is
where is the total number of data points in the range of the Bragg peak. The integrated intensity depends on multiple factors, and can be expressed as the following product:
where:
The width of the diffraction peaks are found to broaden at higher Bragg angles. This angular dependency was originally represented by
where , , and are the half-width parameters and may be refined during the fit.
In powder samples there is a tendency for plate- or rod-like crystallites to align themselves along the axis of a cylindrical sample holder. In solid polycrystalline samples the production of the material may result in greater volume fraction of certain crystal orientations (commonly referred to as texture). In such cases the reflex intensities will vary from that predicted for a completely random distribution. Rietveld allowed for moderate cases of the former by introducing a correction factor:
where is the intensity expected for a random sample, is the preferred orientation parameter and is the acute angle between the scattering vector and the normal of the crystallites.
The principle of the Rietveld method is to minimize a function which analyzes the difference between a calculated profile and the observed data . Rietveld defined such an equation as:
where is the statistical weight and is an overall scale factor such that .
The fitting method used in Rietveld refinement is the non-linear least squares approach. A detailed derivation of non-linear least squares fitting will not be given here. Further detail can be found in Chapter 6 of Pecharsky and Zavalij's text 12 . There are a few things to note however. First, non-linear least squares fitting has an iterative nature for which convergence may be difficult to achieve if the initial approximation is too far from correct, or when the minimized function is poorly defined. The latter occurs when correlated parameters are being refined at the same time, which may result in divergence and instability of the minimization. This iterative nature also means that convergence to a solution does not occur immediately for the method is not exact. Each iteration depends on the results of the last which dictate the new set of parameters used for refinement. Thus, multiple refinement iterations are required to eventually converge to a possible solution.
Using non-linear least squares minimization, the following system is solved:
where is the calculated intensity and is the observed intensity of a point in the powder pattern, , is a scale factor, and is the number of measured data points. The minimized function is given by:
where is the weight, and from the previous equation is unity (since is usually absorbed in the phase scale factor). The summation extends to all data points. Considering the peak shape functions and accounting for the overlapping of Bragg peaks because of the one-dimensionality of XRD data, the expanded form of the above equation for the case of a single phase measured with a single wavelength becomes:
where:
For a material that contains several phases (), the contribution from each is accounted for by modifying the above equation as follows:
It can easily be seen from the above equations that experimentally minimizing the background, which holds no useful structural information, is paramount for a successful profile fitting. For a low background, the functions are defined by contributions from the integrated intensities and peak shape parameters. But with a high background, the function being minimized depends on the adequacy of the background and not integrated intensities or peak shapes. Thus, a structure refinement cannot adequately yield structural information in the presence of a large background.
It is also worth noting the increased complexity brought forth by the presence of multiple phases. Each additional phase adds to the fitting, more Bragg peaks, and another scale factor tied to corresponding structural parameters, and peak shape. Mathematically they are easily accounted for, but practically, due to the finite accuracy and limited resolution of experimental data, each new phase can lower the quality and stability of the refinement. It is advantageous to use single phase materials when interested in finding precise structural parameters of a material. However, since the scale factors of each phase are determined independently, Rietveld refinement of multi phase materials can quantitatively examine the mixing ratio of each phase in the material.
Generally, the background is calculated as a Chebyshev polynomial. In GSAS and GSAS-II they appear as follows. Again, background is treated as a Chebyshev polynomial of the first kind ("Handbook of Mathematical Functions", M. Abramowitz and IA. Stegun, Ch. 22), with intensity given by:
where are the coefficients of the Chebyshev polynomial taken from Table 22.3, pg. 795 of the Handbook. The coefficients have the form:
and the values for are found in the Handbook. The angular range () is converted to to make the Chebyshev polynomial orthogonal by
And, the orthogonal range for this function is –1 to +1.
Now, given the considerations of background, peak shape functions, integrated intensity, and non-linear least squares minimization, the parameters used in the Rietveld refinement which put these things together can be introduced. Below are the groups of independent least squares parameters generally refined in a Rietveld refinement.
Each Rietveld refinement is unique and there is no prescribed sequence of parameters to include in a refinement. It is up to the user to determine and find the best sequence of parameters for refinement. It is worth noting that it is rarely possible to refine all relevant variables simultaneously from the beginning of a refinement, nor near the end since the least squares fitting will be destabilized or lead to a false minimum. It is important for the user to determine a stopping point for a given refinement. Given the complexity of Rietveld refinement it is important to have a clear grasp of the system being studied (sample, and instrumentation) to ensure that results are accurate, realistic, and meaningful. High data quality, a large enough range, and a good model – to serve as the initial approximation in the least squares fitting – are necessary for a successful, reliable, and meaningful Rietveld refinement.
Since refinement depends on finding the best fit between a calculated and experimental pattern, it is important to have a numerical figure of merit quantifying the quality of the fit. Below are the figures of merit generally used to characterize the quality of a refinement. They provide insight to how well the model fits the observed data.
Profile residual (reliability factor):
Weighted profile residual:
Bragg residual:
Expected profile residual:
Goodness of fit:
It is worth mentioning that all but one () figure of merit include a contribution from the background. There are some concerns about the reliability of these figures, as well there is no threshold or accepted value which dictates what represents a good fit. [8] The most popular and conventional figure of merit used is the goodness of fit which should approach unity given a perfect fit, though this is rarely the case. In practice, the best way to assess quality is a visual analysis of the fit by plotting the difference between the observed and calculated data plotted on the same scale.
Diffraction is the interference or bending of waves around the corners of an obstacle or through an aperture into the region of geometrical shadow of the obstacle/aperture. The diffracting object or aperture effectively becomes a secondary source of the propagating wave. Italian scientist Francesco Maria Grimaldi coined the word diffraction and was the first to record accurate observations of the phenomenon in 1660.
A likelihood function measures how well a statistical model explains observed data by calculating the probability of seeing that data under different parameter values of the model. It is constructed from the joint probability distribution of the random variable that (presumably) generated the observations. When evaluated on the actual data points, it becomes a function solely of the model parameters.
In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statistical model, the observed data is most probable. The point in the parameter space that maximizes the likelihood function is called the maximum likelihood estimate. The logic of maximum likelihood is both intuitive and flexible, and as such the method has become a dominant means of statistical inference.
In probability theory and statistics, the gamma distribution is a versatile two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the gamma distribution. There are two equivalent parameterizations in common use:
In mathematics, a Gaussian function, often simply referred to as a Gaussian, is a function of the base form and with parametric extension for arbitrary real constants a, b and non-zero c. It is named after the mathematician Carl Friedrich Gauss. The graph of a Gaussian is a characteristic symmetric "bell curve" shape. The parameter a is the height of the curve's peak, b is the position of the center of the peak, and c controls the width of the "bell".
In statistics, the Lehmann–Scheffé theorem is a prominent statement, tying together the ideas of completeness, sufficiency, uniqueness, and best unbiased estimation. The theorem states that any estimator that is unbiased for a given unknown quantity and that depends on the data only through a complete, sufficient statistic is the unique best unbiased estimator of that quantity. The Lehmann–Scheffé theorem is named after Erich Leo Lehmann and Henry Scheffé, given their two early papers.
In many areas of science, Bragg's law, Wulff–Bragg's condition, or Laue–Bragg interference are a special case of Laue diffraction, giving the angles for coherent scattering of waves from a large crystal lattice. It describes how the superposition of wave fronts scattered by lattice planes leads to a strict relation between the wavelength and scattering angle. This law was initially formulated for X-rays, but it also applies to all types of matter waves including neutron and electron waves if there are a large number of atoms, as well as visible light with artificial periodic microscale lattices.
In optics, the Airy disk and Airy pattern are descriptions of the best-focused spot of light that a perfect lens with a circular aperture can make, limited by the diffraction of light. The Airy disk is of importance in physics, optics, and astronomy.
In optics, the Fraunhofer diffraction equation is used to model the diffraction of waves when plane waves are incident on a diffracting object, and the diffraction pattern is viewed at a sufficiently long distance from the object, and also when it is viewed at the focal plane of an imaging lens. In contrast, the diffraction pattern created near the diffracting object and is given by the Fresnel diffraction equation.
In mathematics, the Helmholtz equation is the eigenvalue problem for the Laplace operator. It corresponds to the elliptic partial differential equation: where ∇2 is the Laplace operator, k2 is the eigenvalue, and f is the (eigen)function. When the equation is applied to waves, k is known as the wave number. The Helmholtz equation has a variety of applications in physics and other sciences, including the wave equation, the diffusion equation, and the Schrödinger equation for a free particle.
The Stokes parameters are a set of values that describe the polarization state of electromagnetic radiation. They were defined by George Gabriel Stokes in 1852, as a mathematically convenient alternative to the more common description of incoherent or partially polarized radiation in terms of its total intensity (I), (fractional) degree of polarization (p), and the shape parameters of the polarization ellipse. The effect of an optical system on the polarization of light can be determined by constructing the Stokes vector for the input light and applying Mueller calculus, to obtain the Stokes vector of the light leaving the system. They can be determined from directly observable phenomena. The original Stokes paper was discovered independently by Francis Perrin in 1942 and by Subrahamanyan Chandrasekhar in 1947, who named it as the Stokes parameters.
An acousto-optic modulator (AOM), also called a Bragg cell or an acousto-optic deflector (AOD), uses the acousto-optic effect to diffract and shift the frequency of light using sound waves. They are used in lasers for Q-switching, telecommunications for signal modulation, and in spectroscopy for frequency control. A piezoelectric transducer is attached to a material such as glass. An oscillating electric signal drives the transducer to vibrate, which creates sound waves in the material. These can be thought of as moving periodic planes of expansion and compression that change the index of refraction. Incoming light scatters off the resulting periodic index modulation and interference occurs similar to Bragg diffraction. The interaction can be thought of as a three-wave mixing process resulting in sum-frequency generation or difference-frequency generation between phonons and photons.
Powder diffraction is a scientific technique using X-ray, neutron, or electron diffraction on powder or microcrystalline samples for structural characterization of materials. An instrument dedicated to performing such powder measurements is called a powder diffractometer.
The Debye–Waller factor (DWF), named after Peter Debye and Ivar Waller, is used in condensed matter physics to describe the attenuation of x-ray scattering or coherent neutron scattering caused by thermal motion. It is also called the B factor, atomic B factor, or temperature factor. Often, "Debye–Waller factor" is used as a generic term that comprises the Lamb–Mössbauer factor of incoherent neutron scattering and Mössbauer spectroscopy.
In condensed matter physics and crystallography, the static structure factor is a mathematical description of how a material scatters incident radiation. The structure factor is a critical tool in the interpretation of scattering patterns obtained in X-ray, electron and neutron diffraction experiments.
Diffraction processes affecting waves are amenable to quantitative description and analysis. Such treatments are applied to a wave passing through one or more slits whose width is specified as a proportion of the wavelength. Numerical approximations may be used, including the Fresnel and Fraunhofer approximations.
The Scherrer equation, in X-ray diffraction and crystallography, is a formula that relates the size of sub-micrometre crystallites in a solid to the broadening of a peak in a diffraction pattern. It is often referred to, incorrectly, as a formula for particle size measurement or analysis. It is named after Paul Scherrer. It is used in the determination of size of crystals in the form of powder.
In optics, the Fraunhofer diffraction equation is used to model the diffraction of waves when the diffraction pattern is viewed at a long distance from the diffracting object, and also when it is viewed at the focal plane of an imaging lens.
Le Bail analysis is a whole diffraction pattern profile fitting technique used to characterize the properties of crystalline materials, such as structure. It was invented by Armel Le Bail around 1988.
In X-ray diffraction, the Rachinger correction is a method for accounting for the effect of an undesired K-alpha 2 peak in the energy spectrum. Ideally, diffraction measurements are made with X-rays of a single wavelength. Practically, the x-rays for a measurement are usually generated in an X-ray tube from a metal's K-alpha line. This generation creates x-rays at a variety of wavelengths, but most of the non K-alpha X-rays can be blocked from reaching the sample by filters. However, the K-alpha line is actually two x-ray lines close together: the stronger K-alpha 1 peak, and the weaker K-alpha 2 peak. Compared to other radiation such as the Bremsstrahlung, the K-alpha two peak is more difficult to filter mechanically. The Rachinger correction is a recursive method suggested by William Albert Rachinger (1927) to eliminate the disturbing peak.