In mathematics, the Zernike polynomials are a sequence of polynomials that are orthogonal on the unit disk. Named after optical physicist Frits Zernike, laureate of the 1953 Nobel Prize in Physics and the inventor of phase-contrast microscopy, they play important roles in various optics branches such as beam optics and imaging. [1] [2]
There are even and odd Zernike polynomials. The even Zernike polynomials are defined as
(even function over the azimuthal angle ), and the odd Zernike polynomials are defined as
(odd function over the azimuthal angle ) where m and n are nonnegative integers with n ≥ m ≥ 0 (m = 0 for spherical Zernike polynomials), is the azimuthal angle, ρ is the radial distance , and are the radial polynomials defined below. Zernike polynomials have the property of being limited to a range of −1 to +1, i.e. . The radial polynomials are defined as
for an even number of n − m, while it is 0 for an odd number of n − m. A special value is
Rewriting the ratios of factorials in the radial part as products of binomials shows that the coefficients are integer numbers:
A notation as terminating Gaussian hypergeometric functions is useful to reveal recurrences, to demonstrate that they are special cases of Jacobi polynomials, to write down the differential equations, etc.:
for n − m even.
The inverse relation expands for fixed into
with rational coefficients [3]
for even .
The factor in the radial polynomial may be expanded in a Bernstein basis of for even or times a function of for odd in the range . The radial polynomial may therefore be expressed by a finite number of Bernstein Polynomials with rational coefficients:
Applications often involve linear algebra, where an integral over a product of Zernike polynomials and some other factor builds a matrix elements. To enumerate the rows and columns of these matrices by a single index, a conventional mapping of the two indices n and l to a single index j has been introduced by Noll. [4] The table of this association starts as follows (sequence A176988 in the OEIS ).
n,l | 0,0 | 1,1 | 1,−1 | 2,0 | 2,−2 | 2,2 | 3,−1 | 3,1 | 3,−3 | 3,3 |
---|---|---|---|---|---|---|---|---|---|---|
j | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
n,l | 4,0 | 4,2 | 4,−2 | 4,4 | 4,−4 | 5,1 | 5,−1 | 5,3 | 5,−3 | 5,5 |
j | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 |
The rule is the following.
OSA [5] and ANSI single-index Zernike polynomials using:
n,l | 0,0 | 1,−1 | 1,1 | 2,−2 | 2,0 | 2,2 | 3,−3 | 3,−1 | 3,1 | 3,3 |
---|---|---|---|---|---|---|---|---|---|---|
j | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
n,l | 4,−4 | 4,−2 | 4,0 | 4,2 | 4,4 | 5,−5 | 5,−3 | 5,−1 | 5,1 | 5,3 |
j | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 |
The Fringe indexing scheme is used in commercial optical design software and optical testing in, e.g., photolithography. [6] [7]
where is the sign or signum function. The first 20 fringe numbers are listed below.
n,l | 0,0 | 1,1 | 1,−1 | 2,0 | 2,2 | 2,−2 | 3,1 | 3,−1 | 4,0 | 3,3 |
---|---|---|---|---|---|---|---|---|---|---|
j | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
n,l | 3,−3 | 4,2 | 4,−2 | 5,1 | 5,−1 | 6,0 | 4,4 | 4,−4 | 5,3 | 5,−3 |
j | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 |
James C. Wyant uses the "Fringe" indexing scheme except it starts at 0 instead of 1 (subtract 1). [8] This method is commonly used including interferogram analysis software in Zygo interferometers and the open source software DFTFringe.
They satisfy the Rodrigues' formula
and can be related to the Jacobi polynomials as
The orthogonality in the radial part reads [9]
or
Orthogonality in the angular part is represented by the elementary
where (sometimes called the Neumann factor because it frequently appears in conjunction with Bessel functions) is defined as 2 if and 1 if . The product of the angular and radial parts establishes the orthogonality of the Zernike functions with respect to both indices if integrated over the unit disk,
where is the Jacobian of the circular coordinate system, and where and are both even.
Any sufficiently smooth real-valued phase field over the unit disk can be represented in terms of its Zernike coefficients (odd and even), just as periodic functions find an orthogonal representation with the Fourier series. We have
where the coefficients can be calculated using inner products. On the space of functions on the unit disk, there is an inner product defined by
The Zernike coefficients can then be expressed as follows:
Alternatively, one can use the known values of phase function G on the circular grid to form a system of equations. The phase function is retrieved by the unknown-coefficient weighted product with (known values) of Zernike polynomial across the unit grid. Hence, coefficients can also be found by solving a linear system, for instance by matrix inversion. Fast algorithms to calculate the forward and inverse Zernike transform use symmetry properties of trigonometric functions, separability of radial and azimuthal parts of Zernike polynomials, and their rotational symmetries.
The reflections of trigonometric functions result that the parity with respect to reflection along the x axis is
The π shifts of trigonometric functions result that the parity with respect to point reflection at the center of coordinates is
where could as well be written because as even numbers are only cases to get non-vanishing Zernike polynomials. (If n is even then l is also even. If n is odd, then l is also odd.) This property is sometimes used to categorize Zernike polynomials into even and odd polynomials in terms of their angular dependence. (it is also possible to add another category with l = 0 since it has a special property of no angular dependence.)
The radial polynomials are also either even or odd, depending on order n or m:
These equalities are easily seen since with an odd (even) m contains only odd (even) powers to ρ (see examples of below).
The periodicity of the trigonometric functions results in invariance if rotated by multiples of radian around the center:
The Zernike polynomials satisfy the following recurrence relation which depends neither on the degree nor on the azimuthal order of the radial polynomials: [10]
From the definition of it can be seen that and . The following three-term recurrence relation [11] then allows to calculate all other :
The above relation is especially useful since the derivative of can be calculated from two radial Zernike polynomials of adjacent degree: [11]
The differential equation of the Gaussian Hypergeometric Function is equivalent to
The first few radial polynomials are:
The first few Zernike modes, at various indices, are shown below. They are normalized such that: , which is equivalent to .
OSA/ANSI index () | Noll index () | Wyant index () | Fringe/UA index () | Radial degree () | Azimuthal degree () | Classical name | ||
---|---|---|---|---|---|---|---|---|
0 | 1 | 0 | 1 | 0 | 0 | Piston (see, Wigner semicircle distribution) | ||
1 | 3 | 2 | 3 | 1 | −1 | Tilt (Y-Tilt, vertical tilt) | ||
2 | 2 | 1 | 2 | 1 | +1 | Tilt (X-Tilt, horizontal tilt) | ||
3 | 5 | 5 | 6 | 2 | −2 | Oblique astigmatism | ||
4 | 4 | 3 | 4 | 2 | 0 | Defocus (longitudinal position) | ||
5 | 6 | 4 | 5 | 2 | +2 | Vertical astigmatism | ||
6 | 9 | 10 | 11 | 3 | −3 | Vertical trefoil | ||
7 | 7 | 7 | 8 | 3 | −1 | Vertical coma | ||
8 | 8 | 6 | 7 | 3 | +1 | Horizontal coma | ||
9 | 10 | 9 | 10 | 3 | +3 | Oblique trefoil | ||
10 | 15 | 17 | 18 | 4 | −4 | Oblique quadrafoil | ||
11 | 13 | 12 | 13 | 4 | −2 | Oblique secondary astigmatism | ||
12 | 11 | 8 | 9 | 4 | 0 | Primary spherical | ||
13 | 12 | 11 | 12 | 4 | +2 | Vertical secondary astigmatism | ||
14 | 14 | 16 | 17 | 4 | +4 | Vertical quadrafoil |
The functions are a basis defined over the circular support area, typically the pupil planes in classical optical imaging at visible and infrared wavelengths through systems of lenses and mirrors of finite diameter. Their advantages are the simple analytical properties inherited from the simplicity of the radial functions and the factorization in radial and azimuthal functions; this leads, for example, to closed-form expressions of the two-dimensional Fourier transform in terms of Bessel functions. [12] [13] Their disadvantage, in particular if high n are involved, is the unequal distribution of nodal lines over the unit disk, which introduces ringing effects near the perimeter , which often leads attempts to define other orthogonal functions over the circular disk. [14] [15] [16]
In precision optical manufacturing, Zernike polynomials are used to characterize higher-order errors observed in interferometric analyses. In wavefront slope sensors like the Shack-Hartmann, Zernike coefficients of the wavefront can be obtained by fitting measured slopes with Zernike polynomial derivatives averaged over the sampling subapertures. [17] In optometry and ophthalmology, Zernike polynomials are used to describe wavefront aberrations of the cornea or lens from an ideal spherical shape, which result in refraction errors. They are also commonly used in adaptive optics, where they can be used to characterize atmospheric distortion. Obvious applications for this are IR or visual astronomy and satellite imagery.
Another application of the Zernike polynomials is found in the Extended Nijboer–Zernike theory of diffraction and aberrations.
Zernike polynomials are widely used as basis functions of image moments. Since Zernike polynomials are orthogonal to each other, Zernike moments can represent properties of an image with no redundancy or overlap of information between the moments. Although Zernike moments are significantly dependent on the scaling and the translation of the object in a region of interest (ROI), their magnitudes are independent of the rotation angle of the object. [18] Thus, they can be utilized to extract features from images that describe the shape characteristics of an object. For instance, Zernike moments are utilized as shape descriptors to classify benign and malignant breast masses [19] or the surface of vibrating disks. [20] Zernike Moments also have been used to quantify shape of osteosarcoma cancer cell lines in single cell level. [21] Moreover, Zernike Moments have been used for early detection of Alzheimer's disease by extracting discriminative information from the MR images of Alzheimer's disease, Mild cognitive impairment, and Healthy groups. [22]
The concept translates to higher dimensions D if multinomials in Cartesian coordinates are converted to hyperspherical coordinates, , multiplied by a product of Jacobi polynomials of the angular variables. In dimensions, the angular variables are spherical harmonics, for example. Linear combinations of the powers define an orthogonal basis satisfying
(Note that a factor is absorbed in the definition of R here, whereas in the normalization is chosen slightly differently. This is largely a matter of taste, depending on whether one wishes to maintain an integer set of coefficients or prefers tighter formulas if the orthogonalization is involved.) The explicit representation is [3]
for even , else identical to zero.
In mathematics and physics, Laplace's equation is a second-order partial differential equation named after Pierre-Simon Laplace, who first studied its properties. This is often written as or where is the Laplace operator, is the divergence operator, is the gradient operator, and is a twice-differentiable real-valued function. The Laplace operator therefore maps a scalar function to another scalar function.
The Navier–Stokes equations are partial differential equations which describe the motion of viscous fluid substances. They were named after French engineer and physicist Claude-Louis Navier and the Irish physicist and mathematician George Gabriel Stokes. They were developed over several decades of progressively building the theories, from 1822 (Navier) to 1842–1850 (Stokes).
A cylindrical coordinate system is a three-dimensional coordinate system that specifies point positions by the distance from a chosen reference axis (axis L in the image opposite), the direction from the axis relative to a chosen reference direction (axis A), and the distance from a chosen reference plane perpendicular to the axis (plane containing the purple section). The latter distance is given as a positive or negative number depending on which side of the reference plane faces the point.
In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. The table of spherical harmonics contains a list of common spherical harmonics.
In electromagnetism, the Mie solution to Maxwell's equations describes the scattering of an electromagnetic plane wave by a homogeneous sphere. The solution takes the form of an infinite series of spherical multipole partial waves. It is named after German physicist Gustav Mie.
In geometry, a cardioid is a plane curve traced by a point on the perimeter of a circle that is rolling around a fixed circle of the same radius. It can also be defined as an epicycloid having a single cusp. It is also a type of sinusoidal spiral, and an inverse curve of the parabola with the focus as the center of inversion. A cardioid can also be defined as the set of points of reflections of a fixed point on a circle through all tangents to the circle.
In the theory of stochastic processes, the Karhunen–Loève theorem, also known as the Kosambi–Karhunen–Loève theorem states that a stochastic process can be represented as an infinite linear combination of orthogonal functions, analogous to a Fourier series representation of a function on a bounded interval. The transformation is also known as Hotelling transform and eigenvector transform, and is closely related to principal component analysis (PCA) technique widely used in image processing and in data analysis in many fields.
In mathematics, the Hankel transform expresses any given function f(r) as the weighted sum of an infinite number of Bessel functions of the first kind Jν(kr). The Bessel functions in the sum are all of the same order ν, but differ in a scaling factor k along the r axis. The necessary coefficient Fν of each Bessel function in the sum, as a function of the scaling factor k constitutes the transformed function. The Hankel transform is an integral transform and was first developed by the mathematician Hermann Hankel. It is also known as the Fourier–Bessel transform. Just as the Fourier transform for an infinite interval is related to the Fourier series over a finite interval, so the Hankel transform over an infinite interval is related to the Fourier–Bessel series over a finite interval.
In mathematics (specifically multivariable calculus), a multiple integral is a definite integral of a function of several real variables, for instance, f(x, y) or f(x, y, z).
In physics, the Einstein relation is a previously unexpected connection revealed independently by William Sutherland in 1904, Albert Einstein in 1905, and by Marian Smoluchowski in 1906 in their works on Brownian motion. The more general form of the equation in the classical case is
A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.
In mathematics, the secondary measure associated with a measure of positive density ρ when there is one, is a measure of positive density μ, turning the secondary polynomials associated with the orthogonal polynomials for ρ into an orthogonal system.
The Timoshenko–Ehrenfest beam theory was developed by Stephen Timoshenko and Paul Ehrenfest early in the 20th century. The model takes into account shear deformation and rotational bending effects, making it suitable for describing the behaviour of thick beams, sandwich composite beams, or beams subject to high-frequency excitation when the wavelength approaches the thickness of the beam. The resulting equation is of 4th order but, unlike Euler–Bernoulli beam theory, there is also a second-order partial derivative present. Physically, taking into account the added mechanisms of deformation effectively lowers the stiffness of the beam, while the result is a larger deflection under a static load and lower predicted eigenfrequencies for a given set of boundary conditions. The latter effect is more noticeable for higher frequencies as the wavelength becomes shorter, and thus the distance between opposing shear forces decreases.
In mathematics, the cylindrical harmonics are a set of linearly independent functions that are solutions to Laplace's differential equation, , expressed in cylindrical coordinates, ρ (radial coordinate), φ (polar angle), and z (height). Each function Vn(k) is the product of three terms, each depending on one coordinate alone. The ρ-dependent term is given by Bessel functions (which occasionally are also called cylindrical harmonics).
In mathematics, vector spherical harmonics (VSH) are an extension of the scalar spherical harmonics for use with vector fields. The components of the VSH are complex-valued functions expressed in the spherical coordinate basis vectors.
The Mehler kernel is a complex-valued function found to be the propagator of the quantum harmonic oscillator.
In mathematics, infinite compositions of analytic functions (ICAF) offer alternative formulations of analytic continued fractions, series, products and other infinite expansions, and the theory evolving from such compositions may shed light on the convergence/divergence of these expansions. Some functions can actually be expanded directly as infinite compositions. In addition, it is possible to use ICAF to evaluate solutions of fixed point equations involving infinite expansions. Complex dynamics offers another venue for iteration of systems of functions rather than a single function. For infinite compositions of a single function see Iterated function. For compositions of a finite number of functions, useful in fractal theory, see Iterated function system.
In quantum probability, the Belavkin equation, also known as Belavkin-Schrödinger equation, quantum filtering equation, stochastic master equation, is a quantum stochastic differential equation describing the dynamics of a quantum system undergoing observation in continuous time. It was derived and henceforth studied by Viacheslav Belavkin in 1988.
The removal of heat from nuclear reactors is an essential step in the generation of energy from nuclear reactions. In nuclear engineering there are a number of empirical or semi-empirical relations used for quantifying the process of removing heat from a nuclear reactor core so that the reactor operates in the projected temperature interval that depends on the materials used in the construction of the reactor. The effectiveness of removal of heat from the reactor core depends on many factors, including the cooling agents used and the type of reactor. Common liquid coolants for nuclear reactors include: deionized water, heavy water, the lighter alkaline metals, lead or lead-based eutectic alloys like lead-bismuth, and NaK, a eutectic alloy of sodium and potassium. Gas cooled reactors operate with coolants like carbon dioxide, helium or nitrogen but some very low powered research reactors have even been air-cooled with Chicago Pile 1 relying on natural convection of the surrounding air to remove the negligible thermal power output. There is ongoing research into using supercritical fluids as reactor coolants but thus far neither the supercritical water reactor nor a reactor cooled with supercritical Carbon Dioxide nor any other kind of supercritical-fluid-cooled reactor has ever been built.
In mathematics, a conical spiral, also known as a conical helix, is a space curve on a right circular cone, whose floor projection is a plane spiral. If the floor projection is a logarithmic spiral, it is called conchospiral.