Generalized structure tensor

Last updated

In image analysis, the generalized structure tensor (GST) is an extension of the Cartesian structure tensor to curvilinear coordinates. [1] It is mainly used to detect and to represent the "direction" parameters of curves, just as the Cartesian structure tensor detects and represents the direction in Cartesian coordinates. Curve families generated by pairs of locally orthogonal functions have been the best studied.

Contents

It is a widely known method in applications of image and video processing including computer vision, such as biometric identification by fingerprints, [2] and studies of human tissue sections. [3] [4]

GST in 2D and locally orthogonal bases

Let the term image represent a function where are real variables and , and , are real valued functions. GST represents the direction along which the image can undergo an infinitesimal translation with minimal (total least squares) error, along the "lines" fulfilling the following conditions:

1. The "lines" are ordinary lines in the curvilinear coordinate basis

which are curves in Cartesian coordinates as depicted by the equation above. The error is measured in the sense and the minimality of the error refers thereby to L2 norm.

2. The functions constitute a harmonic pair, i.e. they fulfill Cauchy–Riemann equations,

Accordingly, such curvilinear coordinates are locally orthogonal.

Then GST consists in

where are errors of (infinitesimal) translation in the best direction (designated by the angle ) and the worst direction (designated by ). The function is the window function defining the "outer scale" wherein the detection of will be carried out, which can be omitted if it is already included in or if is the full image (rather than local). The matrix is the identity matrix. Using the chain rule, it can be shown that the integration above can be implemented as convolutions in Cartesian coordinates applied to the ordinary structure tensor when pair the real and imaginary parts of an analytic function ,

where . [5] Examples of analytic functions include , as well as monomials , , where is an arbitrary positive or negative integer. The monomials are also referred to as harmonic functions in computer vision, and image processing.

Thereby, Cartesian Structure tensor is a special case of GST where , and , i.e. the harmonic function is simply . Thus by choosing a harmonic function , one can detect all curves that are linear combinations of its real and imaginary parts by convolutions on (rectangular) image grids only, even if are non-Cartesian. Furthermore, the convolution computations can be done by using complex filters applied to the complex version of the structure tensor. Thus, GST implementations have frequently been done using complex version of the structure tensor, rather than using the (1,1) tensor.

Complex version of GST

As there is a complex version of the ordinary structure tensor, there is also a complex version of the GST

which is identical to its cousin with the difference that is a complex filter. It should be recalled that, the ordinary structure tensor is a real filter, usually defined by a sampled and scaled Gaussian to delineate the neighborhood, also known as the outer scale. This simplicity is a reason for why GST implementations have predominantly used the complex version above. For curve families defined by analytic functions , it can be shown that, [1] the neighborhood defining function is complex valued,

,

a so called symmetry derivative of a Gaussian. Thus, the orientation wise variation of the pattern to be looked for is directly incorporated into the neighborhood defining function, and the detection occurs in the space of the (ordinary) structure tensor.

Basic concept for its use in image processing and computer vision

Efficient detection of in images is possible by image processing for a pair , . Complex convolutions (or the corresponding matrix operations) and point-wise non-linear mappings are the basic computational elements of GST implementations. A total least square error estimation of is then obtained along with the two errors, and . In analogy with the Cartesian structure tensor, the estimated angle is in double angle representation, i.e. is delivered by computations, and can be used as a shape feature whereas alone or in combination with can be used as a quality (confidence, certainty) measure for the angle estimation.

Logarithmic spirals, including circles, can for instance be detected by (complex) convolutions and non-linear mappings. [1] The spirals can be in gray (valued) images or in a binary image, i.e. locations of edge elements of the concerned patterns, such as contours of circles or spirals, must not be known or marked otherwise.

Generalized structure tensor can be used as an alternative to Hough transform in image processing and computer vision to detect patterns whose local orientations can be modelled, for example junction points. The main differences comprise:

Physical and mathematical interpretation

The curvilinear coordinates of GST can explain physical processes applied to images. A well known pair of processes consist in rotation, and zooming. These are related to the coordinate transformation and .

If an image consists in iso-curves that can be explained by only i.e. its iso-curves consist in circles , where is any real valued differentiable function defined on 1D, the image is invariant to rotations (around the origin).

Zooming (comprising unzooming) operation is modeled similarly. If the image has iso-curves that look like a "star" or bicycle spokes, i.e. for some differentiable 1D function then, the image is invariant to scaling (w.r.t. the origin).

In combination,

is invariant to a certain amount of rotation combined with scaling, where the amount is precised by the parameter .

Analogously, the Cartesian structure tensor is a representation of a translation too. Here the physical process consists in an ordinary translation of a certain amount along combined with translation along ,

where the amount is specified by the parameter . Evidently here represents the direction of the line.

Generally, the estimated represents the direction (in coordinates) along which infinitesimal translations leave the image invariant, in practice least variant. With every curvilinear coordinate basis pair, there is thus a pair of infinitesimal translators, a linear combination of which is a Differential operator. The latter are related to Lie algebra.

Miscellaneous

"Image" in the context of the GST can mean both an ordinary image and an image neighborhood thereof (local image), depending on context. For example, a photograph is an image as is any neighborhood of it.

See also

Related Research Articles

<span class="mw-page-title-main">3-sphere</span> Mathematical object

In mathematics, a 3-sphere is a higher-dimensional analogue of a sphere. It may be embedded in 4-dimensional Euclidean space as the set of points equidistant from a fixed central point. Analogous to how the boundary of a ball in three dimensions is an ordinary sphere, the boundary of a ball in four dimensions is a 3-sphere. A 3-sphere is an example of a 3-manifold and an n-sphere.

In mathematics, the Laplace operator or Laplacian is a differential operator given by the divergence of the gradient of a scalar function on Euclidean space. It is usually denoted by the symbols , , or . In a Cartesian coordinate system, the Laplacian is given by the sum of second partial derivatives of the function with respect to each independent variable. In other coordinate systems, such as cylindrical and spherical coordinates, the Laplacian also has a useful form. Informally, the Laplacian Δf (p) of a function f at a point p measures by how much the average value of f over small spheres or balls centered at p deviates from f (p).

<span class="mw-page-title-main">Lorentz group</span> Lie group of Lorentz transformations

In physics and mathematics, the Lorentz group is the group of all Lorentz transformations of Minkowski spacetime, the classical and quantum setting for all (non-gravitational) physical phenomena. The Lorentz group is named for the Dutch physicist Hendrik Lorentz.

In mathematics, the directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a velocity specified by v.

<span class="mw-page-title-main">Hopf fibration</span> Fiber bundle of the 3-sphere over the 2-sphere, with 1-spheres as fibers

In the mathematical field of differential topology, the Hopf fibration describes a 3-sphere in terms of circles and an ordinary sphere. Discovered by Heinz Hopf in 1931, it is an influential early example of a fiber bundle. Technically, Hopf found a many-to-one continuous function from the 3-sphere onto the 2-sphere such that each distinct point of the 2-sphere is mapped from a distinct great circle of the 3-sphere. Thus the 3-sphere is composed of fibers, where each fiber is a circle — one for each point of the 2-sphere.

In physics and mathematics, supermanifolds are generalizations of the manifold concept based on ideas coming from supersymmetry. Several definitions are in use, some of which are described below.

In differential geometry, a spray is a vector field H on the tangent bundle TM that encodes a quasilinear second order system of ordinary differential equations on the base manifold M. Usually a spray is required to be homogeneous in the sense that its integral curves t→ΦHt(ξ)∈TM obey the rule ΦHt(λξ)=ΦHλt(ξ) in positive reparameterizations. If this requirement is dropped, H is called a semispray.

In special functions, a topic in mathematics, spin-weighted spherical harmonics are generalizations of the standard spherical harmonics and—like the usual spherical harmonics—are functions on the sphere. Unlike ordinary spherical harmonics, the spin-weighted harmonics are U(1) gauge fields rather than scalar fields: mathematically, they take values in a complex line bundle. The spin-weighted harmonics are organized by degree l, just like ordinary spherical harmonics, but have an additional spin weights that reflects the additional U(1) symmetry. A special basis of harmonics can be derived from the Laplace spherical harmonics Ylm, and are typically denoted by sYlm, where l and m are the usual parameters familiar from the standard Laplace spherical harmonics. In this special basis, the spin-weighted spherical harmonics appear as actual functions, because the choice of a polar axis fixes the U(1) gauge ambiguity. The spin-weighted spherical harmonics can be obtained from the standard spherical harmonics by application of spin raising and lowering operators. In particular, the spin-weighted spherical harmonics of spin weight s = 0 are simply the standard spherical harmonics:

In a field of mathematics known as differential geometry, the Courant bracket is a generalization of the Lie bracket from an operation on the tangent bundle to an operation on the direct sum of the tangent bundle and the vector bundle of p-forms.

In mathematics, the spectral theory of ordinary differential equations is the part of spectral theory concerned with the determination of the spectrum and eigenfunction expansion associated with a linear ordinary differential equation. In his dissertation Hermann Weyl generalized the classical Sturm–Liouville theory on a finite closed interval to second order differential operators with singularities at the endpoints of the interval, possibly semi-infinite or infinite. Unlike the classical case, the spectrum may no longer consist of just a countable set of eigenvalues, but may also contain a continuous part. In this case the eigenfunction expansion involves an integral over the continuous part with respect to a spectral measure, given by the Titchmarsh–Kodaira formula. The theory was put in its final simplified form for singular differential equations of even degree by Kodaira and others, using von Neumann's spectral theorem. It has had important applications in quantum mechanics, operator theory and harmonic analysis on semisimple Lie groups.

The derivatives of scalars, vectors, and second-order tensors with respect to second-order tensors are of considerable use in continuum mechanics. These derivatives are used in the theories of nonlinear elasticity and plasticity, particularly in the design of algorithms for numerical simulations.

In mathematical physics, the Berezin integral, named after Felix Berezin,, is a way to define integration for functions of Grassmann variables. It is not an integral in the Lebesgue sense; the word "integral" is used because the Berezin integral has properties analogous to the Lebesgue integral and because it extends the path integral in physics, where it is used as a sum over histories for fermions.

<span class="mw-page-title-main">Cnoidal wave</span> Nonlinear and exact periodic wave solution of the Korteweg–de Vries equation

In fluid dynamics, a cnoidal wave is a nonlinear and exact periodic wave solution of the Korteweg–de Vries equation. These solutions are in terms of the Jacobi elliptic function cn, which is why they are coined cnoidal waves. They are used to describe surface gravity waves of fairly long wavelength, as compared to the water depth.

In mathematics, log-polar coordinates is a coordinate system in two dimensions, where a point is identified by two numbers, one for the logarithm of the distance to a certain point, and one for an angle. Log-polar coordinates are closely connected to polar coordinates, which are usually used to describe domains in the plane with some sort of rotational symmetry. In areas like harmonic and complex analysis, the log-polar coordinates are more canonical than polar coordinates.

In optics, the Fraunhofer diffraction equation is used to model the diffraction of waves when the diffraction pattern is viewed at a long distance from the diffracting object, and also when it is viewed at the focal plane of an imaging lens.

In probability theory, an interacting particle system (IPS) is a stochastic process on some configuration space given by a site space, a countable-infinite graph and a local state space, a compact metric space . More precisely IPS are continuous-time Markov jump processes describing the collective behavior of stochastically interacting components. IPS are the continuous-time analogue of stochastic cellular automata.

<span class="mw-page-title-main">Symmetry in quantum mechanics</span> Properties underlying modern physics

Symmetries in quantum mechanics describe features of spacetime and particles which are unchanged under some transformation, in the context of quantum mechanics, relativistic quantum mechanics and quantum field theory, and with applications in the mathematical formulation of the standard model and condensed matter physics. In general, symmetry in physics, invariance, and conservation laws, are fundamentally important constraints for formulating physical theories and models. In practice, they are powerful methods for solving problems and predicting what can happen. While conservation laws do not always give the answer to the problem directly, they form the correct constraints and the first steps to solving a multitude of problems.

<span class="mw-page-title-main">Trochoidal wave</span> Exact solution of the Euler equations for periodic surface gravity waves

In fluid dynamics, a trochoidal wave or Gerstner wave is an exact solution of the Euler equations for periodic surface gravity waves. It describes a progressive wave of permanent form on the surface of an incompressible fluid of infinite depth. The free surface of this wave solution is an inverted (upside-down) trochoid – with sharper crests and flat troughs. This wave solution was discovered by Gerstner in 1802, and rediscovered independently by Rankine in 1863.

In combustion, a Burke–Schumann flame is a type of diffusion flame, established at the mouth of the two concentric ducts, by issuing fuel and oxidizer from the two region respectively. It is named after S.P. Burke and T.E.W. Schumann, who were able to predict the flame height and flame shape using their simple analysis of infinitely fast chemistry in 1928 at the First symposium on combustion.

Moffatt eddies are sequences of eddies that develop in corners bounded by plane walls due to an arbitrary disturbance acting at asymptotically large distances from the corner. Although the source of motion is the arbitrary disturbance at large distances, the eddies develop quite independently and thus solution of these eddies emerges from an eigenvalue problem, a self-similar solution of the second kind.

References

  1. 1 2 3 Bigun, J.; Bigun, T.; Nilsson, K. (December 2004). "Recognition by symmetry derivatives and the generalized structure tensor". IEEE Transactions on Pattern Analysis and Machine Intelligence. 26 (12): 1590–1605. doi:10.1109/TPAMI.2004.126. PMID   15573820.
  2. Fronthaler, H.; Kollreider, K.; Bigun, J. (2008). "Local Features for Enhancement and Minutiae Extraction in Fingerprints". IEEE Transactions on Image Processing. 17 (3): 354–363. Bibcode:2008ITIP...17..354F. CiteSeerX   10.1.1.160.6312 . doi:10.1109/TIP.2007.916155. PMID   18270124.
  3. O. Schmitt; H. Birkholz (2010). "Improvement in cytoarchitectonic mapping by combining electrodynamic modeling with local orientation in high-resolution images of the cerebral cortex". Microsc. Res. Tech. 74 (3): 225–243. doi:10.1109/TIP.2007.916155. PMID   18270124.
  4. O. Schmitt; M. Pakura; T. Aach; L. Homke; M. Bohme; S. Bock; S. Preusse (2004). "Analysis of nerve fibers and their distribution in histologic sections of the human brain". Microsc. Res. Tech. 63 (4): 220–243. doi:10.1002/jemt.20033. PMID   14988920.
  5. Bigun, Josef (December 1997). "Pattern Recognition in Images by Symmetries and Coordinate Transformations". Computer Vision and Image Understanding. 68 (3): 290–307. doi:10.1006/cviu.1997.0556.