Harris corner detector

Last updated January 01, 2025

The Harris corner detector is a corner detection operator that is commonly used in computer vision algorithms to extract corners and infer features of an image. It was first introduced by Chris Harris and Mike Stephens in 1988 upon the improvement of Moravec's corner detector.^[1] Compared to its predecessor, Harris' corner detector takes the differential of the corner score into account with reference to direction directly, instead of using shifting patches for every 45 degree angles, and has been proved to be more accurate in distinguishing between edges and corners.^[2] Since then, it has been improved and adopted in many algorithms to preprocess images for subsequent applications.

Introduction

A corner is a point whose local neighborhood stands in two dominant and different edge directions. In other words, a corner can be interpreted as the junction of two edges, where an edge is a sudden change in image brightness.^[3] Corners are the important features in the image, and they are generally termed as interest points which are invariant to translation, rotation and illumination. Although corners are only a small percentage of the image, they contain the most important features in restoring image information, and they can be used to minimize the amount of processed data for motion tracking, image stitching, building 2D mosaics, stereo vision, image representation and other related computer vision areas.

In order to capture the corners from the image, researchers have proposed many different corner detectors including the Kanade-Lucas-Tomasi (KLT) operator and the Harris operator which are most simple, efficient and reliable for use in corner detection. These two popular methodologies are both closely associated with and based on the local structure matrix. Compared to the Kanade-Lucas-Tomasi corner detector, the Harris corner detector provides good repeatability under changing illumination and rotation, and therefore, it is more often used in stereo matching and image database retrieval. Although there still exists drawbacks and limitations, the Harris corner detector is still an important and fundamental technique for many computer vision applications.

Development of Harris corner detection algorithm ^[1]

Without loss of generality, we will assume a grayscale 2-dimensional image is used. Let this image be given by $I$ . Consider taking an image patch $(x,y)\in W$ (window) and shifting it by $(\Delta x,\Delta y)$ . The sum of squared differences (SSD) between these two patches, denoted $f$ , is given by:

f(\Delta x,\Delta y)={\underset {(x_{k},y_{k})\in W}{\sum }}\left(I(x_{k},y_{k})-I(x_{k}+\Delta x,y_{k}+\Delta y)\right)^{2}

$I(x+\Delta x,y+\Delta y)$ can be approximated by a Taylor expansion. Let $I_{x}$ and $I_{y}$ be the partial derivatives of $I$ , such that

I(x+\Delta x,y+\Delta y)\approx I(x,y)+I_{x}(x,y)\Delta x+I_{y}(x,y)\Delta y

This produces the approximation

f(\Delta x,\Delta y)\approx {\underset {(x,y)\in W}{\sum }}\left(I_{x}(x,y)\Delta x+I_{y}(x,y)\Delta y\right)^{2},

which can be written in matrix form:

f(\Delta x,\Delta y)\approx {\begin{pmatrix}\Delta x&\Delta y\end{pmatrix}}M{\begin{pmatrix}\Delta x\\\Delta y\end{pmatrix}},

where M is the structure tensor,

M={\underset {(x,y)\in W}{\sum }}{\begin{bmatrix}I_{x}^{2}&I_{x}I_{y}\\I_{x}I_{y}&I_{y}^{2}\end{bmatrix}}={\begin{bmatrix}{\underset {(x,y)\in W}{\sum }}I_{x}^{2}&{\underset {(x,y)\in W}{\sum }}I_{x}I_{y}\\{\underset {(x,y)\in W}{\sum }}I_{x}I_{y}&{\underset {(x,y)\in W}{\sum }}I_{y}^{2}\end{bmatrix}}

Process of Harris corner detection algorithm^[4]^[5]^[6]

Commonly, Harris corner detector algorithm can be divided into five steps.

Color to grayscale
Spatial derivative calculation
Structure tensor setup
Harris response calculation
Non-maximum suppression

Color to grayscale

If we use Harris corner detector in a color image, the first step is to convert it into a grayscale image, which will enhance the processing speed.

The value of the gray scale pixel can be computed as a weighted sums of the values R, B and G of the color image,

\sum _{C\,\in \,\{R,G,B\}}w_{C}\cdot C

,

where, e.g.,

w_{R}=0.299,\ w_{G}=0.587,\ w_{B}=1-(w_{R}+w_{G})=0.114.

Spatial derivative calculation

Next, we are going to find the derivative with respect to x and the derivative with respect to y, $I_{x}(x,y)$ and $I_{y}(x,y)$ . This can be approximated by applying Sobel operators.

Structure tensor setup

With $I_{x}(x,y)$ , $I_{y}(x,y)$ , we can construct the structure tensor $M$ .

Harris response calculation

For $x\ll y$ , one has ${\tfrac {x\cdot y}{x+y}}=x{\tfrac {1}{1+x/y}}\approx x.$ In this step, we compute the smallest eigenvalue of the structure tensor using that approximation:

\lambda _{\min }\approx {\frac {\lambda _{1}\lambda _{2}}{(\lambda _{1}+\lambda _{2})}}={\frac {\det(M)}{\operatorname {tr} (M)}}

with the trace $\mathrm {tr} (M)=m_{11}+m_{22}$ .

Another commonly used Harris response calculation is shown as below,

$R=\lambda _{1}\lambda _{2}-k(\lambda _{1}+\lambda _{2})^{2}=\det(M)-k\operatorname {tr} (M)^{2}$

where $k$ is an empirically determined constant; $k\in [0.04,0.06]$ .

Non-maximum suppression

In order to pick up the optimal values to indicate corners, we find the local maxima as corners within the window which is a 3 by 3 filter.

Improvement^[7]^[8]

Harris-Laplace Corner Detector ^[9]
Differential Morphological Decomposition Based Corner Detector^[10]
Multi-scale Bilateral Structure Tensor Based Corner Detector^[11]

Applications

Image Alignment, Stitching and Registration ^[12]
2D Mosaics Creation^[13]
3D Scene Modeling and Reconstruction ^[14]
Motion Detection ^[15]
Object Recognition ^[16]
Image Indexing and Content-based Retrieval ^[17]
Video Tracking ^[18]

Related Research Articles

In mathematical physics and mathematics, the Pauli matrices are a set of three $2 \times 2$ complex matrices that are traceless, Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma, they are occasionally denoted by tau when used in connection with isospin symmetries.

Simultaneous equations models are a type of statistical model in which the dependent variables are functions of other dependent variables, rather than just independent variables. This means some of the explanatory variables are jointly determined with the dependent variable, which in economics usually is the consequence of some underlying equilibrium mechanism. Take the typical supply and demand model: whilst typically one would determine the quantity supplied and demanded to be a function of the price set by the market, it is also possible for the reverse to be true, where producers observe the quantity that consumers demand and then set the price.

Linear elasticity is a mathematical model as to how solid objects deform and become internally stressed by prescribed loading conditions. It is a simplification of the more general nonlinear theory of elasticity and a branch of continuum mechanics.

In mathematics, the spectral radius of a square matrix is the maximum of the absolute values of its eigenvalues. More generally, the spectral radius of a bounded linear operator is the supremum of the absolute values of the elements of its spectrum. The spectral radius is often denoted by $ρ(\cdot)$ .

In mathematics, the Hessian matrix, Hessian or Hesse matrix is a square matrix of second-order partial derivatives of a scalar-valued function, or scalar field. It describes the local curvature of a function of many variables. The Hessian matrix was developed in the 19th century by the German mathematician Ludwig Otto Hesse and later named after him. Hesse originally used the term "functional determinants". The Hessian is sometimes denoted by H or, ambiguously, by ∇².

In mathematics, a Casimir element is a distinguished element of the center of the universal enveloping algebra of a Lie algebra. A prototypical example is the squared angular momentum operator, which is a Casimir element of the three-dimensional rotation group.

In mathematics and computing, the Levenberg–Marquardt algorithm, also known as the damped least-squares (DLS) method, is used to solve non-linear least squares problems. These minimization problems arise especially in least squares curve fitting. The LMA interpolates between the Gauss–Newton algorithm (GNA) and the method of gradient descent. The LMA is more robust than the GNA, which means that in many cases it finds a solution even if it starts very far off the final minimum. For well-behaved functions and reasonable starting parameters, the LMA tends to be slower than the GNA. LMA can also be viewed as Gauss–Newton using a trust region approach.

Neutrino oscillation is a quantum mechanical phenomenon in which a neutrino created with a specific lepton family number can later be measured to have a different lepton family number. The probability of measuring a particular flavor for a neutrino varies between three known states, as it propagates through space.

In mathematics, the discrete Laplace operator is an analog of the continuous Laplace operator, defined so that it has meaning on a graph or a discrete grid. For the case of a finite-dimensional graph, the discrete Laplace operator is more commonly called the Laplacian matrix.

Mehrotra's predictor–corrector method in optimization is a specific interior point method for linear programming. It was proposed in 1989 by Sanjay Mehrotra.

In differential geometry, a tensor density or relative tensor is a generalization of the tensor field concept. A tensor density transforms as a tensor field when passing from one coordinate system to another, except that it is additionally multiplied or weighted by a power W of the Jacobian determinant of the coordinate transition function or its absolute value. A tensor density with a single index is called a vector density. A distinction is made among (authentic) tensor densities, pseudotensor densities, even tensor densities and odd tensor densities. Sometimes tensor densities with a negative weight W are called tensor capacity. A tensor density can also be regarded as a section of the tensor product of a tensor bundle with a density bundle.

In continuum mechanics, the finite strain theory—also called large strain theory, or large deformation theory—deals with deformations in which strains and/or rotations are large enough to invalidate assumptions inherent in infinitesimal strain theory. In this case, the undeformed and deformed configurations of the continuum are significantly different, requiring a clear distinction between them. This is commonly the case with elastomers, plastically deforming materials and other fluids and biological soft tissue.

In queueing theory, a discipline within the mathematical theory of probability, a Jackson network is a class of queueing network where the equilibrium distribution is particularly simple to compute as the network has a product-form solution. It was the first significant development in the theory of networks of queues, and generalising and applying the ideas of the theorem to search for similar product-form solutions in other networks has been the subject of much research, including ideas used in the development of the Internet. The networks were first identified by James R. Jackson and his paper was re-printed in the journal Management Science’s ‘Ten Most Influential Titles of Management Sciences First Fifty Years.’

In computer vision, the Lucas–Kanade method is a widely used differential method for optical flow estimation developed by Bruce D. Lucas and Takeo Kanade. It assumes that the flow is essentially constant in a local neighbourhood of the pixel under consideration, and solves the basic optical flow equations for all the pixels in that neighbourhood, by the least squares criterion.

<span class="mw-page-title-main">Corner detection</span> Approach used in computer vision systems

Corner detection is an approach used within computer vision systems to extract certain kinds of features and infer the contents of an image. Corner detection is frequently used in motion detection, image registration, video tracking, image mosaicing, panorama stitching, 3D reconstruction and object recognition. Corner detection overlaps with the topic of interest point detection.

In mathematics, the Weyl character formula in representation theory describes the characters of irreducible representations of compact Lie groups in terms of their highest weights. It was proved by Hermann Weyl. There is a closely related formula for the character of an irreducible representation of a semisimple Lie algebra. In Weyl's approach to the representation theory of connected compact Lie groups, the proof of the character formula is a key step in proving that every dominant integral element actually arises as the highest weight of some irreducible representation. Important consequences of the character formula are the Weyl dimension formula and the Kostant multiplicity formula.

In mathematics, the structure tensor, also referred to as the second-moment matrix, is a matrix derived from the gradient of a function. It describes the distribution of the gradient in a specified neighborhood around a point and makes the information invariant to the observing coordinates. The structure tensor is often used in image processing and computer vision.

In mathematics, the Schur orthogonality relations, which were proven by Issai Schur through Schur's lemma, express a central fact about representations of finite groups. They admit a generalization to the case of compact groups in general, and in particular compact Lie groups, such as the rotation group SO(3).

In the fields of computer vision and image analysis, the Harris affine region detector belongs to the category of feature detection. Feature detection is a preprocessing step of several algorithms that rely on identifying characteristic points or interest points so to make correspondences between images, recognize textures, categorize objects or build panoramas.

Geometric feature learning is a technique combining machine learning and computer vision to solve visual tasks. The main goal of this method is to find a set of representative features of geometric form to represent an object by collecting geometric features from images and learning them using efficient machine learning methods. Humans solve visual tasks and can give fast response to the environment by extracting perceptual information from what they see. Researchers simulate humans' ability of recognizing objects to solve computer vision problems. For example, M. Mata et al.(2002) applied feature learning techniques to the mobile robot navigation tasks in order to avoid obstacles. They used genetic algorithms for learning features and recognizing objects (figures). Geometric feature learning methods can not only solve recognition problems but also predict subsequent actions by analyzing a set of sequential input sensory images, usually some extracting features of images. Through learning, some hypothesis of the next action are given and according to the probability of each hypothesis give a most probable action. This technique is widely used in the area of artificial intelligence.

References

1 2 Chris Harris and Mike Stephens (1988). "A Combined Corner and Edge Detector". Alvey Vision Conference. Vol. 15.
↑ Dey, Nilanjan; et al. (2012). "A Comparative Study between Moravec and Harris Corner Detection of Noisy Images Using Adaptive Wavelet Thresholding Technique". arXiv: 1209.1558 [cs.CV].
↑ Konstantinos G. Derpanis (2004). The harris corner detector. York University.
↑ "Harris Operator Corner Detection using Sliding Window Method - Google Scholar". scholar.google.com. Retrieved 2015-11-29.
↑ "The Comparison and Application of Corner Detection Algorithms - Google Scholar". scholar.google.com. Retrieved 2015-11-29.
↑ Javier Sánchez, Nelson Monzón and Agustín Salgado (2018). "An Analysis and Implementation of the Harris Corner Detector". Image Processing on Line. 8: 305–328. doi: 10.5201/ipol.2018.229 . hdl: 10553/43499 .
↑ Bellavia, F.; Tegolo, D.; Valenti, C. (2011-03-01). "Improving Harris corner selection strategy". IET Computer Vision. 5 (2): 87. doi:10.1049/iet-cvi.2009.0127.
↑ Rosten, Edward; Drummond, Tom (2006-05-07). Leonardis, Aleš; Bischof, Horst; Pinz, Axel (eds.). Machine Learning for High-Speed Corner Detection. Lecture Notes in Computer Science. Springer Berlin Heidelberg. pp. 430–443. CiteSeerX 10.1.1.64.8513 . doi:10.1007/11744023_34. ISBN 978-3-540-33832-1. S2CID 1388140.
↑ "A Comparison of Affine Region Detectors - Google Scholar". scholar.google.com. Retrieved 2015-11-29.
↑ Gueguen, L.; Pesaresi, M. (2011). "Multi scale Harris corner detector based on Differential Morphological Decomposition". Pattern Recognition Letters. 32 (14): 1714–1719. Bibcode:2011PaReL..32.1714G. doi:10.1016/j.patrec.2011.07.021.
↑ "A Multi-scale Bilateral Structure Tensor Based Corner Detector - Google Scholar". scholar.google.com. Retrieved 2015-11-29.
↑ Kang, Juan; Xiao, Chuangbai; Deng, M.; Yu, Jing; Liu, Haifeng (2011-08-01). "Image registration based on harris corner and mutual information". Proceedings of 2011 International Conference on Electronic & Mechanical Engineering and Information Technology. Vol. 7. pp. 3434–3437. doi:10.1109/EMEIT.2011.6023066. ISBN 978-1-61284-087-1. S2CID 17367248.
↑ "Underwater Mosaic Creation using Video sequences from Different Altitudes - Google Scholar". scholar.google.com. Retrieved 2015-12-02.
↑ "Automated reconstruction of 3D scenes from sequences of images - Google Scholar". scholar.google.com. Retrieved 2015-12-02.
↑ Liu, Meng; Wu, Chengdong; Zhang, Yunzhou (2008-07-01). "Multi-resolution optical flow tracking algorithm based on multi-scale Harris corner points feature". 2008 Chinese Control and Decision Conference. pp. 5287–5291. doi:10.1109/CCDC.2008.4598340. ISBN 978-1-4244-1733-9. S2CID 8085227.
↑ "Object Recognition from Local Scale-Invariant Features - Google Scholar". scholar.google.com. Retrieved 2015-11-29.
↑ "Salient Points for Content Based Retrieval - Google Scholar". scholar.google.com. Retrieved 2015-12-02.
↑ "Tracking and Recognition of Objects using SURF Descriptor and Harris Corner Detection - Google Scholar". scholar.google.com. Retrieved 2015-12-02.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[harris-1] 1 2 Chris Harris and Mike Stephens (1988). "A Combined Corner and Edge Detector". Alvey Vision Conference. Vol. 15.

[dey-2] Dey, Nilanjan; et al. (2012). "A Comparative Study between Moravec and Harris Corner Detection of Noisy Images Using Adaptive Wavelet Thresholding Technique". arXiv: 1209.1558 [cs.CV].

[derpanis-3] Konstantinos G. Derpanis (2004). The harris corner detector. York University.

[4] "Harris Operator Corner Detection using Sliding Window Method - Google Scholar". scholar.google.com. Retrieved 2015-11-29.

[5] "The Comparison and Application of Corner Detection Algorithms - Google Scholar". scholar.google.com. Retrieved 2015-11-29.

[sanchez-6] Javier Sánchez, Nelson Monzón and Agustín Salgado (2018). "An Analysis and Implementation of the Harris Corner Detector". Image Processing on Line. 8: 305–328. doi: 10.5201/ipol.2018.229 . hdl: 10553/43499 .

[7] Bellavia, F.; Tegolo, D.; Valenti, C. (2011-03-01). "Improving Harris corner selection strategy". IET Computer Vision. 5 (2): 87. doi:10.1049/iet-cvi.2009.0127.

[8] Rosten, Edward; Drummond, Tom (2006-05-07). Leonardis, Aleš; Bischof, Horst; Pinz, Axel (eds.). Machine Learning for High-Speed Corner Detection. Lecture Notes in Computer Science. Springer Berlin Heidelberg. pp. 430–443. CiteSeerX 10.1.1.64.8513 . doi:10.1007/11744023_34. ISBN 978-3-540-33832-1. S2CID 1388140.

[9] "A Comparison of Affine Region Detectors - Google Scholar". scholar.google.com. Retrieved 2015-11-29.

[10] Gueguen, L.; Pesaresi, M. (2011). "Multi scale Harris corner detector based on Differential Morphological Decomposition". Pattern Recognition Letters. 32 (14): 1714–1719. Bibcode:2011PaReL..32.1714G. doi:10.1016/j.patrec.2011.07.021.

[11] "A Multi-scale Bilateral Structure Tensor Based Corner Detector - Google Scholar". scholar.google.com. Retrieved 2015-11-29.

[12] Kang, Juan; Xiao, Chuangbai; Deng, M.; Yu, Jing; Liu, Haifeng (2011-08-01). "Image registration based on harris corner and mutual information". Proceedings of 2011 International Conference on Electronic & Mechanical Engineering and Information Technology. Vol. 7. pp. 3434–3437. doi:10.1109/EMEIT.2011.6023066. ISBN 978-1-61284-087-1. S2CID 17367248.

[13] "Underwater Mosaic Creation using Video sequences from Different Altitudes - Google Scholar". scholar.google.com. Retrieved 2015-12-02.

[14] "Automated reconstruction of 3D scenes from sequences of images - Google Scholar". scholar.google.com. Retrieved 2015-12-02.

[15] Liu, Meng; Wu, Chengdong; Zhang, Yunzhou (2008-07-01). "Multi-resolution optical flow tracking algorithm based on multi-scale Harris corner points feature". 2008 Chinese Control and Decision Conference. pp. 5287–5291. doi:10.1109/CCDC.2008.4598340. ISBN 978-1-4244-1733-9. S2CID 8085227.

[16] "Object Recognition from Local Scale-Invariant Features - Google Scholar". scholar.google.com. Retrieved 2015-11-29.

[17] "Salient Points for Content Based Retrieval - Google Scholar". scholar.google.com. Retrieved 2015-12-02.

[18] "Tracking and Recognition of Objects using SURF Descriptor and Harris Corner Detection - Google Scholar". scholar.google.com. Retrieved 2015-12-02.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

Harris corner detector

Contents

Introduction

Development of Harris corner detection algorithm ^[1]