Steerable filter

Last updated December 26, 2025

In image processing, a steerable filter is an orientation-selective filter that can be computationally rotated to any direction. Rather than designing a new filter for each orientation, a steerable filter is synthesized from a linear combination of a small, fixed set of "basis filters". This approach is efficient and is widely used for tasks that involve directionality, such as edge detection, texture analysis, and shape-from-shading.^[1]^[2]

Example

A common example of a steerable filter is the first derivative of a two-dimensional Gaussian function. This filter responds strongly to oriented image features like edges. It is constructed from two basis filters: the partial derivative of the Gaussian with respect to the horizontal direction ( $x$ ) and the vertical direction ( $y$ ).

If $G(x,y)$ is the Gaussian function, and $G_{x}$ and $G_{y}$ are its partial derivatives (which measure the rate of change in the $x$ and $y$ directions, respectively), a new filter $G_{\theta }$ oriented at an angle $\theta$ can be synthesized with the formula: $G_{\theta }=\cos(\theta )G_{x}+\sin(\theta )G_{y}$

Here, the basis filters $G_{x}$ and $G_{y}$ are weighted by $\cos(\theta )$ and $\sin(\theta )$ to "steer" the filter's sensitivity to the desired orientation. This is equivalent to taking the dot product of the direction vector $(\cos \theta ,\sin \theta )$ with the filter's gradient, $(G_{x},G_{y})$ .^[1]

Generalization in deep learning: Equivariant neural networks

The concept of steerability is foundational to equivariant neural networks, a class of models in deep learning designed to understand symmetries in data.^[5] A network is considered equivariant to a transformation (like a rotation) if transforming the input and then passing it through the network produces the same result as passing the input through the network first and then transforming the output. Formally, for a transformation $T$ and a network $f$ , this property is defined as $f(T({\text{input}}))=T(f({\text{input}}))$ .

This built-in understanding of geometry makes models more data-efficient. For example, a network equivariant to rotation does not need to be shown an object in multiple orientations to learn to recognize it; it inherently understands that a rotated object is still the same object. This leads to better generalization and performance, particularly in scientific applications.^[3]

Mathematical foundation

Equivariant neural networks use principles from group theory to create operations that respect geometric symmetries, such as the SO(3) group for 3D rotations or the E(3) group for rotations and translations.^[3]

Instead of learning standard filter kernels, these networks learn how to combine a fixed set of basis kernels. These basis functions are chosen so that they have well-defined behaviors under transformation groups.

Spherical harmonics are frequently used as basis functions because they form a complete set of functions that behave predictably under rotation, making them ideal for creating steerable 3D kernels.^[6]
Features within the network are treated as geometric tensors, which are mathematical objects (like scalars or vectors) that are "typed" by their behavior under transformations. These types correspond to the irreducible representations (irreps) of the group.^[3]
The tensor product is the fundamental operation used to combine these typed features in a way that preserves equivariance, guaranteeing that the network as a whole respects the desired symmetry.^[3]

Frameworks like e3nn simplify the construction of these networks by automating the complex mathematics of irreducible representations and tensor products.^[3]

Applications

Steerable and equivariant models are highly effective for problems with inherent geometric symmetries. Examples include:

Protein structure analysis: SE(3)-equivariant networks can process 3D molecular structures while respecting their rotational and translational symmetries.^[6]
3D Point cloud processing: Rotation-equivariant filters built from steerable spherical functions can perform tasks like 3D shape classification.^[7]
Computational chemistry : E(3)-equivariant graph neural networks are used to model interatomic potentials for molecular dynamics simulations, creating highly accurate and data-efficient models of physical systems.^[8]

References

1 2 Freeman, W. T. & Adelson, E. H. (1991). "The design and use of steerable filters" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 13 (9): 891–906. doi:10.1109/34.93808.
↑ Perona, P. (1995). "Deformable kernels for early vision" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 17 (5): 488–499. Bibcode:1995ITPAM..17..488P. doi:10.1109/34.391394.
1 2 3 4 5 6 Geiger, Mario; Smidt, Tess (18 July 2022). "e3nn: Euclidean Neural Networks". arXiv: 2207.09453 [cs.LG].
↑ Zhdanov, Maksim; Hoffmann, Nico; Cesa, Gabriele (2023). "Implicit Convolutional Kernels for Steerable CNNs" (PDF). Advances in Neural Information Processing Systems 36. Curran Associates, Inc.
↑ Cohen, Taco S.; Welling, Max (27 December 2016). "Steerable CNNs". arXiv: 1612.08498 [cs.LG].
1 2 Weiler, Maurice; Geiger, Mario; Welling, Max; Boomsma, Wouter; Cohen, Taco S. (2018). "3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data" (PDF). Advances in Neural Information Processing Systems 31. Curran Associates, Inc.
↑ Melnyk, Pavlo; Felsberg, Michael; Wadenbäck, Mårten (2022). "Steerable 3D Spherical Neurons". Proceedings of the 39th International Conference on Machine Learning. Vol. 162. PMLR.
↑ Batzner, Simon; Musaelian, Albert; Sun, Lixin; et al. (4 May 2022). "E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials". Nature Communications. 13 (1): 2453. Bibcode:2022NatCo..13.2453B. doi:10.1038/s41467-022-29939-5. PMC 9068367 . PMID 35513421.

This applied mathematics–related article is a stub. You can help Wikipedia by expanding it.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Freeman-Adelson-1991-1] 1 2 Freeman, W. T. & Adelson, E. H. (1991). "The design and use of steerable filters" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 13 (9): 891–906. doi:10.1109/34.93808.

[Perona-1995-2] Perona, P. (1995). "Deformable kernels for early vision" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 17 (5): 488–499. Bibcode:1995ITPAM..17..488P. doi:10.1109/34.391394.

[Geiger-Smidt-2022-3] 1 2 3 4 5 6 Geiger, Mario; Smidt, Tess (18 July 2022). "e3nn: Euclidean Neural Networks". arXiv: 2207.09453 [cs.LG].

[Zhdanov-2023-4] Zhdanov, Maksim; Hoffmann, Nico; Cesa, Gabriele (2023). "Implicit Convolutional Kernels for Steerable CNNs" (PDF). Advances in Neural Information Processing Systems 36. Curran Associates, Inc.

[Cohen-Welling-2016-5] Cohen, Taco S.; Welling, Max (27 December 2016). "Steerable CNNs". arXiv: 1612.08498 [cs.LG].

[Weiler-2018-6] 1 2 Weiler, Maurice; Geiger, Mario; Welling, Max; Boomsma, Wouter; Cohen, Taco S. (2018). "3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data" (PDF). Advances in Neural Information Processing Systems 31. Curran Associates, Inc.

[Melnyk-2022-7] Melnyk, Pavlo; Felsberg, Michael; Wadenbäck, Mårten (2022). "Steerable 3D Spherical Neurons". Proceedings of the 39th International Conference on Machine Learning. Vol. 162. PMLR.

[Batzner-2022-8] Batzner, Simon; Musaelian, Albert; Sun, Lixin; et al. (4 May 2022). "E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials". Nature Communications. 13 (1): 2453. Bibcode:2022NatCo..13.2453B. doi:10.1038/s41467-022-29939-5. PMC 9068367 . PMID 35513421.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]