Algebraic signal processing

Last updated May 22, 2023

Algebraic signal processing (ASP) is an emerging area of theoretical signal processing (SP). In the algebraic theory of signal processing, a set of filters is treated as an (abstract) algebra, a set of signals is treated as a module or vector space, and convolution is treated as an algebra representation. The advantage of algebraic signal processing is its generality and portability.

History

In the original formulation of algebraic signal processing by Puschel and Moura, the signals are collected in an ${\mathcal {A}}$ -module for some algebra ${\mathcal {A}}$ of filters, and filtering is given by the action of ${\mathcal {A}}$ on the ${\mathcal {A}}$ -module.^[1]

Definitions

Let $K$ be a field, for instance the complex numbers, and ${\mathcal {A}}$ be a $K$ -algebra (i.e. a vector space over $K$ with a binary operation ${\displaystyle \ast$ that is linear in both arguments) treated as a set of filters. Suppose ${\mathcal {M}}$ is a vector space representing a set signals. A representation of ${\mathcal {A}}$ consists of an algebra homomorphism ${\displaystyle \rho$ where $\mathrm {End} ({\mathcal {M}})$ is the algebra of linear transformations $T:{\mathcal {M}}\to {\mathcal {M}}$ with composition (equivalent, in the finite-dimensional case, to matrix multiplication). For convenience, we write $\rho _{a}$ for the endomorphism $\rho (a)$ . To be an algebra homomorphism, $\rho$ must not only be a linear transformation, but also satisfy the property

\rho _{a\ast b}=\rho _{b}\circ \rho _{a}\quad \forall a,b\in {\mathcal {A}}

Given a signal $x\in {\mathcal {M}}$ , convolution of the signal by a filter $a\in {\mathcal {A}}$ yields a new signal $\rho _{a}(x)$ . Some additional terminology is needed from the representation theory of algebras. A subset ${\mathcal {G}}\subseteq {\mathcal {A}}$ is said to generate the algebra if every element of ${\mathcal {A}}$ can be represented as polynomials in the elements of ${\mathcal {A}}$ . The image of a generator $g\in {\mathcal {G}}$ is called a shift operator. In all practically all examples, convolutions are formed as polynomials in $\mathrm {End} ({\mathcal {M}})$ generated by shift operators. However, this is not necessarily the case for a representation of an arbitrary algebra.

Examples

Discrete Signal Processing

In discrete signal processing (DSP), the signal space is the set of complex-valued functions ${\mathcal {M}}={\mathcal {L}}^{2}(\mathbb {Z} )$ with bounded energy (i.e. square-integrable functions). This means the infinite series $\sum _{n=-\infty }^{\infty }|(x)_{n}|<\infty$ where $|\cdot |$ is the modulus of a complex number. The shift operator is given by the linear endomorphism $(Sx)_{n}=(x)_{n-1}$ . The filter space is the algebra of polynomials with complex coefficients ${\mathcal {A}}=\mathbb {C} [z^{-1},z]$ and convolution is given by $\rho _{h}=\sum _{k=-\infty }^{\infty }h_{k}S^{k}$ where $h(t)=\sum _{k=-\infty }^{\infty }h_{k}z^{k}$ is an element of the algebra. Filtering a signal by $h$ , then yields $(y)_{n}=\sum _{k=-\infty }^{\infty }h_{k}x_{n-k}$ because $(S^{k}x)_{n}=(x)_{n-k}$ .

Graph Signal Processing

A weighted graph is an undirected graph ${\mathcal {G}}=({\mathcal {V}},{\mathcal {E}})$ with pseudometric on the node set ${\mathcal {V}}$ written $a_{ij}$ . A graph signal is simply a real-valued function on the set of nodes of the graph. In graph neural networks, graph signals are sometimes called features. The signal space is the set of all graph signals ${\mathcal {M}}=\mathbb {R} ^{\mathcal {V}}$ where ${\mathcal {V}}$ is a set of $n=|{\mathcal {V}}|$ nodes in ${\mathcal {G}}=({\mathcal {V}},{\mathcal {E}})$ . The filter algebra is the algebra of polynomials in one indeterminate ${\mathcal {A}}=\mathbb {R} [t]$ . There a few possible choices for a graph shift operator (GSO). The (un)normalized weighted adjacency matrix of $[A]_{ij}=a_{ij}$ is a popular choice, as well as the (un)normalized graph Laplacian $[L]_{ij}={\begin{cases}\sum _{j=1}^{n}a_{ij}&i=j\\-a_{ij}&i\neq j\end{cases}}$ . The choice is dependent on performance and design considerations. If $S$ is the GSO, then a graph convolution is the linear transformation $\rho _{h}=\sum _{k=0}^{\infty }h_{k}S^{k}$ for some $h(t)=\sum _{k=0}^{\infty }h_{k}z^{k}$ , and convolution of a graph signal $\mathbf {x$ by a filter $h(t)$ yields a new graph signal $\mathbf {y} =\left(\sum _{k=0}^{\infty }h_{k}S^{k}\right)\cdot \mathbf {x}$ .

Other Examples

Other mathematical objects with their own proposed signal-processing frameworks are algebraic signal models. These objects include including quivers,^[2] graphons,^[3] semilattices,^[4] finite groups, and Lie groups,^[5] and others.

Intertwining Maps

In the framework of representation theory, relationships between two representations of the same algebra are described with intertwining maps which in the context of signal processing translates to transformations of signals that respect the algebra structure. Suppose ${\displaystyle \rho$ and $\rho ':{\mathcal {A}}\to \mathrm {End} ({\mathcal {M}}')$ are two different representations of ${\mathcal {A}}$ . An intertwining map is a linear transformation ${\displaystyle \alpha$ such that

$\alpha \circ \rho _{a}=\rho '_{a}\circ \alpha \quad \forall a\in {\mathcal {A}}$

Intuitively, this means that filtering a signal by $a$ then transforming it with $\alpha$ is equivalent to first transforming a signal with $\alpha$ , then filtering by $a$ . The z transform ^[1] is a prototypical example of an intertwining map.

Algebraic Neural Networks

Inspired by a recent perspective that popular graph neural networks (GNNs) architectures are in fact convolutional neural networks (CNNs),^[6] recent work has been focused on developing novel neural network architectures from the algebraic point-of-view.^[7]^[8] An algebraic neural network is a composition of algebraic convolutions, possibly with multiple features and feature aggregations, and nonlinearities.

Related Research Articles

In mathematics, convolution is a mathematical operation on two functions that produces a third function that expresses how the shape of one is modified by the other. The term convolution refers to both the result function and to the process of computing it. It is defined as the integral of the product of the two functions after one is reflected about the y-axis and shifted. The choice of which function is reflected and shifted before the integral does not change the integral result. The integral is evaluated for all values of shift, producing the convolution function.

In mathematics, a Lie algebroid is a vector bundle $together with a Lie bracket on its space of sections and a vector bundle morphism, satisfying a Leibniz rule. A Lie algebroid can thus be thought of as a "many-object generalisation" of a Lie algebra.$

In mathematics, a Cayley graph, also known as a Cayley color graph, Cayley diagram, group diagram, or color group is a graph that encodes the abstract structure of a group. Its definition is suggested by Cayley's theorem, and uses a specified set of generators for the group. It is a central tool in combinatorial and geometric group theory. The structure and symmetry of Cayley graphs makes them particularly good candidates for constructing families of expander graphs.

In mathematics, spectral graph theory is the study of the properties of a graph in relationship to the characteristic polynomial, eigenvalues, and eigenvectors of matrices associated with the graph, such as its adjacency matrix or Laplacian matrix.

In signal processing, a matched filter is obtained by correlating a known delayed signal, or template, with an unknown signal to detect the presence of the template in the unknown signal. This is equivalent to convolving the unknown signal with a conjugated time-reversed version of the template. The matched filter is the optimal linear filter for maximizing the signal-to-noise ratio (SNR) in the presence of additive stochastic noise.

In system analysis, among other fields of study, a linear time-invariant (LTI) system is a system that produces an output signal from any input signal subject to the constraints of linearity and time-invariance; these terms are briefly defined below. These properties apply (exactly or approximately) to many important physical systems, in which case the response $y (t)$ of the system to an arbitrary input $x (t)$ can be found directly using convolution: $y (t) = (x * h)(t)$ where $h (t)$ is called the system's impulse response and ∗ represents convolution (not to be confused with multiplication, as is frequently employed by the symbol in computer languages). What's more, there are systematic methods for solving any such system (determining $h (t)$ ), whereas systems not meeting both properties are generally more difficult (or impossible) to solve analytically. A good example of an LTI system is any electrical circuit consisting of resistors, capacitors, inductors and linear amplifiers.

In theoretical physics and mathematics, a Wess–Zumino–Witten (WZW) model, also called a Wess–Zumino–Novikov–Witten model, is a type of two-dimensional conformal field theory named after Julius Wess, Bruno Zumino, Sergei Novikov and Edward Witten. A WZW model is associated to a Lie group, and its symmetry algebra is the affine Lie algebra built from the corresponding Lie algebra. By extension, the name WZW model is sometimes used for any conformal field theory whose symmetry algebra is an affine Lie algebra.

<span class="mw-page-title-main">Quantum tomography</span> Reconstruction of quantum states based on measurements

Quantum tomography or quantum state tomography is the process by which a quantum state is reconstructed using measurements on an ensemble of identical quantum states. The source of these states may be any device or system which prepares quantum states either consistently into quantum pure states or otherwise into general mixed states. To be able to uniquely identify the state, the measurements must be tomographically complete. That is, the measured operators must form an operator basis on the Hilbert space of the system, providing all the information about the state. Such a set of observations is sometimes called a quorum. The term tomography was first used in the quantum physics literature in a 1993 paper introducing experimental optical homodyne tomography.

In mathematics, the Cuntz algebra $, named after Joachim Cuntz, is the universal C*-algebra generated by isometries of an infinite-dimensional Hilbert space satisfying certain relations. These algebras were introduced as the first concrete examples of a separable infinite simple C*-algebra, meaning as a Hilbert space, is isometric to the sequence space$

In information theory, information dimension is an information measure for random vectors in Euclidean space, based on the normalized entropy of finely quantized versions of the random vectors. This concept was first introduced by Alfréd Rényi in 1959.

An autoencoder is a type of artificial neural network used to learn efficient codings of unlabeled data. An autoencoder learns two functions: an encoding function that transforms the input data, and a decoding function that recreates the input data from the encoded representation. The autoencoder learns an efficient representation (encoding) for a set of data, typically for dimensionality reduction.

In many-body theory, the term Green's function is sometimes used interchangeably with correlation function, but refers specifically to correlators of field operators or creation and annihilation operators.

In the mathematical theory of artificial neural networks, universal approximation theorems are results that establish the density of an algorithmically generated class of functions within a given function space of interest. Typically, these results concern the approximation capabilities of the feedforward architecture on the space of continuous functions between two Euclidean spaces, and the approximation is with respect to the compact convergence topology.

A product distribution is a probability distribution constructed as the distribution of the product of random variables having two other known distributions. Given two statistically independent random variables X and Y, the distribution of the random variable Z that is formed as the product $is a product distribution .$

Quantum stochastic calculus is a generalization of stochastic calculus to noncommuting variables. The tools provided by quantum stochastic calculus are of great use for modeling the random evolution of systems undergoing measurement, as in quantum trajectories. Just as the Lindblad master equation provides a quantum generalization to the Fokker–Planck equation, quantum stochastic calculus allows for the derivation of quantum stochastic differential equations (QSDE) that are analogous to classical Langevin equations.

In signal processing, multidimensional discrete convolution refers to the mathematical operation between two functions f and g on an n-dimensional lattice that produces a third function, also of n-dimensions. Multidimensional discrete convolution is the discrete analog of the multidimensional convolution of functions on Euclidean space. It is also a special case of convolution on groups when the group is the group of n-tuples of integers.

Maximal entropy random walk (MERW) is a popular type of biased random walk on a graph, in which transition probabilities are chosen accordingly to the principle of maximum entropy, which says that the probability distribution which best represents the current state of knowledge is the one with largest entropy. While standard random walk chooses for every vertex uniform probability distribution among its outgoing edges, locally maximizing entropy rate, MERW maximizes it globally by assuming uniform probability distribution among all paths in a given graph.

In mathematics, the graph Fourier transform is a mathematical transform which eigendecomposes the Laplacian matrix of a graph into eigenvalues and eigenvectors. Analogously to the classical Fourier transform, the eigenvalues represent frequencies and eigenvectors form what is known as a graph Fourier basis.

Tau functions are an important ingredient in the modern theory of integrable systems, and have numerous applications in a variety of other domains. They were originally introduced by Ryogo Hirota in his direct method approach to soliton equations, based on expressing them in an equivalent bilinear form. The term Tau function, or $-function$ , was first used systematically by Mikio Sato and his students in the specific context of the Kadomtsev–Petviashvili equation, and related integrable hierarchies. It is a central ingredient in the theory of solitons. Tau functions also appear as matrix model partition functions in the spectral theory of Random Matrices, and may also serve as generating functions, in the sense of combinatorics and enumerative geometry, especially in relation to moduli spaces of Riemann surfaces, and enumeration of branched coverings, or so-called Hurwitz numbers.

In machine learning, the word tensor informally refers to two different concepts that organize and represent data. Data may be organized in an M-way array that is informally referred to as a "data tensor". However, a tensor is a multilinear mapping over a set of domain vector spaces to a range vector space. Observations, such as images, movies, volumes, sounds, and relationships among words and concepts, stored in an M-way array may be analyzed either by artificial neural networks or tensor methods.

References

1 2 Puschel, M.; Moura, J. (2008). "Algebraic Signal Processing Theory: Foundation and 1-D Time". IEEE Transactions on Signal Processing. 56 (8): 3572–3585. Bibcode:2008ITSP...56.3572P. doi:10.1109/TSP.2008.925261. ISSN 1053-587X. S2CID 206797175.
↑ Parada-Mayorga, Alejandro; Riess, Hans; Ribeiro, Alejandro; Ghrist, Robert (2020-10-22). "Quiver Signal Processing (QSP)". arXiv: 2010.11525 [eess.SP].
↑ Ruiz, Luana; Chamon, Luiz F. O.; Ribeiro, Alejandro (2021). "Graphon Signal Processing". IEEE Transactions on Signal Processing. 69: 4961–4976. arXiv: 2003.05030 . Bibcode:2021ITSP...69.4961R. doi:10.1109/TSP.2021.3106857. ISSN 1053-587X. S2CID 212657497.
↑ Puschel, Markus; Seifert, Bastian; Wendler, Chris (2021). "Discrete Signal Processing on Meet/Join Lattices". IEEE Transactions on Signal Processing. 69: 3571–3584. arXiv: 2012.04358 . Bibcode:2021ITSP...69.3571P. doi:10.1109/TSP.2021.3081036. ISSN 1053-587X. S2CID 227736440.
↑ Bernardini, Riccardo; Rinaldo, Roberto (2021). "Demystifying Lie Group Methods for Signal Processing: A Tutorial". IEEE Signal Processing Magazine. 38 (2): 45–64. Bibcode:2021ISPM...38b..45B. doi:10.1109/MSP.2020.3023540. ISSN 1053-5888. S2CID 232071730.
↑ Gama, Fernando; Isufi, Elvin; Leus, Geert; Ribeiro, Alejandro (2020). "Graphs, Convolutions, and Neural Networks: From Graph Filters to Graph Neural Networks". IEEE Signal Processing Magazine. 37 (6): 128–138. arXiv: 2003.03777 . Bibcode:2020ISPM...37f.128G. doi:10.1109/MSP.2020.3016143. ISSN 1053-5888. S2CID 226292855.
↑ Parada-Mayorga, Alejandro; Ribeiro, Alejandro (2021). "Algebraic Neural Networks: Stability to Deformations". IEEE Transactions on Signal Processing. 69: 3351–3366. arXiv: 2009.01433 . Bibcode:2021ITSP...69.3351P. doi:10.1109/TSP.2021.3084537. ISSN 1053-587X. S2CID 221517145.
↑ Parada-Mayorga, Alejandro; Butler, Landon; Ribeiro, Alejandro (2022). "Convolutional Filtering and Neural Networks with Non Commutative Algebras". arXiv: 2108.09923 [cs.LG].

External links

Smart Project: Algebraic Theory of Signal Processing at the Department of Electrical and Computer Engineering at Carnegie Mellon University.
Lecture 12: "Algebraic Neural Networks," University of Pennsylvania (ESE 514).

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 Puschel, M.; Moura, J. (2008). "Algebraic Signal Processing Theory: Foundation and 1-D Time". IEEE Transactions on Signal Processing. 56 (8): 3572–3585. Bibcode:2008ITSP...56.3572P. doi:10.1109/TSP.2008.925261. ISSN 1053-587X. S2CID 206797175.

[2] Parada-Mayorga, Alejandro; Riess, Hans; Ribeiro, Alejandro; Ghrist, Robert (2020-10-22). "Quiver Signal Processing (QSP)". arXiv: 2010.11525 [eess.SP].

[3] Ruiz, Luana; Chamon, Luiz F. O.; Ribeiro, Alejandro (2021). "Graphon Signal Processing". IEEE Transactions on Signal Processing. 69: 4961–4976. arXiv: 2003.05030 . Bibcode:2021ITSP...69.4961R. doi:10.1109/TSP.2021.3106857. ISSN 1053-587X. S2CID 212657497.

[4] Puschel, Markus; Seifert, Bastian; Wendler, Chris (2021). "Discrete Signal Processing on Meet/Join Lattices". IEEE Transactions on Signal Processing. 69: 3571–3584. arXiv: 2012.04358 . Bibcode:2021ITSP...69.3571P. doi:10.1109/TSP.2021.3081036. ISSN 1053-587X. S2CID 227736440.

[5] Bernardini, Riccardo; Rinaldo, Roberto (2021). "Demystifying Lie Group Methods for Signal Processing: A Tutorial". IEEE Signal Processing Magazine. 38 (2): 45–64. Bibcode:2021ISPM...38b..45B. doi:10.1109/MSP.2020.3023540. ISSN 1053-5888. S2CID 232071730.

[6] Gama, Fernando; Isufi, Elvin; Leus, Geert; Ribeiro, Alejandro (2020). "Graphs, Convolutions, and Neural Networks: From Graph Filters to Graph Neural Networks". IEEE Signal Processing Magazine. 37 (6): 128–138. arXiv: 2003.03777 . Bibcode:2020ISPM...37f.128G. doi:10.1109/MSP.2020.3016143. ISSN 1053-5888. S2CID 226292855.

[7] Parada-Mayorga, Alejandro; Ribeiro, Alejandro (2021). "Algebraic Neural Networks: Stability to Deformations". IEEE Transactions on Signal Processing. 69: 3351–3366. arXiv: 2009.01433 . Bibcode:2021ITSP...69.3351P. doi:10.1109/TSP.2021.3084537. ISSN 1053-587X. S2CID 221517145.

[8] Parada-Mayorga, Alejandro; Butler, Landon; Ribeiro, Alejandro (2022). "Convolutional Filtering and Neural Networks with Non Commutative Algebras". arXiv: 2108.09923 [cs.LG].

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]