Multidimensional discrete convolution

Last updated November 27, 2024

In signal processing, multidimensional discrete convolution refers to the mathematical operation between two functions f and g on an n-dimensional lattice that produces a third function, also of n-dimensions. Multidimensional discrete convolution is the discrete analog of the multidimensional convolution of functions on Euclidean space. It is also a special case of convolution on groups when the group is the group of n-tuples of integers.

Definition
Problem statement and basics
Motivation and applications
Row-column decomposition with separable signals
Separable signals
Row-column decomposition
Computational speedup from row-column decomposition
Circular convolution of discrete-valued multidimensional signals
Convolution theorem in multiple dimensions
Circular convolution approach
Choosing DFT size to avoid aliasing
Summary of procedure using DFTs
Overlap and add
Decomposition into smaller convolution blocks
Breakdown of procedure
Pictorial method of operation
Overlap and save
Comparison to overlap and add
Breakdown of procedure 2
The helix transform
Multidimensional convolution with one-dimensional convolution methods
Filtering on a helix
Applications
Gaussian convolution
Approximation by FIR filter
Approximation by box filter
Applications 2
See also
References

Definition

Problem statement and basics

Similar to the one-dimensional case, an asterisk is used to represent the convolution operation. The number of dimensions in the given operation is reflected in the number of asterisks. For example, an M-dimensional convolution would be written with M asterisks. The following represents a M-dimensional convolution of discrete signals:

$y(n_{1},n_{2},...,n_{M})=x(n_{1},n_{2},...,n_{M})*{\overset {M}{\cdots }}*h(n_{1},n_{2},...,n_{M})$

For discrete-valued signals, this convolution can be directly computed via the following:

$\sum _{k_{1}=-\infty }^{\infty }\sum _{k_{2}=-\infty }^{\infty }...\sum _{k_{M}=-\infty }^{\infty }h(k_{1},k_{2},...,k_{M})x(n_{1}-k_{1},n_{2}-k_{2},...,n_{M}-k_{M})$

The resulting output region of support of a discrete multidimensional convolution will be determined based on the size and regions of support of the two input signals.

Listed are several properties of the two-dimensional convolution operator. Note that these can also be extended for signals of $N$ -dimensions.

Commutative Property:

$x**h=h**x$

Associate Property:

$(x**h)**g=x**(h**g)$

Distributive Property:

$x**(h+g)=(x**h)+(x**g)$

These properties are seen in use in the figure below. Given some input $x(n_{1},n_{2})$ that goes into a filter with impulse response $h(n_{1},n_{2})$ and then another filter with impulse response $g(n_{1},n_{2})$ , the output is given by $y(n_{1},n_{2})$ . Assume that the output of the first filter is given by $w(n_{1},n_{2})$ , this means that:

$w=x**h$

Further, that intermediate function is then convolved with the impulse response of the second filter, and thus the output can be represented by:

$y=w**g=(x**h)**g$

Using the associative property, this can be rewritten as follows:

$y=x**(h**g)$

meaning that the equivalent impulse response for a cascaded system is given by:

$h_{eq}=h**g$

Both figures represent cascaded systems. Note that the order of the filters does not affect the output. Cascaded.png — Both figures represent cascaded systems. Note that the order of the filters does not affect the output.

A similar analysis can be done on a set of parallel systems illustrated below.

In this case, it is clear that:

$y=(x**h)+(x**g)$

Using the distributive law, it is demonstrated that:

$y=x**(h+g)$

This means that in the case of a parallel system, the equivalent impulse response is provided by:

$h_{eq}=h+g$

The equivalent impulse responses in both cascaded systems and parallel systems can be generalized to systems with $N$ -number of filters.^[1]

Motivation and applications

Convolution in one dimension was a powerful discovery that allowed the input and output of a linear shift-invariant (LSI) system (see LTI system theory) to be easily compared so long as the impulse response of the filter system was known. This notion carries over to multidimensional convolution as well, as simply knowing the impulse response of a multidimensional filter too allows for a direct comparison to be made between the input and output of a system. This is profound since several of the signals that are transferred in the digital world today are of multiple dimensions including images and videos. Similar to the one-dimensional convolution, the multidimensional convolution allows the computation of the output of an LSI system for a given input signal.

For example, consider an image that is sent over some wireless network subject to electro-optical noise. Possible noise sources include errors in channel transmission, the analog to digital converter, and the image sensor. Usually noise caused by the channel or sensor creates spatially-independent, high-frequency signal components that translates to arbitrary light and dark spots on the actual image. In order to rid the image data of the high-frequency spectral content, it can be multiplied by the frequency response of a low-pass filter, which based on the convolution theorem, is equivalent to convolving the signal in the time/spatial domain by the impulse response of the low-pass filter. Several impulse responses that do so are shown below.^[2]

Impulse Responses of Typical Multidimensional Low Pass Filters Screen Shot 2015-11-11 at 11.18.23 PM.png — Impulse Responses of Typical Multidimensional Low Pass Filters

In addition to filtering out spectral content, the multidimensional convolution can implement edge detection and smoothing. This once again is wholly dependent on the values of the impulse response that is used to convolve with the input image. Typical impulse responses for edge detection are illustrated below.

Typical Impulse Responses for Edge Detection Screen Shot 2015-11-11 at 11.21.00 PM.png — Typical Impulse Responses for Edge Detection

In addition to image processing, multidimensional convolution can be implemented to enable a variety of other applications. Since filters are widespread in digital communication systems, any system that must transmit multidimensional data is assisted by filtering techniques It is used in real-time video processing, neural network analysis, digital geophysical data analysis, and much more.^[3]

One typical distortion that occurs during image and video capture or transmission applications is blur that is caused by a low-pass filtering process. The introduced blur can be modeled using Gaussian low-pass filtering.

Row-column decomposition with separable signals

Separable signals

A signal is said to be separable if it can be written as the product of multiple one-dimensional signals.^[1] Mathematically, this is expressed as the following:

$x(n_{1},n_{2},...,n_{M})=x(n_{1})x(n_{2})...x(n_{M})$

Some readily recognizable separable signals include the unit step function, and the dirac-delta impulse function.

$u(n_{1},n_{2},...,n_{M})=u(n_{1})u(n_{2})...u(n_{M})$ (unit step function)

$\delta (n_{1},n_{2},...,n_{M})=\delta (n_{1})\delta (n_{2})...\delta (n_{M})$ (dirac-delta impulse function)

Convolution is a linear operation. It then follows that the multidimensional convolution of separable signals can be expressed as the product of many one-dimensional convolutions. For example, consider the case where x and h are both separable functions.

$x(n_{1},n_{2})**h(n_{1},n_{2})=\sum _{k_{1}=-\infty }^{\infty }\sum _{k_{2}=-\infty }^{\infty }h(k_{1},k_{2})x(n_{1}-k_{1},n_{2}-k_{2})$

By applying the properties of separability, this can then be rewritten as the following:

$x(n_{1},n_{2})**h(n_{1},n_{2})={\bigg (}\sum _{k_{1}=-\infty }^{\infty }h(k_{1})x(n_{1}-k_{1}){\bigg )}{\bigg (}\sum _{k_{2}=-\infty }^{\infty }h(k_{2})x(n_{2}-k_{2}){\bigg )}$

It is readily seen then that this reduces to the product of one-dimensional convolutions:

$x(n_{1},n_{2})**h(n_{1},n_{2})={\bigg [}x(n_{1})*h(n_{1}){\bigg ]}{\bigg [}x(n_{2})*h(n_{2}){\bigg ]}$

This conclusion can then be extended to the convolution of two separable M-dimensional signals as follows:

$x(n_{1},n_{2},...,n_{M})*{\overset {M}{\cdots }}*h(n_{1},n_{2},...,n_{M})={\bigg [}x(n_{1})*h(n_{1}){\bigg ]}{\bigg [}x(n_{2})*h(n_{2}){\bigg ]}...{\bigg [}x(n_{M})*h(n_{M}){\bigg ]}$

So, when the two signals are separable, the multidimensional convolution can be computed by computing $n_{M}$ one-dimensional convolutions.

Row-column decomposition

The row-column method can be applied when one of the signals in the convolution is separable. The method exploits the properties of separability in order to achieve a method of calculating the convolution of two multidimensional signals that is more computationally efficient than direct computation of each sample (given that one of the signals are separable).^[4] The following shows the mathematical reasoning behind the row-column decomposition approach (typically $h(n_{1},n_{2})$ is the separable signal):

${\begin{aligned}y(n_{1},n_{2})&=\sum _{k_{1}=-\infty }^{\infty }\sum _{k_{2}=-\infty }^{\infty }h(k_{1},k_{2})x(n_{1}-k_{1},n_{2}-k_{2})\\&=\sum _{k_{1}=-\infty }^{\infty }\sum _{k_{2}=-\infty }^{\infty }h_{1}(k_{1})h_{2}(k_{2})x(n_{1}-k_{1},n_{2}-k_{2})\\&=\sum _{k_{1}=-\infty }^{\infty }h_{1}(k_{1}){\Bigg [}\sum _{k_{2}=-\infty }^{\infty }h_{2}(k_{2})x(n_{1}-k_{1},n_{2}-k_{2}){\Bigg ]}\end{aligned}}$

The value of $\sum _{k_{2}=-\infty }^{\infty }h_{2}(k_{2})x(n_{1}-k_{1},n_{2}-k_{2})$ can now be re-used when evaluating other $y$ values with a shared value of $n_{2}$ :

${\begin{aligned}y(n_{1}+\delta ,n_{2})&=\sum _{k_{1}=-\infty }^{\infty }h_{1}(k_{1}){\Bigg [}\sum _{k_{2}=-\infty }^{\infty }h_{2}(k_{2})x(n_{1}-[k_{1}-\delta ],n_{2}-k_{2}){\Bigg ]}\\&=\sum _{k_{1}=-\infty }^{\infty }h_{1}(k_{1}+\delta ){\Bigg [}\sum _{k_{2}=-\infty }^{\infty }h_{2}(k_{2})x(n_{1}-k_{1},n_{2}-k_{2}){\Bigg ]}\end{aligned}}$

Thus, the resulting convolution can be effectively calculated by first performing the convolution operation on all of the rows of $x(n_{1},n_{2})$ , and then on all of its columns. This approach can be further optimized by taking into account how memory is accessed within a computer processor.

A processor will load in the signal data needed for the given operation. For modern processors, data will be loaded from memory into the processors cache, which has faster access times than memory. The cache itself is partitioned into lines. When a cache line is loaded from memory, multiple data operands are loaded at once. Consider the optimized case where a row of signal data can fit entirely within the processor's cache. This particular processor would be able to access the data row-wise efficiently, but not column-wise since different data operands in the same column would lie on different cache lines.^[5] In order to take advantage of the way in which memory is accessed, it is more efficient to transpose the data set and then access it row-wise rather than attempt to access it column-wise. The algorithm then becomes:

Separate the separable two-dimensional signal $h(n_{1},n_{2})$ into two one-dimensional signals $h_{1}(n_{1})$ and $h_{2}(n_{2})$
Perform row-wise convolution on the horizontal components of the signal $x(n_{1},n_{2})$ using $h_{1}(n_{1})$ to obtain $g(n_{1},n_{2})$
Transpose the vertical components of the signal $g(n_{1},n_{2})$ resulting from Step 2.
Perform row-wise convolution on the transposed vertical components of $g(n_{1},n_{2})$ to get the desired output $y(n_{1},n_{2})$

Computational speedup from row-column decomposition

Examine the case where an image of size $X\times Y$ is being passed through a separable filter of size $J\times K$ . The image itself is not separable. If the result is calculated using the direct convolution approach without exploiting the separability of the filter, this will require approximately $XYJK$ multiplications and additions. If the separability of the filter is taken into account, the filtering can be performed in two steps. The first step will have $XYJ$ multiplications and additions and the second step will have $XYK$ , resulting in a total of $XYJ+XYK$ or $XY(J+K)$ multiplications and additions.^[6] A comparison of the computational complexity between direct and separable convolution is given in the following image:

Circular convolution of discrete-valued multidimensional signals

The premise behind the circular convolution approach on multidimensional signals is to develop a relation between the Convolution theorem and the Discrete Fourier transform (DFT) that can be used to calculate the convolution between two finite-extent, discrete-valued signals.^[7]

Convolution theorem in multiple dimensions

For one-dimensional signals, the Convolution Theorem states that the Fourier transform of the convolution between two signals is equal to the product of the Fourier Transforms of those two signals. Thus, convolution in the time domain is equal to multiplication in the frequency domain. Mathematically, this principle is expressed via the following: $y(n)=h(n)*x(n)\longleftrightarrow Y(\omega )=H(\omega )X(\omega )$ This principle is directly extendable to dealing with signals of multiple dimensions. $y(n_{1},n_{2},...,n_{M})=h(n_{1},n_{2},...,n_{M})*{\overset {M}{\cdots }}*x(n_{1},n_{2},...,n_{M})\longleftrightarrow Y(\omega _{1},\omega _{2},...,\omega _{M})=H(\omega _{1},\omega _{2},...,\omega _{M})X(\omega _{1},\omega _{2},...,\omega _{M})$ This property is readily extended to the usage with the Discrete Fourier transform (DFT) as follows (note that linear convolution is replaced with circular convolution where $\otimes$ is used to denote the circular convolution operation of size $N$ ):

$y(n)=h(n)\otimes x(n)\longleftrightarrow Y(k)=H(k)X(k)$

When dealing with signals of multiple dimensions: $y(n_{1},n_{2},...,n_{M})=h(n_{1},n_{2},...,n_{M})\otimes {\overset {M}{\cdots }}\otimes x(n_{1},n_{2},...,n_{M})\longleftrightarrow Y(k_{1},k_{2},...,k_{M})=H(k_{1},k_{2},...,k_{M})X(k_{1},k_{2},...,k_{M})$ The circular convolutions here will be of size $N_{1},N_{2},...,N_{M}$ .

Circular convolution approach

The motivation behind using the circular convolution approach is that it is based on the DFT. The premise behind circular convolution is to take the DFTs of the input signals, multiply them together, and then take the inverse DFT. Care must be taken such that a large enough DFT is used such that aliasing does not occur. The DFT is numerically computable when dealing with signals of finite-extent. One advantage this approach has is that since it requires taking the DFT and inverse DFT, it is possible to utilize efficient algorithms such as the Fast Fourier transform (FFT). Circular convolution can also be computed in the time/spatial domain and not only in the frequency domain.

Choosing DFT size to avoid aliasing

Consider the following case where two finite-extent signals x and h are taken. For both signals, there is a corresponding DFT as follows:

$x(n_{1},n_{2})\longleftrightarrow X(k_{1},k_{2})$ and $h(n_{1},n_{2})\longleftrightarrow H(k_{1},k_{2})$

The region of support of $x(n_{1},n_{2})$ is $0\leq n_{1}\leq P_{1}-1$ and $0\leq n_{2}\leq P_{2}-1$ and the region of support of $h(n_{1},n_{2})$ is $0\leq n_{1}\leq Q_{1}-1$ and $0\leq n_{2}\leq Q_{2}-1$ .

The linear convolution of these two signals would be given as: $y_{linear}(n_{1},n_{2})=\sum _{m_{1}}\sum _{m_{2}}h(m_{1},m_{2})x(n_{1}-m_{1},n_{2}-m_{2})$ Given the regions of support of $x(n_{1},n_{2})$ and $h(n_{1},n_{2})$ , the region of support of $y_{linear}(n_{1},n_{2})$ will then be given as the following:

$0\leq n_{1}\leq P_{1}+Q_{1}-1$ $0\leq n_{2}\leq P_{2}+Q_{2}-1$ Based on the regions of support of the two signals, a DFT of size $N_{1}\times N_{2}$ must be used where $N_{1}\geq \max(P_{1},Q_{1})$ and $N_{2}\geq \max(P_{2},Q_{2})$ since the same size DFT must be used on both signals. In the event where a DFT size larger than the extent of a signal is needed, the signal is zero-padded until it reaches the required length. After multiplying the DFTs and taking the inverse DFT on the result, the resulting circular convolution is then given by:

$y_{circular}(n_{1},n_{2})=\sum _{r_{1}}\sum _{r_{2}}{\Bigg [}\sum _{m_{1}=0}^{Q_{1}-1}\sum _{m_{2}=0}^{Q_{2}-1}h(m_{1},m_{2})x(n_{1}-m_{1}-r_{1}N_{1},n_{2}-m_{2}-r_{2}N_{2}){\Bigg ]}$ for $(n_{1},n_{2})\in R_{N_{1}N_{2}}$

$R_{N_{1}N_{2}}\triangleq \{(n_{1},n_{2}):0\leq n_{1}\leq N_{1}-1,0\leq n_{2}\leq N_{2}-1\}$

The result will be that $y_{circular}(n_{1},n_{2})$ will be a spatially aliased version of the linear convolution result $y_{linear}(n_{1},n_{2})$ . This can be expressed as the following:

$y_{circular}(n_{1},n_{2})=\sum _{r_{1}}\sum _{r_{2}}y_{linear}(n_{1}-r_{1}N_{1},n_{2}-r_{2}N_{2}){\mathrm {\,\,\,for\,\,\,} }(n_{1},n_{2})\in R_{N_{1}N_{2}}$

Then, in order to avoid aliasing between the spatially aliased replicas, $N_{1}$ and $N_{2}$ must be chosen to satisfy the following conditions:

$N_{1}\geq P_{1}+Q_{1}-1$

$N_{2}\geq P_{2}+Q_{2}-1$

If these conditions are satisfied, then the results of the circular convolution will equal that of the linear convolution (taking the main period of the circular convolution as the region of support). That is:

$y_{circular}(n_{1},n_{2})=y_{linear}(n_{1},n_{2})$ for $(n_{1},n_{2})\in R_{N_{1}N_{2}}$

Summary of procedure using DFTs

The Convolution theorem and circular convolution can thus be used in the following manner to achieve a result that is equal to performing the linear convolution:^[8]

Choose $N_{1}$ and $N_{2}$ to satisfy $N_{1}\geq P_{1}+Q_{1}-1$ and $N_{2}\geq P_{2}+Q_{2}-1$
Zero pad the signals $h(n_{1},n_{2})$ and $x(n_{1},n_{2})$ such that they are both $N_{1}\times N_{2}$ in size
Compute the DFTs of both $h(n_{1},n_{2})$ and $x(n_{1},n_{2})$
Multiple the results of the DFTs to obtain $Y(k_{1},k_{2})=H(k_{1},k_{2})X(k_{1},k_{2})$
The result of the IDFT of $Y(k_{1},k_{2})$ will then be equal to the result of performing linear convolution on the two signals

Overlap and add

Another method to perform multidimensional convolution is the overlap and add approach. This method helps reduce the computational complexity often associated with multidimensional convolutions due to the vast amounts of data inherent in modern-day digital systems.^[9] For sake of brevity, the two-dimensional case is used as an example, but the same concepts can be extended to multiple dimensions.

Consider a two-dimensional convolution using a direct computation:

$y(n_{1},n_{2})=\sum _{k_{1}=-\infty }^{\infty }\sum _{k_{2}=-\infty }^{\infty }x(n_{1}-k_{1},n_{2}-k_{2})h(k_{1},k_{2})$

Assuming that the output signal $y(n_{1},n_{2})$ has N nonzero coefficients, and the impulse response has M nonzero samples, this direct computation would need MN multiplies and MN - 1 adds in order to compute. Using an FFT instead, the frequency response of the filter and the Fourier transform of the input would have to be stored in memory.^[10] Massive amounts of computations and excessive use of memory storage space pose a problematic issue as more dimensions are added. This is where the overlap and add convolution method comes in.

Decomposition into smaller convolution blocks

Instead of performing convolution on the blocks of information in their entirety, the information can be broken up into smaller blocks of dimensions $L_{1}$ x $L_{2}$ resulting in smaller FFTs, less computational complexity, and less storage needed. This can be expressed mathematically as follows:

$x(n_{1},n_{2})=\sum _{i=1}^{P_{1}}\sum _{j=1}^{P_{2}}x_{ij}(n_{1},n_{2})$

where $x(n_{1},n_{2})$ represents the $N_{1}$ x $N_{2}$ input signal, which is a summation of $P_{1}P_{2}$ block segments, with $P_{1}=N_{1}/L_{1}$ and $P_{2}=N_{2}/L_{2}$ .

To produce the output signal, a two-dimensional convolution is performed:

$y(n_{1},n_{2})=x(n_{1},n_{2})**h(n_{1},n_{2})$

Substituting in for $x(n_{1},n_{2})$ results in the following:

$y(n_{1},n_{2})=\sum _{i=1}^{P_{1}}\sum _{j=1}^{P_{2}}x_{ij}(n_{1},n_{2})**h(n_{1},n_{2})$

This convolution adds more complexity than doing a direct convolution; however, since it is integrated with an FFT fast convolution, overlap-add performs faster and is a more memory-efficient method, making it practical for large sets of multidimensional data.

Breakdown of procedure

Let $h(n_{1},n_{2})$ be of size $M_{1}\times M_{2}$ :

Break input $x(n_{1},n_{2})$ into non-overlapping blocks of dimensions $L_{1}\times L_{2}$ .
Zero pad $h(n_{1},n_{2})$ such that it has dimensions ( $L_{1}+M_{1}-1$ ) $\times$ ( $L_{2}+M_{2}-1$ ).
Use DFT to get $H(k_{1},k_{2})$ .
For each input block:
1. Zero pad $x_{ij}(n_{1},n_{2})$ to be of dimensions ( $L_{1}+M_{1}-1$ ) $\times$ ( $L_{2}+M_{2}-1$ ).
2. Take discrete Fourier transform of each block to give $X_{ij}(k_{1},k_{2})$ .
3. Multiply to get $Y_{ij}(k_{1},k_{2})=X_{ij}(k_{1},k_{2})H(k_{1},k_{2})$ .
4. Take inverse discrete Fourier transform of $Y_{ij}(k_{1},k_{2})$ to get $y_{ij}(n_{1},n_{2})$ .
Find $y(n_{1},n_{2})$ by overlap and adding the last $(M_{1}-1)$ $\times$ $(M_{2}-1)$ samples of $y_{ij}(n_{1},n_{2})$ with the first $(M_{1}-1)$ $\times$ $(M_{2}-1)$ samples of $y_{i+1,j+1}(n_{1},n_{2})$ to get the result.^[11]

Pictorial method of operation

In order to visualize the overlap-add method more clearly, the following illustrations examine the method graphically. Assume that the input $x(n_{1},n_{2})$ has a square region support of length N in both vertical and horizontal directions as shown in the figure below. It is then broken up into four smaller segments in such a way that it is now composed of four smaller squares. Each block of the aggregate signal has dimensions $(N/2)$ $\times$ $(N/2)$ .

Then, each component is convolved with the impulse response of the filter. Note that an advantage for an implementation such as this can be visualized here since each of these convolutions can be parallelized on a computer, as long as the computer has sufficient memory and resources to store and compute simultaneously.

In the figure below, the first graph on the left represents the convolution corresponding to the component of the input $x_{0,0}$ with the corresponding impulse response $h(n_{1},n_{2})$ . To the right of that, the input $x_{1,0}$ is then convolved with the impulse response $h(n_{1},n_{2})$ .

The same process is done for the other two inputs respectively, and they are accumulated together in order to form the convolution. This is depicted to the left.

Assume that the filter impulse response $h(n_{1},n_{2})$ has a region of support of $(N/8)$ in both dimensions. This entails that each convolution convolves signals with dimensions $(N/2)$ $\times$ $(N/8)$ in both $n_{1}$ and $n_{2}$ directions, which leads to overlap (highlighted in blue) since the length of each individual convolution is equivalent to:

$(N/2)$ $+$ $(N/8)$ $-$ $1$ = $(5/8)N-1$

in both directions. The lighter blue portion correlates to the overlap between two adjacent convolutions, whereas the darker blue portion correlates to overlap between all four convolutions. All of these overlap portions are added together in addition to the convolutions in order to form the combined convolution $y(n_{1},n_{2})$ .^[12]

Overlap and save

The overlap and save method, just like the overlap and add method, is also used to reduce the computational complexity associated with discrete-time convolutions. This method, coupled with the FFT, allows for massive amounts of data to be filtered through a digital system while minimizing the necessary memory space used for computations on massive arrays of data.

Comparison to overlap and add

The overlap and save method is very similar to the overlap and add methods with a few notable exceptions. The overlap-add method involves a linear convolution of discrete-time signals, whereas the overlap-save method involves the principle of circular convolution. In addition, the overlap and save method only uses a one-time zero padding of the impulse response, while the overlap-add method involves a zero-padding for every convolution on each input component. Instead of using zero padding to prevent time-domain aliasing like its overlap-add counterpart, overlap-save simply discards all points of aliasing, and saves the previous data in one block to be copied into the convolution for the next block.

In one dimension, the performance and storage metric differences between the two methods is minimal. However, in the multidimensional convolution case, the overlap-save method is preferred over the overlap-add method in terms of speed and storage abilities.^[13] Just as in the overlap and add case, the procedure invokes the two-dimensional case but can easily be extended to all multidimensional procedures.

Breakdown of procedure

Let $h(n_{1},n_{2})$ be of size $M_{1}\times M_{2}$ :

Insert $(M_{1}-1)$ columns and $(M_{2}-1)$ rows of zeroes at the beginning of the input signal $x(n_{1},n_{2})$ in both dimensions.
Split the corresponding signal into overlapping segments of dimensions ( $L_{1}+M_{1}-1$ ) $\times$ ( $L_{2}+M_{2}-1$ ) in which each two-dimensional block will overlap by $(M_{1}-1)$ $\times$ $(M_{2}-1)$ .
Zero pad $h(n_{1},n_{2})$ such that it has dimensions ( $L_{1}+M_{1}-1$ ) $\times$ ( $L_{2}+M_{2}-1$ ).
Use DFT to get $H(k_{1},k_{2})$ .
For each input block:
1. Take discrete Fourier transform of each block to give $X_{ij}(k_{1},k_{2})$ .
2. Multiply to get $Y_{ij}(k_{1},k_{2})=X_{ij}(k_{1},k_{2})H(k_{1},k_{2})$ .
3. Take inverse discrete Fourier transform of $Y_{ij}(k_{1},k_{2})$ to get $y_{ij}(n_{1},n_{2})$ .
4. Get rid of the first $(M_{1}-1)$ $\times$ $(M_{2}-1)$ for each output block $y_{ij}(n_{1},n_{2})$ .
Find $y(n_{1},n_{2})$ by attaching the last $(L_{1}\times L_{2})$ samples for each output block $y_{ij}(n_{1},n_{2})$ .^[11]

The helix transform

Similar to row-column decomposition, the helix transform computes the multidimensional convolution by incorporating one-dimensional convolutional properties and operators. Instead of using the separability of signals, however, it maps the Cartesian coordinate space to a helical coordinate space allowing for a mapping from a multidimensional space to a one-dimensional space.

Multidimensional convolution with one-dimensional convolution methods

To understand the helix transform, it is useful to first understand how a multidimensional convolution can be broken down into a one-dimensional convolution. Assume that the two signals to be convolved are $X_{M\times N}$ and $Y_{K\times L}$ , which results in an output $Z_{(M-K+1)\times (N-L+1)}$ . This is expressed as follows:

$Z(i,j)=\sum _{m=0}^{M-1}\sum _{n=0}^{N-1}X(m,n)Y(i-m,j-n)$

Next, two matrices are created that zero pad each input in both dimensions such that each input has equivalent dimensions, i.e.

$\mathbf {X'} ={\begin{bmatrix}X&0\\0&0\\\end{bmatrix}}$ and $\mathbf {Y'} ={\begin{bmatrix}Y&0\\0&0\\\end{bmatrix}}$

where each of the input matrices are now of dimensions $(M+K-1)$ $\times$ $(N+L-1)$ . It is then possible to implement column-wise lexicographic ordering in order to convert the modified matrices into vectors, $X''$ and $Y''$ . In order to minimize the number of unimportant samples in each vector, each vector is truncated after the last sample in the original matrices $X$ and $Y$ respectively. Given this, the length of vector $X''$ and $Y''$ are given by:

$l_{X''}=$ $(M+K-1)$ $\times$ $(N-1)$ + $M$

$l_{Y''}=$ $(M+K-1)$ $\times$ $(L-1)$ + $K$

The length of the convolution of these two vectors, $Z''$ , can be derived and shown to be:

$l_{Z''}=$ $l_{Y''}+$ $l_{X''}$ $=(M+K-1)$ $\times$ $(N+L-1)$

This vector length is equivalent to the dimensions of the original matrix output $Z$ , making converting back to a matrix a direct transformation. Thus, the vector, $Z''$ , is converted back to matrix form, which produces the output of the two-dimensional discrete convolution.^[14]

Filtering on a helix

When working on a two-dimensional Cartesian mesh, a Fourier transform along either axes will result in the two-dimensional plane becoming a cylinder as the end of each column or row attaches to its respective top forming a cylinder. Filtering on a helix behaves in a similar fashion, except in this case, the bottom of each column attaches to the top of the next column, resulting in a helical mesh. This is illustrated below. The darkened tiles represent the filter coefficients.

If this helical structure is then sliced and unwound into a one-dimensional strip, the same filter coefficients on the 2-d Cartesian plane will match up with the same input data, resulting in an equivalent filtering scheme. This ensures that a two-dimensional convolution will be able to be performed by a one-dimensional convolution operator as the 2D filter has been unwound to a 1D filter with gaps of zeroes separating the filter coefficients.

Assuming that some-low pass two-dimensional filter was used, such as:

0	-1	0
-1	4	-1
0	-1	0

Then, once the two-dimensional space was converted into a helix, the one-dimensional filter would look as follows:

$h(n)=-1,0,...,0,-1,4,-1,0,...,0,-1,0,...$

Notice in the one-dimensional filter that there are no leading zeroes as illustrated in the one-dimensional filtering strip after being unwound. The entire one-dimensional strip could have been convolved with; however, it is less computationally expensive to simply ignore the leading zeroes. In addition, none of these backside zero values will need to be stored in memory, preserving precious memory resources.^[15]

Applications

Helix transformations to implement recursive filters via convolution are used in various areas of signal processing. Although frequency domain Fourier analysis is effective when systems are stationary, with constant coefficients and periodically-sampled data, it becomes more difficult in unstable systems. The helix transform enables three-dimensional post-stack migration processes that can process data for three-dimensional variations in velocity.^[15] In addition, it can be applied to assist with the problem of implicit three-dimensional wavefield extrapolation.^[16] Other applications include helpful algorithms in seismic data regularization, prediction error filters, and noise attenuation in geophysical digital systems.^[14]

Gaussian convolution

One application of multidimensional convolution that is used within signal and image processing is Gaussian convolution. This refers to convolving an input signal with the Gaussian distribution function.

The Gaussian distribution sampled at discrete values in one dimension is given by the following (assuming $\mu =0$ ): $G(n)={\frac {1}{\sqrt {2\pi \sigma ^{2}}}}e^{-{\frac {n^{2}}{2\sigma ^{2}}}}$ This is readily extended to a signal of M dimensions (assuming $\sigma$ stays constant for all dimensions and $\mu _{1}=\mu _{2}=...=\mu _{M}=0$ ): $G(n_{1},n_{2},...,n_{M})={\frac {1}{(2\pi )^{M/2}\sigma ^{M}}}e^{-{\frac {({n_{1}}^{2}+{n_{2}}^{2}+...+{n_{M}}^{2})}{2\sigma ^{2}}}}$ One important property to recognize is that the M dimensional signal is separable such that: $G(n_{1},n_{2},...,n_{M})=G(n_{1})G(n_{2})...G(n_{M})$ Then, Gaussian convolution with discrete-valued signals can be expressed as the following:

$y(n)=x(n)*G(n)$

$y(n_{1},n_{2},...,n_{M})=x(n_{1},n_{2},...,n_{M})*...*G(n_{1},n_{2},...,n_{M})$

Approximation by FIR filter

Gaussian convolution can be effectively approximated via implementation of a Finite impulse response (FIR) filter. The filter will be designed with truncated versions of the Gaussian. For a two-dimensional filter, the transfer function of such a filter would be defined as the following:^[17]

$H(z_{1},z_{2})={\frac {1}{s(r_{1},r_{2})}}\sum _{n_{1}=-r_{1}}^{r_{1}}\sum _{n_{2}=-r_{2}}^{r_{2}}G(n_{1},n_{2}){z_{1}}^{-n_{1}}{z_{2}}^{-n_{2}}$

where

$s(r_{1},r_{2})=\sum _{n_{1}=-r_{1}}^{r_{1}}\sum _{n_{2}=-r_{2}}^{r_{2}}G(n_{1},n_{2})$

Choosing lower values for $r_{1}$ and $r_{2}$ will result in performing less computations, but will yield a less accurate approximation while choosing higher values will yield a more accurate approximation, but will require a greater number of computations.

Approximation by box filter

Another method for approximating Gaussian convolution is via recursive passes through a box filter. For approximating one-dimensional convolution, this filter is defined as the following:^[17]

$H(z)={\frac {1}{2r+1}}{\frac {z^{r}-z^{-r-1}}{1-z^{-}1}}$

Typically, recursive passes 3, 4, or 5 times are performed in order to obtain an accurate approximation.^[17] A suggested method for computing r is then given as the following:^[18]

$\sigma ^{2}={\frac {1}{12}}K((2r+1)^{2}-1)$ where K is the number of recursive passes through the filter.

Then, since the Gaussian distribution is separable across different dimensions, it follows that recursive passes through one-dimensional filters (isolating each dimension separately) will thus yield an approximation of the multidimensional Gaussian convolution. That is, M-dimensional Gaussian convolution could be approximated via recursive passes through the following one-dimensional filters:

$H(z_{1})={\frac {1}{2r_{1}+1}}{\frac {{z_{1}}^{r_{1}}-{z_{1}}^{-r_{1}-1}}{1-{z_{1}}^{-}1}}$

$H(z_{2})={\frac {1}{2r_{2}+1}}{\frac {{z_{2}}^{r_{2}}-{z_{2}}^{-r_{2}-1}}{1-{z_{2}}^{-}1}}$

$\vdots$

$H(z_{M})={\frac {1}{2r_{M}+1}}{\frac {{z_{M}}^{r_{M}}-{z_{M}}^{-r_{M}-1}}{1-{z_{M}}^{-}1}}$

Applications

Gaussian convolutions are used extensively in signal and image processing. For example, image-blurring can be accomplished with Gaussian convolution where the $\sigma$ parameter will control the strength of the blurring. Higher values would thus correspond to a more blurry end result.^[19] It is also commonly used in Computer vision applications such as Scale-invariant feature transform (SIFT) feature detection.^[20]

Related Research Articles

In mathematics, convolution is a mathematical operation on two functions that produces a third function. The term convolution refers to both the result function and to the process of computing it. It is defined as the integral of the product of the two functions after one is reflected about the y-axis and shifted. The integral is evaluated for all values of shift, producing the convolution function. The choice of which function is reflected and shifted before the integral does not change the integral result. Graphically, it expresses how the 'shape' of one function is modified by the other.

In mathematics, the discrete Fourier transform (DFT) converts a finite sequence of equally-spaced samples of a function into a same-length sequence of equally-spaced samples of the discrete-time Fourier transform (DTFT), which is a complex-valued function of frequency. The interval at which the DTFT is sampled is the reciprocal of the duration of the input sequence. An inverse DFT (IDFT) is a Fourier series, using the DTFT samples as coefficients of complex sinusoids at the corresponding DTFT frequencies. It has the same sample-values as the original input sequence. The DFT is therefore said to be a frequency domain representation of the original input sequence. If the original sequence spans all the non-zero values of a function, its DTFT is continuous, and the DFT provides discrete samples of one cycle. If the original sequence is one cycle of a periodic function, the DFT provides all the non-zero values of one DTFT cycle.

A discrete Hartley transform (DHT) is a Fourier-related transform of discrete, periodic data similar to the discrete Fourier transform (DFT), with analogous applications in signal processing and related fields. Its main distinction from the DFT is that it transforms real inputs to real outputs, with no intrinsic involvement of complex numbers. Just as the DFT is the discrete analogue of the continuous Fourier transform (FT), the DHT is the discrete analogue of the continuous Hartley transform (HT), introduced by Ralph V. L. Hartley in 1942.

In mathematics and signal processing, the Hilbert transform is a specific singular integral that takes a function, $u (t)$ of a real variable and produces another function of a real variable $H(u)(t)$ . The Hilbert transform is given by the Cauchy principal value of the convolution with the function $(see § Definition). The Hilbert transform has a particularly simple representation in the frequency domain: It imparts a phase shift of \pm90° (π /2 radians) to every frequency component of a function, the sign of the shift depending on the sign of the frequency (see § Relationship with the Fourier transform). The Hilbert transform is important in signal processing, where it is a component of the analytic representation of a real-valued signal u (t) . The Hilbert transform was first introduced by David Hilbert in this setting, to solve a special case of the Riemann-Hilbert problem for analytic functions.$

In numerical analysis and functional analysis, a discrete wavelet transform (DWT) is any wavelet transform for which the wavelets are discretely sampled. As with other wavelet transforms, a key advantage it has over Fourier transforms is temporal resolution: it captures both frequency and location information.

In mathematics, the discrete-time Fourier transform (DTFT) is a form of Fourier analysis that is applicable to a sequence of discrete values.

In system analysis, among other fields of study, a linear time-invariant (LTI) system is a system that produces an output signal from any input signal subject to the constraints of linearity and time-invariance; these terms are briefly defined in the overview below. These properties apply (exactly or approximately) to many important physical systems, in which case the response $y (t)$ of the system to an arbitrary input $x (t)$ can be found directly using convolution: $y (t) = (x * h)(t)$ where $h (t)$ is called the system's impulse response and ∗ represents convolution (not to be confused with multiplication). What's more, there are systematic methods for solving any such system (determining $h (t)$ ), whereas systems not meeting both properties are generally more difficult (or impossible) to solve analytically. A good example of an LTI system is any electrical circuit consisting of resistors, capacitors, inductors and linear amplifiers.

In signal processing, a filter bank is an array of bandpass filters that separates the input signal into multiple components, each one carrying a sub-band of the original signal. One application of a filter bank is a graphic equalizer, which can attenuate the components differently and recombine them into a modified version of the original signal. The process of decomposition performed by the filter bank is called analysis ; the output of analysis is referred to as a subband signal with as many subbands as there are filters in the filter bank. The reconstruction process is called synthesis, meaning reconstitution of a complete signal resulting from the filtering process.

In the areas of computer vision, image analysis and signal processing, the notion of scale-space representation is used for processing measurement data at multiple scales, and specifically enhance or suppress image features over different ranges of scale. A special type of scale-space representation is provided by the Gaussian scale space, where the image data in N dimensions is subjected to smoothing by Gaussian convolution. Most of the theory for Gaussian scale space deals with continuous images, whereas one when implementing this theory will have to face the fact that most measurement data are discrete. Hence, the theoretical problem arises concerning how to discretize the continuous theory while either preserving or well approximating the desirable theoretical properties that lead to the choice of the Gaussian kernel. This article describes basic approaches for this that have been developed in the literature, see also for an in-depth treatment regarding the topic of approximating the Gaussian smoothing operation and the Gaussian derivative computations in scale-space theory.

In signal processing, the overlap–add method is an efficient way to evaluate the discrete convolution of a very long signal $with a finite impulse response (FIR) filter :$

In signal processing, overlap–save is the traditional name for an efficient way to evaluate the discrete convolution between a very long signal $and a finite impulse response (FIR) filter :$

Algebraic signal processing (ASP) is an emerging area of theoretical signal processing (SP). In the algebraic theory of signal processing, a set of filters is treated as an (abstract) algebra, a set of signals is treated as a module or vector space, and convolution is treated as an algebra representation. The advantage of algebraic signal processing is its generality and portability.

In mathematical analysis and applications, multidimensional transforms are used to analyze the frequency content of signals in a domain of two or more dimensions.

In signal processing, multidimensional signal processing covers all signal processing done using multidimensional signals and systems. While multidimensional signal processing is a subset of signal processing, it is unique in the sense that it deals specifically with data that can only be adequately detailed using more than one dimension. In m-D digital signal processing, useful data is sampled in more than one dimension. Examples of this are image processing and multi-sensor radar detection. Both of these examples use multiple sensors to sample signals and form images based on the manipulation of these multiple signals. Processing in multi-dimension (m-D) requires more complex algorithms, compared to the 1-D case, to handle calculations such as the fast Fourier transform due to more degrees of freedom. In some cases, m-D signals and systems can be simplified into single dimension signal processing methods, if the considered systems are separable.

Multidimension spectral estimation is a generalization of spectral estimation, normally formulated for one-dimensional signals, to multidimensional signals or multivariate data, such as wave vectors.

Multidimensional Multirate systems find applications in image compression and coding. Several applications such as conversion between progressive video signals require usage of multidimensional multirate systems. In multidimensional multirate systems, the basic building blocks are decimation matrix (M), expansion matrix(L) and Multidimensional digital filters. The decimation and expansion matrices have dimension of D x D, where D represents the dimension. To extend the one dimensional (1-D) multirate results, there are two different ways which are based on the structure of decimation and expansion matrices. If these matrices are diagonal, separable approaches can be used, which are separable operations in each dimension. Although separable approaches might serve less complexity, non-separable methods, with non-diagonal expansion and decimation matrices, provide much better performance. The difficult part in non-separable methods is to create results in MD case by extend the 1-D case. Polyphase decomposition and maximally decimated reconstruction systems are already carried out.

Similar to 1-D Digital signal processing in case of the Multidimensional signal processing we have Efficient algorithms. The efficiency of an Algorithm can be evaluated by the amount of computational resources it takes to compute output or the quantity of interest. In this page, two of the very efficient algorithms for multidimensional signals are explained. For the sake of simplicity and description it is explained for 2-D Signals. However, same theory holds good for M-D signals. The exact computational savings for each algorithm is also mentioned.

In signal processing, nonlinear multidimensional signal processing (NMSP) covers all signal processing using nonlinear multidimensional signals and systems. Nonlinear multidimensional signal processing is a subset of signal processing (multidimensional signal processing). Nonlinear multi-dimensional systems can be used in a broad range such as imaging, teletraffic, communications, hydrology, geology, and economics. Nonlinear systems cannot be treated as linear systems, using Fourier transformation and wavelet analysis. Nonlinear systems will have chaotic behavior, limit cycle, steady state, bifurcation, multi-stability and so on. Nonlinear systems do not have a canonical representation, like impulse response for linear systems. But there are some efforts to characterize nonlinear systems, such as Volterra and Wiener series using polynomial integrals as the use of those methods naturally extend the signal into multi-dimensions. Another example is the Empirical mode decomposition method using Hilbert transform instead of Fourier Transform for nonlinear multi-dimensional systems. This method is an empirical method and can be directly applied to data sets. Multi-dimensional nonlinear filters (MDNF) are also an important part of NMSP, MDNF are mainly used to filter noise in real data. There are nonlinear-type hybrid filters used in color image processing, nonlinear edge-preserving filters use in magnetic resonance image restoration. Those filters use both temporal and spatial information and combine the maximum likelihood estimate with the spatial smoothing algorithm.

Multidimensional seismic data processing forms a major component of seismic profiling, a technique used in geophysical exploration. The technique itself has various applications, including mapping ocean floors, determining the structure of sediments, mapping subsurface currents and hydrocarbon exploration. Since geophysical data obtained in such techniques is a function of both space and time, multidimensional signal processing techniques may be better suited for processing such data.

Parallelmultidimensionaldigitalsignal processing (mD-DSP) is defined as the application of parallel programming and multiprocessing to digital signal processing techniques to process digital signals that have more than a single dimension. The use of mD-DSP is fundamental to many application areas such as digital image and video processing, medical imaging, geophysical signal analysis, sonar, radar, lidar, array processing, computer vision, computational photography, and augmented and virtual reality. However, as the number of dimensions of a signal increases the computational complexity to operate on the signal increases rapidly. This relationship between the number of dimensions and the amount of complexity, related to both time and space, as studied in the field of algorithm analysis, is analogues to the concept of the curse of dimensionality. This large complexity generally results in an extremely long execution run-time of a given mD-DSP application rendering its usage to become impractical for many applications; especially for real-time applications. This long run-time is the primary motivation of applying parallel algorithmic techniques to mD-DSP problems.

References

1 2 Dudgeon, Dan; Mersereau, Russell (1983), Multidimensional Digital Signal Processing, Prentice-Hall, pp. 21–22
↑ "MARBLE: Interactive Vision". homepages.inf.ed.ac.uk. Retrieved 2015-11-12.
↑ "Digital Geophysical Analysis Redesign". www-rohan.sdsu.edu. Retrieved 2015-11-12.
↑ Sihvo, Tero; Niittylahti, Jarkko (5 June 2005). "Row-Column Decomposition Based 2D Transform Optimization on Subword Parallel Processors". International Symposium on Signals, Circuits and Systems, 2005. ISSCS 2005. Vol. 1. pp. 99–102. doi:10.1109/ISSCS.2005.1509860. ISBN 978-0-7803-9029-4.
↑ "Introduction to Caches". Computer Science University of Maryland. Retrieved 10 November 2015.
↑ Eddins, Steve. "Separable Convolution". Mathwords. Retrieved 10 November 2015.
↑ Dudgeon, Dan; Mersereau, Russell (1983), Multidimensional Digital Signal Processing, Prentice-Hall, p. 70
↑ Dudgeon, Dan; Mersereau, Russell (1983), Multidimensional Digital Signal Processing, Prentice-Hall, p. 72
↑ Fernandez, Joseph; Kumar, Vijaya (2013). Multidimensional Overlap-Add and Overlap-Save for Correlation and Convolution. pp. 509–513. doi:10.1109/ICIP.2013.6738105. ISBN 978-1-4799-2341-0.
↑ "2D Signal Processing" (PDF). EE502: Digital Signal Processing. Dublin City University. p. 24. Retrieved November 11, 2015.
1 2 Kundur, Deepa. "Overlap-Save and Overlap-Add" (PDF). University of Toronto. Retrieved November 12, 2015.
↑ "2D Signal Processing" (PDF). EE502: Digital Signal Processing. Dublin City University. p. 26. Retrieved November 11, 2015.
↑ Kim, Chang; Strintzis, Michael (May 1980). "High-Speed Multidimensional Convolution". IEEE Transactions on Pattern Analysis and Machine Intelligence. PAMI-2 (3): 269–273. doi:10.1109/tpami.1980.4767017.
1 2 Naghizadeh, Mostafa; Sacchi, Mauricio (November 2009). "Multidimensional convolution via a 1D convolution algorithm". The Leading Edge.
1 2 Claerbout, Jon (September 1998). "Multidimensional recursive filters via a helix". Geophysics. 63 (5): 9. Bibcode:1998Geop...63.1532C. CiteSeerX 10.1.1.76.1193 . doi:10.1190/1.1444449.
↑ Fomel, Sergey; Claerbout, Jon (1997). "Exploring three-dimensional implicit wavefield extrapolation with the helix transform" (PDF). SEP Report: 43–60. Archived from the original (PDF) on 2019-01-04.
1 2 3 Getreuer, Pascal (2013). "A Survey of Gaussian Convolution Algorithms". Image Processing on Line. 3: 286–310. doi: 10.5201/ipol.2013.87 .
↑ Wells, W.M. (1986). "Efficient synthesis of Gaussian filters by cascaded uniform filters". IEEE Transactions on Pattern Analysis and Machine Intelligence. PAMI-8 (2): 234–239. doi:10.1109/TPAMI.1986.4767776.
↑ "Gaussian Blur - Image processing for scientists and engineers, Part 4". patrick-fuller.com. Retrieved 2015-11-12.
↑ Lowe, D.G. (1999). "Object recognition from local scale-invariant features" (PDF). Proceedings of the International Conference on Computer Vision. 2: 1150–1157.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:4-1] 1 2 Dudgeon, Dan; Mersereau, Russell (1983), Multidimensional Digital Signal Processing, Prentice-Hall, pp. 21–22

[2] "MARBLE: Interactive Vision". homepages.inf.ed.ac.uk. Retrieved 2015-11-12.

[3] "Digital Geophysical Analysis Redesign". www-rohan.sdsu.edu. Retrieved 2015-11-12.

[4] Sihvo, Tero; Niittylahti, Jarkko (5 June 2005). "Row-Column Decomposition Based 2D Transform Optimization on Subword Parallel Processors". International Symposium on Signals, Circuits and Systems, 2005. ISSCS 2005. Vol. 1. pp. 99–102. doi:10.1109/ISSCS.2005.1509860. ISBN 978-0-7803-9029-4.

[5] "Introduction to Caches". Computer Science University of Maryland. Retrieved 10 November 2015.

[6] Eddins, Steve. "Separable Convolution". Mathwords. Retrieved 10 November 2015.

[7] Dudgeon, Dan; Mersereau, Russell (1983), Multidimensional Digital Signal Processing, Prentice-Hall, p. 70

[8] Dudgeon, Dan; Mersereau, Russell (1983), Multidimensional Digital Signal Processing, Prentice-Hall, p. 72

[9] Fernandez, Joseph; Kumar, Vijaya (2013). Multidimensional Overlap-Add and Overlap-Save for Correlation and Convolution. pp. 509–513. doi:10.1109/ICIP.2013.6738105. ISBN 978-1-4799-2341-0.

[10] "2D Signal Processing" (PDF). EE502: Digital Signal Processing. Dublin City University. p. 24. Retrieved November 11, 2015.

[:3-11] 1 2 Kundur, Deepa. "Overlap-Save and Overlap-Add" (PDF). University of Toronto. Retrieved November 12, 2015.

[12] "2D Signal Processing" (PDF). EE502: Digital Signal Processing. Dublin City University. p. 26. Retrieved November 11, 2015.

[13] Kim, Chang; Strintzis, Michael (May 1980). "High-Speed Multidimensional Convolution". IEEE Transactions on Pattern Analysis and Machine Intelligence. PAMI-2 (3): 269–273. doi:10.1109/tpami.1980.4767017.

[:1-14] 1 2 Naghizadeh, Mostafa; Sacchi, Mauricio (November 2009). "Multidimensional convolution via a 1D convolution algorithm". The Leading Edge.

[:2-15] 1 2 Claerbout, Jon (September 1998). "Multidimensional recursive filters via a helix". Geophysics. 63 (5): 9. Bibcode:1998Geop...63.1532C. CiteSeerX 10.1.1.76.1193 . doi:10.1190/1.1444449.

[16] Fomel, Sergey; Claerbout, Jon (1997). "Exploring three-dimensional implicit wavefield extrapolation with the helix transform" (PDF). SEP Report: 43–60. Archived from the original (PDF) on 2019-01-04.

[:0-17] 1 2 3 Getreuer, Pascal (2013). "A Survey of Gaussian Convolution Algorithms". Image Processing on Line. 3: 286–310. doi: 10.5201/ipol.2013.87 .

[18] Wells, W.M. (1986). "Efficient synthesis of Gaussian filters by cascaded uniform filters". IEEE Transactions on Pattern Analysis and Machine Intelligence. PAMI-8 (2): 234–239. doi:10.1109/TPAMI.1986.4767776.

[19] "Gaussian Blur - Image processing for scientists and engineers, Part 4". patrick-fuller.com. Retrieved 2015-11-12.

[20] Lowe, D.G. (1999). "Object recognition from local scale-invariant features" (PDF). Proceedings of the International Conference on Computer Vision. 2: 1150–1157.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

Multidimensional discrete convolution

Contents

Definition

Problem statement and basics

Motivation and applications

Row-column decomposition with separable signals

Separable signals

Row-column decomposition

Computational speedup from row-column decomposition

Circular convolution of discrete-valued multidimensional signals

Convolution theorem in multiple dimensions

Circular convolution approach

Choosing DFT size to avoid aliasing

Summary of procedure using DFTs

Overlap and add

Decomposition into smaller convolution blocks

Breakdown of procedure

Pictorial method of operation

Overlap and save

Comparison to overlap and add

Breakdown of procedure

The helix transform

Multidimensional convolution with one-dimensional convolution methods

Filtering on a helix

Applications

Gaussian convolution

Approximation by FIR filter

Approximation by box filter

Applications

See also

Related Research Articles

References