Matched filter

Last updated August 29, 2024

In signal processing, the output of the matched filter is given by correlating a known delayed signal, or template, with an unknown signal to detect the presence of the template in the unknown signal.^[1]^[2] This is equivalent to convolving the unknown signal with a conjugated time-reversed version of the template. The matched filter is the optimal linear filter for maximizing the signal-to-noise ratio (SNR) in the presence of additive stochastic noise.

Matched filters are commonly used in radar, in which a known signal is sent out, and the reflected signal is examined for common elements of the out-going signal. Pulse compression is an example of matched filtering. It is so called because the impulse response is matched to input pulse signals. Two-dimensional matched filters are commonly used in image processing, e.g., to improve the SNR of X-ray observations. Additional applications of note are in seismology and gravitational-wave astronomy.

Matched filtering is a demodulation technique with LTI (linear time invariant) filters to maximize SNR.^[3] It was originally also known as a North filter.^[4]

Derivation

Derivation via matrix algebra

The following section derives the matched filter for a discrete-time system. The derivation for a continuous-time system is similar, with summations replaced with integrals.

The matched filter is the linear filter, $h$ , that maximizes the output signal-to-noise ratio.

\ y[n]=\sum _{k=-\infty }^{\infty }h[n-k]x[k],

where $x[k]$ is the input as a function of the independent variable $k$ , and $y[n]$ is the filtered output. Though we most often express filters as the impulse response of convolution systems, as above (see LTI system theory), it is easiest to think of the matched filter in the context of the inner product, which we will see shortly.

We can derive the linear filter that maximizes output signal-to-noise ratio by invoking a geometric argument. The intuition behind the matched filter relies on correlating the received signal (a vector) with a filter (another vector) that is parallel with the signal, maximizing the inner product. This enhances the signal. When we consider the additive stochastic noise, we have the additional challenge of minimizing the output due to noise by choosing a filter that is orthogonal to the noise.

Let us formally define the problem. We seek a filter, $h$ , such that we maximize the output signal-to-noise ratio, where the output is the inner product of the filter and the observed signal $x$ .

Our observed signal consists of the desirable signal $s$ and additive noise $v$ :

\ x=s+v.\,

Let us define the auto-correlation matrix of the noise, reminding ourselves that this matrix has Hermitian symmetry, a property that will become useful in the derivation:

\ R_{v}=E\{vv^{\mathrm {H} }\}\,

where $v^{\mathrm {H} }$ denotes the conjugate transpose of $v$ , and $E$ denotes expectation (note that in case the noise $v$ has zero-mean, its auto-correlation matrix $R_{v}$ is equal to its covariance matrix).

Let us call our output, $y$ , the inner product of our filter and the observed signal such that

\ y=\sum _{k=-\infty }^{\infty }h^{*}[k]x[k]=h^{\mathrm {H} }x=h^{\mathrm {H} }s+h^{\mathrm {H} }v=y_{s}+y_{v}.

We now define the signal-to-noise ratio, which is our objective function, to be the ratio of the power of the output due to the desired signal to the power of the output due to the noise:

\mathrm {SNR} ={\frac {|y_{s}|^{2}}{E\{|y_{v}|^{2}\}}}.

We rewrite the above:

\mathrm {SNR} ={\frac {|h^{\mathrm {H} }s|^{2}}{E\{|h^{\mathrm {H} }v|^{2}\}}}.

We wish to maximize this quantity by choosing $h$ . Expanding the denominator of our objective function, we have

\ E\{|h^{\mathrm {H} }v|^{2}\}=E\{(h^{\mathrm {H} }v){(h^{\mathrm {H} }v)}^{\mathrm {H} }\}=h^{\mathrm {H} }E\{vv^{\mathrm {H} }\}h=h^{\mathrm {H} }R_{v}h.\,

Now, our $\mathrm {SNR}$ becomes

\mathrm {SNR} ={\frac {|h^{\mathrm {H} }s|^{2}}{h^{\mathrm {H} }R_{v}h}}.

We will rewrite this expression with some matrix manipulation. The reason for this seemingly counterproductive measure will become evident shortly. Exploiting the Hermitian symmetry of the auto-correlation matrix $R_{v}$ , we can write

\mathrm {SNR} ={\frac {|{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{-1/2}s)|^{2}}{{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{1/2}h)}},

We would like to find an upper bound on this expression. To do so, we first recognize a form of the Cauchy–Schwarz inequality:

\ |a^{\mathrm {H} }b|^{2}\leq (a^{\mathrm {H} }a)(b^{\mathrm {H} }b),\,

which is to say that the square of the inner product of two vectors can only be as large as the product of the individual inner products of the vectors. This concept returns to the intuition behind the matched filter: this upper bound is achieved when the two vectors $a$ and $b$ are parallel. We resume our derivation by expressing the upper bound on our $\mathrm {SNR}$ in light of the geometric inequality above:

\mathrm {SNR} ={\frac {|{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{-1/2}s)|^{2}}{{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{1/2}h)}}\leq {\frac {\left[{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{1/2}h)\right]\left[{(R_{v}^{-1/2}s)}^{\mathrm {H} }(R_{v}^{-1/2}s)\right]}{{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{1/2}h)}}.

Our valiant matrix manipulation has now paid off. We see that the expression for our upper bound can be greatly simplified:

\mathrm {SNR} ={\frac {|{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{-1/2}s)|^{2}}{{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{1/2}h)}}\leq s^{\mathrm {H} }R_{v}^{-1}s.

We can achieve this upper bound if we choose,

\ R_{v}^{1/2}h=\alpha R_{v}^{-1/2}s

where $\alpha$ is an arbitrary real number. To verify this, we plug into our expression for the output $\mathrm {SNR}$ :

\mathrm {SNR} ={\frac {|{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{-1/2}s)|^{2}}{{(R_{v}^{1/2}h)}^{\mathrm {H} }(R_{v}^{1/2}h)}}={\frac {\alpha ^{2}|{(R_{v}^{-1/2}s)}^{\mathrm {H} }(R_{v}^{-1/2}s)|^{2}}{\alpha ^{2}{(R_{v}^{-1/2}s)}^{\mathrm {H} }(R_{v}^{-1/2}s)}}={\frac {|s^{\mathrm {H} }R_{v}^{-1}s|^{2}}{s^{\mathrm {H} }R_{v}^{-1}s}}=s^{\mathrm {H} }R_{v}^{-1}s.

Thus, our optimal matched filter is

\ h=\alpha R_{v}^{-1}s.

We often choose to normalize the expected value of the power of the filter output due to the noise to unity. That is, we constrain

\ E\{|y_{v}|^{2}\}=1.\,

This constraint implies a value of $\alpha$ , for which we can solve:

\ E\{|y_{v}|^{2}\}=\alpha ^{2}s^{\mathrm {H} }R_{v}^{-1}s=1,

yielding

\ \alpha ={\frac {1}{\sqrt {s^{\mathrm {H} }R_{v}^{-1}s}}},

giving us our normalized filter,

\ h={\frac {1}{\sqrt {s^{\mathrm {H} }R_{v}^{-1}s}}}R_{v}^{-1}s.

If we care to write the impulse response $h$ of the filter for the convolution system, it is simply the complex conjugate time reversal of the input $s$ .

Though we have derived the matched filter in discrete time, we can extend the concept to continuous-time systems if we replace $R_{v}$ with the continuous-time autocorrelation function of the noise, assuming a continuous signal $s(t)$ , continuous noise $v(t)$ , and a continuous filter $h(t)$ .

Derivation via Lagrangian

Alternatively, we may solve for the matched filter by solving our maximization problem with a Lagrangian. Again, the matched filter endeavors to maximize the output signal-to-noise ratio ( $\mathrm {SNR}$ ) of a filtered deterministic signal in stochastic additive noise. The observed sequence, again, is

\ x=s+v,\,

with the noise auto-correlation matrix,

\ R_{v}=E\{vv^{\mathrm {H} }\}.\,

The signal-to-noise ratio is

\mathrm {SNR} ={\frac {|y_{s}|^{2}}{E\{|y_{v}|^{2}\}}},

where $y_{s}=h^{\mathrm {H} }s$ and $y_{v}=h^{\mathrm {H} }v$ .

Evaluating the expression in the numerator, we have

\ |y_{s}|^{2}={y_{s}}^{\mathrm {H} }y_{s}=h^{\mathrm {H} }ss^{\mathrm {H} }h.\,

and in the denominator,

\ E\{|y_{v}|^{2}\}=E\{{y_{v}}^{\mathrm {H} }y_{v}\}=E\{h^{\mathrm {H} }vv^{\mathrm {H} }h\}=h^{\mathrm {H} }R_{v}h.\,

The signal-to-noise ratio becomes

\mathrm {SNR} ={\frac {h^{\mathrm {H} }ss^{\mathrm {H} }h}{h^{\mathrm {H} }R_{v}h}}.

If we now constrain the denominator to be 1, the problem of maximizing $\mathrm {SNR}$ is reduced to maximizing the numerator. We can then formulate the problem using a Lagrange multiplier:

\ h^{\mathrm {H} }R_{v}h=1

\ {\mathcal {L}}=h^{\mathrm {H} }ss^{\mathrm {H} }h+\lambda (1-h^{\mathrm {H} }R_{v}h)

\ \nabla _{h^{*}}{\mathcal {L}}=ss^{\mathrm {H} }h-\lambda R_{v}h=0

\ (ss^{\mathrm {H} })h=\lambda R_{v}h

which we recognize as a generalized eigenvalue problem

\ h^{\mathrm {H} }(ss^{\mathrm {H} })h=\lambda h^{\mathrm {H} }R_{v}h.

Since $ss^{\mathrm {H} }$ is of unit rank, it has only one nonzero eigenvalue. It can be shown that this eigenvalue equals

\ \lambda _{\max }=s^{\mathrm {H} }R_{v}^{-1}s,

yielding the following optimal matched filter

\ h={\frac {1}{\sqrt {s^{\mathrm {H} }R_{v}^{-1}s}}}R_{v}^{-1}s.

This is the same result found in the previous subsection.

Interpretation as a least-squares estimator

Derivation

Matched filtering can also be interpreted as a least-squares estimator for the optimal location and scaling of a given model or template. Once again, let the observed sequence be defined as

\ x_{k}=s_{k}+v_{k},\,

where $v_{k}$ is uncorrelated zero mean noise. The signal $s_{k}$ is assumed to be a scaled and shifted version of a known model sequence $f_{k}$ :

\ s_{k}=\mu _{0}\cdot f_{k-j_{0}}

We want to find optimal estimates $j^{*}$ and $\mu ^{*}$ for the unknown shift $j_{0}$ and scaling $\mu _{0}$ by minimizing the least-squares residual between the observed sequence $x_{k}$ and a "probing sequence" $h_{j-k}$ :

\ j^{*},\mu ^{*}=\arg \min _{j,\mu }\sum _{k}\left(x_{k}-\mu \cdot h_{j-k}\right)^{2}

The appropriate $h_{j-k}$ will later turn out to be the matched filter, but is as yet unspecified. Expanding $x_{k}$ and the square within the sum yields

\ j^{*},\mu ^{*}=\arg \min _{j,\mu }\left[\sum _{k}(s_{k}+v_{k})^{2}+\mu ^{2}\sum _{k}h_{j-k}^{2}-2\mu \sum _{k}s_{k}h_{j-k}-2\mu \sum _{k}v_{k}h_{j-k}\right].

The first term in brackets is a constant (since the observed signal is given) and has no influence on the optimal solution. The last term has constant expected value because the noise is uncorrelated and has zero mean. We can therefore drop both terms from the optimization. After reversing the sign, we obtain the equivalent optimization problem

\ j^{*},\mu ^{*}=\arg \max _{j,\mu }\left[2\mu \sum _{k}s_{k}h_{j-k}-\mu ^{2}\sum _{k}h_{j-k}^{2}\right].

Setting the derivative w.r.t. $\mu$ to zero gives an analytic solution for $\mu ^{*}$ :

\ \mu ^{*}={\frac {\sum _{k}s_{k}h_{j-k}}{\sum _{k}h_{j-k}^{2}}}.

Inserting this into our objective function yields a reduced maximization problem for just $j^{*}$ :

\ j^{*}=\arg \max _{j}{\frac {\left(\sum _{k}s_{k}h_{j-k}\right)^{2}}{\sum _{k}h_{j-k}^{2}}}.

The numerator can be upper-bounded by means of the Cauchy–Schwarz inequality:

\ {\frac {\left(\sum _{k}s_{k}h_{j-k}\right)^{2}}{\sum _{k}h_{j-k}^{2}}}\leq {\frac {\sum _{k}s_{k}^{2}\cdot \sum _{k}h_{j-k}^{2}}{\sum _{k}h_{j-k}^{2}}}=\sum _{k}s_{k}^{2}={\text{constant}}.

The optimization problem assumes its maximum when equality holds in this expression. According to the properties of the Cauchy–Schwarz inequality, this is only possible when

\ h_{j-k}=\nu \cdot s_{k}=\kappa \cdot f_{k-j_{0}}.

for arbitrary non-zero constants $\nu$ or $\kappa$ , and the optimal solution is obtained at $j^{*}=j_{0}$ as desired. Thus, our "probing sequence" $h_{j-k}$ must be proportional to the signal model $f_{k-j_{0}}$ , and the convenient choice $\kappa =1$ yields the matched filter

\ h_{k}=f_{-k}.

Note that the filter is the mirrored signal model. This ensures that the operation $\sum _{k}x_{k}h_{j-k}$ to be applied in order to find the optimum is indeed the convolution between the observed sequence $x_{k}$ and the matched filter $h_{k}$ . The filtered sequence assumes its maximum at the position where the observed sequence $x_{k}$ best matches (in a least-squares sense) the signal model $f_{k}$ .

Implications

The matched filter may be derived in a variety of ways,^[2] but as a special case of a least-squares procedure it may also be interpreted as a maximum likelihood method in the context of a (coloured) Gaussian noise model and the associated Whittle likelihood.^[5] If the transmitted signal possessed no unknown parameters (like time-of-arrival, amplitude,...), then the matched filter would, according to the Neyman–Pearson lemma, minimize the error probability. However, since the exact signal generally is determined by unknown parameters that effectively are estimated (or fitted) in the filtering process, the matched filter constitutes a generalized maximum likelihood (test-) statistic.^[6] The filtered time series may then be interpreted as (proportional to) the profile likelihood, the maximized conditional likelihood as a function of the time parameter.^[7] This implies in particular that the error probability (in the sense of Neyman and Pearson, i.e., concerning maximization of the detection probability for a given false-alarm probability^[8]) is not necessarily optimal. What is commonly referred to as the Signal-to-noise ratio (SNR) , which is supposed to be maximized by a matched filter, in this context corresponds to ${\sqrt {2\log({\mathcal {L}})}}$ , where ${\mathcal {L}}$ is the (conditionally) maximized likelihood ratio.^[7]^{[nb 1]}

The construction of the matched filter is based on a known noise spectrum. In reality, however, the noise spectrum is usually estimated from data and hence only known up to a limited precision. For the case of an uncertain spectrum, the matched filter may be generalized to a more robust iterative procedure with favourable properties also in non-Gaussian noise.^[7]

Frequency-domain interpretation

When viewed in the frequency domain, it is evident that the matched filter applies the greatest weighting to spectral components exhibiting the greatest signal-to-noise ratio (i.e., large weight where noise is relatively low, and vice versa). In general this requires a non-flat frequency response, but the associated "distortion" is no cause for concern in situations such as radar and digital communications, where the original waveform is known and the objective is the detection of this signal against the background noise. On the technical side, the matched filter is a weighted least-squares method based on the (heteroscedastic) frequency-domain data (where the "weights" are determined via the noise spectrum, see also previous section), or equivalently, a least-squares method applied to the whitened data.

Examples

Radar and sonar

Matched filters are often used in signal detection.^[1] As an example, suppose that we wish to judge the distance of an object by reflecting a signal off it. We may choose to transmit a pure-tone sinusoid at 1 Hz. We assume that our received signal is an attenuated and phase-shifted form of the transmitted signal with added noise.

To judge the distance of the object, we correlate the received signal with a matched filter, which, in the case of white (uncorrelated) noise, is another pure-tone 1-Hz sinusoid. When the output of the matched filter system exceeds a certain threshold, we conclude with high probability that the received signal has been reflected off the object. Using the speed of propagation and the time that we first observe the reflected signal, we can estimate the distance of the object. If we change the shape of the pulse in a specially-designed way, the signal-to-noise ratio and the distance resolution can be even improved after matched filtering: this is a technique known as pulse compression.

Additionally, matched filters can be used in parameter estimation problems (see estimation theory). To return to our previous example, we may desire to estimate the speed of the object, in addition to its position. To exploit the Doppler effect, we would like to estimate the frequency of the received signal. To do so, we may correlate the received signal with several matched filters of sinusoids at varying frequencies. The matched filter with the highest output will reveal, with high probability, the frequency of the reflected signal and help us determine the radial velocity of the object, i.e. the relative speed either directly towards or away from the observer. This method is, in fact, a simple version of the discrete Fourier transform (DFT). The DFT takes an $N$ -valued complex input and correlates it with $N$ matched filters, corresponding to complex exponentials at $N$ different frequencies, to yield $N$ complex-valued numbers corresponding to the relative amplitudes and phases of the sinusoidal components (see Moving target indication).

Digital communications

The matched filter is also used in communications. In the context of a communication system that sends binary messages from the transmitter to the receiver across a noisy channel, a matched filter can be used to detect the transmitted pulses in the noisy received signal.

Imagine we want to send the sequence "0101100100" coded in non polar non-return-to-zero (NRZ) through a certain channel.

Mathematically, a sequence in NRZ code can be described as a sequence of unit pulses or shifted rect functions, each pulse being weighted by +1 if the bit is "1" and by -1 if the bit is "0". Formally, the scaling factor for the $k^{\mathrm {th} }$ bit is,

\ a_{k}={\begin{cases}+1,&{\text{if bit }}k{\text{ is }}1,\\-1,&{\text{if bit }}k{\text{ is }}0.\end{cases}}

We can represent our message, $M(t)$ , as the sum of shifted unit pulses:

\ M(t)=\sum _{k=-\infty }^{\infty }a_{k}\times \Pi \left({\frac {t-kT}{T}}\right).

where $T$ is the time length of one bit and $\Pi (x)$ is the rectangular function.

Thus, the signal to be sent by the transmitter is

If we model our noisy channel as an AWGN channel, white Gaussian noise is added to the signal. At the receiver end, for a Signal-to-noise ratio of 3 dB, this may look like:

A first glance will not reveal the original transmitted sequence. There is a high power of noise relative to the power of the desired signal (i.e., there is a low signal-to-noise ratio). If the receiver were to sample this signal at the correct moments, the resulting binary message could be incorrect.

To increase our signal-to-noise ratio, we pass the received signal through a matched filter. In this case, the filter should be matched to an NRZ pulse (equivalent to a "1" coded in NRZ code). Precisely, the impulse response of the ideal matched filter, assuming white (uncorrelated) noise should be a time-reversed complex-conjugated scaled version of the signal that we are seeking. We choose

\ h(t)=\Pi \left({\frac {t}{T}}\right).

In this case, due to symmetry, the time-reversed complex conjugate of $h(t)$ is in fact $h(t)$ , allowing us to call $h(t)$ the impulse response of our matched filter convolution system.

After convolving with the correct matched filter, the resulting signal, $M_{\mathrm {filtered} }(t)$ is,

\ M_{\mathrm {filtered} }(t)=(M*h)(t)

where $*$ denotes convolution.

Which can now be safely sampled by the receiver at the correct sampling instants, and compared to an appropriate threshold, resulting in a correct interpretation of the binary message.

Gravitational-wave astronomy

Matched filters play a central role in gravitational-wave astronomy.^[9] The first observation of gravitational waves was based on large-scale filtering of each detector's output for signals resembling the expected shape, followed by subsequent screening for coincident and coherent triggers between both instruments.^[10] False-alarm rates, and with that, the statistical significance of the detection were then assessed using resampling methods.^[11]^[12] Inference on the astrophysical source parameters was completed using Bayesian methods based on parameterized theoretical models for the signal waveform and (again) on the Whittle likelihood.^[13]^[14]

Seismology

Matched filters find use in seismology to detect similar earthquake or other seismic signals, often using multicomponent and/or multichannel empirically determined templates ^[15]. Matched filtering applications in seismology include the generation of large event catalogues to study earthquake seismicity ^[16] and volcanic activity ^[17]^[18], and in the global detection of nuclear explosions ^[19].

Biology

Animals living in relatively static environments would have relatively fixed features of the environment to perceive. This allows the evolution of filters that match the expected signal with the highest signal-to-noise ratio, the matched filter.^[20] Sensors that perceive the world "through such a 'matched filter' severely limits the amount of information the brain can pick up from the outside world, but it frees the brain from the need to perform more intricate computations to extract the information finally needed for fulfilling a particular task."^[21]

Notes

↑ The common reference to SNR has in fact been criticized as somewhat misleading: "The interesting feature of this approach is that theoretical perfection is attained without aiming consciously at a maximum signal/noise ratio. As the matter of quite incidental interest, it happens that the operation [...] does maximize the peak signal/noise ratio, but this fact plays no part whatsoever in the present theory. Signal/noise ratio is not a measure of information [...]." (Woodward, 1953;^[1] Sec.5.1).

Related Research Articles

Signal-to-noise ratio is a measure used in science and engineering that compares the level of a desired signal to the level of background noise. SNR is defined as the ratio of signal power to noise power, often expressed in decibels. A ratio higher than 1:1 indicates more signal than noise.

The total harmonic distortion is a measurement of the harmonic distortion present in a signal and is defined as the ratio of the sum of the powers of all harmonic components to the power of the fundamental frequency. Distortion factor, a closely related term, is sometimes used as a synonym.

In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statistical model, the observed data is most probable. The point in the parameter space that maximizes the likelihood function is called the maximum likelihood estimate. The logic of maximum likelihood is both intuitive and flexible, and as such the method has become a dominant means of statistical inference.

In mathematical analysis, Hölder's inequality, named after Otto Hölder, is a fundamental inequality between integrals and an indispensable tool for the study of $L p$ spaces.

In mathematical analysis, the Minkowski inequality establishes that the L^p spaces are normed vector spaces. Let $be a measure space, let and let and be elements of Then is in and we have the triangle inequality with equality for if and only if and are positively linearly dependent; that is, for some or Here, the norm is given by: if or in the case by the essential supremum$

Additive white Gaussian noise (AWGN) is a basic noise model used in information theory to mimic the effect of many random processes that occur in nature. The modifiers denote specific characteristics:

A thermodynamic potential is a scalar quantity used to represent the thermodynamic state of a system. Just as in mechanics, where potential energy is defined as capacity to do work, similarly different potentials have different meanings. The concept of thermodynamic potentials was introduced by Pierre Duhem in 1886. Josiah Willard Gibbs in his papers used the term fundamental functions. While thermodynamic potentials cannot be measured directly, they can be predicted using computational chemistry.

In thermodynamics, the Helmholtz free energy is a thermodynamic potential that measures the useful work obtainable from a closed thermodynamic system at a constant temperature (isothermal). The change in the Helmholtz energy during a process is equal to the maximum amount of work that the system can perform in a thermodynamic process in which temperature is held constant. At constant temperature, the Helmholtz free energy is minimized at equilibrium.

In the calculus of variations and classical mechanics, the Euler–Lagrange equations are a system of second-order ordinary differential equations whose solutions are stationary points of the given action functional. The equations were discovered in the 1750s by Swiss mathematician Leonhard Euler and Italian mathematician Joseph-Louis Lagrange.

In statistics, an expectation–maximization (EM) algorithm is an iterative method to find (local) maximum likelihood or maximum a posteriori (MAP) estimates of parameters in statistical models, where the model depends on unobserved latent variables. The EM iteration alternates between performing an expectation (E) step, which creates a function for the expectation of the log-likelihood evaluated using the current estimate for the parameters, and a maximization (M) step, which computes parameters maximizing the expected log-likelihood found on the E step. These parameter-estimates are then used to determine the distribution of the latent variables in the next E step. It can be used, for example, to estimate a mixture of gaussians, or to solve the multiple linear regression problem.

Similarity may refer to:

Linear discriminant analysis (LDA), normal discriminant analysis (NDA), or discriminant function analysis is a generalization of Fisher's linear discriminant, a method used in statistics and other fields, to find a linear combination of features that characterizes or separates two or more classes of objects or events. The resulting combination may be used as a linear classifier, or, more commonly, for dimensionality reduction before later classification.

In statistics and information theory, a maximum entropy probability distribution has entropy that is at least as great as that of all other members of a specified class of probability distributions. According to the principle of maximum entropy, if nothing is known about a distribution except that it belongs to a certain class, then the distribution with the largest entropy should be chosen as the least-informative default. The motivation is twofold: first, maximizing entropy minimizes the amount of prior information built into the distribution; second, many physical systems tend to move towards maximal entropy configurations over time.

In probability theory, the Rice distribution or Rician distribution is the probability distribution of the magnitude of a circularly-symmetric bivariate normal random variable, possibly with non-zero mean (noncentral). It was named after Stephen O. Rice (1907–1986).

Least mean squares (LMS) algorithms are a class of adaptive filter used to mimic a desired filter by finding the filter coefficients that relate to producing the least mean square of the error signal. It is a stochastic gradient descent method in that the filter is only adapted based on the error at the current time. It was invented in 1960 by Stanford University professor Bernard Widrow and his first Ph.D. student, Ted Hoff, based on their research in single-layer neural networks (ADALINE). Specifically, they used gradient descent to train ADALINE to recognize patterns, and called the algorithm "delta rule". They then applied the rule to filters, resulting in the LMS algorithm.

In mathematics, Wiener deconvolution is an application of the Wiener filter to the noise problems inherent in deconvolution. It works in the frequency domain, attempting to minimize the impact of deconvolved noise at frequencies which have a poor signal-to-noise ratio.

Precoding is a generalization of beamforming to support multi-stream transmission in multi-antenna wireless communications. In conventional single-stream beamforming, the same signal is emitted from each of the transmit antennas with appropriate weighting such that the signal power is maximized at the receiver output. When the receiver has multiple antennas, single-stream beamforming cannot simultaneously maximize the signal level at all of the receive antennas. In order to maximize the throughput in multiple receive antenna systems, multi-stream transmission is generally required.

In nonideal fluid dynamics, the Hagen–Poiseuille equation, also known as the Hagen–Poiseuille law, Poiseuille law or Poiseuille equation, is a physical law that gives the pressure drop in an incompressible and Newtonian fluid in laminar flow flowing through a long cylindrical pipe of constant cross section. It can be successfully applied to air flow in lung alveoli, or the flow through a drinking straw or through a hypodermic needle. It was experimentally derived independently by Jean Léonard Marie Poiseuille in 1838 and Gotthilf Heinrich Ludwig Hagen, and published by Hagen in 1839 and then by Poiseuille in 1840–41 and 1846. The theoretical justification of the Poiseuille law was given by George Stokes in 1845.

Signal averaging is a signal processing technique applied in the time domain, intended to increase the strength of a signal relative to noise that is obscuring it. By averaging a set of replicate measurements, the signal-to-noise ratio (SNR) will be increased, ideally in proportion to the square root of the number of measurements.

In probability theory, the family of complex normal distributions, denoted $or, characterizes complex random variables whose real and imaginary parts are jointly normal. The complex normal family has three parameters: location parameter μ, covariance matrix, and the relation matrix . The standard complex normal is the univariate distribution with,, and .$

References

1 2 3 Woodward, P. M. (1953). Probability and information theory with applications to radar. London: Pergamon Press.
1 2 Turin, G. L. (1960). "An introduction to matched filters". IRE Transactions on Information Theory. 6 (3): 311–329. doi:10.1109/TIT.1960.1057571. S2CID 5128742.
↑ "Demodulation". OpenStax CNX. Retrieved 2017-04-18.
↑ After D.O. North who was among the first to introduce the concept: North, D. O. (1943). "An analysis of the factors which determine signal/noise discrimination in pulsed carrier systems". Report PPR-6C, RCA Laboratories, Princeton, NJ.
Re-print: North, D. O. (1963). "An analysis of the factors which determine signal/noise discrimination in pulsed-carrier systems". Proceedings of the IEEE. 51 (7): 1016–1027. doi:10.1109/PROC.1963.2383.
See also: Jaynes, E. T. (2003). "14.6.1 The classical matched filter". Probability theory: The logic of science. Cambridge: Cambridge University Press.
↑ Choudhuri, N.; Ghosal, S.; Roy, A. (2004). "Contiguity of the Whittle measure for a Gaussian time series". Biometrika. 91 (4): 211–218. doi: 10.1093/biomet/91.1.211 .
↑ Mood, A. M.; Graybill, F. A.; Boes, D. C. (1974). "IX. Tests of hypotheses". Introduction to the theory of statistics (3rd ed.). New York: McGraw-Hill.
1 2 3 Röver, C. (2011). "Student-t based filter for robust signal detection". Physical Review D. 84 (12): 122004. arXiv: 1109.0442 . Bibcode:2011PhRvD..84l2004R. doi:10.1103/PhysRevD.84.122004.
↑ Neyman, J.; Pearson, E. S. (1933). "On the problem of the most efficient tests of statistical hypotheses". Philosophical Transactions of the Royal Society of London A. 231 (694–706): 289–337. Bibcode:1933RSPTA.231..289N. doi: 10.1098/rsta.1933.0009 .
↑ Schutz, B. F. (1999). "Gravitational wave astronomy". Classical and Quantum Gravity. 16 (12A): A131–A156. arXiv: gr-qc/9911034 . Bibcode:1999CQGra..16A.131S. doi:10.1088/0264-9381/16/12A/307. S2CID 19021009.
↑ "LIGO: How We Searched For Merging Black Holes And Found GW150914". A technique known as matched filtering is used to see if there are any signals contained within our data. The aim of matched filtering is to see if the data contains any signals similar to a template bank member. Since our templates should describe the gravitational waveforms for the range of different merging systems that we expect to be able to see, any sufficiently loud signal should be found by this method.
↑ Usman, Samantha A. (2016). "The PyCBC search for gravitational waves from compact binary coalescence". Class. Quantum Grav. 33 (21): 215004. arXiv: 1508.02357 . Bibcode:2016CQGra..33u5004U. doi:10.1088/0264-9381/33/21/215004. S2CID 53979477.
↑ Abbott, B. P.; et al. (The LIGO Scientific Collaboration, the Virgo Collaboration) (2016). "GW150914: First results from the search for binary black hole coalescence with Advanced LIGO". Physical Review D. 93 (12): 122003. arXiv: 1602.03839 . Bibcode:2016PhRvD..93l2003A. doi:10.1103/PhysRevD.93.122003. PMC 7430253 . PMID 32818163.
↑ Abbott, B. P.; et al. (The LIGO Scientific Collaboration, the Virgo Collaboration) (2016). "Properties of the binary black hole merger GW150914". Physical Review Letters. 116 (24): 241102. arXiv: 1602.03840 . Bibcode:2016PhRvL.116x1102A. doi:10.1103/PhysRevLett.116.241102. PMID 27367378. S2CID 217406416.
↑ Meyer, R.; Christensen, N. (2016). "Gravitational waves: A statistical autopsy of a black hole merger". Significance. 13 (2): 20–25. doi: 10.1111/j.1740-9713.2016.00896.x .
↑ Senobari, N.; Funning, G.; Keogh, E.; Zhu, Y.; Yah, C; Zimmerman, Z.; Mueen, A. (2018). "Super‐Efficient Cross‐Correlation (SEC‐C): A Fast Matched Filtering Code Suitable for Desktop Computers". Seismological Research Letters. 90 (1).
↑ Shelly, D. (2017). "A 15 year catalog of more than 1 million low-frequency earthquakes: Tracking tremor and slip along the deep San Andreas Fault". Journal of Geophysical Research. 122 (5).
↑ Shelly, D.; Thelen, W. (2019). "Anatomy of a Caldera Collapse: Kilauea 2018 Summit Seismicity Sequence in High Resolution". Geophysical Research Letters. 46 (24).
↑ Knox, H.; Chaput, J.; Aster, R.; Kyle, P. (2018). "Multi-year shallow conduit changes observed with lava lake eruption seismograms at Erebus volcano, Antarctica". Journal of Geophysical Research: Solid Earth. 123.
↑ Robinson, E. (1963). "Mathematical development of discrete filters for the detection of nuclear explosions". Journal of Geophysical Research. 68 (19).
↑ Warrant, Eric J. (October 2016). "Sensory matched filters". Current Biology. 26 (20): R976–R980. doi: 10.1016/j.cub.2016.05.042 . ISSN 0960-9822. PMID 27780072.
↑ Wehner, Rüdiger (1987). "'Matched filters': neural models of the external world". Journal of Comparative Physiology A. 161 (4): 511–531. doi:10.1007/bf00603659. ISSN 0340-7594. S2CID 32779686.

Matched filter

Contents

Derivation

Derivation via matrix algebra

Derivation via Lagrangian

Interpretation as a least-squares estimator

Derivation

Implications

Frequency-domain interpretation

Examples

Radar and sonar

Digital communications

Gravitational-wave astronomy

Seismology

Biology

See also

Notes

Related Research Articles

References

Further reading