Signal averaging

Last updated November 29, 2021

Signal averaging is a signal processing technique applied in the time domain, intended to increase the strength of a signal relative to noise that is obscuring it. By averaging a set of replicate measurements, the signal-to-noise ratio (SNR) will be increased, ideally in proportion to the square root of the number of measurements.

Deriving the SNR for averaged signals

Assumed that

Signal $s(t)$ is uncorrelated to noise, and noise $z(t)$ is uncorrelated : $E[z(t)z(t-\tau )]=0=E[z(t)s(t-\tau )]\forall t,\tau$ .
Signal power $P_{signal}=E[s^{2}]$ is constant in the replicate measurements.
Noise is random, with a mean of zero and constant variance in the replicate measurements: $E[z]=0=\mu$ and $0<E[\left(z-\mu \right)^{2}]=E[z^{2}]=P_{noise}=\sigma ^{2}$ .
We (canonically) define Signal-to-Noise ratio as $SNR={\frac {P_{signal}}{P_{noise}}}={\frac {E[s^{2}]}{\sigma ^{2}}}$ .

Noise power for sampled signals

Assuming we sample the noise, we get a per-sample variance of

$\mathrm {Var} (z)=E[z^{2}]=\sigma ^{2}$ .

Averaging a random variable leads to the following variance:

$\mathrm {Var} \left({\frac {1}{n}}\sum _{i=1}^{n}z_{i}\right)={\frac {1}{n^{2}}}\mathrm {Var} \left(\sum _{i=1}^{n}z_{i}\right)={\frac {1}{n^{2}}}\sum _{i=1}^{n}\mathrm {Var} \left(z_{i}\right)$ .

Since noise variance is constant $\sigma ^{2}$ :

$\mathrm {Var} (N_{\text{avg}})=\mathrm {Var} \left({\frac {1}{n}}\sum _{i=1}^{n}z_{i}\right)={\frac {1}{n^{2}}}n\sigma ^{2}={\frac {1}{n}}\sigma ^{2}$ ,

demonstrating that averaging $n$ realizations of the same, uncorrelated noise reduces noise power by a factor of $n$ , and reduces noise level by a factor of ${\sqrt {n}}$ .

Signal power for sampled signals

Considering $n$ vectors $V_{i},\,i\in \{1,\ldots ,n\}$ of signal samples of length $T$ :

$V_{i}=\left[s_{i,1},\ldots ,s_{i,T}\right],\quad s_{i,k}\in \mathbb {K} ^{T}$ ,

the power $P_{i}$ of such a vector simply is

$P_{i}=\sum _{k=1}^{T}{s_{i,k}^{2}}=\left|V_{i}\right|^{2}$ .

Again, averaging the $n$ vectors $V_{i},\,i=1,\ldots ,n$ , yields the following averaged vector

$V_{\text{avg}}={\frac {1}{n}}\sum _{k=1}^{T}\sum _{i=1}^{n}s_{i,k}={\frac {1}{n}}\sum _{i=1}^{n}\sum _{k=1}^{T}s_{i,k}$ .

In the case where $V_{n}\equiv V_{m}\forall m,n\in \{1,\ldots ,n\}$ , we see that $V_{\text{avg}}$ reaches a maximum of

$V_{\text{avg, identical signals}}=P_{i}$ .

In this case, the ratio of signal to noise also reaches a maximum,

${\text{SNR}}_{\text{avg, identical signals}}={\frac {V_{\text{avg, identical signals}}}{N_{\text{avg}}}}=n{\text{SNR}}$ .

This is the oversampling case, where the observed signal is correlated (because oversampling implies that the signal observations are strongly correlated).

Time-locked signals

Averaging is applied to enhance a time-locked signal component in noisy measurements; time-locking implies that the signal is observation-periodic, so we end up in the maximum case above.

Averaging odd and even trials

A specific way of obtaining replicates is to average all the odd and even trials in separate buffers. This has the advantage of allowing for comparison of even and odd results from interleaved trials. An average of odd and even averages generates the completed averaged result, while the difference between the odd and even averages, divided by two, constitutes an estimate of the noise.

Algorithmic implementation

The following is a MATLAB simulation of the averaging process:

N=1000;% signal lengtheven=zeros(N,1);% even bufferodd=even;% odd bufferactual_noise=even;% keep track of noise levelx=sin(linspace(0,4*pi,N))';% tracked signalforii=1:256% number of replicatesn=randn(N,1);% random noiseactual_noise=actual_noise+n;if(mod(ii,2))even=even+n+x;elseodd=odd+n+x;endendeven_avg=even/(ii/2);% even buffer average odd_avg=odd/(ii/2);% odd buffer averageact_avg=actual_noise/ii;% actual noise leveldb(rms(act_avg))db(rms((even_avg-odd_avg)/2))plot((odd_avg+even_avg));holdon;plot((even_avg-odd_avg)/2)

The averaging process above, and in general, results in an estimate of the signal. When compared with the raw trace, the averaged noise component is reduced with every averaged trial. When averaging real signals, the underlying component may not always be as clear, resulting in repeated averages in a search for consistent components in two or three replicates. It is unlikely that two or more consistent results will be produced by chance alone.

Correlated noise

Signal averaging typically relies heavily on the assumption that the noise component of a signal is random, having zero mean, and being unrelated to the signal. However, there are instances in which the noise is not uncorrelated. A common example of correlated noise is quantization noise (e.g. the noise created when converting from an analog to a digital signal).

Related Research Articles

In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of numbers is spread out from their average value. Variance has a central role in statistics, where some ideas that use it include descriptive statistics, statistical inference, hypothesis testing, goodness of fit, and Monte Carlo sampling. Variance is an important tool in the sciences, where statistical analysis of data is common. The variance is the square of the standard deviation, the second central moment of a distribution, and the covariance of the random variable with itself, and it is often represented by $,,,, or .$

The weighted arithmetic mean is similar to an ordinary arithmetic mean, except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. The notion of weighted mean plays a role in descriptive statistics and also occurs in a more general form in several other areas of mathematics.

In probability theory, the central limit theorem (CLT) establishes that, in many situations, when independent random variables are summed up, their properly normalized sum tends toward a normal distribution even if the original variables themselves are not normally distributed. The theorem is a key concept in probability theory because it implies that probabilistic and statistical methods that work for normal distributions can be applicable to many problems involving other types of distributions. This theorem has seen many changes during the formal development of probability theory. Previous versions of the theorem date back to 1811, but in its modern general form, this fundamental result in probability theory was precisely stated as late as 1920, thereby serving as a bridge between classical and modern probability theory.

Signal-to-noise ratio is a measure used in science and engineering that compares the level of a desired signal to the level of background noise. SNR is defined as the ratio of signal power to the noise power, often expressed in decibels. A ratio higher than 1:1 indicates more signal than noise.

Multivariate normal distribution Generalization of the one-dimensional normal distribution to higher dimensions

In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions. One definition is that a random vector is said to be k-variate normally distributed if every linear combination of its k components has a univariate normal distribution. Its importance derives mainly from the multivariate central limit theorem. The multivariate normal distribution is often used to describe, at least approximately, any set of (possibly) correlated real-valued random variables each of which clusters around a mean value.

In probability theory and statistics, covariance is a measure of the joint variability of two random variables. If the greater values of one variable mainly correspond with the greater values of the other variable, and the same holds for the lesser values, the covariance is positive. In the opposite case, when the greater values of one variable mainly correspond to the lesser values of the other,, the covariance is negative. The sign of the covariance therefore shows the tendency in the linear relationship between the variables. The magnitude of the covariance is not easy to interpret because it is not normalized and hence depends on the magnitudes of the variables. The normalized version of the covariance, the correlation coefficient, however, shows by its magnitude the strength of the linear relation.

In probability theory and statistics, the Rayleigh distribution is a continuous probability distribution for nonnegative-valued random variables. Up to rescaling, it coincides with the chi distribution with two degrees of freedom.

In statistics, propagation of uncertainty is the effect of variables' uncertainties on the uncertainty of a function based on them. When the variables are the values of experimental measurements they have uncertainties due to measurement limitations which propagate due to the combination of variables in the function.

In probability theory and statistics, the coefficient of variation (CV), also known as relative standard deviation (RSD), is a standardized measure of dispersion of a probability distribution or frequency distribution. It is often expressed as a percentage, and is defined as the ratio of the standard deviation $to the mean . The CV or RSD is widely used in analytical chemistry to express the precision and repeatability of an assay. It is also commonly used in fields such as engineering or physics when doing quality assurance studies and ANOVA gauge R&R. In addition, CV is utilized by economists and investors in economic models.$

In signal processing, a matched filter is obtained by correlating a known delayed signal, or template, with an unknown signal to detect the presence of the template in the unknown signal. This is equivalent to convolving the unknown signal with a conjugated time-reversed version of the template. The matched filter is the optimal linear filter for maximizing the signal-to-noise ratio (SNR) in the presence of additive stochastic noise.

In signal processing, oversampling is the process of sampling a signal at a sampling frequency significantly higher than the Nyquist rate. Theoretically, a bandwidth-limited signal can be perfectly reconstructed if sampled at the Nyquist rate or above it. The Nyquist rate is defined as twice the bandwidth of the signal. Oversampling is capable of improving resolution and signal-to-noise ratio, and can be helpful in avoiding aliasing and phase distortion by relaxing anti-aliasing filter performance requirements.

In statistics, econometrics and signal processing, an autoregressive (AR) model is a representation of a type of random process; as such, it is used to describe certain time-varying processes in nature, economics, etc. The autoregressive model specifies that the output variable depends linearly on its own previous values and on a stochastic term ; thus the model is in the form of a stochastic difference equation. Together with the moving-average (MA) model, it is a special case and key component of the more general autoregressive–moving-average (ARMA) and autoregressive integrated moving average (ARIMA) models of time series, which have a more complicated stochastic structure; it is also a special case of the vector autoregressive model (VAR), which consists of a system of more than one interlocking stochastic difference equation in more than one evolving random variable.

Estimation theory is a branch of statistics that deals with estimating the values of parameters based on measured empirical data that has a random component. The parameters describe an underlying physical setting in such a way that their value affects the distribution of the measured data. An estimator attempts to approximate the unknown parameters using the measurements. In estimation theory, two approaches are generally considered:

In statistics, ordinary least squares (OLS) is a type of linear least squares method for estimating the unknown parameters in a linear regression model. OLS chooses the parameters of a linear function of a set of explanatory variables by the principle of least squares: minimizing the sum of the squares of the differences between the observed dependent variable in the given dataset and those predicted by the linear function of the independent variable.

In probability theory, the Rice distribution or Rician distribution is the probability distribution of the magnitude of a circularly-symmetric bivariate normal random variable, possibly with non-zero mean (noncentral). It was named after Stephen O. Rice (1907–1986).

A cyclostationary process is a signal having statistical properties that vary cyclically with time. A cyclostationary process can be viewed as multiple interleaved stationary processes. For example, the maximum daily temperature in New York City can be modeled as a cyclostationary process: the maximum temperature on July 21 is statistically different from the temperature on December 20; however, it is a reasonable approximation that the temperature on December 20 of different years has identical statistics. Thus, we can view the random process composed of daily maximum temperatures as 365 interleaved stationary processes, each of which takes on a new value once per year.

Signal-to-noise ratio (SNR) is used in imaging to characterize image quality. The sensitivity of a imaging system is typically described in the terms of the signal level that yields a threshold level of SNR.

In probability theory and statistics, the generalized chi-squared distribution is the distribution of a quadratic form of a multinormal variable, or a linear combination of different normal variables and squares of normal variables. Equivalently, it is also a linear sum of independent noncentral chi-square variables and a normal variable. There are several other such generalizations for which the same term is sometimes used; some of them are special cases of the family discussed here, for example the gamma distribution.

In statistics, inverse-variance weighting is a method of aggregating two or more random variables to minimize the variance of the weighted average. Each random variable is weighted in inverse proportion to its variance, i.e. proportional to its precision.

In physics, and especially scattering theory, the momentum-transfer cross section is an effective scattering cross section useful for describing the average momentum transferred from a particle when it collides with a target. Essentially, it contains all the information about a scattering process necessary for calculating average momentum transfers but ignores other details about the scattering angle.

References

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

v t e Digital signal processing
Theory	Detection theory Discrete signal Estimation theory Nyquist–Shannon sampling theorem
Sub-fields	Audio signal processing Digital image processing Speech processing Statistical signal processing
Techniques	Z-transform Advanced z-transform Matched Z-transform method Bilinear transform Constant-Q transform Discrete cosine transform (DCT) Discrete Fourier transform (DFT) Discrete-time Fourier transform (DTFT) Impulse invariance Integral transform Laplace transform Post's inversion formula Starred transform Zak transform
Sampling	Aliasing Anti-aliasing filter Downsampling Nyquist rate / frequency Oversampling Quantization Sampling rate Undersampling Upsampling