Space-time adaptive processing

Last updated
Doppler-Bearing response of a 2-dimensional beam-former Space-Time Beamformer Response.jpg
Doppler-Bearing response of a 2-dimensional beam-former

Space-time adaptive processing (STAP) is a signal processing technique most commonly used in radar systems. It involves adaptive array processing algorithms to aid in target detection. Radar signal processing benefits from STAP in areas where interference is a problem (i.e. ground clutter, jamming, etc.). Through careful application of STAP, it is possible to achieve order-of-magnitude sensitivity improvements in target detection.

Contents

STAP involves a two-dimensional filtering technique using a phased-array antenna with multiple spatial channels. Coupling multiple spatial channels with pulse-Doppler waveforms lends to the name "space-time." Applying the statistics of the interference environment, an adaptive STAP weight vector is formed. This weight vector is applied to the coherent samples received by the radar.

History

The theory of STAP was first published by Lawrence E. Brennan and Irving S. Reed in the early 1970s. At the time of publication, both Brennan and Reed were at Technology Service Corporation (TSC). While it was formally introduced in 1973, [1] it has theoretical roots dating back to 1959. [2]

Motivation and applications

For ground-based radar, cluttered returns tend to be at DC, making them easily discriminated by Moving Target Indication (MTI). [3] Thus, a notch filter at the zero-Doppler bin can be used. [2] Airborne platforms with ownship motion experience relative ground clutter motion dependent on the angle, resulting in angle-Doppler coupling at the input. [2] In this case, 1D filtering is not sufficient, since clutter can overlap the desired target's Doppler from multiple directions. [2] The resulting interference is typically called a "clutter ridge," since it forms a line in the angle-Doppler domain. [2] Narrowband jamming signals are also a source of interference, and exhibit significant spatial correlation. [4] Thus receiver noise and interference must be considered, and detection processors must attempt to maximize the signal-to-interference and noise ratio (SINR).

While primarily developed for radar, STAP techniques have applications for communications systems. [5]

Basic theory

Top-level diagram for the STAP 2-D adaptive filter STAP Block Diagram.jpg
Top-level diagram for the STAP 2-D adaptive filter

STAP is essentially filtering in the space-time domain. [2] This means that we are filtering over multiple dimensions, and multi-dimensional signal processing techniques must be employed. [6] The goal is to find the optimal space-time weights in -dimensional space, where is the number of antenna elements (our spatial degrees of freedom) and is the number of pulse-repetition interval (PRI) taps (our time degrees of freedom), to maximize the signal-to-interference and noise ratio (SINR). [2] Thus, the goal is to suppress noise, clutter, jammers, etc., while keeping the desired radar return. It can be thought of as a 2-D finite-impulse response (FIR) filter, with a standard 1-D FIR filter for each channel (steered spatial channels from an electronically steered array or individual elements), and the taps of these 1-D FIR filters corresponding to multiple returns (spaced at PRI time). [1] Having degrees of freedom in both the spatial domain and time domain is crucial, as clutter can be correlated in time and space, while jammers tend to be correlated spatially (along a specific bearing). [1]

A simple, trivial example of STAP is shown in the first figure, for . This is an idealized example of a steering pattern, where the response of the array has been steered to the ideal target response, . [2] Unfortunately, in practice, this is oversimplified, as the interference to be overcome by steering the nulls shown is not deterministic, but statistical in nature. [2] This is what requires STAP to be an adaptive technique. Note that even in this idealized example, in general, we must steer over the 2-D angle-Doppler plane at discrete points to detect potential targets (moving the location of the 2-D sinc main lobe shown in the figure), and do so for each of the range bins in our system.

The basic functional diagram is shown to the right. For each antenna, a down conversion and analog-to-digital conversion step is typically completed. Then, a 1-D FIR filter with PRI length delay elements is used for each steered antenna channel. The lexicographically ordered weights to are the degrees of freedom to be solved in the STAP problem. That is, STAP aims to find the optimal weights for the antenna array. It can be shown, that for a given interference covariance matrix, , the optimal weights maximizing the SINR are calculated as

where is a scalar that does not affect the SINR. [2] The optimal detector input is given by:

where is a space-time snap-shot of the input data. The main difficulty of STAP is solving for and inverting the typically unknown interference covariance matrix, . [1] Other difficulties arise when the interference covariance matrix is ill-conditioned, making the inversion numerically unstable. [5] In general, this adaptive filtering must be performed for each of the unambiguous range bins in the system, for each target of interest (angle-Doppler coordinates), making for a massive computational burden. [4] Steering losses can occur when true target returns do not fall exactly on one of the points in our 2-D angle-Doppler plane that we've sampled with our steering vector . [1]

Approaches

The various approaches can be broken down by processing taxonomy, [7] or by simplifying the data space / data sources. [2]

Direct methods

The optimum solution is using all degrees of freedom by processing the adaptive filter on the antenna elements. For adaptive direct methods, Sample Matrix Inversion (SMI) uses the estimated (sample) interference covariance matrix in place of the actual interference covariance matrix. [8] This is because the actual interference covariance matrix is not known in practice. [1] If it is known by some means, then it need not be estimated, and the optimal weights are fixed. This is sometimes called the data-independent variation. The data-dependent variation estimates the interference covariance matrix from the data. In MIMO communications systems, this can be done via a training sequence. [5] The clairvoyant detector is given when the covariance matrix is known perfectly and defined as:

where is the space-time snapshot statistic for the range cell under the interference only hypothesis, . [1] For SMI, the interference covariance matrix for the range cell consisting of the statistics from interfering noise, clutter, and jammers is estimated as follows: [4]

where is the training data obtained from the input processor for the range cell. Therefore, space-time snapshots surrounding the desired range cell are averaged. Note that the desired range cell space-time snapshot is typically excluded (as well as a number of additional cells, or "guard cells") to prevent whitening of the statistics. [1]

The main problem with direct methods is the great computational complexity associated with the estimation and inversion of matrices formed from many degrees of freedom (large number of elements and or pulses). [1] In addition, for methods where must be estimated using data samples, the number of samples required to achieve a particular error is heavily dependent on the dimensionality of the interference covariance matrix. [4] As a result, for high dimensional systems, this may require an unachievable number of unambiguous range cells. [1] Further, these adjacent data cells must contain stationary statistics as a function of range which is rarely a good assumption for the large number of cells required ( for 3 dB SINR degradation from optimal, clairvoyant STAP). [2] [1]

Reduced rank methods

Reduced rank methods aim to overcome the computational burdens of the direct method by reducing the dimensionality of the data or the rank of the interference covariance matrix. [2] This can be accomplished by forming beams and performing STAP on the beamspace. [7] Both pre and post Doppler methods can be used in the beamspace. Post Doppler methods may also be used on the full antenna element input as well to reduce the data in this dimension only. A popular example is displaced phase center antenna (DPCA), which is a form of data-independent STAP in the beamspace, pre-Doppler. [7] The goal is to perform beamforming such that the beam appears stationary as the airborne radar is in motion over discrete time periods so the clutter appears without Doppler. [2] However, phase errors can cause significant degradation since the algorithm is not adaptive to the returned data. [2] Many other methods may be used to reduce the rank of the interference covariance matrix, and so all methods in the reduced rank category can be thought of as simplifying the covariance matrix to be inverted:

Post-Doppler methods decompose the STAP problem from an adaptive filtering problem to individual adaptive filters of length (an adaptive filter problem). [2] By performing fixed Doppler processing, the adaptive filters become spatial only. [2] Since the target response is already steered to a specified angle-Doppler location, the dimensionality can be reduced by pre-processing multiple Doppler bins and angles surrounding this point. [4] In addition to reducing the dimensionality of the adaptive processor, this in turn reduces the number of required training data frames when estimating the interference covariance matrix since this quantity is dimension dependent. [2]

Since these methods reduce the data dimensionality, they are inherently sub-optimal. [1] There are a number of techniques to compare the performance of reduced-rank methods and estimated direct methods to clairvoyant STAP (direct with perfect knowledge of interference covariance matrix and target steering vector), mostly based around SINR loss. [1] One such example is

where we've taken the ratio of the SINR evaluated with the sub-optimal weights and the SINR evaluated with the optimal weights . [1] Note in general this quantity is statistical and the expectation must be taken to find the average SINR loss. The clairvoyant SINR loss may also be calculated by taking the ratio of the optimal SINR to the system SNR, indicating the loss due to interference. [1]

Model based methods

There are also model based methods that attempt to force or exploit the structure of the covariance interference matrix. The more generally applicable of these methods is the covariance taper matrix structure. [2] The goal is to compactly model the interference, at which point it can then be processed using principal component techniques or diagonal-loading SMI (where a small magnitude, random diagonal matrix is added to attempt to stabilize the matrix prior to inverting). [2] This modeling has an added benefit of decorrelating interference subspace leakage (ISL), and is resistant to internal clutter motion (ICM). [2] The principal component method firsts applies principal component analysis to estimate the dominant eigenvalues and eigenvectors, and then applies a covariance taper and adds an estimated noise floor:

where is the eigenvalue estimated using PCA, is the associated eigenvector estimated using PCA, implies element-by-element multiplication of matrices and , is the estimated covariance matrix taper, and is the estimated noise floor. [2] The estimation of the covariance taper can be complicated, depending on the complexity of the underlying model attempting to emulate the interference environment. The reader is encouraged to see [2] for more information on this particular subject. Once this taper is sufficiently modeled, it may also be applied to the more simple SMI adaptation of CMT as follows:

where is the typical SMI estimated matrix seen in the approximate direct method, is the diagonal loading factor, and is the identity matrix of the appropriate size. It should be seen that this is meant to improve the standard SMI method where SMI uses a smaller number of range bins in its average than the standard SMI technique. Since fewer samples are used in the training data, the matrix often requires stabilization in the form of diagonal loading. [2]

More restrictive examples involve modeling the interference to force Toeplitz structures, and can greatly reduce the computational complexity associated with the processing by exploiting this structure. [2] However, these methods can suffer due to model-mismatch, or the computational savings may be undone by the problem of model fitting (such as the nonlinear problem of fitting to a Toeplitz or block-Toeplitz matrix) and order estimation. [2]

Modern applications

Despite nearly 40 years of existence, STAP has modern applications.

MIMO communications

For dispersive channels, multiple-input multiple-output communications can formulate STAP solutions. Frequency-selective channel compensation can be used to extend traditional equalization techniques for SISO systems using STAP. [5] To estimate the transmitted signal at a MIMO receiver, we can linearly weight our space-time input with weighting matrix as follows

to minimize the mean squared error (MSE). [5] Using STAP with a training sequence , the estimated optimal weighting matrix (STAP coefficients) is given by: [5]

MIMO radar

STAP has been extended for MIMO radar to improve spatial resolution for clutter, using modified SIMO radar STAP techniques. [9] New algorithms and formulations are required that depart from the standard technique due to the large rank of the jammer-clutter subspace created by MIMO radar virtual arrays, [9] which typically involves exploiting the block diagonal structure of the MIMO interference covariance matrix to break the large matrix inversion problem into smaller ones. In comparison with SIMO radar systems, which will have transmit degrees of freedom, and receive degrees of freedom, for a total of , MIMO radar systems have degrees of freedom, allowing for much greater adaptive spatial resolution for clutter mitigation. [9]

See also

Related Research Articles

<span class="mw-page-title-main">Principal component analysis</span> Method of data analysis

Principal component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratory data analysis, visualization and data preprocessing.

<span class="mw-page-title-main">Kalman filter</span> Algorithm that estimates unknowns from a series of measurements over time

For statistics and control theory, Kalman filtering, also known as linear quadratic estimation (LQE), is an algorithm that uses a series of measurements observed over time, including statistical noise and other inaccuracies, and produces estimates of unknown variables that tend to be more accurate than those based on a single measurement alone, by estimating a joint probability distribution over the variables for each timeframe. The filter is named after Rudolf E. Kálmán, who was one of the primary developers of its theory.

<span class="mw-page-title-main">Synthetic-aperture radar</span> Form of radar used to create images of landscapes

Synthetic-aperture radar (SAR) is a form of radar that is used to create two-dimensional images or three-dimensional reconstructions of objects, such as landscapes. SAR uses the motion of the radar antenna over a target region to provide finer spatial resolution than conventional stationary beam-scanning radars. SAR is typically mounted on a moving platform, such as an aircraft or spacecraft, and has its origins in an advanced form of side looking airborne radar (SLAR). The distance the SAR device travels over a target during the period when the target scene is illuminated creates the large synthetic antenna aperture. Typically, the larger the aperture, the higher the image resolution will be, regardless of whether the aperture is physical or synthetic – this allows SAR to create high-resolution images with comparatively small physical antennas. For a fixed antenna size and orientation, objects which are further away remain illuminated longer – therefore SAR has the property of creating larger synthetic apertures for more distant objects, which results in a consistent spatial resolution over a range of viewing distances.

<span class="mw-page-title-main">Adaptive equalizer</span>

An adaptive equalizer is an equalizer that automatically adapts to time-varying properties of the communication channel. It is frequently used with coherent modulations such as phase-shift keying, mitigating the effects of multipath propagation and Doppler spreading.

<span class="mw-page-title-main">Array processing</span>

Array processing is a wide area of research in the field of signal processing that extends from the simplest form of 1 dimensional line arrays to 2 and 3 dimensional array geometries. Array structure can be defined as a set of sensors that are spatially separated, e.g. radio antenna and seismic arrays. The sensors used for a specific problem may vary widely, for example microphones, accelerometers and telescopes. However, many similarities exist, the most fundamental of which may be an assumption of wave propagation. Wave propagation means there is a systemic relationship between the signal received on spatially separated sensors. By creating a physical model of the wave propagation, or in machine learning applications a training data set, the relationships between the signals received on spatially separated sensors can be leveraged for many applications.

Beamforming or spatial filtering is a signal processing technique used in sensor arrays for directional signal transmission or reception. This is achieved by combining elements in an antenna array in such a way that signals at particular angles experience constructive interference while others experience destructive interference. Beamforming can be used at both the transmitting and receiving ends in order to achieve spatial selectivity. The improvement compared with omnidirectional reception/transmission is known as the directivity of the array.

Least mean squares (LMS) algorithms are a class of adaptive filter used to mimic a desired filter by finding the filter coefficients that relate to producing the least mean square of the error signal. It is a stochastic gradient descent method in that the filter is only adapted based on the error at the current time. It was invented in 1960 by Stanford University professor Bernard Widrow and his first Ph.D. student, Ted Hoff, based on their research in single-layer neural networks (ADALINE). Specifically, they used gradient descent to train ADALINE to recognize patterns, and called the algorithm "delta rule". They then applied the rule to filters, resulting in the LMS algorithm.

Recursive least squares (RLS) is an adaptive filter algorithm that recursively finds the coefficients that minimize a weighted linear least squares cost function relating to the input signals. This approach is in contrast to other algorithms such as the least mean squares (LMS) that aim to reduce the mean square error. In the derivation of the RLS, the input signals are considered deterministic, while for the LMS and similar algorithms they are considered stochastic. Compared to most of its competitors, the RLS exhibits extremely fast convergence. However, this benefit comes at the cost of high computational complexity.

An adaptive beamformer is a system that performs adaptive spatial signal processing with an array of transmitters or receivers. The signals are combined in a manner which increases the signal strength to/from a chosen direction. Signals to/from other directions are combined in a benign or destructive manner, resulting in degradation of the signal to/from the undesired direction. This technique is used in both radio frequency and acoustic arrays, and provides for directional sensitivity without physically moving an array of receivers or transmitters.

In machine learning, kernel machines are a class of algorithms for pattern analysis, whose best known member is the support-vector machine (SVM). These methods involve using linear classifiers to solve nonlinear problems. The general task of pattern analysis is to find and study general types of relations in datasets. For many algorithms that solve these tasks, the data in raw representation have to be explicitly transformed into feature vector representations via a user-specified feature map: in contrast, kernel methods require only a user-specified kernel, i.e., a similarity function over all pairs of data points computed using inner products. The feature map in kernel machines is infinite dimensional but only requires a finite dimensional matrix from user-input according to the Representer theorem. Kernel machines are slow to compute for datasets larger than a couple of thousand examples without parallel processing.

In wireless communications, channel state information (CSI) is the known channel properties of a communication link. This information describes how a signal propagates from the transmitter to the receiver and represents the combined effect of, for example, scattering, fading, and power decay with distance. The method is called channel estimation. The CSI makes it possible to adapt transmissions to current channel conditions, which is crucial for achieving reliable communication with high data rates in multiantenna systems.

In the field of wireless communication, macrodiversity is a kind of space diversity scheme using several receiver or transmitter antennas for transferring the same signal. The distance between the transmitters is much longer than the wavelength, as opposed to microdiversity where the distance is in the order of or shorter than the wavelength.

<span class="mw-page-title-main">Spatial multiplexing</span>

Spatial multiplexing or space-division multiplexing is a multiplexing technique in MIMO wireless communication, fibre-optic communication and other communications technologies used to transmit independent channels separated in space.

The ensemble Kalman filter (EnKF) is a recursive filter suitable for problems with a large number of variables, such as discretizations of partial differential equations in geophysical models. The EnKF originated as a version of the Kalman filter for large problems, and it is now an important data assimilation component of ensemble forecasting. EnKF is related to the particle filter but the EnKF makes the assumption that all probability distributions involved are Gaussian; when it is applicable, it is much more efficient than the particle filter.

Precoding is a generalization of beamforming to support multi-stream transmission in multi-antenna wireless communications. In conventional single-stream beamforming, the same signal is emitted from each of the transmit antennas with appropriate weighting such that the signal power is maximized at the receiver output. When the receiver has multiple antennas, single-stream beamforming cannot simultaneously maximize the signal level at all of the receive antennas. In order to maximize the throughput in multiple receive antenna systems, multi-stream transmission is generally required.

<span class="mw-page-title-main">MIMO</span> Use of multiple antennas in radio

In radio, multiple-input and multiple-output (MIMO) is a method for multiplying the capacity of a radio link using multiple transmission and receiving antennas to exploit multipath propagation. MIMO has become an essential element of wireless communication standards including IEEE 802.11n, IEEE 802.11ac, HSPA+ (3G), WiMAX, and Long Term Evolution (LTE). More recently, MIMO has been applied to power-line communication for three-wire installations as part of the ITU G.hn standard and of the HomePlug AV2 specification.

<span class="mw-page-title-main">Singular spectrum analysis</span> Nonparametric spectral estimation method

In time series analysis, singular spectrum analysis (SSA) is a nonparametric spectral estimation method. It combines elements of classical time series analysis, multivariate statistics, multivariate geometry, dynamical systems and signal processing. Its roots lie in the classical Karhunen (1946)–Loève spectral decomposition of time series and random fields and in the Mañé (1981)–Takens (1981) embedding theorem. SSA can be an aid in the decomposition of time series into a sum of components, each having a meaningful interpretation. The name "singular spectrum analysis" relates to the spectrum of eigenvalues in a singular value decomposition of a covariance matrix, and not directly to a frequency domain decomposition.

In estimation theory, the extended Kalman filter (EKF) is the nonlinear version of the Kalman filter which linearizes about an estimate of the current mean and covariance. In the case of well defined transition models, the EKF has been considered the de facto standard in the theory of nonlinear state estimation, navigation systems and GPS.

In applied statistics and geostatistics, regression-kriging (RK) is a spatial prediction technique that combines a regression of the dependent variable on auxiliary variables with interpolation (kriging) of the regression residuals. It is mathematically equivalent to the interpolation method variously called universal kriging and kriging with external drift, where auxiliary predictors are used directly to solve the kriging weights.

SAMV is a parameter-free superresolution algorithm for the linear inverse problem in spectral estimation, direction-of-arrival (DOA) estimation and tomographic reconstruction with applications in signal processing, medical imaging and remote sensing. The name was coined in 2013 to emphasize its basis on the asymptotically minimum variance (AMV) criterion. It is a powerful tool for the recovery of both the amplitude and frequency characteristics of multiple highly correlated sources in challenging environments. Applications include synthetic-aperture radar, computed tomography scan, and magnetic resonance imaging (MRI).

References

  1. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Melvin, W.L., A STAP Overview, IEEE Aerospace and Electronic Systems Magazine – Special Tutorials Issue, Vol. 19, No. 1, January 2004, pp. 19–35.
  2. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 Guerci, J.R., Space-Time Adaptive Processing for Radar, Artech House Publishers, 2003. ISBN   1-58053-377-9.
  3. Richards, M.A., Scheer, J.A., and Holm, W.A., Principles of Modern Radar, SciTech Publishing, 2010. ISBN   1-89112-152-9.
  4. 1 2 3 4 5 Richards, M.A., Fundamentals of Radar Signal Processing, McGraw-Hill Education, 2014. ISBN   0-07179-832-3.
  5. 1 2 3 4 5 6 Bliss, D.W. and Govindasamy, S., Adaptive Wireless Communications: MIMO Channels and Networks, Cambridge University Press, 2013. ISBN   1-10703-320-9.
  6. Dudgeon, D.E. and Mersereau, R.M., Multidimensional Digital Signal Processing, Prentice-Hall Signal Processing Series, 1984. ISBN   0-13604-959-1.
  7. 1 2 3 Ward, J., Space-Time Adaptive Processing for Airborne Radar, IEE Colloquium on Space-Time Adaptive Processing (Ref. No. 1998/241), April 1998, pp. 2/1–2/6.
  8. Van Trees, H. L., Optimum Array Processing, Wiley, NY, 2002.
  9. 1 2 3 Li, J. and Stoica, P., MIMO Radar Signal Processing, John Wiley & Sons, 2009. ISBN   0-47017-898-1.

Further reading