State observer

Last updated December 18, 2024

In control theory, a state observer, state estimator, or Luenberger observer is a system that provides an estimate of the internal state of a given real system, from measurements of the input and output of the real system. It is typically computer-implemented, and provides the basis of many practical applications.

Knowing the system state is necessary to solve many control theory problems; for example, stabilizing a system using state feedback. In most practical cases, the physical state of the system cannot be determined by direct observation. Instead, indirect effects of the internal state are observed by way of the system outputs. A simple example is that of vehicles in a tunnel: the rates and velocities at which vehicles enter and leave the tunnel can be observed directly, but the exact state inside the tunnel can only be estimated. If a system is observable, it is possible to fully reconstruct the system state from its output measurements using the state observer.

Typical observer model

Linear, delayed, sliding mode, high gain, Tau, homogeneity-based, extended and cubic observers are among several observer structures used for state estimation of linear and nonlinear systems. A linear observer structure is described in the following sections.

Discrete-time case

The state of a linear, time-invariant discrete-time system is assumed to satisfy

x(k+1)=Ax(k)+Bu(k)

y(k)=Cx(k)+Du(k)

where, at time $k$ , $x(k)$ is the plant's state; $u(k)$ is its inputs; and $y(k)$ is its outputs. These equations simply say that the plant's current outputs and its future state are both determined solely by its current states and the current inputs. (Although these equations are expressed in terms of discrete time steps, very similar equations hold for continuous systems). If this system is observable then the output of the plant, $y(k)$ , can be used to steer the state of the state observer.

The observer model of the physical system is then typically derived from the above equations. Additional terms may be included in order to ensure that, on receiving successive measured values of the plant's inputs and outputs, the model's state converges to that of the plant. In particular, the output of the observer may be subtracted from the output of the plant and then multiplied by a matrix $L$ ; this is then added to the equations for the state of the observer to produce a so-called Luenberger observer, defined by the equations below. Note that the variables of a state observer are commonly denoted by a "hat": ${\hat {x}}(k)$ and ${\hat {y}}(k)$ to distinguish them from the variables of the equations satisfied by the physical system.

{\hat {x}}(k+1)=A{\hat {x}}(k)+L\left[y(k)-{\hat {y}}(k)\right]+Bu(k)

{\hat {y}}(k)=C{\hat {x}}(k)+Du(k)

The observer is called asymptotically stable if the observer error $e(k)={\hat {x}}(k)-x(k)$ converges to zero when $k\to \infty$ . For a Luenberger observer, the observer error satisfies $e(k+1)=(A-LC)e(k)$ . The Luenberger observer for this discrete-time system is therefore asymptotically stable when the matrix $A-LC$ has all the eigenvalues inside the unit circle.

For control purposes the output of the observer system is fed back to the input of both the observer and the plant through the gains matrix $K$ .

u(k)=-K{\hat {x}}(k)

The observer equations then become:

{\hat {x}}(k+1)=A{\hat {x}}(k)+L\left(y(k)-{\hat {y}}(k)\right)-BK{\hat {x}}(k)

{\hat {y}}(k)=C{\hat {x}}(k)-DK{\hat {x}}(k)

or, more simply,

{\hat {x}}(k+1)=\left(A-BK\right){\hat {x}}(k)+L\left(y(k)-{\hat {y}}(k)\right)

{\hat {y}}(k)=\left(C-DK\right){\hat {x}}(k)

Due to the separation principle we know that we can choose $K$ and $L$ independently without harm to the overall stability of the systems. As a rule of thumb, the poles of the observer $A-LC$ are usually chosen to converge 10 times faster than the poles of the system $A-BK$ .

Continuous-time case

The previous example was for an observer implemented in a discrete-time LTI system. However, the process is similar for the continuous-time case; the observer gains $L$ are chosen to make the continuous-time error dynamics converge to zero asymptotically (i.e., when $A-LC$ is a Hurwitz matrix).

For a continuous-time linear system

{\dot {x}}=Ax+Bu,

y=Cx+Du,

where $x\in \mathbb {R} ^{n},u\in \mathbb {R} ^{m},y\in \mathbb {R} ^{r}$ , the observer looks similar to discrete-time case described above:

{\dot {\hat {x}}}=A{\hat {x}}+Bu+L\left(y-{\hat {y}}\right)

.

{\hat {y}}=C{\hat {x}}+Du,

The observer error $e=x-{\hat {x}}$ satisfies the equation

{\dot {e}}=(A-LC)e

.

The eigenvalues of the matrix $A-LC$ can be chosen arbitrarily by appropriate choice of the observer gain $L$ when the pair $[A,C]$ is observable, i.e. observability condition holds. In particular, it can be made Hurwitz, so the observer error $e(t)\to 0$ when $t\to \infty$ .

Peaking and other observer methods

When the observer gain $L$ is high, the linear Luenberger observer converges to the system states very quickly. However, high observer gain leads to a peaking phenomenon in which initial estimator error can be prohibitively large (i.e., impractical or unsafe to use).^[1] As a consequence, nonlinear high-gain observer methods are available that converge quickly without the peaking phenomenon. For example, sliding mode control can be used to design an observer that brings one estimated state's error to zero in finite time even in the presence of measurement error; the other states have error that behaves similarly to the error in a Luenberger observer after peaking has subsided. Sliding mode observers also have attractive noise resilience properties that are similar to a Kalman filter.^[2]^[3] Another approach is to apply multi observer, that significantly improves transients and reduces observer overshoot. Multi-observer can be adapted to every system where high-gain observer is applicable.^[4]

State observers for nonlinear systems

High gain, sliding mode and extended observers are the most common observers for nonlinear systems. To illustrate the application of sliding mode observers for nonlinear systems, first consider the no-input non-linear system:

{\dot {x}}=f(x)

where $x\in \mathbb {R} ^{n}$ . Also assume that there is a measurable output $y\in \mathbb {R}$ given by

y=h(x).

There are several non-approximate approaches for designing an observer. The two observers given below also apply to the case when the system has an input. That is,

{\dot {x}}=f(x)+B(x)u

y=h(x).

Linearizable error dynamics

One suggestion by Krener and Isidori^[5] and Krener and Respondek^[6] can be applied in a situation when there exists a linearizing transformation (i.e., a diffeomorphism, like the one used in feedback linearization) $z=\Phi (x)$ such that in new variables the system equations read

{\dot {z}}=Az+\phi (y),

y=Cz.

The Luenberger observer is then designed as

{\dot {\hat {z}}}=A{\hat {z}}+\phi (y)-L\left(C{\hat {z}}-y\right)

.

The observer error for the transformed variable $e={\hat {z}}-z$ satisfies the same equation as in classical linear case.

{\dot {e}}=(A-LC)e

.

As shown by Gauthier, Hammouri, and Othman^[7] and Hammouri and Kinnaert,^[8] if there exists transformation $z=\Phi (x)$ such that the system can be transformed into the form

{\dot {z}}=A(u(t))z+\phi (y,u(t)),

y=Cz,

then the observer is designed as

{\dot {\hat {z}}}=A(u(t)){\hat {z}}+\phi (y,u(t))-L(t)\left(C{\hat {z}}-y\right)

,

where $L(t)$ is a time-varying observer gain.

Ciccarella, Dalla Mora, and Germani^[9] obtained more advanced and general results, removing the need for a nonlinear transform and proving global asymptotic convergence of the estimated state to the true state using only simple assumptions on regularity.

Switched observers

As discussed for the linear case above, the peaking phenomenon present in Luenberger observers justifies the use of switched observers. A switched observer encompasses a relay or binary switch that acts upon detecting minute changes in the measured output. Some common types of switched observers include the sliding mode observer, nonlinear extended state observer,^[10] fixed time observer,^[11] switched high gain observer^[12] and uniting observer.^[13] The sliding mode observer uses non-linear high-gain feedback to drive estimated states to a hypersurface where there is no difference between the estimated output and the measured output. The non-linear gain used in the observer is typically implemented with a scaled switching function, like the signum (i.e., sgn) of the estimated – measured output error. Hence, due to this high-gain feedback, the vector field of the observer has a crease in it so that observer trajectories slide along a curve where the estimated output matches the measured output exactly. So, if the system is observable from its output, the observer states will all be driven to the actual system states. Additionally, by using the sign of the error to drive the sliding mode observer, the observer trajectories become insensitive to many forms of noise. Hence, some sliding mode observers have attractive properties similar to the Kalman filter but with simpler implementation.^[2]^[3]

As suggested by Drakunov,^[14] a sliding mode observer can also be designed for a class of non-linear systems. Such an observer can be written in terms of original variable estimate ${\hat {x}}$ and has the form

{\dot {\hat {x}}}=\left[{\frac {\partial H({\hat {x}})}{\partial x}}\right]^{-1}M({\hat {x}})\operatorname {sgn}(V(t)-H({\hat {x}}))

where:

The $\operatorname {sgn}({\mathord {\cdot }})$ vector extends the scalar signum function to $n$ dimensions. That is,
$\operatorname {sgn}(z)={\begin{bmatrix}\operatorname {sgn}(z_{1})\\\operatorname {sgn}(z_{2})\\\vdots \\\operatorname {sgn}(z_{i})\\\vdots \\\operatorname {sgn}(z_{n})\end{bmatrix}}$
for the vector $z\in \mathbb {R} ^{n}$ .
The vector $H(x)$ has components that are the output function $h(x)$ and its repeated Lie derivatives. In particular,
$H(x)\triangleq {\begin{bmatrix}h_{1}(x)\\h_{2}(x)\\h_{3}(x)\\\vdots \\h_{n}(x)\end{bmatrix}}\triangleq {\begin{bmatrix}h(x)\\L_{f}h(x)\\L_{f}^{2}h(x)\\\vdots \\L_{f}^{n-1}h(x)\end{bmatrix}}$
where $L_{f}^{i}h$ is the i^th Lie derivative of output function $h$ along the vector field $f$ (i.e., along $x$ trajectories of the non-linear system). In the special case where the system has no input or has a relative degree of n, $H(x(t))$ is a collection of the output $y(t)=h(x(t))$ and its $n-1$ derivatives. Because the inverse of the Jacobian linearization of $H(x)$ must exist for this observer to be well defined, the transformation $H(x)$ is guaranteed to be a local diffeomorphism.
The diagonal matrix $M({\hat {x}})$ of gains is such that
$M({\hat {x}})\triangleq \operatorname {diag} (m_{1}({\hat {x}}),m_{2}({\hat {x}}),\ldots ,m_{n}({\hat {x}}))={\begin{bmatrix}m_{1}({\hat {x}})&&&&&\\&m_{2}({\hat {x}})&&&&\\&&\ddots &&&\\&&&m_{i}({\hat {x}})&&\\&&&&\ddots &\\&&&&&m_{n}({\hat {x}})\end{bmatrix}}$
where, for each $i\in \{1,2,\dots ,n\}$ , element $m_{i}({\hat {x}})>0$ and suitably large to ensure reachability of the sliding mode.
The observer vector $V(t)$ is such that
$V(t)\triangleq {\begin{bmatrix}v_{1}(t)\\v_{2}(t)\\v_{3}(t)\\\vdots \\v_{i}(t)\\\vdots \\v_{n}(t)\end{bmatrix}}\triangleq {\begin{bmatrix}y(t)\\\{m_{1}({\hat {x}})\operatorname {sgn}(v_{1}(t)-h_{1}({\hat {x}}(t)))\}_{\text{eq}}\\\{m_{2}({\hat {x}})\operatorname {sgn}(v_{2}(t)-h_{2}({\hat {x}}(t)))\}_{\text{eq}}\\\vdots \\\{m_{i-1}({\hat {x}})\operatorname {sgn}(v_{i-1}(t)-h_{i-1}({\hat {x}}(t)))\}_{\text{eq}}\\\vdots \\\{m_{n-1}({\hat {x}})\operatorname {sgn}(v_{n-1}(t)-h_{n-1}({\hat {x}}(t)))\}_{\text{eq}}\end{bmatrix}}$
where $\operatorname {sgn}({\mathord {\cdot }})$ here is the normal signum function defined for scalars, and $\{\ldots \}_{\text{eq}}$ denotes an "equivalent value operator" of a discontinuous function in sliding mode.

The idea can be briefly explained as follows. According to the theory of sliding modes, in order to describe the system behavior, once sliding mode starts, the function $\operatorname {sgn}(v_{i}(t)\!-\!h_{i}({\hat {x}}(t)))$ should be replaced by equivalent values (see equivalent control in the theory of sliding modes). In practice, it switches (chatters) with high frequency with slow component being equal to the equivalent value. Applying appropriate lowpass filter to get rid of the high frequency component on can obtain the value of the equivalent control, which contains more information about the state of the estimated system. The observer described above uses this method several times to obtain the state of the nonlinear system ideally in finite time.

The modified observation error can be written in the transformed states $e=H(x)-H({\hat {x}})$ . In particular,

{\begin{aligned}{\dot {e}}&={\frac {\mathrm {d} }{\mathrm {d} t}}H(x)-{\frac {\mathrm {d} }{\mathrm {d} t}}H({\hat {x}})\\&={\frac {\mathrm {d} }{\mathrm {d} t}}H(x)-M({\hat {x}})\,\operatorname {sgn}(V(t)-H({\hat {x}}(t))),\end{aligned}}

and so

{\begin{aligned}{\begin{bmatrix}{\dot {e}}_{1}\\{\dot {e}}_{2}\\\vdots \\{\dot {e}}_{i}\\\vdots \\{\dot {e}}_{n-1}\\{\dot {e}}_{n}\end{bmatrix}}&={\mathord {\overbrace {\begin{bmatrix}{\dot {h}}_{1}(x)\\{\dot {h}}_{2}(x)\\\vdots \\{\dot {h}}_{i}(x)\\\vdots \\{\dot {h}}_{n-1}(x)\\{\dot {h}}_{n}(x)\end{bmatrix}} ^{{\tfrac {\mathrm {d} }{\mathrm {d} t}}H(x)}}}-{\mathord {\overbrace {M({\hat {x}})\,\operatorname {sgn}(V(t)-H({\hat {x}}(t)))} ^{{\tfrac {\mathrm {d} }{\mathrm {d} t}}H({\hat {x}})}}}={\begin{bmatrix}h_{2}(x)\\h_{3}(x)\\\vdots \\h_{i+1}(x)\\\vdots \\h_{n}(x)\\L_{f}^{n}h(x)\end{bmatrix}}-{\begin{bmatrix}m_{1}\operatorname {sgn}(v_{1}(t)-h_{1}({\hat {x}}(t)))\\m_{2}\operatorname {sgn}(v_{2}(t)-h_{2}({\hat {x}}(t)))\\\vdots \\m_{i}\operatorname {sgn}(v_{i}(t)-h_{i}({\hat {x}}(t)))\\\vdots \\m_{n-1}\operatorname {sgn}(v_{n-1}(t)-h_{n-1}({\hat {x}}(t)))\\m_{n}\operatorname {sgn}(v_{n}(t)-h_{n}({\hat {x}}(t)))\end{bmatrix}}\\&={\begin{bmatrix}h_{2}(x)-m_{1}({\hat {x}})\operatorname {sgn}({\mathord {\overbrace {{\mathord {\overbrace {v_{1}(t)} ^{v_{1}(t)=y(t)=h_{1}(x)}}}-h_{1}({\hat {x}}(t))} ^{e_{1}}}})\\h_{3}(x)-m_{2}({\hat {x}})\operatorname {sgn}(v_{2}(t)-h_{2}({\hat {x}}(t)))\\\vdots \\h_{i+1}(x)-m_{i}({\hat {x}})\operatorname {sgn}(v_{i}(t)-h_{i}({\hat {x}}(t)))\\\vdots \\h_{n}(x)-m_{n-1}({\hat {x}})\operatorname {sgn}(v_{n-1}(t)-h_{n-1}({\hat {x}}(t)))\\L_{f}^{n}h(x)-m_{n}({\hat {x}})\operatorname {sgn}(v_{n}(t)-h_{n}({\hat {x}}(t)))\end{bmatrix}}.\end{aligned}}

So:

As long as $m_{1}({\hat {x}})\geq |h_{2}(x(t))|$ , the first row of the error dynamics, ${\dot {e}}_{1}=h_{2}({\hat {x}})-m_{1}({\hat {x}})\operatorname {sgn}(e_{1})$ , will meet sufficient conditions to enter the $e_{1}=0$ sliding mode in finite time.
Along the $e_{1}=0$ surface, the corresponding $v_{2}(t)=\{m_{1}({\hat {x}})\operatorname {sgn}(e_{1})\}_{\text{eq}}$ equivalent control will be equal to $h_{2}(x)$ , and so $v_{2}(t)-h_{2}({\hat {x}})=h_{2}(x)-h_{2}({\hat {x}})=e_{2}$ . Hence, so long as $m_{2}({\hat {x}})\geq |h_{3}(x(t))|$ , the second row of the error dynamics, ${\dot {e}}_{2}=h_{3}({\hat {x}})-m_{2}({\hat {x}})\operatorname {sgn}(e_{2})$ , will enter the $e_{2}=0$ sliding mode in finite time.
Along the $e_{i}=0$ surface, the corresponding $v_{i+1}(t)=\{\ldots \}_{\text{eq}}$ equivalent control will be equal to $h_{i+1}(x)$ . Hence, so long as $m_{i+1}({\hat {x}})\geq |h_{i+2}(x(t))|$ , the $(i+1)$ ^th row of the error dynamics, ${\dot {e}}_{i+1}=h_{i+2}({\hat {x}})-m_{i+1}({\hat {x}})\operatorname {sgn}(e_{i+1})$ , will enter the $e_{i+1}=0$ sliding mode in finite time.

So, for sufficiently large $m_{i}$ gains, all observer estimated states reach the actual states in finite time. In fact, increasing $m_{i}$ allows for convergence in any desired finite time so long as each $|h_{i}(x(0))|$ function can be bounded with certainty. Hence, the requirement that the map $H:\mathbb {R} ^{n}\to \mathbb {R} ^{n}$ is a diffeomorphism (i.e., that its Jacobian linearization is invertible) asserts that convergence of the estimated output implies convergence of the estimated state. That is, the requirement is an observability condition.

In the case of the sliding mode observer for the system with the input, additional conditions are needed for the observation error to be independent of the input. For example, that

{\frac {\partial H(x)}{\partial x}}B(x)

does not depend on time. The observer is then

{\dot {\hat {x}}}=\left[{\frac {\partial H({\hat {x}})}{\partial x}}\right]^{-1}M({\hat {x}})\operatorname {sgn}(V(t)-H({\hat {x}}))+B({\hat {x}})u.

Multi-observer

Multi-observer extends the high-gain observer structure from single to multi observer, with many models working simultaneously. This has two layers: the first consists of multiple high-gain observers with different estimation states, and the second determines the importance weights of the first layer observers. The algorithm is simple to implement and does not contain any risky operations like differentiation.^[4] The idea of multiple models was previously applied to obtain information in adaptive control.^[15]

Multi-observer schema

Assuming that the number of high-gain observers equals $n+1$ ,

{\dot {\hat {x}}}_{k}(t)=A{\hat {x_{k}}}(t)+B\phi _{0}({\hat {x}}(t),u(t))-L({\hat {y_{k}}}(t)-y(t))

{\hat {y_{k}}}(t)=C{\hat {x_{k}}}(t)

where $k=1,\dots ,n+1$ is the observer index. The first layer observers consists of the same gain $L$ but they differ with the initial state $x_{k}(0)$ . In the second layer all $x_{k}(t)$ from $k=1...n+1$ observers are combined into one to obtain single state vector estimation

{\hat {y_{k}}}(t)=\sum \limits _{k=1}^{n+1}\alpha _{k}(t){\hat {x_{k}}}(t)

where $\alpha _{k}\in \mathbb {R}$ are weight factors. These factors are changed to provide the estimation in the second layer and to improve the observation process.

Let assume that

\sum \limits _{k=1}^{n+1}\alpha _{k}(t)\xi _{k}(t)=0

and

\sum \limits _{k=1}^{n+1}\alpha _{k}(t)=1

where $\xi _{k}\in \mathbb {R} ^{n\times 1}$ is some vector that depends on $kth$ observer error $e_{k}(t)$ .

Some transformation yields to linear regression problem

[-\xi _{n+1}(t)]=[\xi _{1}(t)-\xi _{n+1}(t)\dots \xi _{k}(t)-\xi _{n+1}(t)\dots \xi _{n}(t)-\xi _{n+1}(t)]^{T}{\begin{bmatrix}\alpha _{1}(t)\\\vdots \\\alpha _{k}(t)\\\vdots \\\alpha _{n}(t)\end{bmatrix}}

This formula gives possibility to estimate $\alpha _{k}(t)$ . To construct manifold we need mapping $m:\mathbb {R} ^{n}\to \mathbb {R} ^{n}$ between $\xi _{k}(t)=m(e_{k}(t))$ and ensurance that $\xi _{k}(t)$ is calculable relying on measurable signals. First thing is to eliminate parking phenomenon for $\alpha _{k}(t)$ from observer error

e_{\sigma }(t)=\sum \limits _{k=1}^{n+1}\alpha _{k}(t)e_{k}(t)

.

Calculate $n$ times derivative on $\eta _{k}(t)={\hat {y}}_{k}(t)-y(t)$ to find mapping m lead to $\xi _{k}(t)$ defined as

\xi _{k}(t)={\begin{bmatrix}1&0&0&\cdots &0\\CL&1&0&\cdots &0\\CAL&CL&1&\cdots &0\\CA^{2}L&CAL&CL&\cdots &0\\\vdots &\vdots &\vdots &\ddots \\CA^{n-2}L&CA^{n-3}L&CA^{n-4}L&\cdots &1\end{bmatrix}}{\begin{bmatrix}\int \limits _{t-t_{d}}^{t}{{n-1} \atop \cdots }\int \limits _{t-t_{d}}^{t}\eta _{k}(\tau )d\tau \\\vdots \\\eta (t)-\eta (t-(n-1)t_{d})\end{bmatrix}}

where $t_{d}>0$ is some time constant. Note that $\xi _{k}(t)$ relays on both $\eta _{k}(t)$ and its integrals hence it is easily available in the control system. Further $\alpha _{k}(t)$ is specified by estimation law; and thus it proves that manifold is measurable. In the second layer ${\hat {\alpha }}_{k}(t)$ for $k=1\dots n+1$ is introduced as estimates of $\alpha _{k}(t)$ coefficients. The mapping error is specified as

e_{\xi }(t)=\sum \limits _{k=1}^{n+1}{\hat {\alpha }}_{k}(t)\xi _{k}(t)

where $e_{\xi }(t)\in \mathbb {R} ^{n\times 1},{\hat {\alpha }}_{k}(t)\in \mathbb {R}$ . If coefficients ${\hat {\alpha }}(t)$ are equal to $\alpha _{k}(t)$ , then mapping error $e_{\xi }(t)=0$ Now it is possible to calculate ${\hat {x}}$ from above equation and hence the peaking phenomenon is reduced thanks to properties of manifold. The created mapping gives a lot of flexibility in the estimation process. Even it is possible to estimate the value of $x(t)$ in the second layer and to calculate the state $x$ .^[4]

Bounding observers

Bounding^[16] or interval observers^[17]^[18] constitute a class of observers that provide two estimations of the state simultaneously: one of the estimations provides an upper bound on the real value of the state, whereas the second one provides a lower bound. The real value of the state is then known to be always within these two estimations.

These bounds are very important in practical applications,^[19]^[20] as they make possible to know at each time the precision of the estimation.

Mathematically, two Luenberger observers can be used, if $L$ is properly selected, using, for example, positive systems properties:^[21] one for the upper bound ${\hat {x}}_{U}(k)$ (that ensures that $e(k)={\hat {x}}_{U}(k)-x(k)$ converges to zero from above when $k\to \infty$ , in the absence of noise and uncertainty), and a lower bound ${\hat {x}}_{L}(k)$ (that ensures that $e(k)={\hat {x}}_{L}(k)-x(k)$ converges to zero from below). That is, always ${\hat {x}}_{U}(k)\geq x(k)\geq {\hat {x}}_{L}(k)$

Related Research Articles

In mathematics, the determinant is a scalar-valued function of the entries of a square matrix. The determinant of a matrix $A$ is commonly denoted $det(A)$ , $det A$ , or $| A |$ . Its value characterizes some properties of the matrix and the linear map represented, on a given basis, by the matrix. In particular, the determinant is nonzero if and only if the matrix is invertible and the corresponding linear map is an isomorphism.

<span class="mw-page-title-main">Gradient</span> Multivariate derivative (mathematics)

In vector calculus, the gradient of a scalar-valued differentiable function $of several variables is the vector field whose value at a point gives the direction and the rate of fastest increase. The gradient transforms like a vector under change of basis of the space of variables of . If the gradient of a function is non-zero at a point, the direction of the gradient is the direction in which the function increases most quickly from, and the magnitude of the gradient is the rate of increase in that direction, the greatest absolute directional derivative. Further, a point where the gradient is the zero vector is known as a stationary point. The gradient thus plays a fundamental role in optimization theory, where it is used to minimize a function by gradient descent. In coordinate-free terms, the gradient of a function may be defined by:$

<span class="mw-page-title-main">Central limit theorem</span> Fundamental theorem in probability theory and statistics

In probability theory, the central limit theorem (CLT) states that, under appropriate conditions, the distribution of a normalized version of the sample mean converges to a standard normal distribution. This holds even if the original variables themselves are not normally distributed. There are several versions of the CLT, each applying in the context of different conditions.

In statistics, the Gauss–Markov theorem states that the ordinary least squares (OLS) estimator has the lowest sampling variance within the class of linear unbiased estimators, if the errors in the linear regression model are uncorrelated, have equal variances and expectation value of zero. The errors do not need to be normal, nor do they need to be independent and identically distributed. The requirement that the estimator be unbiased cannot be dropped, since biased estimators exist with lower variance. See, for example, the James–Stein estimator, ridge regression, or simply any degenerate estimator.

In mathematics, the special unitary group of degree $n$ , denoted $SU(n)$ , is the Lie group of $n \times n$ unitary matrices with determinant 1.

In statistics and control theory, Kalman filtering is an algorithm that uses a series of measurements observed over time, including statistical noise and other inaccuracies, to produce estimates of unknown variables that tend to be more accurate than those based on a single measurement, by estimating a joint probability distribution over the variables for each time-step. The filter is constructed as a mean squared error minimiser, but an alternative derivation of the filter is also provided showing how the filter relates to maximum likelihood statistics. The filter is named after Rudolf E. Kálmán.

<span class="mw-page-title-main">Covariance matrix</span> Measure of covariance of components of a random vector

In probability theory and statistics, a covariance matrix is a square matrix giving the covariance between each pair of elements of a given random vector.

<span class="mw-page-title-main">Adjoint representation</span> Mathematical term

In mathematics, the adjoint representation of a Lie group G is a way of representing the elements of the group as linear transformations of the group's Lie algebra, considered as a vector space. For example, if G is $, the Lie group of real n -by- n invertible matrices, then the adjoint representation is the group homomorphism that sends an invertible n -by- n matrix to an endomorphism of the vector space of all linear transformations of defined by: .$

In control systems, sliding mode control (SMC) is a nonlinear control method that alters the dynamics of a nonlinear system by applying a discontinuous control signal that forces the system to "slide" along a cross-section of the system's normal behavior. The state-feedback control law is not a continuous function of time. Instead, it can switch from one continuous structure to another based on the current position in the state space. Hence, sliding mode control is a variable structure control method. The multiple control structures are designed so that trajectories always move toward an adjacent region with a different control structure, and so the ultimate trajectory will not exist entirely within one control structure. Instead, it will slide along the boundaries of the control structures. The motion of the system as it slides along these boundaries is called a sliding mode and the geometrical locus consisting of the boundaries is called the sliding (hyper)surface. In the context of modern control theory, any variable structure system, like a system under SMC, may be viewed as a special case of a hybrid dynamical system as the system both flows through a continuous state space but also moves through different discrete control modes.

In statistics, originally in geostatistics, kriging or Kriging, also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging gives the best linear unbiased prediction (BLUP) at unsampled locations. Interpolating methods based on other criteria such as smoothness may not yield the BLUP. The method is widely used in the domain of spatial analysis and computer experiments. The technique is also known as Wiener–Kolmogorov prediction, after Norbert Wiener and Andrey Kolmogorov.

In control engineering and system identification, a state-space representation is a mathematical model of a physical system specified as a set of input, output, and variables related by first-order differential equations or difference equations. Such variables, called state variables, evolve over time in a way that depends on the values they have at any given instant and on the externally imposed values of input variables. Output variables’ values depend on the state variable values and may also depend on the input variable values.

In mathematics and signal processing, the Hilbert transform is a specific singular integral that takes a function, $u (t)$ of a real variable and produces another function of a real variable $H(u)(t)$ . The Hilbert transform is given by the Cauchy principal value of the convolution with the function $(see § Definition). The Hilbert transform has a particularly simple representation in the frequency domain: It imparts a phase shift of \pm90° (π /2 radians) to every frequency component of a function, the sign of the shift depending on the sign of the frequency (see § Relationship with the Fourier transform). The Hilbert transform is important in signal processing, where it is a component of the analytic representation of a real-valued signal u (t) . The Hilbert transform was first introduced by David Hilbert in this setting, to solve a special case of the Riemann-Hilbert problem for analytic functions.$

Quantum statistical mechanics is statistical mechanics applied to quantum mechanical systems. In quantum mechanics a statistical ensemble is described by a density operator S, which is a non-negative, self-adjoint, trace-class operator of trace 1 on the Hilbert space H describing the quantum system. This can be shown under various mathematical formalisms for quantum mechanics.

Observability is a measure of how well internal states of a system can be inferred from knowledge of its external outputs. In control theory, the observability and controllability of a linear system are mathematical duals.

In functional analysis, the Friedrichs extension is a canonical self-adjoint extension of a non-negative densely defined symmetric operator. It is named after the mathematician Kurt Friedrichs. This extension is particularly useful in situations where an operator may fail to be essentially self-adjoint or whose essential self-adjointness is difficult to show.

<span class="mw-page-title-main">Prony's method</span> Method to estimate the components of a signal

Prony analysis was developed by Gaspard Riche de Prony in 1795. However, practical use of the method awaited the digital computer. Similar to the Fourier transform, Prony's method extracts valuable information from a uniformly sampled signal and builds a series of damped complex exponentials or damped sinusoids. This allows the estimation of frequency, amplitude, phase and damping components of a signal.

Feedback linearization is a common strategy employed in nonlinear control to control nonlinear systems. Feedback linearization techniques may be applied to nonlinear control systems of the form

In statistics, Bayesian multivariate linear regression is a Bayesian approach to multivariate linear regression, i.e. linear regression where the predicted outcome is a vector of correlated random variables rather than a single scalar random variable. A more general treatment of this approach can be found in the article MMSE estimator.

<span class="mw-page-title-main">Classical group</span> Type of group in mathematics

In mathematics, the classical groups are defined as the special linear groups over the reals $, the complex numbers and the quaternions together with special automorphism groups of symmetric or skew-symmetric bilinear forms and Hermitian or skew-Hermitian sesquilinear forms defined on real, complex and quaternionic finite-dimensional vector spaces. Of these, the complex classical Lie groups are four infinite families of Lie groups that together with the exceptional groups exhaust the classification of simple Lie groups. The compact classical groups are compact real forms of the complex classical groups. The finite analogues of the classical groups are the classical groups of Lie type . The term "classical group" was coined by Hermann Weyl, it being the title of his 1939 monograph The Classical Groups .$

In control theory, Ackermann's formula is a control system design method for solving the pole allocation problem for invariant-time systems by Jürgen Ackermann. One of the primary problems in control system design is the creation of controllers that will change the dynamics of a system by changing the eigenvalues of the matrix representing the dynamics of the closed-loop system. This is equivalent to changing the poles of the associated transfer function in the case that there is no cancellation of poles and zeros.

References

In-line references

↑ Khalil, H.K. (2002), Nonlinear Systems (3rd ed.), Upper Saddle River, NJ: Prentice Hall, ISBN 978-0-13-067389-3
1 2 Utkin, Vadim; Guldner, Jürgen; Shi, Jingxin (1999), Sliding Mode Control in Electromechanical Systems, Philadelphia, PA: Taylor & Francis, Inc., ISBN 978-0-7484-0116-1
1 2 Drakunov, S.V. (1983), "An adaptive quasioptimal filter with discontinuous parameters", Automation and Remote Control, 44 (9): 1167–1175
1 2 3 Bernat, J.; Stepien, S. (2015), "Multi modelling as new estimation schema for High Gain Observers", International Journal of Control, 88 (6): 1209–1222, Bibcode:2015IJC....88.1209B, doi:10.1080/00207179.2014.1000380, S2CID 8599596
↑ Krener, A.J.; Isidori, Alberto (1983), "Linearization by output injection and nonlinear observers", System and Control Letters, 3: 47–52, doi:10.1016/0167-6911(83)90037-3
↑ Krener, A.J.; Respondek, W. (1985), "Nonlinear observers with linearizable error dynamics", SIAM Journal on Control and Optimization, 23 (2): 197–216, doi:10.1137/0323016
↑ Gauthier, J.P.; Hammouri, H.; Othman, S. (1992), "A simple observer for nonlinear systems applications to bioreactors", IEEE Transactions on Automatic Control, 37 (6): 875–880, doi:10.1109/9.256352
↑ Hammouri, H.; Kinnaert, M. (1996), "A New Procedure for Time-Varying Linearization up to Output Injection", System and Control Letters, 28 (3): 151–157, doi:10.1016/0167-6911(96)00022-9
↑ Ciccarella, G.; Dalla Mora, M.; Germani, A. (1993), "A Luenberger-like observer for nonlinear systems", International Journal of Control, 57 (3): 537–556, doi:10.1080/00207179308934406
↑ Guo, Bao-Zhu; Zhao, Zhi-Liang (January 2011). "Extended State Observer for Nonlinear Systems with Uncertainty". IFAC Proceedings Volumes. 44 (1). International Federation of Automatic Control: 1855–1860. doi:10.3182/20110828-6-IT-1002.00399 . Retrieved 8 August 2023.
↑ "The Wayback Machine has not archived that URL" . Retrieved 8 August 2023.^{[ dead link ‍]}
↑ Kumar, Sunil; Kumar Pal, Anil; Kamal, Shyam; Xiong, Xiaogang (19 May 2023). "Design of switched high-gain observer for nonlinear systems" . International Journal of Systems Science. 54 (7). Science Publishing Group: 1471–1483. Bibcode:2023IJSS...54.1471K. doi:10.1080/00207721.2023.2178863. S2CID 257145897 . Retrieved 8 August 2023.
↑ "Registration" . IEEE Xplore . Retrieved 8 August 2023.
↑ Drakunov, S.V. (1992). "Sliding-mode observers based on equivalent control method". [1992] Proceedings of the 31st IEEE Conference on Decision and Control. pp. 2368–2370. doi:10.1109/CDC.1992.371368. ISBN 978-0-7803-0872-5. S2CID 120072463.
↑ Narendra, K.S.; Han, Z. (August 2012). "A new approach to adaptive control using multiple models". International Journal of Adaptive Control and Signal Processing. 26 (8): 778–799. doi:10.1002/acs.2269. ISSN 1099-1115. S2CID 60482210.
↑ Combastel, C. (2003). "A state bounding observer based on zonotopes" (PDF). 2003 European Control Conference (ECC). pp. 2589–2594. doi:10.23919/ECC.2003.7085991. ISBN 978-3-9524173-7-9. S2CID 13790057.
↑ Rami, M. Ait; Cheng, C. H.; De Prada, C. (2008). "Tight robust interval observers: An LP approach" (PDF). 2008 47th IEEE Conference on Decision and Control. pp. 2967–2972. doi:10.1109/CDC.2008.4739280. ISBN 978-1-4244-3123-6. S2CID 288928.
↑ Efimov, D.; Raïssi, T. (2016). "Design of interval observers for uncertain dynamical systems". Automation and Remote Control. 77 (2): 191–225. doi:10.1134/S0005117916020016. hdl: 20.500.12210/25069 . S2CID 49322177.
↑ "Selection of Time-after-injection in Bone Scanning using Compartmental Observers" (PDF). Archived from the original (PDF) on 13 December 2013.
↑ Hadj-Sadok, M.Z.; Gouzé, J.L. (2001). "Estimation of uncertain models of activated sludge processes with interval observers". Journal of Process Control. 11 (3): 299–310. doi:10.1016/S0959-1524(99)00074-8.
↑ Rami, Mustapha Ait; Tadeo, Fernando; Helmke, Uwe (2011). "Positive observers for linear positive systems, and their implications". International Journal of Control. 84 (4): 716–725. Bibcode:2011IJC....84..716A. doi:10.1080/00207179.2011.573000. S2CID 21211012.

General references

Sontag, Eduardo (1998), Mathematical Control Theory: Deterministic Finite Dimensional Systems. Second Edition, Springer, ISBN 978-0-387-98489-6

External links

Kalman Filter Explained Simply, Step-by-Step Tutorial of the Kalman Filter with Equations

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Khalil02-1] Khalil, H.K. (2002), Nonlinear Systems (3rd ed.), Upper Saddle River, NJ: Prentice Hall, ISBN 978-0-13-067389-3

[UtkinGS99-2] 1 2 Utkin, Vadim; Guldner, Jürgen; Shi, Jingxin (1999), Sliding Mode Control in Electromechanical Systems, Philadelphia, PA: Taylor & Francis, Inc., ISBN 978-0-7484-0116-1

[Drakunov83-3] 1 2 Drakunov, S.V. (1983), "An adaptive quasioptimal filter with discontinuous parameters", Automation and Remote Control, 44 (9): 1167–1175

[MMObserver-4] 1 2 3 Bernat, J.; Stepien, S. (2015), "Multi modelling as new estimation schema for High Gain Observers", International Journal of Control, 88 (6): 1209–1222, Bibcode:2015IJC....88.1209B, doi:10.1080/00207179.2014.1000380, S2CID 8599596

[KrenerIsidori83-5] Krener, A.J.; Isidori, Alberto (1983), "Linearization by output injection and nonlinear observers", System and Control Letters, 3: 47–52, doi:10.1016/0167-6911(83)90037-3

[KrenerRespondek85-6] Krener, A.J.; Respondek, W. (1985), "Nonlinear observers with linearizable error dynamics", SIAM Journal on Control and Optimization, 23 (2): 197–216, doi:10.1137/0323016

[GauthierHammouriOthman92-7] Gauthier, J.P.; Hammouri, H.; Othman, S. (1992), "A simple observer for nonlinear systems applications to bioreactors", IEEE Transactions on Automatic Control, 37 (6): 875–880, doi:10.1109/9.256352

[HammouriKinnaert96-8] Hammouri, H.; Kinnaert, M. (1996), "A New Procedure for Time-Varying Linearization up to Output Injection", System and Control Letters, 28 (3): 151–157, doi:10.1016/0167-6911(96)00022-9

[CiccarellaDallaMoraGermani93-9] Ciccarella, G.; Dalla Mora, M.; Germani, A. (1993), "A Luenberger-like observer for nonlinear systems", International Journal of Control, 57 (3): 537–556, doi:10.1080/00207179308934406

[10] Guo, Bao-Zhu; Zhao, Zhi-Liang (January 2011). "Extended State Observer for Nonlinear Systems with Uncertainty". IFAC Proceedings Volumes. 44 (1). International Federation of Automatic Control: 1855–1860. doi:10.3182/20110828-6-IT-1002.00399 . Retrieved 8 August 2023.

[11] "The Wayback Machine has not archived that URL" . Retrieved 8 August 2023.^{[ dead link ‍]}

[12] Kumar, Sunil; Kumar Pal, Anil; Kamal, Shyam; Xiong, Xiaogang (19 May 2023). "Design of switched high-gain observer for nonlinear systems" . International Journal of Systems Science. 54 (7). Science Publishing Group: 1471–1483. Bibcode:2023IJSS...54.1471K. doi:10.1080/00207721.2023.2178863. S2CID 257145897 . Retrieved 8 August 2023.

[13] "Registration" . IEEE Xplore . Retrieved 8 August 2023.

[Drakunov92-14] Drakunov, S.V. (1992). "Sliding-mode observers based on equivalent control method". [1992] Proceedings of the 31st IEEE Conference on Decision and Control. pp. 2368–2370. doi:10.1109/CDC.1992.371368. ISBN 978-0-7803-0872-5. S2CID 120072463.

[15] Narendra, K.S.; Han, Z. (August 2012). "A new approach to adaptive control using multiple models". International Journal of Adaptive Control and Signal Processing. 26 (8): 778–799. doi:10.1002/acs.2269. ISSN 1099-1115. S2CID 60482210.

[16] Combastel, C. (2003). "A state bounding observer based on zonotopes" (PDF). 2003 European Control Conference (ECC). pp. 2589–2594. doi:10.23919/ECC.2003.7085991. ISBN 978-3-9524173-7-9. S2CID 13790057.

[17] Rami, M. Ait; Cheng, C. H.; De Prada, C. (2008). "Tight robust interval observers: An LP approach" (PDF). 2008 47th IEEE Conference on Decision and Control. pp. 2967–2972. doi:10.1109/CDC.2008.4739280. ISBN 978-1-4244-3123-6. S2CID 288928.

[18] Efimov, D.; Raïssi, T. (2016). "Design of interval observers for uncertain dynamical systems". Automation and Remote Control. 77 (2): 191–225. doi:10.1134/S0005117916020016. hdl: 20.500.12210/25069 . S2CID 49322177.

[19] "Selection of Time-after-injection in Bone Scanning using Compartmental Observers" (PDF). Archived from the original (PDF) on 13 December 2013.

[20] Hadj-Sadok, M.Z.; Gouzé, J.L. (2001). "Estimation of uncertain models of activated sludge processes with interval observers". Journal of Process Control. 11 (3): 299–310. doi:10.1016/S0959-1524(99)00074-8.

[21] Rami, Mustapha Ait; Tadeo, Fernando; Helmke, Uwe (2011). "Positive observers for linear positive systems, and their implications". International Journal of Control. 84 (4): 716–725. Bibcode:2011IJC....84..716A. doi:10.1080/00207179.2011.573000. S2CID 21211012.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]