Controllability is an important property of a control system and plays a crucial role in many regulation problems, such as the stabilization of unstable systems using feedback, tracking problems, obtaining optimal control strategies, or, simply prescribing an input that has a desired effect on the state.
Controllability and observability are dual notions. Controllability pertains to regulating the state by a choice of a suitable input, while observability pertains to being able to know the state by observing the output (assuming that the input is also being observed).
Broadly speaking, the concept of controllability relates to the ability to steer a system around in its configuration space using only certain admissible manipulations. The exact definition varies depending on the framework or the type of models dealt with.
The following are examples of variants of notions of controllability that have been introduced in the systems and control literature:
The state of a deterministic system, which is the set of values of all the system's state variables (those variables characterized by dynamic equations), completely describes the system at any given time. In particular, no information on the past of a system is needed to help in predicting the future, if the states at the present time are known and all current and future values of the control variables (those whose values can be chosen) are known.
Complete state controllability (or simply controllability if no other context is given) describes the ability of an external input (the vector of control variables) to move the internal state of a system from any initial state to any final state in a finite time interval. [1] : 737
That is, we can informally define controllability as follows: If for any initial state x0 and any final state xf there exists an input sequence to transfer the system state from x0 to xf in a finite time interval, then the system modeled by the state-space representation is controllable. For the simplest example of a continuous, linear time-invariant (LTI) system, the row dimension of the state space expression ẋ = Ax(t) + Bu(t) determines the interval; each row contributes a vector in the state space of the system. If there are not enough such vectors to span the state space of x, then the system cannot achieve controllability. It may be necessary to modify A and B to better approximate the underlying differential relationships it estimates to achieve controllability.
Controllability does not mean that a reached state can be maintained, merely that any state can be reached.
Controllability does not mean that arbitrary paths can be made through state space, only that there exists a path within a finite time interval. When the time interval can also be specified, the dynamical system is often referred to as being strongly controllable.
Consider the continuous linear system [note 1]
There exists a control u from state x0 at time t0 to state x1 at time t1 > t0 if and only if x1 − ϕ(t0, t1)x0 is in the column space of where ϕ(t0, t) is the state-transition matrix, and W(t0, t1) is the Controllability Gramian.
In fact, if η0 is a solution to W(t0, t1)η = x1 − ϕ(t0, t1)x0 then a control given by u(t) = −B(t)Tϕ(t0, t)Tη0 would make the desired transfer.
Note that the matrix W(t0, t1) defined as above has the following properties:
The Controllability Gramian involves integration of the state-transition matrix of a system. A simpler condition for controllability is a rank condition analogous to the Kalman rank condition for time-invariant systems.
Consider a continuous-time linear system Σ smoothly varying in an interval [t0, t]:
The state-transition matrix ϕ is also smooth. Introduce the n × m matrix-valued function M0(t) = ϕ(t0, t)B(t) and define
Consider the matrix of matrix-valued functions obtained by listing all the columns of the matrix M (Mi, for i = 0, 1, ... , k):
If there exists a t̅ ∈ [t0, t] and a nonnegative integer k such that rank M(k)(t̅) = n, then Σ is controllable. [3]
If Σ is also analytically varying in an interval [t0, t], then Σ is controllable on every nontrivial subinterval of [t0, t] if and only if there exists a t̅ ∈ [t0, t] and a nonnegative integer k such that rank M(k)(ti) = n. [3]
The above methods can still be complex to check, since it involves the computation of the state-transition matrix ϕ. Another equivalent condition is defined as follow. Let B0(t) = B(t), and for each i ≥ 0, define In this case, each Bi is obtained directly from the data (A(t), B(t)). The system is controllable if there exists a t̅ ∈ [t0, t] and a nonnegative integer k such that rank([B0(t̅), B1(t̅), ... , Bk(t̅)]) = n. [3]
Consider a system varying analytically in (−∞, ∞) and matrices Then and since this matrix has rank 3, the system is controllable on every nontrivial interval of .
Consider the continuous linear time-invariant system where
The n × nr controllability matrix is given by The system is controllable if the controllability matrix has full row rank (i.e. rank(R) = n.
For a discrete-time linear state-space system (i.e. time variable ) the state equation is where A is an n × n matrix and B is a n × r matrix (i.e. u is r inputs collected in a r × 1 vector). The test for controllability is that the n × nr matrix has full row rank (i.e., rank(C) = n). That is, if the system is controllable, C will have n columns that are linearly independent; if n columns of C are linearly independent, each of the n states is reachable by giving the system proper inputs through the variable u(k).
Given the state x(0) at an initial time, arbitrarily denoted as k = 0, the state equation gives x(1) = Ax(0) + Bu(0), then x(2) = Ax(1) + Bu(1) =A2x(0) + ABu(0) + Bu(1), and so on with repeated back-substitutions of the state variable, eventually yielding or equivalently
Imposing any desired value of the state vector x(n) on the left side, this can always be solved for the stacked vector of control vectors if and only if the matrix of matrices at the beginning of the right side has full row rank.
For example, consider the case when n = 2 and r = 1 (i.e. only one control input). Thus, B and AB are 2 × 1 vectors. If [BAB] has rank 2 (full rank), and so B and AB are linearly independent and span the entire plane. If the rank is 1, then B and AB are collinear and do not span the plane.
Assume that the initial state is zero.
At time k = 0: At time k = 1:
At time k = 0 all of the reachable states are on the line formed by the vector B.
At time k = 1 all of the reachable states are linear combinations of AB and B. If the system is controllable then these two vectors can span the entire plane and can be done so for time k = 2.
The assumption made that the initial state is zero is merely for convenience. Clearly if all states can be reached from the origin then any state can be reached from another state (merely a shift in coordinates).
This example holds for all positive n, but the case of n = 2 is easier to visualize.
Consider an analogy to the previous example system. You are sitting in your car on an infinite, flat plane and facing north. The goal is to reach any point in the plane by driving a distance in a straight line, come to a full stop, turn, and driving another distance, again, in a straight line.
If your car has no steering then you can only drive straight, which means you can only drive on a line (in this case the north-south line since you started facing north). The lack of steering case would be analogous to when the rank of C is 1 (the two distances you drove are on the same line).
Now, if your car did have steering then you could easily drive to any point in the plane and this would be the analogous case to when the rank of C is 2.
If you change this example to n = 3 then the analogy would be flying in space to reach any position in 3D space (ignoring the orientation of the aircraft).
You are allowed to:
Although the 3-dimensional case is harder to visualize, the concept of controllability is still analogous.
Nonlinear systems in the control-affine form
are locally accessible about if the accessibility distribution spans space, when equals the dimension of and R is given by: [4]
Here, is the repeated Lie bracket operation defined by
The controllability matrix for linear systems in the previous section can in fact be derived from this equation.
When control authority on a linear dynamical system is exerted through a choice of a time-varying feedback gain matrix , the system
is nonlinear, in that products of control parameters and states are present. The accessibility distribution is, as before,
It is clear that for the system to be controllable, it is necessary that has full column rank. It turns out that this condition is also sufficient. However, the (optimal) control strategy explained earlier needs to be modified so that the trajectory when applying an optimal input to steer the system between the specified states, does not pass through the origin, else the regulating input cannot be written in feedback form . The controllability, as well as the strong controllability of this bilinear system was proven in [5] .
Collective controllability represents the ability to steer linear dynamical systems that obey identical dynamics
where equals the dimension of , between specified starting and ending configurations by way of a common state feedback gain matrix , and thereby, each instantiating a control input
for , respectively.
The accessibility distribution having full column rank is trivially a necessary condition. It is also sufficient, and in fact, the collective is strongly controllable, in that it can be steered from an initial configuration
to any specified terminal configuration
provided , over any specified time interval through a choice of a common time-varying feedback gain matrix provided has full column rank [5] .
If a discrete control system is null-controllable, it means that there exists a controllable so that for some initial state . In other words, it is equivalent to the condition that there exists a matrix such that is nilpotent.
This can be easily shown by controllable-uncontrollable decomposition.
Output controllability is the related notion for the output of the system (denoted y in the previous equations); the output controllability describes the ability of an external input to move the output from any initial condition to any final condition in a finite time interval. It is not necessary that there is any relationship between state controllability and output controllability. In particular:
For a linear continuous-time system, like the example above, described by matrices A, B, C, and D, the m × (n+1)routput controllability matrix has full row rank (i.e. rank m) if and only if the system is output controllable. [1] : 742
In systems with limited control authority, it is often no longer possible to move any initial state to any final state inside the controllable subspace. This phenomenon is caused by constraints on the input that could be inherent to the system (e.g. due to saturating actuator) or imposed on the system for other reasons (e.g. due to safety-related concerns). The controllability of systems with input and state constraints is studied in the context of reachability [6] and viability theory. [7]
In the so-called behavioral system theoretic approach due to Willems (see people in systems and control), models considered do not directly define an input–output structure. In this framework systems are described by admissible trajectories of a collection of variables, some of which might be interpreted as inputs or outputs.
A system is then defined to be controllable in this setting, if any past part of a behavior (trajectory of the external variables) can be concatenated with any future trajectory of the behavior in such a way that the concatenation is contained in the behavior, i.e. is part of the admissible system behavior. [8] : 151
A slightly weaker notion than controllability is that of stabilizability. A system is said to be stabilizable when all uncontrollable state variables can be made to have stable dynamics. Thus, even though some of the state variables cannot be controlled (as determined by the controllability test above) all the state variables will still remain bounded during the system's behavior. [9]
Let T ∈ Т and x ∈ X (where X is the set of all possible states and Т is an interval of time). The reachable set from x in time T is defined as: [3] where xz denotes that there exists a state transition from x to z in time T.
For autonomous systems the reachable set is given by: where R is the controllability matrix.
Claim—In terms of the reachable set, the system is controllable if and only if .
We have the following equalities: Considering that the system is controllable, the columns of R should be linearly independent. So:
A related set to the reachable set is the controllable set, defined by: The relation between reachability and controllability is presented by Sontag: [3]
Let the system be an n dimensional discrete-time-invariant system from the formula: where ϕ (final time, initial time, state variable, restrictions) is defined as the transition matrix of a state variable x from an initial time 0 to a final time n with some restrictions w).
It follows that the future state is in Rk(0) if and only if it is in , the image of the linear map R, defined as: which maps
When u = Km and X = Kn we identify R(A, B) with a n × nm matrix whose columns are B, AB, ... , An−1B in that order. If the system is controllable the rank of [BAB ... An−1B] is n. If this is true, the image of the linear map R is all of X. Based on that, we have: with