![]() | This article provides insufficient context for those unfamiliar with the subject.(October 2009) |
Sequential quadratic programming (SQP) is an iterative method for constrained nonlinear optimization, also known as Lagrange-Newton method. SQP methods are used on mathematical problems for which the objective function and the constraints are twice continuously differentiable, but not necessarily convex.
SQP methods solve a sequence of optimization subproblems, each of which optimizes a quadratic model of the objective subject to a linearization of the constraints. If the problem is unconstrained, then the method reduces to Newton's method for finding a point where the gradient of the objective vanishes. If the problem has only equality constraints, then the method is equivalent to applying Newton's method to the first-order optimality conditions, or Karush–Kuhn–Tucker conditions, of the problem.
Consider a nonlinear programming problem of the form:
where , , and .
The Lagrangian for this problem is [1]
where and are Lagrange multipliers.
If the problem does not have inequality constrained (that is, ), the first-order optimality conditions (aka KKT conditions) are a set of nonlinear equations that may be iteratively solved with Newton's Method. Newton's method linearizes the KKT conditions at the current iterate , which provides the following expression for the Newton step :
,
where denotes the Hessian matrix of the Lagrangian, and and are the primal and dual displacements, respectively. Note that the Lagrangian Hessian is not explicitly inverted and a linear system is solved instead.
When the Lagrangian Hessian is not positive definite, the Newton step may not exist or it may characterize a stationary point that is not a local minimum (but rather, a local maximum or a saddle point). In this case, the Lagrangian Hessian must be regularized, for example one can add a multiple of the identity to it such that the resulting matrix is positive definite.
An alternative view for obtaining the primal-dual displacements is to construct and solve a local quadratic model of the original problem at the current iterate:
The optimality conditions of this quadratic problem correspond to the linearized KKT conditions of the original problem. Note that the term in the expression above may be left out, since it is constant under the operator.
In the presence of inequality constraints (), we can naturally extend the definition of the local quadratic model introduced in the previous section:
The SQP algorithm starts from the initial iterate . At each iteration, the QP subproblem is built and solved; the resulting Newton step direction is used to update current iterate:
This process is repeated for until some convergence criterion is satisfied.
Practical implementations of the SQP algorithm are significantly more complex than its basic version above. To adapt SQP for real-world applications, the following challenges must be addressed:
To overcome these challenges, various strategies are typically employed:
These strategies can be combined in numerous ways, resulting in a diverse range of SQP methods.
SQP methods have been implemented in well known numerical environments such as MATLAB and GNU Octave. There also exist numerous software libraries, including open source:
and commercial