Preconditioner

Last updated December 27, 2024

In mathematics, preconditioning is the application of a transformation, called the preconditioner, that conditions a given problem into a form that is more suitable for numerical solving methods. Preconditioning is typically related to reducing a condition number of the problem. The preconditioned problem is then usually solved by an iterative method.

Preconditioning for linear systems

In linear algebra and numerical analysis, a preconditioner $P$ of a matrix $A$ is a matrix such that $P^{-1}A$ has a smaller condition number than $A$ . It is also common to call $T=P^{-1}$ the preconditioner, rather than $P$ , since $P$ itself is rarely explicitly available. In modern preconditioning, the application of $T=P^{-1}$ , i.e., multiplication of a column vector, or a block of column vectors, by $T=P^{-1}$ , is commonly performed in a matrix-free fashion, i.e., where neither $P$ , nor $T=P^{-1}$ (and often not even $A$ ) are explicitly available in a matrix form.

Preconditioners are useful in iterative methods to solve a linear system $Ax=b$ for $x$ since the rate of convergence for most iterative linear solvers increases because the condition number of a matrix decreases as a result of preconditioning. Preconditioned iterative solvers typically outperform direct solvers, e.g., Gaussian elimination, for large, especially for sparse, matrices. Iterative solvers can be used as matrix-free methods, i.e. become the only choice if the coefficient matrix $A$ is not stored explicitly, but is accessed by evaluating matrix-vector products.

Description

Instead of solving the original linear system $Ax=b$ for $x$ , one may consider the right preconditioned system $AP^{-1}(Px)=b$ and solve $AP^{-1}y=b$ for $y$ and $Px=y$ for $x$ .

Alternatively, one may solve the left preconditioned system $P^{-1}(Ax-b)=0.$

Both systems give the same solution as the original system as long as the preconditioner matrix $P$ is nonsingular. The left preconditioning is more traditional.

The two-sided preconditioned system $QAP^{-1}(Px)=Qb$ may be beneficial, e.g., to preserve the matrix symmetry: if the original matrix $A$ is real symmetric and real preconditioners $Q$ and $P$ satisfy $Q^{T}=P^{-1}$ then the preconditioned matrix $QAP^{-1}$ is also symmetric. The two-sided preconditioning is common for diagonal scaling where the preconditioners $Q$ and $P$ are diagonal and scaling is applied both to columns and rows of the original matrix $A$ , e.g., in order to decrease the dynamic range of entries of the matrix.

The goal of preconditioning is reducing the condition number, e.g., of the left or right preconditioned system matrix $P^{-1}A$ or $AP^{-1}$ . Small condition numbers benefit fast convergence of iterative solvers and improve stability of the solution with respect to perturbations in the system matrix and the right-hand side, e.g., allowing for more aggressive quantization of the matrix entries using lower computer precision.

The preconditioned matrix $P^{-1}A$ or $AP^{-1}$ is rarely explicitly formed. Only the action of applying the preconditioner solve operation $P^{-1}$ to a given vector may need to be computed.

Typically there is a trade-off in the choice of $P$ . Since the operator $P^{-1}$ must be applied at each step of the iterative linear solver, it should have a small cost (computing time) of applying the $P^{-1}$ operation. The cheapest preconditioner would therefore be $P=I$ since then $P^{-1}=I.$ Clearly, this results in the original linear system and the preconditioner does nothing. At the other extreme, the choice $P=A$ gives $P^{-1}A=AP^{-1}=I,$ which has optimal condition number of 1, requiring a single iteration for convergence; however in this case $P^{-1}=A^{-1},$ and applying the preconditioner is as difficult as solving the original system. One therefore chooses $P$ as somewhere between these two extremes, in an attempt to achieve a minimal number of linear iterations while keeping the operator $P^{-1}$ as simple as possible. Some examples of typical preconditioning approaches are detailed below.

Preconditioned iterative methods

Preconditioned iterative methods for $Ax-b=0$ are, in most cases, mathematically equivalent to standard iterative methods applied to the preconditioned system $P^{-1}(Ax-b)=0.$ For example, the standard Richardson iteration for solving $Ax-b=0$ is $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}(A\mathbf {x} _{n}-\mathbf {b} ),\ n\geq 0.$

Applied to the preconditioned system $P^{-1}(Ax-b)=0,$ it turns into a preconditioned method $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}P^{-1}(A\mathbf {x} _{n}-\mathbf {b} ),\ n\geq 0.$

Examples of popular preconditioned iterative methods for linear systems include the preconditioned conjugate gradient method, the biconjugate gradient method, and generalized minimal residual method. Iterative methods, which use scalar products to compute the iterative parameters, require corresponding changes in the scalar product together with substituting $P^{-1}(Ax-b)=0$ for $Ax-b=0.$

Matrix splitting

A stationary iterative method is determined by the matrix splitting $A=M-N$ and the iteration matrix $C=I-M^{-1}A$ . Assuming that

the system matrix $A$ is symmetric positive-definite,
the splitting matrix $M$ is symmetric positive-definite,
the stationary iterative method is convergent, as determined by $\rho (C)<1$ ,

the condition number $\kappa (M^{-1}A)$ is bounded above by $\kappa (M^{-1}A)\leq {\frac {1+\rho (C)}{1-\rho (C)}}\,.$

Geometric interpretation

For a symmetric positive definite matrix $A$ the preconditioner $P$ is typically chosen to be symmetric positive definite as well. The preconditioned operator $P^{-1}A$ is then also symmetric positive definite, but with respect to the $P$ -based scalar product. In this case, the desired effect in applying a preconditioner is to make the quadratic form of the preconditioned operator $P^{-1}A$ with respect to the $P$ -based scalar product to be nearly spherical.^[1]

Variable and non-linear preconditioning

Denoting $T=P^{-1}$ , we highlight that preconditioning is practically implemented as multiplying some vector $r$ by $T$ , i.e., computing the product $Tr.$ In many applications, $T$ is not given as a matrix, but rather as an operator $T(r)$ acting on the vector $r$ . Some popular preconditioners, however, change with $r$ and the dependence on $r$ may not be linear. Typical examples involve using non-linear iterative methods, e.g., the conjugate gradient method, as a part of the preconditioner construction. Such preconditioners may be practically very efficient, however, their behavior is hard to predict theoretically.

Random preconditioning

One interesting particular case of variable preconditioning is random preconditioning, e.g., multigrid preconditioning on random coarse grids.^[2] If used in gradient descent methods, random preconditioning can be viewed as an implementation of stochastic gradient descent and can lead to faster convergence, compared to fixed preconditioning, since it breaks the asymptotic "zig-zag" pattern of the gradient descent.

Spectrally equivalent preconditioning

The most common use of preconditioning is for iterative solution of linear systems resulting from approximations of partial differential equations. The better the approximation quality, the larger the matrix size is. In such a case, the goal of optimal preconditioning is, on the one side, to make the spectral condition number of $P^{-1}A$ to be bounded from above by a constant independent of the matrix size, which is called spectrally equivalent preconditioning by D'yakonov. On the other hand, the cost of application of the $P^{-1}$ should ideally be proportional (also independent of the matrix size) to the cost of multiplication of $A$ by a vector.

Examples

Jacobi (or diagonal) preconditioner

The Jacobi preconditioner is one of the simplest forms of preconditioning, in which the preconditioner is chosen to be the diagonal of the matrix $P=\mathrm {diag} (A).$ Assuming $A_{ii}\neq 0,\forall i$ , we get $P_{ij}^{-1}={\frac {\delta _{ij}}{A_{ij}}}.$ It is efficient for diagonally dominant matrices $A$ . It is used in analysis software for beam problems or 1-D problems (EX:- STAAD.Pro)

SPAI

The Sparse Approximate Inverse preconditioner minimises $\|AT-I\|_{F},$ where $\|\cdot \|_{F}$ is the Frobenius norm and $T=P^{-1}$ is from some suitably constrained set of sparse matrices. Under the Frobenius norm, this reduces to solving numerous independent least-squares problems (one for every column). The entries in $T$ must be restricted to some sparsity pattern or the problem remains as difficult and time-consuming as finding the exact inverse of $A$ . The method was introduced by M.J. Grote and T. Huckle together with an approach to selecting sparsity patterns.^[3]

Other preconditioners

External links

Preconditioning for eigenvalue problems

Eigenvalue problems can be framed in several alternative ways, each leading to its own preconditioning. The traditional preconditioning is based on the so-called spectral transformations. Knowing (approximately) the targeted eigenvalue, one can compute the corresponding eigenvector by solving the related homogeneous linear system, thus allowing to use preconditioning for linear system. Finally, formulating the eigenvalue problem as optimization of the Rayleigh quotient brings preconditioned optimization techniques to the scene.^[4]

Spectral transformations

By analogy with linear systems, for an eigenvalue problem $Ax=\lambda x$ one may be tempted to replace the matrix $A$ with the matrix $P^{-1}A$ using a preconditioner $P$ . However, this makes sense only if the seeking eigenvectors of $A$ and $P^{-1}A$ are the same. This is the case for spectral transformations.

The most popular spectral transformation is the so-called shift-and-invert transformation, where for a given scalar $\alpha$ , called the shift, the original eigenvalue problem $Ax=\lambda x$ is replaced with the shift-and-invert problem $(A-\alpha I)^{-1}x=\mu x$ . The eigenvectors are preserved, and one can solve the shift-and-invert problem by an iterative solver, e.g., the power iteration. This gives the Inverse iteration, which normally converges to the eigenvector, corresponding to the eigenvalue closest to the shift $\alpha$ . The Rayleigh quotient iteration is a shift-and-invert method with a variable shift.

Spectral transformations are specific for eigenvalue problems and have no analogs for linear systems. They require accurate numerical calculation of the transformation involved, which becomes the main bottleneck for large problems.

General preconditioning

To make a close connection to linear systems, let us suppose that the targeted eigenvalue $\lambda _{\star }$ is known (approximately). Then one can compute the corresponding eigenvector from the homogeneous linear system $(A-\lambda _{\star }I)x=0$ . Using the concept of left preconditioning for linear systems, we obtain $T(A-\lambda _{\star }I)x=0$ , where $T$ is the preconditioner, which we can try to solve using the Richardson iteration

$\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}T(A-\lambda _{\star }I)\mathbf {x} _{n},\ n\geq 0.$

The ideal preconditioning^[4]

The Moore–Penrose pseudoinverse $T=(A-\lambda _{\star }I)^{+}$ is the preconditioner, which makes the Richardson iteration above converge in one step with $\gamma _{n}=1$ , since $I-(A-\lambda _{\star }I)^{+}(A-\lambda _{\star }I)$ , denoted by $P_{\star }$ , is the orthogonal projector on the eigenspace, corresponding to $\lambda _{\star }$ . The choice $T=(A-\lambda _{\star }I)^{+}$ is impractical for three independent reasons. First, $\lambda _{\star }$ is actually not known, although it can be replaced with its approximation ${\tilde {\lambda }}_{\star }$ . Second, the exact Moore–Penrose pseudoinverse requires the knowledge of the eigenvector, which we are trying to find. This can be somewhat circumvented by the use of the Jacobi–Davidson preconditioner $T=(I-{\tilde {P}}_{\star })(A-{\tilde {\lambda }}_{\star }I)^{-1}(I-{\tilde {P}}_{\star })$ , where ${\tilde {P}}_{\star }$ approximates $P_{\star }$ . Last, but not least, this approach requires accurate numerical solution of linear system with the system matrix $(A-{\tilde {\lambda }}_{\star }I)$ , which becomes as expensive for large problems as the shift-and-invert method above. If the solution is not accurate enough, step two may be redundant.

Practical preconditioning

Let us first replace the theoretical value $\lambda _{\star }$ in the Richardson iteration above with its current approximation $\lambda _{n}$ to obtain a practical algorithm $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}T(A-\lambda _{n}I)\mathbf {x} _{n},\ n\geq 0.$

A popular choice is $\lambda _{n}=\rho (x_{n})$ using the Rayleigh quotient function $\rho (\cdot )$ . Practical preconditioning may be as trivial as just using $T=(\operatorname {diag} (A))^{-1}$ or $T=(\operatorname {diag} (A-\lambda _{n}I))^{-1}.$ For some classes of eigenvalue problems the efficiency of $T\approx A^{-1}$ has been demonstrated, both numerically and theoretically. The choice $T\approx A^{-1}$ allows one to easily utilize for eigenvalue problems the vast variety of preconditioners developed for linear systems.

Due to the changing value $\lambda _{n}$ , a comprehensive theoretical convergence analysis is much more difficult, compared to the linear systems case, even for the simplest methods, such as the Richardson iteration.

External links

Templates for the Solution of Algebraic Eigenvalue Problems: a Practical Guide

Preconditioning in optimization

In optimization, preconditioning is typically used to accelerate first-order optimization algorithms.

Description

For example, to find a local minimum of a real-valued function $F(\mathbf {x} )$ using gradient descent, one takes steps proportional to the negative of the gradient $-\nabla F(\mathbf {a} )$ (or of the approximate gradient) of the function at the current point: $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}\nabla F(\mathbf {x} _{n}),\ n\geq 0.$

The preconditioner is applied to the gradient: $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}P^{-1}\nabla F(\mathbf {x} _{n}),\ n\geq 0.$

Preconditioning here can be viewed as changing the geometry of the vector space with the goal to make the level sets look like circles.^[5] In this case the preconditioned gradient aims closer to the point of the extrema as on the figure, which speeds up the convergence.

Connection to linear systems

The minimum of a quadratic function $F(\mathbf {x} )={\tfrac {1}{2}}\mathbf {x} ^{T}A\mathbf {x} -\mathbf {x} ^{T}\mathbf {b} ,$ where $\mathbf {x}$ and $\mathbf {b}$ are real column-vectors and $A$ is a real symmetric positive-definite matrix, is exactly the solution of the linear equation $A\mathbf {x} =\mathbf {b}$ . Since $\nabla F(\mathbf {x} )=A\mathbf {x} -\mathbf {b}$ , the preconditioned gradient descent method of minimizing $F(\mathbf {x} )$ is $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}P^{-1}(A\mathbf {x} _{n}-\mathbf {b} ),\ n\geq 0.$

This is the preconditioned Richardson iteration for solving a system of linear equations.

Connection to eigenvalue problems

The minimum of the Rayleigh quotient $\rho (\mathbf {x} )={\frac {\mathbf {x} ^{T}A\mathbf {x} }{\mathbf {x} ^{T}\mathbf {x} }},$ where $\mathbf {x}$ is a real non-zero column-vector and $A$ is a real symmetric positive-definite matrix, is the smallest eigenvalue of $A$ , while the minimizer is the corresponding eigenvector. Since $\nabla \rho (\mathbf {x} )$ is proportional to $A\mathbf {x} -\rho (\mathbf {x} )\mathbf {x}$ , the preconditioned gradient descent method of minimizing $\rho (\mathbf {x} )$ is $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}P^{-1}(A\mathbf {x} _{n}-\rho (\mathbf {x_{n}} )\mathbf {x_{n}} ),\ n\geq 0.$

This is an analog of preconditioned Richardson iteration for solving eigenvalue problems.

Variable preconditioning

In many cases, it may be beneficial to change the preconditioner at some or even every step of an iterative algorithm in order to accommodate for a changing shape of the level sets, as in $\mathbf {x} _{n+1}=\mathbf {x} _{n}-\gamma _{n}P_{n}^{-1}\nabla F(\mathbf {x} _{n}),\ n\geq 0.$

One should have in mind, however, that constructing an efficient preconditioner is very often computationally expensive. The increased cost of updating the preconditioner can easily override the positive effect of faster convergence. If $P_{n}^{-1}=H_{n}$ , a BFGS approximation of the inverse hessian matrix, this method is referred to as a Quasi-Newton method.

Related Research Articles

In computational mathematics, an iterative method is a mathematical procedure that uses an initial value to generate a sequence of improving approximate solutions for a class of problems, in which the i-th approximation is derived from the previous ones.

Quadratic programming (QP) is the process of solving certain mathematical optimization problems involving quadratic functions. Specifically, one seeks to optimize a multivariate quadratic function subject to linear constraints on the variables. Quadratic programming is a type of nonlinear programming.

In mathematics, the spectral radius of a square matrix is the maximum of the absolute values of its eigenvalues. More generally, the spectral radius of a bounded linear operator is the supremum of the absolute values of the elements of its spectrum. The spectral radius is often denoted by $ρ(\cdot)$ .

In linear algebra, a generalized eigenvector of an $matrix is a vector which satisfies certain criteria which are more relaxed than those for an (ordinary) eigenvector.$

In mathematics, the conjugate gradient method is an algorithm for the numerical solution of particular systems of linear equations, namely those whose matrix is positive-semidefinite. The conjugate gradient method is often implemented as an iterative algorithm, applicable to sparse systems that are too large to be handled by a direct implementation or other direct methods such as the Cholesky decomposition. Large sparse systems often arise when numerically solving partial differential equations or optimization problems.

The Rayleigh–Ritz method is a direct numerical method of approximating eigenvalues, originated in the context of solving physical boundary value problems and named after Lord Rayleigh and Walther Ritz.

In linear algebra, an eigenvector or characteristic vector is a vector that has its direction unchanged by a given linear transformation. More precisely, an eigenvector, $, of a linear transformation,, is scaled by a constant factor,, when the linear transformation is applied to it: . The corresponding eigenvalue, characteristic value, or characteristic root is the multiplying factor .$

The Lanczos algorithm is an iterative method devised by Cornelius Lanczos that is an adaptation of power methods to find the $"most useful" eigenvalues and eigenvectors of an Hermitian matrix, where is often but not necessarily much smaller than . Although computationally efficient in principle, the method as initially formulated was not useful, due to its numerical instability.$

In numerical linear algebra, the Jacobi method is an iterative algorithm for determining the solutions of a strictly diagonally dominant system of linear equations. Each diagonal element is solved for, and an approximate value is plugged in. The process is then iterated until it converges. This algorithm is a stripped-down version of the Jacobi transformation method of matrix diagonalization. The method is named after Carl Gustav Jacob Jacobi.

In numerical linear algebra, the method of successive over-relaxation (SOR) is a variant of the Gauss–Seidel method for solving a linear system of equations, resulting in faster convergence. A similar method can be used for any slowly converging iterative process.

In mathematics, the generalized minimal residual method (GMRES) is an iterative method for the numerical solution of an indefinite nonsymmetric system of linear equations. The method approximates the solution by the vector in a Krylov subspace with minimal residual. The Arnoldi iteration is used to find this vector.

Numerical linear algebra, sometimes called applied linear algebra, is the study of how matrix operations can be used to create computer algorithms which efficiently and accurately provide approximate answers to questions in continuous mathematics. It is a subfield of numerical analysis, and a type of linear algebra. Computers use floating-point arithmetic and cannot exactly represent irrational data, so when a computer algorithm is applied to a matrix of data, it can sometimes increase the difference between a number stored in the computer and the true number that it is an approximation of. Numerical linear algebra uses properties of vectors and matrices to develop computer algorithms that minimize the error introduced by the computer, and is also concerned with ensuring that the algorithm is as efficient as possible.

Modified Richardson iteration is an iterative method for solving a system of linear equations. Richardson iteration was proposed by Lewis Fry Richardson in his work dated 1910. It is similar to the Jacobi and Gauss–Seidel method.

In linear algebra, eigendecomposition is the factorization of a matrix into a canonical form, whereby the matrix is represented in terms of its eigenvalues and eigenvectors. Only diagonalizable matrices can be factorized in this way. When the matrix being factorized is a normal or real symmetric matrix, the decomposition is called "spectral decomposition", derived from the spectral theorem.

In quantum mechanics, and especially quantum information theory, the purity of a normalized quantum state is a scalar defined as $where is the density matrix of the state and is the trace operation. The purity defines a measure on quantum states, giving information on how much a state is mixed.$

Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) is a matrix-free method for finding the largest eigenvalues and the corresponding eigenvectors of a symmetric generalized eigenvalue problem

A matrix difference equation is a difference equation in which the value of a vector of variables at one point in time is related to its own value at one or more previous points in time, using matrices. The order of the equation is the maximum time gap between any two indicated values of the variable vector. For example,

Augmented Lagrangian methods are a certain class of algorithms for solving constrained optimization problems. They have similarities to penalty methods in that they replace a constrained optimization problem by a series of unconstrained problems and add a penalty term to the objective, but the augmented Lagrangian method adds yet another term designed to mimic a Lagrange multiplier. The augmented Lagrangian is related to, but not identical with, the method of Lagrange multipliers.

Riemann invariants are mathematical transformations made on a system of conservation equations to make them more easily solvable. Riemann invariants are constant along the characteristic curves of the partial differential equations where they obtain the name invariant. They were first obtained by Bernhard Riemann in his work on plane waves in gas dynamics.

In numerical linear algebra, the conjugate gradient squared method (CGS) is an iterative algorithm for solving systems of linear equations of the form $, particularly in cases where computing the transpose is impractical. The CGS method was developed as an improvement to the biconjugate gradient method.$

References

↑ Shewchuk, Jonathan Richard (August 4, 1994). "An Introduction to the Conjugate Gradient Method Without the Agonizing Pain" (PDF).
↑ Henricus Bouwmeester, Andrew Dougherty, Andrew V Knyazev. Nonsymmetric Preconditioning for Conjugate Gradient and Steepest Descent Methods. Procedia Computer Science, Volume 51, Pages 276-285, Elsevier, 2015. https://doi.org/10.1016/j.procs.2015.05.241
↑ Grote, M. J. & Huckle, T. (1997). "Parallel Preconditioning with Sparse Approximate Inverses". SIAM Journal on Scientific Computing . 18 (3): 838–53. doi:10.1137/S1064827594276552.
1 2 Knyazev, Andrew V. (1998). "Preconditioned eigensolvers - an oxymoron?". Electronic Transactions on Numerical Analysis . 7: 104–123.
↑ Himmelblau, David M. (1972). Applied Nonlinear Programming. New York: McGraw-Hill. pp. 78–83. ISBN 0-07-028921-2.

Sources

Axelsson, Owe (1996). Iterative Solution Methods. Cambridge University Press. p. 6722. ISBN 978-0-521-55569-2.
D'yakonov, E. G. (1996). Optimization in solving elliptic problems. CRC-Press. p. 592. ISBN 978-0-8493-2872-5.
Saad, Yousef & van der Vorst, Henk (2001). "Iterative solution of linear systems in the 20th century". In Brezinski, C. & Wuytack, L. (eds.). Numerical Analysis: Historical Developments in the 20th Century. Elsevier Science Publishers. §8 Preconditioning methods, pp 193–8. ISBN 0-444-50617-9.
van der Vorst, H. A. (2003). Iterative Krylov Methods for Large Linear systems. Cambridge University Press, Cambridge. ISBN 0-521-81828-1.
Chen, Ke (2005). Matrix Preconditioning Techniques and Applications. Cambridge: Cambridge University Press. ISBN 978-0521838283. OCLC 61410324.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Shewchuk, Jonathan Richard (August 4, 1994). "An Introduction to the Conjugate Gradient Method Without the Agonizing Pain" (PDF).

[2] Henricus Bouwmeester, Andrew Dougherty, Andrew V Knyazev. Nonsymmetric Preconditioning for Conjugate Gradient and Steepest Descent Methods. Procedia Computer Science, Volume 51, Pages 276-285, Elsevier, 2015. https://doi.org/10.1016/j.procs.2015.05.241

[3] Grote, M. J. & Huckle, T. (1997). "Parallel Preconditioning with Sparse Approximate Inverses". SIAM Journal on Scientific Computing . 18 (3): 838–53. doi:10.1137/S1064827594276552.

[K98-4] 1 2 Knyazev, Andrew V. (1998). "Preconditioned eigensolvers - an oxymoron?". Electronic Transactions on Numerical Analysis . 7: 104–123.

[5] Himmelblau, David M. (1972). Applied Nonlinear Programming. New York: McGraw-Hill. pp. 78–83. ISBN 0-07-028921-2.

[1]

[2]

[3]

[4]

[5]

v t e Numerical linear algebra
Key concepts	Floating point Numerical stability
Problems	System of linear equations Matrix decompositions Matrix multiplication (algorithms) Matrix splitting Sparse problems
Hardware	CPU cache TLB Cache-oblivious algorithm SIMD Multiprocessing
Software	ATLAS MATLAB Basic Linear Algebra Subprograms (BLAS) LAPACK Specialized libraries General purpose software

Preconditioner

Contents

Preconditioning for linear systems

Description

Preconditioned iterative methods

Matrix splitting

Geometric interpretation

Variable and non-linear preconditioning

Random preconditioning

Spectrally equivalent preconditioning

Examples

Jacobi (or diagonal) preconditioner

SPAI

Other preconditioners

External links

Preconditioning for eigenvalue problems

Spectral transformations

General preconditioning

The ideal preconditioning [4]

Practical preconditioning

External links

Preconditioning in optimization

Description

Connection to linear systems

Connection to eigenvalue problems

Variable preconditioning

Related Research Articles

References

Sources

The ideal preconditioning^[4]