Equation solving

Last updated October 22, 2024

{\overset {}{\underset {}{x={\frac {-b\pm {\sqrt {b^{2}-4ac}}}{2a}}}}}

The quadratic formula, the symbolic solution of the quadratic equation

ax 2 + bx + c = 0

In mathematics, to solve an equation is to find its solutions, which are the values (numbers, functions, sets, etc.) that fulfill the condition stated by the equation, consisting generally of two expressions related by an equals sign. When seeking a solution, one or more variables are designated as unknowns . A solution is an assignment of values to the unknown variables that makes the equality in the equation true. In other words, a solution is a value or a collection of values (one for each unknown) such that, when substituted for the unknowns, the equation becomes an equality. A solution of an equation is often called a root of the equation, particularly but not only for polynomial equations. The set of all solutions of an equation is its solution set.

For example, the equation $x + y = 2 x - 1$ is solved for the unknown $x$ by the expression $x = y + 1$ , because substituting $y + 1$ for $x$ in the equation results in $(y + 1) + y = 2(y + 1) - 1$ , a true statement. It is also possible to take the variable $y$ to be the unknown, and then the equation is solved by $y = x - 1$ . Or $x$ and $y$ can both be treated as unknowns, and then there are many solutions to the equation; a symbolic solution is $(x, y) = (a + 1, a)$ , where the variable $a$ may take any value. Instantiating a symbolic solution with specific numbers gives a numerical solution; for example, $a = 0$ gives $(x, y) = (1, 0)$ (that is, $x = 1, y = 0$ ), and $a = 1$ gives $(x, y) = (2, 1)$ .

The distinction between known variables and unknown variables is generally made in the statement of the problem, by phrases such as "an equation in $x$ and $y$ ", or "solve for $x$ and $y$ ", which indicate the unknowns, here $x$ and $y$ . However, it is common to reserve $x$ , $y$ , $z$ , ... to denote the unknowns, and to use $a$ , $b$ , $c$ , ... to denote the known variables, which are often called parameters. This is typically the case when considering polynomial equations, such as quadratic equations. However, for some problems, all variables may assume either role.

Depending on the context, solving an equation may consist to find either any solution (finding a single solution is enough), all solutions, or a solution that satisfies further properties, such as belonging to a given interval. When the task is to find the solution that is the best under some criterion, this is an optimization problem. Solving an optimization problem is generally not referred to as "equation solving", as, generally, solving methods start from a particular solution for finding a better solution, and repeating the process until finding eventually the best solution.

Overview

One general form of an equation is

f\left(x_{1},\dots ,x_{n}\right)=c,

where $f$ is a function, $x 1, ..., x n$ are the unknowns, and $c$ is a constant. Its solutions are the elements of the inverse image (fiber)

f^{-1}(c)={\bigl \{}(a_{1},\dots ,a_{n})\in D\mid f\left(a_{1},\dots ,a_{n}\right)=c{\bigr \}},

where $D$ is the domain of the function $f$ . The set of solutions can be the empty set (there are no solutions), a singleton (there is exactly one solution), finite, or infinite (there are infinitely many solutions).

For example, an equation such as

3x+2y=21z,

with unknowns $x, y$ and $z$ , can be put in the above form by subtracting $21 z$ from both sides of the equation, to obtain

3x+2y-21z=0

In this particular case there is not just one solution, but an infinite set of solutions, which can be written using set builder notation as

{\bigl \{}(x,y,z)\mid 3x+2y-21z=0{\bigr \}}.

One particular solution is $x = 0, y = 0, z = 0$ . Two other solutions are $x = 3, y = 6, z = 1$ , and $x = 8, y = 9, z = 2$ . There is a unique plane in three-dimensional space which passes through the three points with these coordinates, and this plane is the set of all points whose coordinates are solutions of the equation.

Solution sets

The solution set of a given set of equations or inequalities is the set of all its solutions, a solution being a tuple of values, one for each unknown, that satisfies all the equations or inequalities. If the solution set is empty, then there are no values of the unknowns that satisfy simultaneously all equations and inequalities.

For a simple example, consider the equation

x^{2}=2.

This equation can be viewed as a Diophantine equation, that is, an equation for which only integer solutions are sought. In this case, the solution set is the empty set, since 2 is not the square of an integer. However, if one searches for real solutions, there are two solutions, $\sqrt 2$ and $- \sqrt 2$ ; in other words, the solution set is ${\sqrt 2, - \sqrt 2}$ .

When an equation contains several unknowns, and when one has several equations with more unknowns than equations, the solution set is often infinite. In this case, the solutions cannot be listed. For representing them, a parametrization is often useful, which consists of expressing the solutions in terms of some of the unknowns or auxiliary variables. This is always possible when all the equations are linear.

Such infinite solution sets can naturally be interpreted as geometric shapes such as lines, curves (see picture), planes, and more generally algebraic varieties or manifolds. In particular, algebraic geometry may be viewed as the study of solution sets of algebraic equations.

Methods of solution

The methods for solving equations generally depend on the type of equation, both the kind of expressions in the equation and the kind of values that may be assumed by the unknowns. The variety in types of equations is large, and so are the corresponding methods. Only a few specific types are mentioned below.

In general, given a class of equations, there may be no known systematic method (algorithm) that is guaranteed to work. This may be due to a lack of mathematical knowledge; some problems were only solved after centuries of effort. But this also reflects that, in general, no such method can exist: some problems are known to be unsolvable by an algorithm, such as Hilbert's tenth problem, which was proved unsolvable in 1970.

For several classes of equations, algorithms have been found for solving them, some of which have been implemented and incorporated in computer algebra systems, but often require no more sophisticated technology than pencil and paper. In some other cases, heuristic methods are known that are often successful but that are not guaranteed to lead to success.

Brute force, trial and error, inspired guess

If the solution set of an equation is restricted to a finite set (as is the case for equations in modular arithmetic, for example), or can be limited to a finite number of possibilities (as is the case with some Diophantine equations), the solution set can be found by brute force, that is, by testing each of the possible values (candidate solutions). It may be the case, though, that the number of possibilities to be considered, although finite, is so huge that an exhaustive search is not practically feasible; this is, in fact, a requirement for strong encryption methods.

As with all kinds of problem solving, trial and error may sometimes yield a solution, in particular where the form of the equation, or its similarity to another equation with a known solution, may lead to an "inspired guess" at the solution. If a guess, when tested, fails to be a solution, consideration of the way in which it fails may lead to a modified guess.

Elementary algebra

Equations involving linear or simple rational functions of a single real-valued unknown, say $x$ , such as

8x+7=4x+35\quad {\text{or}}\quad {\frac {4x+9}{3x+4}}=2\,,

can be solved using the methods of elementary algebra.

Systems of linear equations

Smaller systems of linear equations can be solved likewise by methods of elementary algebra. For solving larger systems, algorithms are used that are based on linear algebra. See Gaussian elimination and numerical solution of linear systems.

Polynomial equations

Polynomial equations of degree up to four can be solved exactly using algebraic methods, of which the quadratic formula is the simplest example. Polynomial equations with a degree of five or higher require in general numerical methods (see below) or special functions such as Bring radicals, although some specific cases may be solvable algebraically, for example

4x^{5}-x^{3}-3=0

(by using the rational root theorem), and

x^{6}-5x^{3}+6=0\,,

(by using the substitution $x = z.mw-parser-output .frac{white-space:nowrap}.mw-parser-output .frac .num,.mw-parser-output .frac .den{font-size:80%;line-height:0;vertical-align:super}.mw-parser-output .frac .den{vertical-align:sub}.mw-parser-output .sr-only{border:0;clip:rect(0,0,0,0);clip-path:polygon(0px 0px,0px 0px,0px 0px);height:1px;margin:-1px;overflow:hidden;padding:0;position:absolute;width:1px}1⁄3$ , which simplifies this to a quadratic equation in $z$ ).

Diophantine equations

In Diophantine equations the solutions are required to be integers. In some cases a brute force approach can be used, as mentioned above. In some other cases, in particular if the equation is in one unknown, it is possible to solve the equation for rational-valued unknowns (see Rational root theorem), and then find solutions to the Diophantine equation by restricting the solution set to integer-valued solutions. For example, the polynomial equation

2x^{5}-5x^{4}-x^{3}-7x^{2}+2x+3=0\,

has as rational solutions $x = - ⁠ 1 / 2 ⁠$ and $x = 3$ , and so, viewed as a Diophantine equation, it has the unique solution $x = 3$ .

In general, however, Diophantine equations are among the most difficult equations to solve.

Inverse functions

In the simple case of a function of one variable, say, $h (x)$ , we can solve an equation of the form $h (x) = c$ for some constant $c$ by considering what is known as the inverse function of $h$ .

Given a function $h : A \to B$ , the inverse function, denoted $h -1$ and defined as $h -1 : B \to A$ , is a function such that

h^{-1}{\bigl (}h(x){\bigr )}=h{\bigl (}h^{-1}(x){\bigr )}=x\,.

Now, if we apply the inverse function to both sides of $h (x) = c$ , where $c$ is a constant value in $B$ , we obtain

{\begin{aligned}h^{-1}{\bigl (}h(x){\bigr )}&=h^{-1}(c)\\x&=h^{-1}(c)\\\end{aligned}}

and we have found the solution to the equation. However, depending on the function, the inverse may be difficult to be defined, or may not be a function on all of the set $B$ (only on some subset), and have many values at some point.

If just one solution will do, instead of the full solution set, it is actually sufficient if only the functional identity

h\left(h^{-1}(x)\right)=x

holds. For example, the projection $π 1 : R 2 \to R$ defined by $π 1 (x, y) = x$ has no post-inverse, but it has a pre-inverse $π -1 1$ defined by $π -1 1 (x) = (x, 0)$ . Indeed, the equation $π 1 (x, y) = c$ is solved by

(x,y)=\pi _{1}^{-1}(c)=(c,0).

Examples of inverse functions include the $n$ th root (inverse of $x n$ ); the logarithm (inverse of $a x$ ); the inverse trigonometric functions; and Lambert's $W$ function (inverse of $xe x$ ).

Factorization

If the left-hand side expression of an equation $P = 0$ can be factorized as $P = QR$ , the solution set of the original solution consists of the union of the solution sets of the two equations $Q = 0$ and $R = 0$ . For example, the equation

\tan x+\cot x=2

can be rewritten, using the identity $tan x cot x = 1$ as

{\frac {\tan ^{2}x-2\tan x+1}{\tan x}}=0,

which can be factorized into

{\frac {\left(\tan x-1\right)^{2}}{\tan x}}=0.

The solutions are thus the solutions of the equation $tan x = 1$ , and are thus the set

x={\tfrac {\pi }{4}}+k\pi ,\quad k=0,\pm 1,\pm 2,\ldots .

Numerical methods

With more complicated equations in real or complex numbers, simple methods to solve equations can fail. Often, root-finding algorithms like the Newton–Raphson method can be used to find a numerical solution to an equation, which, for some applications, can be entirely sufficient to solve some problem. There are also numerical methods for systems of linear equations.

Matrix equations

Equations involving matrices and vectors of real numbers can often be solved by using methods from linear algebra.

Differential equations

There is a vast body of methods for solving various kinds of differential equations, both numerically and analytically. A particular class of problem that can be considered to belong here is integration, and the analytic methods for solving this kind of problems are now called symbolic integration.^{[ citation needed ]} Solutions of differential equations can be implicit or explicit.^[1]

Related Research Articles

<span class="mw-page-title-main">Diophantine equation</span> Polynomial equation whose integer solutions are sought

In mathematics, a Diophantine equation is an equation, typically a polynomial equation in two or more unknowns with integer coefficients, for which only integer solutions are of interest. A linear Diophantine equation equates to a constant the sum of two or more monomials, each of degree one. An exponential Diophantine equation is one in which unknowns can appear in exponents.

In mathematics, an equation is a mathematical formula that expresses the equality of two expressions, by connecting them with the equals sign =. The word equation and its cognates in other languages may have subtly different meanings; for example, in French an équation is defined as containing one or more variables, while in English, any well-formed formula consisting of two expressions related with an equals sign is an equation.

<span class="mw-page-title-main">Fourier transform</span> Mathematical transform that expresses a function of time as a function of frequency

In physics, engineering and mathematics, the Fourier transform (FT) is an integral transform that takes a function as input and outputs another function that describes the extent to which various frequencies are present in the original function. The output of the transform is a complex-valued function of frequency. The term Fourier transform refers to both this complex-valued function and the mathematical operation. When a distinction needs to be made, the output of the operation is sometimes called the frequency domain representation of the original function. The Fourier transform is analogous to decomposing the sound of a musical chord into the intensities of its constituent pitches.

<span class="mw-page-title-main">Imaginary unit</span> Principal square root of −1

The imaginary unit or unit imaginary number is a solution to the quadratic equation $x 2 + 1 = 0.$ Although there is no real number with this property, $i$ can be used to extend the real numbers to what are called complex numbers, using addition and multiplication. A simple example of the use of $i$ in a complex number is $2 + 3 i .$

<span class="mw-page-title-main">Partial differential equation</span> Type of differential equation

In mathematics, a partial differential equation (PDE) is an equation which computes a function between various partial derivatives of a multivariable function.

Hilbert's tenth problem is the tenth on the list of mathematical problems that the German mathematician David Hilbert posed in 1900. It is the challenge to provide a general algorithm that, for any given Diophantine equation, can decide whether the equation has a solution with all unknowns taking integer values.

In mathematics, a system of linear equations is a collection of two or more linear equations involving the same variables. For example,

In mathematics, an implicit equation is a relation of the form $where R is a function of several variables. For example, the implicit equation of the unit circle is$

In mathematics, the inverse trigonometric functions are the inverse functions of the trigonometric functions, under suitably restricted domains. Specifically, they are the inverses of the sine, cosine, tangent, cotangent, secant, and cosecant functions, and are used to obtain an angle from any of the angle's trigonometric ratios. Inverse trigonometric functions are widely used in engineering, navigation, physics, and geometry.

In mathematics, a linear differential equation is a differential equation that is defined by a linear polynomial in the unknown function and its derivatives, that is an equation of the form $where a 0 (x), ..., a n (x) and b (x) are arbitrary differentiable functions that do not need to be linear, and y', ..., y (n) are the successive derivatives of an unknown function y of the variable x .$

In mathematics, an algebraic equation or polynomial equation is an equation of the form $, where P is a polynomial with coefficients in some field, often the field of the rational numbers. For example, is an algebraic equation with integer coefficients and$

In mathematics and its applications, a Sturm–Liouville problem is a second-order linear ordinary differential equation of the form $for given functions, and, together with some boundary conditions at extreme values of . The goals of a given Sturm-Liouville problem are:$

In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, and the differential equation defines a relationship between the two. Such relations are common; therefore, differential equations play a prominent role in many disciplines including engineering, physics, economics, and biology.

In mathematics, a stiff equation is a differential equation for which certain numerical methods for solving the equation are numerically unstable, unless the step size is taken to be extremely small. It has proven difficult to formulate a precise definition of stiffness, but the main idea is that the equation includes some terms that can lead to rapid variation in the solution.

In mathematics, the Rogers–Ramanujan identities are two identities related to basic hypergeometric series and integer partitions. The identities were first discovered and proved by Leonard James Rogers, and were subsequently rediscovered by Srinivasa Ramanujan some time before 1913. Ramanujan had no proof, but rediscovered Rogers's paper in 1917, and they then published a joint new proof. Issai Schur independently rediscovered and proved the identities.

In mathematics and computational science, the Euler method is a first-order numerical procedure for solving ordinary differential equations (ODEs) with a given initial value. It is the most basic explicit method for numerical integration of ordinary differential equations and is the simplest Runge–Kutta method. The Euler method is named after Leonhard Euler, who first proposed it in his book Institutionum calculi integralis.

In applied mathematics, a transcendental equation is an equation over the real numbers that is not algebraic, that is, if at least one of its sides describes a transcendental function. Examples include:

In probability and statistics, the quantile function outputs the value of a random variable such that its probability is less than or equal to an input probability value. Intuitively, the quantile function associates with a range at and below a probability input the likelihood that a random variable is realized in that range for some probability distribution. It is also called the percentile function, percent-point function, inverse cumulative distribution function or inverse distribution function.

Algebra is the branch of mathematics that studies certain abstract systems, known as algebraic structures, and the manipulation of statements within those systems. It is a generalization of arithmetic that introduces variables and algebraic operations other than the standard arithmetic operations such as addition and multiplication.

$<span class="mw-page-title-main">Rogers–Ramanujan continued fraction</span> Continued fraction closely related to the Rogers–Ramanujan identities$

The Rogers–Ramanujan continued fraction is a continued fraction discovered by Rogers (1894) and independently by Srinivasa Ramanujan, and closely related to the Rogers–Ramanujan identities. It can be evaluated explicitly for a broad class of values of its argument.

References

↑ Dennis G. Zill (15 March 2012). A First Course in Differential Equations with Modeling Applications. Cengage Learning. ISBN 978-1-285-40110-2.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Zill2012-1] Dennis G. Zill (15 March 2012). A First Course in Differential Equations with Modeling Applications. Cengage Learning. ISBN 978-1-285-40110-2.

[1]