Newton polygon

Last updated November 15, 2024

In mathematics, the Newton polygon is a tool for understanding the behaviour of polynomials over local fields, or more generally, over ultrametric fields. In the original case, the ultrametric field of interest was essentially the field of formal Laurent series in the indeterminate X, i.e. the field of fractions of the formal power series ring $K[[X]]$ , over $K$ , where $K$ was the real number or complex number field. This is still of considerable utility with respect to Puiseux expansions. The Newton polygon is an effective device for understanding the leading terms $aX^{r}$ of the power series expansion solutions to equations $P(F(X))=0$ where $P$ is a polynomial with coefficients in $K[X]$ , the polynomial ring; that is, implicitly defined algebraic functions. The exponents $r$ here are certain rational numbers, depending on the branch chosen; and the solutions themselves are power series in $K[[Y]]$ with $Y=X^{\frac {1}{d}}$ for a denominator $d$ corresponding to the branch. The Newton polygon gives an effective, algorithmic approach to calculating $d$ .

Definition

Construction of the Newton polygon of the polynomial 1 + 5 X + 1/5 X + 35 X + 25 X + 625 X with respect to the 5-adic valuation. Newton-polygon.gif — Construction of the Newton polygon of the polynomial 1 + 5 X + 1/5 X + 35 X + 25 X + 625 X with respect to the 5-adic valuation.

A priori, given a polynomial over a field, the behaviour of the roots (assuming it has roots) will be unknown. Newton polygons provide one technique for the study of the behaviour of the roots.

Let $K$ be a field endowed with a non-archimedean valuation $v_{K}:K\to \mathbb {R} \cup \{\infty \}$ , and let

f(x)=a_{n}x^{n}+\cdots +a_{1}x+a_{0}\in K[x],

with $a_{0}a_{n}\neq 0$ . Then the Newton polygon of $f$ is defined to be the lower boundary of the convex hull of the set of points $P_{i}=\left(i,v_{K}(a_{i})\right),$ ignoring the points with $a_{i}=0$ .

Restated geometrically, plot all of these points P_i on the xy-plane. Let's assume that the points indices increase from left to right (P₀ is the leftmost point, P_n is the rightmost point). Then, starting at P₀, draw a ray straight down parallel with the y-axis, and rotate this ray counter-clockwise until it hits the point P_k₁ (not necessarily P₁). Break the ray here. Now draw a second ray from P_k₁ straight down parallel with the y-axis, and rotate this ray counter-clockwise until it hits the point P_k₂. Continue until the process reaches the point P_n; the resulting polygon (containing the points P₀, P_k₁, P_k₂, ..., P_{k_m}, P_n) is the Newton polygon.

Another, perhaps more intuitive way to view this process is this : consider a rubber band surrounding all the points P₀, ..., P_n. Stretch the band upwards, such that the band is stuck on its lower side by some of the points (the points act like nails, partially hammered into the xy plane). The vertices of the Newton polygon are exactly those points.

For a neat diagram of this see Ch6 §3 of "Local Fields" by JWS Cassels, LMS Student Texts 3, CUP 1986. It is on p99 of the 1986 paperback edition.

Main theorem

With the notations in the previous section, the main result concerning the Newton polygon is the following theorem,^[1] which states that the valuation of the roots of $f$ are entirely determined by its Newton polygon:

Let $\mu _{1},\mu _{2},\ldots ,\mu _{r}$ be the slopes of the line segments of the Newton polygon of $f(x)$ (as defined above) arranged in increasing order, and let $\lambda _{1},\lambda _{2},\ldots ,\lambda _{r}$ be the corresponding lengths of the line segments projected onto the x-axis (i.e. if we have a line segment stretching between the points $P_{i}$ and $P_{j}$ then the length is $j-i$ ).

The $\mu _{i}$ are distinct;
$\sum _{i}\lambda _{i}=n$ ;
if $\alpha$ is a root of $f$ in $K$ , $v(\alpha )\in \{-\mu _{1},\ldots ,-\mu _{r}\}$ ;
for every $i$ , the number of roots of $f$ whose valuations are equal to $-\mu _{i}$ (counting multiplicities) is at most $\lambda _{i}$ , with equality if $f$ splits into the product of linear factors over $K$ .

Corollaries and applications

With the notation of the previous sections, we denote, in what follows, by $L$ the splitting field of $f$ over $K$ , and by $v_{L}$ an extension of $v_{K}$ to $L$ .

Newton polygon theorem is often used to show the irreducibility of polynomials, as in the next corollary for example:

Suppose that the valuation $v$ is discrete and normalized, and that the Newton polynomial of $f$ contains only one segment whose slope is $\mu$ and projection on the x-axis is $\lambda$ . If $\mu =a/n$ , with $a$ coprime to $n$ , then $f$ is irreducible over $K$ . In particular, since the Newton polygon of an Eisenstein polynomial consists of a single segment of slope $-{\frac {1}{n}}$ connecting $(0,1)$ and $(n,0)$ , Eisenstein criterion follows.

Indeed, by the main theorem, if $\alpha$ is a root of $f$ , $v_{L}(\alpha )=-a/n.$ If $f$ were not irreducible over $K$ , then the degree $d$ of $\alpha$ would be $<n$ , and there would hold $v_{L}(\alpha )\in {1 \over d}\mathbb {Z}$ . But this is impossible since $v_{L}(\alpha )=-a/n$ with $a$ coprime to $n$ .

Another simple corollary is the following:

Assume that $(K,v_{K})$ is Henselian. If the Newton polygon of $f$ fulfills $\lambda _{i}=1$ for some $i$ , then $f$ has a root in $K$ .

Proof: By the main theorem, $f$ must have a single root $\alpha$ whose valuation is $v_{L}(\alpha )=-\mu _{i}.$ In particular, $\alpha$ is separable over $K$ . If $\alpha$ does not belong to $K$ , $\alpha$ has a distinct Galois conjugate $\alpha '$ over $K$ , with $v_{L}(\alpha ')=v_{L}(\alpha )$ ,^[2] and $\alpha '$ is a root of $f$ , a contradiction.

More generally, the following factorization theorem holds:

Assume that $(K,v_{K})$ is Henselian. Then $f=A\,f_{1}\,f_{2}\cdots f_{r},$ , where $A\in K$ , $f_{i}\in K[X]$ is monic for every $i$ , the roots of $f_{i}$ are of valuation $-\mu _{i}$ , and $\deg(f_{i})=\lambda _{i}$ .^[3]

Moreover, $\mu _{i}=v_{K}(f_{i}(0))/\lambda _{i}$ , and if $v_{K}(f_{i}(0))$ is coprime to $\lambda _{i}$ , $f_{i}$ is irreducible over $K$ .

Proof: For every $i$ , denote by $f_{i}$ the product of the monomials $(X-\alpha )$ such that $\alpha$ is a root of $f$ and $v_{L}(\alpha )=-\mu _{i}$ . We also denote $f=AP_{1}^{k_{1}}P_{2}^{k_{2}}\cdots P_{s}^{k_{s}}$ the factorization of $f$ in $K[X]$ into prime monic factors $(A\in K).$ Let $\alpha$ be a root of $f_{i}$ . We can assume that $P_{1}$ is the minimal polynomial of $\alpha$ over $K$ . If $\alpha '$ is a root of $P_{1}$ , there exists a K-automorphism $\sigma$ of $L$ that sends $\alpha$ to $\alpha '$ , and we have $v_{L}(\sigma \alpha )=v_{L}(\alpha )$ since $K$ is Henselian. Therefore $\alpha '$ is also a root of $f_{i}$ . Moreover, every root of $P_{1}$ of multiplicity $\nu$ is clearly a root of $f_{i}$ of multiplicity $k_{1}\nu$ , since repeated roots share obviously the same valuation. This shows that $P_{1}^{k_{1}}$ divides $f_{i}.$ Let $g_{i}=f_{i}/P_{1}^{k_{1}}$ . Choose a root $\beta$ of $g_{i}$ . Notice that the roots of $g_{i}$ are distinct from the roots of $P_{1}$ . Repeat the previous argument with the minimal polynomial of $\beta$ over $K$ , assumed w.l.g. to be $P_{2}$ , to show that $P_{2}^{k_{2}}$ divides $g_{i}$ . Continuing this process until all the roots of $f_{i}$ are exhausted, one eventually arrives to $f_{i}=P_{1}^{k_{1}}\cdots P_{m}^{k_{m}}$ , with $m\leq s$ . This shows that $f_{i}\in K[X]$ , $f_{i}$ monic. But the $f_{i}$ are coprime since their roots have distinct valuations. Hence clearly $f=Af_{1}\cdot f_{2}\cdots f_{r}$ , showing the main contention. The fact that $\lambda _{i}=\deg(f_{i})$ follows from the main theorem, and so does the fact that $\mu _{i}=v_{K}(f_{i}(0))/\lambda _{i}$ , by remarking that the Newton polygon of $f_{i}$ can have only one segment joining $(0,v_{K}(f_{i}(0))$ to $(\lambda _{i},0=v_{K}(1))$ . The condition for the irreducibility of $f_{i}$ follows from the corollary above. (q.e.d.)

The following is an immediate corollary of the factorization above, and constitutes a test for the reducibility of polynomials over Henselian fields:

Assume that $(K,v_{K})$ is Henselian. If the Newton polygon does not reduce to a single segment $(\mu ,\lambda ),$ then $f$ is reducible over $K$ .

Other applications of the Newton polygon comes from the fact that a Newton Polygon is sometimes a special case of a Newton polytope, and can be used to construct asymptotic solutions of two-variable polynomial equations like $3x^{2}y^{3}-xy^{2}+2x^{2}y^{2}-x^{3}y=0.$

Symmetric function explanation

In the context of a valuation, we are given certain information in the form of the valuations of elementary symmetric functions of the roots of a polynomial, and require information on the valuations of the actual roots, in an algebraic closure. This has aspects both of ramification theory and singularity theory. The valid inferences possible are to the valuations of power sums, by means of Newton's identities.

History

Newton polygons are named after Isaac Newton, who first described them and some of their uses in correspondence from the year 1676 addressed to Henry Oldenburg.^[4]

Related Research Articles

In the mathematical field of representation theory, a weight of an algebra A over a field F is an algebra homomorphism from A to F, or equivalently, a one-dimensional representation of A over F. It is the algebra analogue of a multiplicative character of a group. The importance of the concept, however, stems from its application to representations of Lie algebras and hence also to representations of algebraic and Lie groups. In this context, a weight of a representation is a generalization of the notion of an eigenvalue, and the corresponding eigenspace is called a weight space.

In mathematics, the classical orthogonal polynomials are the most widely used orthogonal polynomials: the Hermite polynomials, Laguerre polynomials, Jacobi polynomials.

In mathematics and theoretical physics, the term quantum group denotes one of a few different kinds of noncommutative algebras with additional structure. These include Drinfeld–Jimbo type quantum groups, compact matrix quantum groups, and bicrossproduct quantum groups. Despite their name, they do not themselves have a natural group structure, though they are in some sense 'close' to a group.

In linear algebra, the Frobenius companion matrix of the monic polynomial $is the square matrix defined as$

<span class="mw-page-title-main">Semisimple Lie algebra</span> Direct sum of simple Lie algebras

In mathematics, a Lie algebra is semisimple if it is a direct sum of simple Lie algebras.

<span class="mw-page-title-main">Interior-point method</span> Algorithms for solving convex optimization problems

Interior-point methods are algorithms for solving linear and non-linear convex optimization problems. IPMs combine two advantages of previously-known algorithms:

Verma modules, named after Daya-Nand Verma, are objects in the representation theory of Lie algebras, a branch of mathematics.

In mathematics, Schur polynomials, named after Issai Schur, are certain symmetric polynomials in n variables, indexed by partitions, that generalize the elementary symmetric polynomials and the complete homogeneous symmetric polynomials. In representation theory they are the characters of polynomial irreducible representations of the general linear groups. The Schur polynomials form a linear basis for the space of all symmetric polynomials. Any product of Schur polynomials can be written as a linear combination of Schur polynomials with non-negative integral coefficients; the values of these coefficients is given combinatorially by the Littlewood–Richardson rule. More generally, skew Schur polynomials are associated with pairs of partitions and have similar properties to Schur polynomials.

In mathematics, Hua's lemma, named for Hua Loo-keng, is an estimate for exponential sums.

In probability theory, the inverse Gaussian distribution is a two-parameter family of continuous probability distributions with support on (0,∞).

In mathematics, the Jack function is a generalization of the Jack polynomial, introduced by Henry Jack. The Jack polynomial is a homogeneous, symmetric polynomial which generalizes the Schur and zonal polynomials, and is in turn generalized by the Heckman–Opdam polynomials and Macdonald polynomials.

A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.

In mathematics, Macdonald polynomialsP_λ(x; t,q) are a family of orthogonal symmetric polynomials in several variables, introduced by Macdonald in 1987. He later introduced a non-symmetric generalization in 1995. Macdonald originally associated his polynomials with weights λ of finite root systems and used just one variable t, but later realized that it is more natural to associate them with affine root systems rather than finite root systems, in which case the variable t can be replaced by several different variables t=(t₁,...,t_k), one for each of the k orbits of roots in the affine root system. The Macdonald polynomials are polynomials in n variables x=(x₁,...,x_n), where n is the rank of the affine root system. They generalize many other families of orthogonal polynomials, such as Jack polynomials and Hall–Littlewood polynomials and Askey–Wilson polynomials, which in turn include most of the named 1-variable orthogonal polynomials as special cases. Koornwinder polynomials are Macdonald polynomials of certain non-reduced root systems. They have deep relationships with affine Hecke algebras and Hilbert schemes, which were used to prove several conjectures made by Macdonald about them.

The Jenkins–Traub algorithm for polynomial zeros is a fast globally convergent iterative polynomial root-finding method published in 1970 by Michael A. Jenkins and Joseph F. Traub. They gave two variants, one for general polynomials with complex coefficients, commonly known as the "CPOLY" algorithm, and a more complicated variant for the special case of polynomials with real coefficients, commonly known as the "RPOLY" algorithm. The latter is "practically a standard in black-box polynomial root-finders".

<span class="mw-page-title-main">Normal-inverse-gamma distribution</span>

In probability theory and statistics, the normal-inverse-gamma distribution is a four-parameter family of multivariate continuous probability distributions. It is the conjugate prior of a normal distribution with unknown mean and variance.

<span class="mw-page-title-main">Kostka number</span>

In mathematics, the Kostka number $is a non-negative integer that is equal to the number of semistandard Young tableaux of shape and weight . They were introduced by the mathematician Carl Kostka in his study of symmetric functions.$

In probability theory and statistics, the Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time if these events occur with a known constant mean rate and independently of the time since the last event. It can also be used for the number of events in other types of intervals than time, and in dimension greater than 1.

A geometric stable distribution or geo-stable distribution is a type of leptokurtic probability distribution. Geometric stable distributions were introduced in Klebanov, L. B., Maniya, G. M., and Melamed, I. A. (1985). A problem of Zolotarev and analogs of infinitely divisible and stable distributions in a scheme for summing a random number of random variables. These distributions are analogues for stable distributions for the case when the number of summands is random, independent of the distribution of summand, and having geometric distribution. The geometric stable distribution may be symmetric or asymmetric. A symmetric geometric stable distribution is also referred to as a Linnik distribution. The Laplace distribution and asymmetric Laplace distribution are special cases of the geometric stable distribution. The Mittag-Leffler distribution is also a special case of a geometric stable distribution.

In theoretical physics, relativistic Lagrangian mechanics is Lagrangian mechanics applied in the context of special relativity and general relativity.

<span class="mw-page-title-main">Lie algebra extension</span> Creating a "larger" Lie algebra from a smaller one, in one of several ways

In the theory of Lie groups, Lie algebras and their representation theory, a Lie algebra extension $e$ is an enlargement of a given Lie algebra $g$ by another Lie algebra $h$ . Extensions arise in several ways. There is the trivial extension obtained by taking a direct sum of two Lie algebras. Other types are the split extension and the central extension. Extensions may arise naturally, for instance, when forming a Lie algebra from projective group representations. Such a Lie algebra will contain central charges.

References

↑ For an interesting demonstration based on hyperfields, see Matthew Baker, Oliver Lorscheid, (2021). Descartes' rule of signs, Newton polygons, and polynomials over hyperfields.Journal of Algebra, Volume 569, p. 416-441.
↑ Recall that in Henselian rings, any valuation extends uniquely to every algebraic extension of the base field. Hence $v_{K}$ extends uniquely to $v_{L}$ . But $v_{L}\circ \sigma$ is an extension of $v_{K}$ for every automorphism $\sigma$ of $L$ , therefore $v_{L}(\alpha ')=v_{L}\circ \sigma (\alpha )=v_{L}(\alpha ).$
↑ J. W. S. Cassels, Local Fields, Chap. 6, thm. 3.1.
↑ Egbert Brieskorn, Horst Knörrer (1986). Plane Algebraic Curves, pp. 370–383.

Goss, David (1996), Basic structures of function field arithmetic, Ergebnisse der Mathematik und ihrer Grenzgebiete (3) [Results in Mathematics and Related Areas (3)], vol. 35, Berlin, New York: Springer-Verlag, doi:10.1007/978-3-642-61480-4, ISBN 978-3-540-61087-8, MR 1423131
Gouvêa, Fernando: p-adic numbers: An introduction. Springer Verlag 1993. p. 199.

External links

Applet drawing a Newton Polygon

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] For an interesting demonstration based on hyperfields, see Matthew Baker, Oliver Lorscheid, (2021). Descartes' rule of signs, Newton polygons, and polynomials over hyperfields.Journal of Algebra, Volume 569, p. 416-441.

[2] Recall that in Henselian rings, any valuation extends uniquely to every algebraic extension of the base field. Hence $v_{K}$ extends uniquely to $v_{L}$ . But $v_{L}\circ \sigma$ is an extension of $v_{K}$ for every automorphism $\sigma$ of $L$ , therefore $v_{L}(\alpha ')=v_{L}\circ \sigma (\alpha )=v_{L}(\alpha ).$

[3] J. W. S. Cassels, Local Fields, Chap. 6, thm. 3.1.

[4] Egbert Brieskorn, Horst Knörrer (1986). Plane Algebraic Curves, pp. 370–383.

[1]

[2]

[3]

[4]

v t e Sir Isaac Newton
Publications	Fluxions (1671) De Motu (1684) Principia (1687) Opticks (1704) Queries (1704) Arithmetica (1707) De Analysi (1711)
Other writings	Quaestiones (1661–1665) "standing on the shoulders of giants" (1675) Notes on the Jewish Temple (c. 1680) "General Scholium" (1713; "hypotheses non fingo" ) Ancient Kingdoms Amended (1728) Corruptions of Scripture (1754)
Contributions	Calculus fluxion Impact depth Inertia Newton disc Newton polygon Newton–Okounkov body Newton's reflector Newtonian telescope Newton scale Newton's metal Spectrum Structural coloration
Newtonianism	Bucket argument Newton's inequalities Newton's law of cooling Newton's law of universal gravitation post-Newtonian expansion parameterized gravitational constant Newton–Cartan theory Schrödinger–Newton equation Newton's laws of motion Kepler's laws Newtonian dynamics Newton's method in optimization Apollonius's problem truncated Newton method Gauss–Newton algorithm Newton's rings Newton's theorem about ovals Newton–Pepys problem Newtonian potential Newtonian fluid Classical mechanics Corpuscular theory of light Leibniz–Newton calculus controversy Newton's notation Rotating spheres Newton's cannonball Newton–Cotes formulas Newton's method generalized Gauss–Newton method Newton fractal Newton's identities Newton polynomial Newton's theorem of revolving orbits Newton–Euler equations Newton number kissing number problem Newton's quotient Parallelogram of force Newton–Puiseux theorem Absolute space and time Luminiferous aether Newtonian series table
Personal life	Woolsthorpe Manor (birthplace) Cranbury Park (home) Early life Later life Apple tree Religious views Occult studies Scientific Revolution Copernican Revolution
Relations	Catherine Barton (niece) John Conduitt (nephew-in-law) Isaac Barrow (professor) William Clarke (mentor) Benjamin Pulleyn (tutor) Roger Cotes (student) William Whiston (student) John Keill (disciple) William Stukeley (friend) William Jones (friend) Abraham de Moivre (friend)
Depictions	Newton by Blake (monotype) Newton by Paolozzi (sculpture) Isaac Newton Gargoyle Astronomers Monument
Namesake	Newton (unit) Newton's cradle Isaac Newton Institute Isaac Newton Medal Isaac Newton Telescope Isaac Newton Group of Telescopes XMM-Newton Sir Isaac Newton Sixth Form Statal Institute of Higher Education Isaac Newton Newton International Fellowship
Categories	Isaac Newton