Norm (mathematics)

Last updated

In mathematics, a norm is a function from a real or complex vector space to the non-negative real numbers that behaves in certain ways like the distance from the origin: it commutes with scaling, obeys a form of the triangle inequality, and is zero only at the origin. In particular, the Euclidean distance in a Euclidean space is defined by a norm on the associated Euclidean vector space, called the Euclidean norm, the 2-norm, or, sometimes, the magnitude of the vector. This norm can be defined as the square root of the inner product of a vector with itself.

Contents

A seminorm satisfies the first two properties of a norm, but may be zero for vectors other than the origin. [1] A vector space with a specified norm is called a normed vector space. In a similar manner, a vector space with a seminorm is called a seminormed vector space.

The term pseudonorm has been used for several related meanings. It may be a synonym of "seminorm". [1] A pseudonorm may satisfy the same axioms as a norm, with the equality replaced by an inequality "" in the homogeneity axiom. [2] It can also refer to a norm that can take infinite values, [3] or to certain functions parametrised by a directed set. [4]

Definition

Given a vector space over a subfield of the complex numbers a norm on is a real-valued function with the following properties, where denotes the usual absolute value of a scalar : [5]

  1. Subadditivity/Triangle inequality: for all
  2. Absolute homogeneity: for all and all scalars
  3. Positive definiteness/positiveness [6] /Point-separating: for all if then
    • Because property (2.) implies some authors replace property (3.) with the equivalent condition: for every if and only if

A seminorm on is a function that has properties (1.) and (2.) [7] so that in particular, every norm is also a seminorm (and thus also a sublinear functional). However, there exist seminorms that are not norms. Properties (1.) and (2.) imply that if is a norm (or more generally, a seminorm) then and that also has the following property:

  1. Non-negativity: [6] for all

Some authors include non-negativity as part of the definition of "norm", although this is not necessary. Although this article defined "positive" to be a synonym of "positive definite", some authors instead define "positive" to be a synonym of "non-negative"; [8] these definitions are not equivalent.

Equivalent norms

Suppose that and are two norms (or seminorms) on a vector space Then and are called equivalent, if there exist two positive real constants and with such that for every vector

The relation " is equivalent to " is reflexive, symmetric ( implies ), and transitive and thus defines an equivalence relation on the set of all norms on The norms and are equivalent if and only if they induce the same topology on [9] Any two norms on a finite-dimensional space are equivalent but this does not extend to infinite-dimensional spaces. [9]

Notation

If a norm is given on a vector space then the norm of a vector is usually denoted by enclosing it within double vertical lines: Such notation is also sometimes used if is only a seminorm. For the length of a vector in Euclidean space (which is an example of a norm, as explained below), the notation with single vertical lines is also widespread.

Examples

Every (real or complex) vector space admits a norm: If is a Hamel basis for a vector space then the real-valued map that sends (where all but finitely many of the scalars are ) to is a norm on [10] There are also a large number of norms that exhibit additional properties that make them useful for specific problems.

Absolute-value norm

The absolute value

is a norm on the one-dimensional vector space formed by the real or complex numbers.

Any norm on a one-dimensional vector space is equivalent (up to scaling) to the absolute value norm, meaning that there is a norm-preserving isomorphism of vector spaces where is either or and norm-preserving means that This isomorphism is given by sending to a vector of norm which exists since such a vector is obtained by multiplying any non-zero vector by the inverse of its norm.

Euclidean norm

On the -dimensional Euclidean space the intuitive notion of length of the vector is captured by the formula [11]

This is the Euclidean norm, which gives the ordinary distance from the origin to the point X—a consequence of the Pythagorean theorem. This operation may also be referred to as "SRSS", which is an acronym for the square root of the sum of squares. [12]

The Euclidean norm is by far the most commonly used norm on [11] but there are other norms on this vector space as will be shown below. However, all these norms are equivalent in the sense that they all define the same topology on finite-dimensional spaces.

The inner product of two vectors of a Euclidean vector space is the dot product of their coordinate vectors over an orthonormal basis. Hence, the Euclidean norm can be written in a coordinate-free way as

The Euclidean norm is also called the quadratic norm, norm, [13] norm, 2-norm, or square norm; see space. It defines a distance function called the Euclidean length, distance, or distance.

The set of vectors in whose Euclidean norm is a given positive constant forms an -sphere.

Euclidean norm of complex numbers

The Euclidean norm of a complex number is the absolute value (also called the modulus) of it, if the complex plane is identified with the Euclidean plane This identification of the complex number as a vector in the Euclidean plane, makes the quantity (as first suggested by Euler) the Euclidean norm associated with the complex number. For , the norm can also be written as where is the complex conjugate of

Quaternions and octonions

There are exactly four Euclidean Hurwitz algebras over the real numbers. These are the real numbers the complex numbers the quaternions and lastly the octonions where the dimensions of these spaces over the real numbers are respectively. The canonical norms on and are their absolute value functions, as discussed previously.

The canonical norm on of quaternions is defined by

for every quaternion in This is the same as the Euclidean norm on considered as the vector space Similarly, the canonical norm on the octonions is just the Euclidean norm on

Finite-dimensional complex normed spaces

On an -dimensional complex space the most common norm is

In this case, the norm can be expressed as the square root of the inner product of the vector and itself:

where is represented as a column vector and denotes its conjugate transpose.

This formula is valid for any inner product space, including Euclidean and complex spaces. For complex spaces, the inner product is equivalent to the complex dot product. Hence the formula in this case can also be written using the following notation:

Taxicab norm or Manhattan norm

The name relates to the distance a taxi has to drive in a rectangular street grid (like that of the New York borough of Manhattan) to get from the origin to the point

The set of vectors whose 1-norm is a given constant forms the surface of a cross polytope, which has dimension equal to the dimension of the vector space minus 1. The Taxicab norm is also called the norm. The distance derived from this norm is called the Manhattan distance or distance.

The 1-norm is simply the sum of the absolute values of the columns.

In contrast,

is not a norm because it may yield negative results.

p-norm

Let be a real number. The -norm (also called -norm) of vector is [11]

For we get the taxicab norm, for we get the Euclidean norm, and as approaches the -norm approaches the infinity norm or maximum norm:

The -norm is related to the generalized mean or power mean.

For the -norm is even induced by a canonical inner product meaning that for all vectors This inner product can be expressed in terms of the norm by using the polarization identity. On this inner product is the Euclidean inner product defined by

while for the space associated with a measure space which consists of all square-integrable functions, this inner product is

This definition is still of some interest for but the resulting function does not define a norm, [14] because it violates the triangle inequality. What is true for this case of even in the measurable analog, is that the corresponding class is a vector space, and it is also true that the function

(without th root) defines a distance that makes into a complete metric topological vector space. These spaces are of great interest in functional analysis, probability theory and harmonic analysis. However, aside from trivial cases, this topological vector space is not locally convex, and has no continuous non-zero linear forms. Thus the topological dual space contains only the zero functional.

The partial derivative of the -norm is given by

The derivative with respect to therefore, is

where denotes Hadamard product and is used for absolute value of each component of the vector.

For the special case of this becomes

or

Maximum norm (special case of: infinity norm, uniform norm, or supremum norm)

||
x
||
[?]
=
1
{\displaystyle \|x\|_{\infty }=1} Vector norm sup.svg

If is some vector such that then:

The set of vectors whose infinity norm is a given constant, forms the surface of a hypercube with edge length

Zero norm

In probability and functional analysis, the zero norm induces a complete metric topology for the space of measurable functions and for the F-space of sequences with F–norm [15] Here we mean by F-norm some real-valued function on an F-space with distance such that The F-norm described above is not a norm in the usual sense because it lacks the required homogeneity property.

Hamming distance of a vector from zero

In metric geometry, the discrete metric takes the value one for distinct points and zero otherwise. When applied coordinate-wise to the elements of a vector space, the discrete distance defines the Hamming distance , which is important in coding and information theory. In the field of real or complex numbers, the distance of the discrete metric from zero is not homogeneous in the non-zero point; indeed, the distance from zero remains one as its non-zero argument approaches zero. However, the discrete distance of a number from zero does satisfy the other properties of a norm, namely the triangle inequality and positive definiteness. When applied component-wise to vectors, the discrete distance from zero behaves like a non-homogeneous "norm", which counts the number of non-zero components in its vector argument; again, this non-homogeneous "norm" is discontinuous.

In signal processing and statistics, David Donoho referred to the zero"norm" with quotation marks. Following Donoho's notation, the zero "norm" of is simply the number of non-zero coordinates of or the Hamming distance of the vector from zero. When this "norm" is localized to a bounded set, it is the limit of -norms as approaches 0. Of course, the zero "norm" is not truly a norm, because it is not positive homogeneous. Indeed, it is not even an F-norm in the sense described above, since it is discontinuous, jointly and severally, with respect to the scalar argument in scalar–vector multiplication and with respect to its vector argument. Abusing terminology, some engineers[ who? ] omit Donoho's quotation marks and inappropriately call the number-of-non-zeros function the norm, echoing the notation for the Lebesgue space of measurable functions.

Infinite dimensions

The generalization of the above norms to an infinite number of components leads to and spaces for with norms

for complex-valued sequences and functions on respectively, which can be further generalized (see Haar measure). These norms are also valid in the limit as , giving a supremum norm, and are called and

Any inner product induces in a natural way the norm

Other examples of infinite-dimensional normed vector spaces can be found in the Banach space article.

Generally, these norms do not give the same topologies. For example, an infinite-dimensional space gives a strictly finer topology than an infinite-dimensional space when

Composite norms

Other norms on can be constructed by combining the above; for example

is a norm on

For any norm and any injective linear transformation we can define a new norm of equal to

In 2D, with a rotation by 45° and a suitable scaling, this changes the taxicab norm into the maximum norm. Each applied to the taxicab norm, up to inversion and interchanging of axes, gives a different unit ball: a parallelogram of a particular shape, size, and orientation.

In 3D, this is similar but different for the 1-norm (octahedrons) and the maximum norm (prisms with parallelogram base).

There are examples of norms that are not defined by "entrywise" formulas. For instance, the Minkowski functional of a centrally-symmetric convex body in (centered at zero) defines a norm on (see § Classification of seminorms: absolutely convex absorbing sets below).

All the above formulas also yield norms on without modification.

There are also norms on spaces of matrices (with real or complex entries), the so-called matrix norms.

In abstract algebra

Let be a finite extension of a field of inseparable degree and let have algebraic closure If the distinct embeddings of are then the Galois-theoretic norm of an element is the value As that function is homogeneous of degree , the Galois-theoretic norm is not a norm in the sense of this article. However, the -th root of the norm (assuming that concept makes sense) is a norm. [16]

Composition algebras

The concept of norm in composition algebras does not share the usual properties of a norm since null vectors are allowed. A composition algebra consists of an algebra over a field an involution and a quadratic form called the "norm".

The characteristic feature of composition algebras is the homomorphism property of : for the product of two elements and of the composition algebra, its norm satisfies In the case of division algebras and the composition algebra norm is the square of the norm discussed above. In those cases the norm is a definite quadratic form. In the split algebras the norm is an isotropic quadratic form.

Properties

For any norm on a vector space the reverse triangle inequality holds:

If is a continuous linear map between normed spaces, then the norm of and the norm of the transpose of are equal. [17]

For the norms, we have Hölder's inequality [18]

A special case of this is the Cauchy–Schwarz inequality: [18]

Illustrations of unit circles in different norms. Vector norms.svg
Illustrations of unit circles in different norms.

Every norm is a seminorm and thus satisfies all properties of the latter. In turn, every seminorm is a sublinear function and thus satisfies all properties of the latter. In particular, every norm is a convex function.

Equivalence

The concept of unit circle (the set of all vectors of norm 1) is different in different norms: for the 1-norm, the unit circle is a square oriented as a diamond; for the 2-norm (Euclidean norm), it is the well-known unit circle; while for the infinity norm, it is an axis-aligned square. For any -norm, it is a superellipse with congruent axes (see the accompanying illustration). Due to the definition of the norm, the unit circle must be convex and centrally symmetric (therefore, for example, the unit ball may be a rectangle but cannot be a triangle, and for a -norm).

In terms of the vector space, the seminorm defines a topology on the space, and this is a Hausdorff topology precisely when the seminorm can distinguish between distinct vectors, which is again equivalent to the seminorm being a norm. The topology thus defined (by either a norm or a seminorm) can be understood either in terms of sequences or open sets. A sequence of vectors is said to converge in norm to if as Equivalently, the topology consists of all sets that can be represented as a union of open balls. If is a normed space then [19]

Two norms and on a vector space are called equivalent if they induce the same topology, [9] which happens if and only if there exist positive real numbers and such that for all

For instance, if on then [20]

In particular,

That is,

If the vector space is a finite-dimensional real or complex one, all norms are equivalent. On the other hand, in the case of infinite-dimensional vector spaces, not all norms are equivalent.

Equivalent norms define the same notions of continuity and convergence and for many purposes do not need to be distinguished. To be more precise the uniform structure defined by equivalent norms on the vector space is uniformly isomorphic.

Classification of seminorms: absolutely convex absorbing sets

All seminorms on a vector space can be classified in terms of absolutely convex absorbing subsets of To each such subset corresponds a seminorm called the gauge of defined as

where is the infimum, with the property that

Conversely:

Any locally convex topological vector space has a local basis consisting of absolutely convex sets. A common method to construct such a basis is to use a family of seminorms that separates points: the collection of all finite intersections of sets turns the space into a locally convex topological vector space so that every p is continuous.

Such a method is used to design weak and weak* topologies.

norm case:

Suppose now that contains a single since is separating, is a norm, and is its open unit ball. Then is an absolutely convex bounded neighbourhood of 0, and is continuous.
The converse is due to Andrey Kolmogorov: any locally convex and locally bounded topological vector space is normable. Precisely:
If is an absolutely convex bounded neighbourhood of 0, the gauge (so that is a norm.

See also

Related Research Articles

<span class="mw-page-title-main">Absolute value</span> Distance from zero to a number

In mathematics, the absolute value or modulus of a real number , denoted , is the non-negative value of without regard to its sign. Namely, if is a positive number, and if is negative, and . For example, the absolute value of 3 is 3, and the absolute value of −3 is also 3. The absolute value of a number may be thought of as its distance from zero.

In mathematics, more specifically in functional analysis, a Banach space is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and is complete in the sense that a Cauchy sequence of vectors always converges to a well-defined limit that is within the space.

The Hahn–Banach theorem is a central tool in functional analysis. It allows the extension of bounded linear functionals defined on a vector subspace of some vector space to the whole space, and it also shows that there are "enough" continuous linear functionals defined on every normed vector space to make the study of the dual space "interesting". Another version of the Hahn–Banach theorem is known as the Hahn–Banach separation theorem or the hyperplane separation theorem, and has numerous uses in convex geometry.

<span class="mw-page-title-main">Inner product space</span> Generalization of the dot product; used to define Hilbert spaces

In mathematics, an inner product space is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, often denoted with angle brackets such as in . Inner products allow formal definitions of intuitive geometric notions, such as lengths, angles, and orthogonality of vectors. Inner product spaces generalize Euclidean vector spaces, in which the inner product is the dot product or scalar product of Cartesian coordinates. Inner product spaces of infinite dimension are widely used in functional analysis. Inner product spaces over the field of complex numbers are sometimes referred to as unitary spaces. The first usage of the concept of a vector space with an inner product is due to Giuseppe Peano, in 1898.

<span class="mw-page-title-main">Normed vector space</span> Vector space on which a distance is defined

In mathematics, a normed vector space or normed space is a vector space over the real or complex numbers on which a norm is defined. A norm is a generalization of the intuitive notion of "length" in the physical world. If is a vector space over , where is a field equal to or to , then a norm on is a map , typically denoted by , satisfying the following four axioms:

  1. Non-negativity: for every ,.
  2. Positive definiteness: for every , if and only if is the zero vector.
  3. Absolute homogeneity: for every and ,
  4. Triangle inequality: for every and ,

The Cauchy–Schwarz inequality is an upper bound on the inner product between two vectors in an inner product space in terms of the product of the vector norms. It is considered one of the most important and widely used inequalities in mathematics.

In mathematics, the Lp spaces are function spaces defined using a natural generalization of the p-norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue, although according to the Bourbaki group they were first introduced by Frigyes Riesz.

<span class="mw-page-title-main">Multivariate normal distribution</span> Generalization of the one-dimensional normal distribution to higher dimensions

In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions. One definition is that a random vector is said to be k-variate normally distributed if every linear combination of its k components has a univariate normal distribution. Its importance derives mainly from the multivariate central limit theorem. The multivariate normal distribution is often used to describe, at least approximately, any set of (possibly) correlated real-valued random variables each of which clusters around a mean value.

In mechanics and geometry, the 3D rotation group, often denoted SO(3), is the group of all rotations about the origin of three-dimensional Euclidean space under the operation of composition.

In mathematics, particularly in functional analysis, a seminorm is a vector space norm that need not be positive definite. Seminorms are intimately connected with convex sets: every seminorm is the Minkowski functional of some absorbing disk and, conversely, the Minkowski functional of any such set is a seminorm.

<span class="mw-page-title-main">Standard basis</span> Vectors whose components are all 0 except one that is 1

In mathematics, the standard basis (also called natural basis or canonical basis) of a coordinate vector space (such as or ) is the set of vectors, each of whose components are all zero, except one that equals 1. For example, in the case of the Euclidean plane formed by the pairs (x, y) of real numbers, the standard basis is formed by the vectors

In functional analysis and related areas of mathematics, locally convex topological vector spaces (LCTVS) or locally convex spaces are examples of topological vector spaces (TVS) that generalize normed spaces. They can be defined as topological vector spaces whose topology is generated by translations of balanced, absorbent, convex sets. Alternatively they can be defined as a vector space with a family of seminorms, and a topology can be defined in terms of that family. Although in general such spaces are not necessarily normable, the existence of a convex local base for the zero vector is strong enough for the Hahn–Banach theorem to hold, yielding a sufficiently rich theory of continuous linear functionals.

<span class="mw-page-title-main">Real coordinate space</span> Space formed by the n-tuples of real numbers

In mathematics, the real coordinate space or real coordinate n-space, of dimension n, denoted Rn or , is the set of all ordered n-tuples of real numbers, that is the set of all sequences of n real numbers, also known as coordinate vectors. Special cases are called the real lineR1, the real coordinate planeR2, and the real coordinate three-dimensional spaceR3. With component-wise addition and scalar multiplication, it is a real vector space.

In statistics, the theory of minimum norm quadratic unbiased estimation (MINQUE) was developed by C. R. Rao. MINQUE is a theory alongside other estimation methods in estimation theory, such as the method of moments or maximum likelihood estimation. Similar to the theory of best linear unbiased estimation, MINQUE is specifically concerned with linear regression models. The method was originally conceived to estimate heteroscedastic error variance in multiple linear regression. MINQUE estimators also provide an alternative to maximum likelihood estimators or restricted maximum likelihood estimators for variance components in mixed effects models. MINQUE estimators are quadratic forms of the response variable and are used to estimate a linear function of the variances.

In functional analysis and related areas of mathematics, a sequence space is a vector space whose elements are infinite sequences of real or complex numbers. Equivalently, it is a function space whose elements are functions from the natural numbers to the field K of real or complex numbers. The set of all such functions is naturally identified with the set of all possible infinite sequences with elements in K, and can be turned into a vector space under the operations of pointwise addition of functions and pointwise scalar multiplication. All sequence spaces are linear subspaces of this space. Sequence spaces are typically equipped with a norm, or at least the structure of a topological vector space.

In linear algebra, a sublinear function, also called a quasi-seminorm or a Banach functional, on a vector space is a real-valued function with only some of the properties of a seminorm. Unlike seminorms, a sublinear function does not have to be nonnegative-valued and also does not have to be absolutely homogeneous. Seminorms are themselves abstractions of the more well known notion of norms, where a seminorm has all the defining properties of a norm except that it is not required to map non-zero vectors to non-zero values.

In Euclidean space, the distance from a point to a plane is the distance between a given point and its orthogonal projection on the plane, the perpendicular distance to the nearest point on the plane.

In functional analysis, the dual norm is a measure of size for a continuous linear function defined on a normed vector space.

In mathematical analysis and its applications, a function of several real variables or real multivariate function is a function with more than one argument, with all arguments being real variables. This concept extends the idea of a function of a real variable to several variables. The "input" variables take real values, while the "output", also called the "value of the function", may be real or complex. However, the study of the complex-valued functions may be easily reduced to the study of the real-valued functions, by considering the real and imaginary parts of the complex function; therefore, unless explicitly specified, only real-valued functions will be considered in this article.

In functional analysis and related areas of mathematics, a metrizable topological vector space (TVS) is a TVS whose topology is induced by a metric. An LM-space is an inductive limit of a sequence of locally convex metrizable TVS.

References

  1. 1 2 Knapp, A.W. (2005). Basic Real Analysis. Birkhäuser. p.  . ISBN   978-0-817-63250-2.
  2. "Pseudo-norm - Encyclopedia of Mathematics". encyclopediaofmath.org. Retrieved 2022-05-12.
  3. "Pseudonorm". www.spektrum.de (in German). Retrieved 2022-05-12.
  4. Hyers, D. H. (1939-09-01). "Pseudo-normed linear spaces and Abelian groups". Duke Mathematical Journal. 5 (3). doi:10.1215/s0012-7094-39-00551-x. ISSN   0012-7094.
  5. Pugh, C.C. (2015). Real Mathematical Analysis. Springer. p.  page 28. ISBN   978-3-319-17770-0.Prugovečki, E. (1981). Quantum Mechanics in Hilbert Space. p.  page 20.
  6. 1 2 Kubrusly 2011, p. 200.
  7. Rudin, W. (1991). Functional Analysis. p. 25.
  8. Narici & Beckenstein 2011, pp. 120–121.
  9. 1 2 3 Conrad, Keith. "Equivalence of norms" (PDF). kconrad.math.uconn.edu. Retrieved September 7, 2020.
  10. Wilansky 2013, pp. 20–21.
  11. 1 2 3 Weisstein, Eric W. "Vector Norm". mathworld.wolfram.com. Retrieved 2020-08-24.
  12. Chopra, Anil (2012). Dynamics of Structures, 4th Ed. Prentice-Hall. ISBN   978-0-13-285803-8.
  13. Weisstein, Eric W. "Norm". mathworld.wolfram.com. Retrieved 2020-08-24.
  14. Except in where it coincides with the Euclidean norm, and where it is trivial.
  15. Rolewicz, Stefan (1987), Functional analysis and control theory: Linear systems, Mathematics and its Applications (East European Series), vol. 29 (Translated from the Polish by Ewa Bednarczuk ed.), Dordrecht; Warsaw: D. Reidel Publishing Co.; PWN—Polish Scientific Publishers, pp. xvi, 524, doi:10.1007/978-94-015-7758-8, ISBN   90-277-2186-6, MR   0920371, OCLC   13064804
  16. Lang, Serge (2002) [1993]. Algebra (Revised 3rd ed.). New York: Springer Verlag. p. 284. ISBN   0-387-95385-X.
  17. Trèves 2006, pp. 242–243.
  18. 1 2 Golub, Gene; Van Loan, Charles F. (1996). Matrix Computations (Third ed.). Baltimore: The Johns Hopkins University Press. p. 53. ISBN   0-8018-5413-X.
  19. Narici & Beckenstein 2011, pp. 107–113.
  20. "Relation between p-norms". Mathematics Stack Exchange.

Bibliography