In physics, a variational principle is an alternative method for determining the state or dynamics of a physical system, by identifying it as an extremum (minimum, maximum or saddle point) of a function or functional. Variational methods are exploited in many modern software to simulate matter and light.
Since the development of analytical mechanics in the 18th century, the fundamental equations of physics have usually been established in terms of action principles, where the variational principle is applied to the action of a system in order to recover the fundamental equation of motion.
This article describes the historical development of such action principles and other variational methods applied in physics. See History of physics for an overview and Outline of the history of physics for related histories.
Variational principles are found among earlier ideas in surveying and optics. The rope stretchers of ancient Egypt stretched corded ropes between two points to measure the path which minimized the distance of separation, and Claudius Ptolemy, in his Geographia (Bk 1, Ch 2), emphasized that one must correct for "deviations from a straight course"; in ancient Greece Euclid states in his Catoptrica that, for the path of light reflecting from a mirror, the angle of incidence equals the angle of reflection; and Hero of Alexandria later showed that this path was the shortest length and least time. [1] : 580
In the static analysis of objects under forces but fixed at mechanical equilibrium, the principle of virtual work imagines tiny mathematical shifts away from equilibrium. Each shift does work—energy lost or gained—against the forces, but the sum of all these bits of virtual work must be zero. This principle was developed by Johann Bernoulli in a letter to Pierre Varignon in 1715, but never separately published. [2] : 23 Cornelius Lanczos uses a slightly different definition as the single postulate for all analytic mechanics, showing thereby the power of energy based variational principles over Newtonian mechanics. [2] : 87
In 1743 Jean le Rond d'Alembert generalized the concept we now call virtual work to dynamical systems with rigid constraints, like rods or string under tension, a form that became known as the d'Alembert principle. [3] : 190 In the case of static (in equilibrium) rigid bodies without friction, the principle of virtual work says the net work of all applied forces () under variation of positions () is zero:
A similar condition but valid for dynamics (systems in motion) introduces, for each force, the change in momentum :
which is d'Alembert's principle. [4] : 17
The earlier geometrical ideas in optics were generalized by Pierre de Fermat, who, in the 17th century, refined the principle to "light travels between two given points along the path of shortest time"; now known as the principle of least time or Fermat's principle. Fermat showed that principle predicts the observed law of refraction. His approach was metaphysical, arguing that Nature acts simply and economically. [1] : 580
In 1696 Johann Bernoulli posed a puzzle to European mathematicians: derive a curve for motion of a frictionless bead falling between a higher and a lower point in the least possible time. He named the curve the "brachistochrone", (from brachystos, "shortest", and chronos, "time") [5] : 31 Isaac Newton, Gottfried Wilhelm Leibniz and others contributed solutions, and in 1718 Johann Bernoulli published an analysis based on the solution created by his brother James Bernoulli. All of these works, especially the approach taken by the Bernoullis, involved reasoning about small deviations in the path taken by the falling bead. Thus this became the first application of the variational technique, albeit as a special-case rather than a general principle. [5] : 68
In 1744 [6] and 1746, [7] Pierre Louis Maupertuis generalized Fermat's concept to mechanics, [8] : 97 in the form of a principle of least action. Maupertuis argued metaphysically, he felt that "Nature is thrifty in all its actions", and applied the principle broadly:
The laws of movement and of rest deduced from this principle being precisely the same as those observed in nature, we can admire the application of it to all phenomena. The movement of animals, the vegetative growth of plants ... are only its consequences; and the spectacle of the universe becomes so much the grander, so much more beautiful, the worthier of its Author, when one knows that a small number of laws, most wisely established, suffice for all movements.
— Pierre Louis Maupertuis [9]
This notion of Maupertuis, although somewhat deterministic today, does capture much of the essence of variational mechanics.
In application to physics, Maupertuis suggested that the quantity to be minimized was the product of the duration (time) of movement within a system by the action; his definitions of action varied with the problems he discussed. [1] : 581 One form he used was called "vis viva",
which is the integral of twice what we now call the kinetic energy T of the system.
Leonhard Euler corresponded with Maupertuis from 1740 to 1744; [1] : 582 in 1744 Euler proposed a refined formulation of the least action principle in 1744. [10] He writes [11]
As Euler states, is the integral of the momentum over distance traveled (note that here contrary to usual notation denotes the squared velocity) which, in modern notation, equals the abbreviated action: [4] : 359
In rather general terms he wrote that "Since the fabric of the Universe is most perfect and is the work of a most wise Creator, nothing whatsoever takes place in the Universe in which some relation of maximum and minimum does not appear."
Euler continued to write on the topic; in his Reflexions sur quelques loix generales de la nature (1748), he called the quantity "effort". His expression corresponds to what we would now call potential energy, so that his statement of least action in statics is equivalent to the principle that a system of bodies at rest will adopt a configuration that minimizes total potential energy.
The first use of the term "method of variations" came in 1755 through the work of a young Joseph Louis Lagrange; Euler presented Lagrange's approach to the Berlin Academy in 1756 as the "calculus of variations". Unlike Euler, Lagrange's approach was purely analytic rather than geometrical. Lagrange introduced the idea of variation of entire curves or paths between the endpoints than of individual coordinates. For this he introduced a new form of a differential, written , that acts on integrals rather than acting on coordinates. [5] : 111 His notation continues to be used today. [1] : 583
The variational principle was not used to derive the equations of motion until almost 75 years later, when William Rowan Hamilton in 1834 and 1835 [12] applied the variational principle to the Lagrangian function (where T is the kinetic energy and V the potential energy of an object) to obtain what are now called the Euler–Lagrange equations. Hamilton believed his results were constrained by conservation of energy, which he called conservation of living force. [13] : 163
While few German scientists read English papers in this era, in 1836 the German mathematician Carl Gustav Jacobi read of Hamilton's work and immediately began new mathematical work, publishing ground breaking work on the variational principle in the following year. [14] Among Jacobi's results was the extension of Hamilton's method to time-dependent potentials (or "force functions" as they were known at that time). [13] : 201
Other extremal principles of classical mechanics were formulated, such as Carl Friedrich Gauss's 1829 principle of least constraint and its corollary, Heinrich Hertz's 1896 principle of least curvature.
Action principles were developed by trial and error over three centuries; the names of the principles are not self-describing. [15] Richard Feynman, through his PhD thesis [16] and later through his reinvention of the undergraduate physics course, reinvigorated the field of variational principles in physics. [15] In the process he upended the terminology. Feynman called Hamilton's principal function simply the "action" and Hamilton's principle he called "the principle of least action". [17] The table below summarizes the key terminology found in modern physics literature.
action | principle | ||||
---|---|---|---|---|---|
definition | historical name | modern name | definition | common name | Alternative name |
Hamilton's principle function [4] : 431 | action [15] [4] : 359 | Hamilton's principle [15] [18] [4] | Least action, [17] [19] : 46 Stationary action [20] | ||
action [4] : 359 | abbreviated [15] [4] : 359 or Maupertuis [18] action | Maupertuis's principle [18] [15] | Least action [4] : 356 | ||
The notation means variations on with fixed; means variation with constant energy. [18] The abbreviated action is sometimes labeled . [15] Some authors use "stationary action" or "least action" to mean any variational principle involving action. [2] : viii [21] : 92
In 1915 David Hilbert applied variational principles to derive the gravitational field equations of general relativity in agreement with Albert Einstein's derivation. [22] (Einstein and Hilbert discussed Einstein's work on general relativity in person and letters throughout 1915. [23] ) Hilbert's approach required accepting the variational principle as "axiomatic", a broadly accepted requirement today but questionable to the physicists of 1915. Hilbert's variations were based on what became known as the Einstein–Hilbert action, given by
where κ is Einstein gravitational constant, is the determinant of a spacetime Lorentz metric and is the scalar curvature.
Variational principles played decisive roles at critical times in the development of quantum mechanics.
Following Max Planck's proposal that quantum radiators explain the blackbody radiation spectrum and Albert Einstein hypothesis of quantum radiation to explain the photoelectric effect, Niels Bohr proposed quantized energy levels for the orbits in his model of the atom, thereby explaining the Balmer series for absorption of radiation by atoms. However this hypothesis involved no mechanical model. Arnold Sommerfeld then showed that quantization of the action of orbits for Hydrogen predicted the Balmer series, complete with relativistic corrections leading to fine structure in spectral lines. However, this approach could not be extended to atoms with more electrons and, more fundamentally, the quantum hypothesis itself had no explanation from this classical mechanics solution. [21] : 97
Combining Einstein's relativity and photoelectric effect results, De Broglie suggested that Sommerfeld's quantized action may relate to quantized wave effects; Erwin Schrödinger took up this idea, applying Hamilton's optico-mechanical analogy to connect the quantized action to Hamilton-Jacobi equations for the action. Hamilton's connection between light rays and light waves now became a connection between matter trajectories and de Broglie matter waves. [21] : 119 The resulting Schrödinger equation became the first successful quantum mechanics.
The work that built on Schrödinger's equation relied on analogies to Hamiltonian mechanics. In 1933 Paul Dirac published a paper seeking an alternative formulation based on Lagrangian mechanics. He was motivated by the power of the action principle and the relativistic invariance of the action itself. [24] Dirac was able to show that the wavefunctions probability amplitude at was related to the amplitude at through a complex exponential function of the action. [25] : 1025
In 1942, nearly a decade after Dirac's work, Richard Feynman built a new quantum mechanics formulation on the action principle. Feynman interpreted Dirac's formula as a physical recipe for the probability amplitude contributions from every possible path between and . These possibilities interfere; constructive interference gives the paths with the most amplitude. In the classical limit with large values of action compared to , the single classical path given by the action principle results. [25] : 1027
In 1950, Julian Schwinger revisited Dirac's Lagrangian paper to develop the action principle in a different direction. [25] : 1082 Unlike Feynman's focus on paths, Schwinger's approach was "differential" or local.
The Standard Model is defined in terms of a Lagrangian density that includes all known elementary particles, the Higgs field and three of the fundamental interactions (electromagnetism, weak interaction and strong interaction, not including gravitational interaction). Its formulation started in the 1970s and has successfully explained almost all experimental results related to microscopic physics. [26]
The breadth of physical phenomena subject to study by action principles lead scientists from all centuries to view these concepts as especially fundamental; the connection of two points by paths lead some to suggest a "purpose" to the selection of one particular path. [27] This teleological viewpoint runs from the earliest physics through Fermat, Maupertuis, and on up to Max Planck, without, however, any scientific backing. [21] : 162 The use of colorful language continues in the modern era with phrases like "Nature's command (to) Explore all paths!" [28] or "It isn't that a particle takes the path of least action but that it smells all the paths in the neighborhood...". [17] : II:19
Lord Rayleigh was the first to popularly adapt the variational principles for the search of eigenvalues and eigenvectors for the study of elasticity and classical waves in his 1877 Theory of Sound. [29] The Rayleigh method allows approximation of the fundamental frequencies without full knowledge of the material composition and without the requirement of computational power. [29] From 1903 to 1908, Walther Ritz introduced a series of improved methods for static and free vibration problems based on the optimization of an ansatz or trial function. Ritz demonstrated his use in the Euler–Bernoulli beam theory and the determination of Chladni figures. [29]
For years, Ritz works were poorly cited in Western Europe and would only become popular after Ritz death in 1909. [30] In Russia, physicists like Ivan Bubnov (in 1913) and Boris Galerkin (in 1915) would rediscover and popularize some of Ritz's methods from 1908. In 1940, Georgii I. Petrov improved these approximations. [30] These methods are now known under different names, including Bubnov–Galerkin, Petrov–Galerkin and Ritz–Galerkin methods. [29]
In 1911, Rayleigh complemented Ritz for his method for solving Chladni's problem, but complained for the lack of citation of his earlier work. However the similarity between Rayleigh's and Ritz's method has sometimes been challenged. [29] [31] [30] Ritz's methods are sometimes referred as Rayleigh–Ritz method or simply Ritz method, depending on the procedure. [29] [30] Ritz's method led to the development of finite element method for the numerical solution of partial differential equations in physics. [30]
The variational method of Ritz would found his use quantum mechanics with the development of Hellmann–Feynman theorem. The theorem was first discussed by Schrödinger in 1926, the first proof was given by Paul Güttinger in 1932, and later rediscovered independently by Wolfgang Pauli and Hans Hellmann in 1933, and by Feynman in 1939.[ citation needed ]
In quantum chemistry and condensed matter physics, variational methods were developed to study atoms, molecules, nuclei and solids under a quantum mechanical framework. Some of these include the use of Ritz methods for the determination of the spectra of the helium atom, 1930 Hartree–Fock method, 1964 density functional theory and variational Monte Carlo and 1992 density matrix renormalization group (DMRG).[ citation needed ]
In 2014, variational principles were part of a hybrid strategy, called noisy intermediate-scale quantum (NISQ) computing, to combine powerful but imperfect quantum computers coupled with classical computers. [32] The first proposals included a variational quantum eigensolver [33] to exploit quantum phenomena to simulate atoms and small molecules using variational methods and an approximate optimization algorithm. [34] [35]
Quantization is the systematic transition procedure from a classical understanding of physical phenomena to a newer understanding known as quantum mechanics. It is a procedure for constructing quantum mechanics from classical mechanics. A generalization involving infinite degrees of freedom is field quantization, as in the "quantization of the electromagnetic field", referring to photons as field "quanta". This procedure is basic to theories of atomic physics, chemistry, particle physics, nuclear physics, condensed matter physics, and quantum optics.
Quantum mechanics is a fundamental theory that describes the behavior of nature at and below the scale of atoms. It is the foundation of all quantum physics, which includes quantum chemistry, quantum field theory, quantum technology, and quantum information science.
The calculus of variations is a field of mathematical analysis that uses variations, which are small changes in functions and functionals, to find maxima and minima of functionals: mappings from a set of functions to the real numbers. Functionals are often expressed as definite integrals involving functions and their derivatives. Functions that maximize or minimize functionals may be found using the Euler–Lagrange equation of the calculus of variations.
Scientific laws or laws of science are statements, based on repeated experiments or observations, that describe or predict a range of natural phenomena. The term law has diverse usage in many cases across all fields of natural science. Laws are developed from data and can be further developed through mathematics; in all cases they are directly or indirectly based on empirical evidence. It is generally understood that they implicitly reflect, though they do not explicitly assert, causal relationships fundamental to reality, and are discovered rather than invented.
In physics, action is a scalar quantity that describes how the balance of kinetic versus potential energy of a physical system changes with trajectory. Action is significant because it is an input to the principle of stationary action, an approach to classical mechanics that is simpler for multiple objects. Action and the variational principle are used in Feynman's formulation of quantum mechanics and in general relativity. For systems with small values of action similar to the Planck constant, quantum effects are significant.
The path integral formulation is a description in quantum mechanics that generalizes the stationary action principle of classical mechanics. It replaces the classical notion of a single, unique classical trajectory for a system with a sum, or functional integral, over an infinity of quantum-mechanically possible trajectories to compute a quantum amplitude.
In theoretical physics and mathematical physics, analytical mechanics, or theoretical mechanics is a collection of closely related formulations of classical mechanics. Analytical mechanics uses scalar properties of motion representing the system as a whole—usually its kinetic energy and potential energy. The equations of motion are derived from the scalar quantity by some underlying principle about the scalar's variation.
In science and especially in mathematical studies, a variational principle is one that enables a problem to be solved using calculus of variations, which concerns finding functions that optimize the values of quantities that depend on those functions. For example, the problem of determining the shape of a hanging chain suspended at both ends—a catenary—can be solved using variational calculus, and in this case, the variational principle is the following: The solution is a function that minimizes the gravitational potential energy of the chain.
In theoretical physics, Euclidean quantum gravity is a version of quantum gravity. It seeks to use the Wick rotation to describe the force of gravity according to the principles of quantum mechanics.
Walther Heinrich Wilhelm Ritz was a Swiss theoretical physicist. He is most famous for his work with Johannes Rydberg on the Rydberg–Ritz combination principle. Ritz is also known for the variational method named after him, the Ritz method.
The superposition principle, also known as superposition property, states that, for all linear systems, the net response caused by two or more stimuli is the sum of the responses that would have been caused by each stimulus individually. So that if input A produces response X, and input B produces response Y, then input (A + B) produces response (X + Y).
The Rayleigh–Ritz method is a direct numerical method of approximating eigenvalues, originated in the context of solving physical boundary value problems and named after Lord Rayleigh and Walther Ritz.
In mathematics, in the area of numerical analysis, Galerkin methods are a family of methods for converting a continuous operator problem, such as a differential equation, commonly in a weak formulation, to a discrete problem by applying linear constraints determined by finite sets of basis functions. They are named after the Soviet mathematician Boris Galerkin.
In classical mechanics, Maupertuis's principle states that the path followed by a physical system is the one of least length. It is a special case of the more generally stated principle of least action. Using the calculus of variations, it results in an integral equation formulation of the equations of motion for the system.
In physics, Hamilton's principle is William Rowan Hamilton's formulation of the principle of stationary action. It states that the dynamics of a physical system are determined by a variational problem for a functional based on a single function, the Lagrangian, which may contain all physical information concerning the system and the forces acting on it. The variational problem is equivalent to and allows for the derivation of the differential equations of motion of the physical system. Although formulated originally for classical mechanics, Hamilton's principle also applies to classical fields such as the electromagnetic and gravitational fields, and plays an important role in quantum mechanics, quantum field theory and criticality theories.
Classical mechanics is a physical theory describing the motion of objects such as projectiles, parts of machinery, spacecraft, planets, stars, and galaxies. The development of classical mechanics involved substantial change in the methods and philosophy of physics. The qualifier classical distinguishes this type of mechanics from physics developed after the revolutions in physics of the early 20th century, all of which revealed limitations in classical mechanics.
In general relativity, the Hamilton–Jacobi–Einstein equation (HJEE) or Einstein–Hamilton–Jacobi equation (EHJE) is an equation in the Hamiltonian formulation of geometrodynamics in superspace, cast in the "geometrodynamics era" around the 1960s, by Asher Peres in 1962 and others. It is an attempt to reformulate general relativity in such a way that it resembles quantum theory within a semiclassical approximation, much like the correspondence between quantum mechanics and classical mechanics.
Hamilton's optico-mechanical analogy is a conceptual parallel between trajectories in classical mechanics and wavefronts in optics, introduced by William Rowan Hamilton around 1831. It may be viewed as linking Huygens' principle of optics with Maupertuis' principle of mechanics.
Action principles lie at the heart of fundamental physics, from classical mechanics through quantum mechanics, particle physics, and general relativity. Action principles start with an energy function called a Lagrangian describing the physical system. The accumulated value of this energy function between two states of the system is called the action. Action principles apply the calculus of variation to the action. The action depends on the energy function, and the energy function depends on the position, motion, and interactions in the system: variation of the action allows the derivation of the equations of motion without vector or forces.