Yang–Mills theory

Last updated

Contents

Unsolved problem in physics:

Yang–Mills theory and the mass gap. Quantum particles described by the theory have mass but the classical waves of the field travel at the speed of light. [1]

The phrase Yang–Mills theory means both a quantum field theory for nuclear binding devised by Chen Ning Yang and Robert Mills in 1953 and the class of similar theories. In mathematical physics, Yang–Mills theory is a gauge theory based on a special unitary group SU(n), or more generally any compact Lie group. A Yang–Mills theory seeks to describe the behavior of elementary particles using these non-abelian Lie groups and is at the core of the unification of the electromagnetic force and weak forces (i.e. U(1) × SU(2)) as well as quantum chromodynamics, the theory of the strong force (based on SU(3)). Thus it forms the basis of our understanding of the Standard Model of particle physics.

History and qualitative description

Gauge theory in electrodynamics

All known fundamental interactions can be described in terms of gauge theories, but working this out took decades. [2] Hermann Weyl's pioneering work on this project started in 1915 when his colleague Amalie Emmy Noether proved that every conserved physical quantity has a matching symmetry, and culminated in 1928 when he published his book applying the geometrical theory of symmetry (group theory) to quantum mechanics. [3] :194 Weyl named the relevant symmetry in Noether's theorem the "gauge symmetry", by analogy to distance standardization in railroad gauges.

Erwin Schrodinger in 1922, three years before working on his famous equation, connected Weyl's group concept to electron charge. Schrodinger showed that the group produced a phase shift in electromagnetic fields that matched the conservation of electric charge. [3] :198 As the theory of quantum electrodynamics developed in the 1930's and 1940's the group transformations played a central role. Many physicists thought there must be an analog for the dynamics of nucleons. Chen Ning Yang in particular was obsessed with this possibility.

Yang and Mills find the nuclear force gauge theory

Yang's core idea was to look for a conserved quantity in nuclear physics comparable to electric charge and use it to develop a corresponding gauge theory comparable to electrodynamics. He settled on conservation of isospin, a quantum number that distinguishes a neutron from a proton, but he made no progress on a theory. [3] :200 Taking a break from Princeton in the summer of 1953, Yang met a collaborator who could help: Robert Mills. As Mills himself describes:

"During the academic year 1953–1954, Yang was a visitor to Brookhaven National Laboratory ... I was at Brookhaven also...and was assigned to the same office as Yang. Yang, who has demonstrated on a number of occasions his generosity to physicists beginning their careers, told me about his idea of generalizing gauge invariance and we discussed it at some length...I was able to contribute something to the discussions, especially with regard to the quantization procedures, and to a small degree in working out the formalism; however, the key ideas were Yang's." [4]

In the summer 1953, Yang and Mills extended the concept of gauge theory for abelian groups, e.g. quantum electrodynamics, to non-abelian groups, selecting the group to provide an explanation for isospin conservation in collisions involving the strong interactions. Yang's presentation of the work at Princeton in February 1954 was challenged by Pauli, asking about the mass in the field developed with the gauge invariance idea. [3] :202 Pauli knew that this might be an issue as he had worked on applying gauge invariance but chose not to publish it, viewing the massless excitations of the theory to be "unphysical 'shadow particles'". [2] :13 Yang and Mills published in October 1954; near the end of the paper, they admit:

We next come to the question of the mass of the quantum, to which we do not have a satisfactory answer. [5]

This problem of unphysical massless excitation blocked further progress. [3]

The idea was set aside until 1960, when the concept of particles acquiring mass through symmetry breaking in massless theories was put forward, initially by Jeffrey Goldstone, Yoichiro Nambu, and Giovanni Jona-Lasinio. This prompted a significant restart of Yang–Mills theory studies that proved successful in the formulation of both electroweak unification and quantum chromodynamics (QCD). The electroweak interaction is described by the gauge group SU(2) × U(1), while QCD is an SU(3) Yang–Mills theory. The massless gauge bosons of the electroweak SU(2) × U(1) mix after spontaneous symmetry breaking to produce the 3 massive weak bosons (
W+
,
W
, and
Z0
) as well as the still-massless photon field. The dynamics of the photon field and its interactions with matter are, in turn, governed by the U(1) gauge theory of quantum electrodynamics. The Standard Model combines the strong interaction with the unified electroweak interaction (unifying the weak and electromagnetic interaction) through the symmetry group SU(3) × SU(2) × U(1). In the current epoch the strong interaction is not unified with the electroweak interaction, but from the observed running of the coupling constants it is believed[ citation needed ] they all converge to a single value at very high energies.

Phenomenology at lower energies in quantum chromodynamics is not completely understood due to the difficulties of managing such a theory with a strong coupling. This may be the reason why confinement has not been theoretically proven, though it is a consistent experimental observation. This shows why QCD confinement at low energy is a mathematical problem of great relevance, and why the Yang–Mills existence and mass gap problem is a Millennium Prize Problem.

Parallel work on non-Abelian gauge theories

In 1953, in a private correspondence, Wolfgang Pauli formulated a six-dimensional theory of Einstein's field equations of general relativity, extending the five-dimensional theory of Kaluza, Klein, Fock, and others to a higher-dimensional internal space. [6] However, there is no evidence that Pauli developed the Lagrangian of a gauge field or the quantization of it. Because Pauli found that his theory "leads to some rather unphysical shadow particles", he refrained from publishing his results formally. [6] Although Pauli did not publish his six-dimensional theory, he gave two seminar lectures about it in Zürich in November 1953. [6]

In January 1954 Ronald Shaw, a graduate student at the University of Cambridge also developed a non-Abelian gauge theory for nuclear forces. [7] However, the theory needed massless particles in order to maintain gauge invariance. Since no such massless particles were known at the time, Shaw and his supervisor Abdus Salam chose not to publish their work. [7] Shortly after Yang and Mills published their paper in October 1954, Salam encouraged Shaw to publish his work to mark his contribution. Shaw declined, and instead it only forms a chapter of his PhD thesis published in 1956. [8] [9]

Mathematical overview

-y-(x^2+y^2+1) plot; BPST instanton.png
X-(x^2+y^2+1) plot; BPST instanton.png
Curvature of BPST Instanton.png
BPST on sphere.png
The dx1σ3 coefficient of a BPST instanton on the (x1,x2)-slice of 4 where σ3 is the third Pauli matrix (top left). The dx2σ3 coefficient (top right). These coefficients determine the restriction of the BPST instanton A with g=2, ρ=1, z=0 to this slice. The corresponding field strength centered around z=0 (bottom left). A visual representation of the field strength of a BPST instanton with center z on the compactification S4 of 4 (bottom right). The BPST instanton is a classical instanton solution to the Yang–Mills equations on 4.

Yang–Mills theories are special examples of gauge theories with a non-abelian symmetry group given by the Lagrangian

with the generators of the Lie algebra, indexed by a, corresponding to the F-quantities (the curvature or field-strength form) satisfying

Here, the fabc are structure constants of the Lie algebra (totally antisymmetric if the generators of the Lie algebra are normalised such that is proportional to ), the covariant derivative is defined as

I is the identity matrix (matching the size of the generators), is the vector potential, and g is the coupling constant. In four dimensions, the coupling constant g is a pure number and for a SU(n) group one has

The relation

can be derived by the commutator

The field has the property of being self-interacting and the equations of motion that one obtains are said to be semilinear, as nonlinearities are both with and without derivatives. This means that one can manage this theory only by perturbation theory with small nonlinearities.[ citation needed ]

Note that the transition between "upper" ("contravariant") and "lower" ("covariant") vector or tensor components is trivial for a indices (e.g. ), whereas for μ and ν it is nontrivial, corresponding e.g. to the usual Lorentz signature,

From the given Lagrangian one can derive the equations of motion given by

Putting these can be rewritten as

A Bianchi identity holds

which is equivalent to the Jacobi identity

since Define the dual strength tensor then the Bianchi identity can be rewritten as

A source enters into the equations of motion as

Note that the currents must properly change under gauge group transformations.

We give here some comments about the physical dimensions of the coupling. In D dimensions, the field scales as and so the coupling must scale as This implies that Yang–Mills theory is not renormalizable for dimensions greater than four. Furthermore, for D = 4 , the coupling is dimensionless and both the field and the square of the coupling have the same dimensions of the field and the coupling of a massless quartic scalar field theory. So, these theories share the scale invariance at the classical level.

Quantization

A method of quantizing the Yang–Mills theory is by functional methods, i.e. path integrals. One introduces a generating functional for n-point functions as

but this integral has no meaning as it is because the potential vector can be arbitrarily chosen due to the gauge freedom. This problem was already known for quantum electrodynamics but here becomes more severe due to non-abelian properties of the gauge group. A way out has been given by Ludvig Faddeev and Victor Popov with the introduction of a ghost field (see Faddeev–Popov ghost) that has the property of being unphysical since, although it agrees with Fermi–Dirac statistics, it is a complex scalar field, which violates the spin–statistics theorem. So, we can write the generating functional as

being

for the field,

for the gauge fixing and

for the ghost. This is the expression commonly used to derive Feynman's rules (see Feynman diagram). Here we have ca for the ghost field while ξ fixes the gauge's choice for the quantization. Feynman's rules obtained from this functional are the following

FeynRulesEN.jpg

These rules for Feynman's diagrams can be obtained when the generating functional given above is rewritten as

with

being the generating functional of the free theory. Expanding in g and computing the functional derivatives, we are able to obtain all the n-point functions with perturbation theory. Using LSZ reduction formula we get from the n-point functions the corresponding process amplitudes, cross sections and decay rates. The theory is renormalizable and corrections are finite at any order of perturbation theory.

For quantum electrodynamics the ghost field decouples because the gauge group is abelian. This can be seen from the coupling between the gauge field and the ghost field that is For the abelian case, all the structure constants are zero and so there is no coupling. In the non-abelian case, the ghost field appears as a useful way to rewrite the quantum field theory without physical consequences on the observables of the theory such as cross sections or decay rates.

One of the most important results obtained for Yang–Mills theory is asymptotic freedom. This result can be obtained by assuming that the coupling constant g is small (so small nonlinearities), as for high energies, and applying perturbation theory. The relevance of this result is due to the fact that a Yang–Mills theory that describes strong interaction and asymptotic freedom permits proper treatment of experimental results coming from deep inelastic scattering.

To obtain the behavior of the Yang–Mills theory at high energies, and so to prove asymptotic freedom, one applies perturbation theory assuming a small coupling. This is verified a posteriori in the ultraviolet limit. In the opposite limit, the infrared limit, the situation is the opposite, as the coupling is too large for perturbation theory to be reliable. Most of the difficulties that research meets is just managing the theory at low energies. That is the interesting case, being inherent to the description of hadronic matter and, more generally, to all the observed bound states of gluons and quarks and their confinement (see hadrons). The most used method to study the theory in this limit is to try to solve it on computers (see lattice gauge theory). In this case, large computational resources are needed to be sure the correct limit of infinite volume (smaller lattice spacing) is obtained. This is the limit the results must be compared with. Smaller spacing and larger coupling are not independent of each other, and larger computational resources are needed for each. As of today, the situation appears somewhat satisfactory for the hadronic spectrum and the computation of the gluon and ghost propagators, but the glueball and hybrids spectra are yet a questioned matter in view of the experimental observation of such exotic states. Indeed, the σ resonance [10] [11] is not seen in any of such lattice computations and contrasting interpretations have been put forward. This is a hotly debated issue.

Open problems

Yang–Mills theories met with general acceptance in the physics community after Gerard 't Hooft, in 1972, worked out their renormalization, relying on a formulation of the problem worked out by his advisor Martinus Veltman. [12] Renormalizability is obtained even if the gauge bosons described by this theory are massive, as in the electroweak theory, provided the mass is only an "acquired" one, generated by the Higgs mechanism.

The mathematics of the Yang–Mills theory is a very active field of research, yielding e.g. invariants of differentiable structures on four-dimensional manifolds via work of Simon Donaldson. Furthermore, the field of Yang–Mills theories was included in the Clay Mathematics Institute's list of "Millennium Prize Problems". Here the prize-problem consists, especially, in a proof of the conjecture that the lowest excitations of a pure Yang–Mills theory (i.e. without matter fields) have a finite mass-gap with regard to the vacuum state. Another open problem, connected with this conjecture, is a proof of the confinement property in the presence of additional fermions.

In physics the survey of Yang–Mills theories does not usually start from perturbation analysis or analytical methods, but more recently from systematic application of numerical methods to lattice gauge theories.

See also

Related Research Articles

<span class="mw-page-title-main">Electroweak interaction</span> Unified description of electromagnetism and the weak interaction

In particle physics, the electroweak interaction or electroweak force is the unified description of two of the four known fundamental interactions of nature: electromagnetism (electromagnetic interaction) and the weak interaction. Although these two forces appear very different at everyday low energies, the theory models them as two different aspects of the same force. Above the unification energy, on the order of 246 GeV, they would merge into a single force. Thus, if the temperature is high enough – approximately 1015 K – then the electromagnetic force and weak force merge into a combined electroweak force. During the quark epoch (shortly after the Big Bang), the electroweak force split into the electromagnetic and weak force. It is thought that the required temperature of 1015 K has not been seen widely throughout the universe since before the quark epoch, and currently the highest human-made temperature in thermal equilibrium is around 5.5x1012 K (from the Large Hadron Collider).

<span class="mw-page-title-main">Pauli matrices</span> Matrices important in quantum mechanics and the study of spin

In mathematical physics and mathematics, the Pauli matrices are a set of three 2 × 2 complex matrices that are Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma, they are occasionally denoted by tau when used in connection with isospin symmetries.

<span class="mw-page-title-main">Scanning tunneling microscope</span> Instrument able to image surfaces at the atomic level by exploiting quantum tunneling effects

A scanning tunneling microscope (STM) is a type of microscope used for imaging surfaces at the atomic level. Its development in 1981 earned its inventors, Gerd Binnig and Heinrich Rohrer, then at IBM Zürich, the Nobel Prize in Physics in 1986. STM senses the surface by using an extremely sharp conducting tip that can distinguish features smaller than 0.1 nm with a 0.01 nm (10 pm) depth resolution. This means that individual atoms can routinely be imaged and manipulated. Most scanning tunneling microscopes are built for use in ultra-high vacuum at temperatures approaching absolute zero, but variants exist for studies in air, water and other environments, and for temperatures over 1000 °C.

In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-12 massive particles, called "Dirac particles", such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity, and was the first theory to account fully for special relativity in the context of quantum mechanics. It was validated by accounting for the fine structure of the hydrogen spectrum in a completely rigorous way.

<span class="mw-page-title-main">Noether's theorem</span> Statement relating differentiable symmetries to conserved quantities

Noether's theorem states that every continuous symmetry of the action of a physical system with conservative forces has a corresponding conservation law. This is the first of two theorems proven by mathematician Emmy Noether in 1915 and published in 1918. The action of a physical system is the integral over time of a Lagrangian function, from which the system's behavior can be determined by the principle of least action. This theorem only applies to continuous and smooth symmetries of physical space.

In mathematics, particularly in linear algebra, tensor analysis, and differential geometry, the Levi-Civita symbol or Levi-Civita epsilon represents a collection of numbers; defined from the sign of a permutation of the natural numbers 1, 2, ..., n, for some positive integer n. It is named after the Italian mathematician and physicist Tullio Levi-Civita. Other names include the permutation symbol, antisymmetric symbol, or alternating symbol, which refer to its antisymmetric property and definition in terms of permutations.

The Klein–Gordon equation is a relativistic wave equation, related to the Schrödinger equation. It is second-order in space and time and manifestly Lorentz-covariant. It is a differential equation version of the relativistic energy–momentum relation .

<span class="mw-page-title-main">Hooke's law</span> Physical law: force needed to deform a spring scales linearly with distance

In physics, Hooke's law is an empirical law which states that the force needed to extend or compress a spring by some distance scales linearly with respect to that distance—that is, Fs = kx, where k is a constant factor characteristic of the spring, and x is small compared to the total possible deformation of the spring. The law is named after 17th-century British physicist Robert Hooke. He first stated the law in 1676 as a Latin anagram. He published the solution of his anagram in 1678 as: ut tensio, sic vis. Hooke states in the 1678 work that he was aware of the law since 1660.

In calculus and real analysis, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship between the two central operations of calculus—differentiation and integration. This relationship is commonly characterized in the framework of Riemann integration, but with absolute continuity it may be formulated in terms of Lebesgue integration. For real-valued functions on the real line, two interrelated notions appear: absolute continuity of functions and absolute continuity of measures. These two notions are generalized in different directions. The usual derivative of a function is related to the Radon–Nikodym derivative, or density, of a measure. We have the following chains of inclusions for functions over a compact subset of the real line:

In mathematics, the Radon–Nikodym theorem is a result in measure theory that expresses the relationship between two measures defined on the same measurable space. A measure is a set function that assigns a consistent magnitude to the measurable subsets of a measurable space. Examples of a measure include area and volume, where the subsets are sets of points; or the probability of an event, which is a subset of possible outcomes within a wider probability space.

In the general theory of relativity, the Einstein field equations relate the geometry of spacetime to the distribution of matter within it.

A directional derivative is a concept in multivariable calculus that measures the rate at which a function changes in a particular direction at a given point.

In quantum field theory, Wilson loops are gauge invariant operators arising from the parallel transport of gauge variables around closed loops. They encode all gauge information of the theory, allowing for the construction of loop representations which fully describe gauge theories in terms of these loops. In pure gauge theory they play the role of order operators for confinement, where they satisfy what is known as the area law. Originally formulated by Kenneth G. Wilson in 1974, they were used to construct links and plaquettes which are the fundamental parameters in lattice gauge theory. Wilson loops fall into the broader class of loop operators, with some other notable examples being 't Hooft loops, which are magnetic duals to Wilson loops, and Polyakov loops, which are the thermal version of Wilson loops.

<span class="mw-page-title-main">Propagator</span> Function in quantum field theory showing probability amplitudes of moving particles

In quantum mechanics and quantum field theory, the propagator is a function that specifies the probability amplitude for a particle to travel from one place to another in a given period of time, or to travel with a certain energy and momentum. In Feynman diagrams, which serve to calculate the rate of collisions in quantum field theory, virtual particles contribute their propagator to the rate of the scattering event described by the respective diagram. Propagators may also be viewed as the inverse of the wave operator appropriate to the particle, and are, therefore, often called (causal) Green's functions.

In theoretical physics, Seiberg–Witten theory is an supersymmetric gauge theory with an exact low-energy effective action, of which the kinetic part coincides with the Kähler potential of the moduli space of vacua. Before taking the low-energy effective action, the theory is known as supersymmetric Yang–Mills theory, as the field content is a single vector supermultiplet, analogous to the field content of Yang–Mills theory being a single vector gauge field or connection.

The Newman–Penrose (NP) formalism is a set of notation developed by Ezra T. Newman and Roger Penrose for general relativity (GR). Their notation is an effort to treat general relativity in terms of spinor notation, which introduces complex forms of the usual variables used in GR. The NP formalism is itself a special case of the tetrad formalism, where the tensors of the theory are projected onto a complete vector basis at each point in spacetime. Usually this vector basis is chosen to reflect some symmetry of the spacetime, leading to simplified expressions for physical observables. In the case of the NP formalism, the vector basis chosen is a null tetrad: a set of four null vectors—two real, and a complex-conjugate pair. The two real members often asymptotically point radially inward and radially outward, and the formalism is well adapted to treatment of the propagation of radiation in curved spacetime. The Weyl scalars, derived from the Weyl tensor, are often used. In particular, it can be shown that one of these scalars— in the appropriate frame—encodes the outgoing gravitational radiation of an asymptotically flat system.

f(R) is a type of modified gravity theory which generalizes Einstein's general relativity. f(R) gravity is actually a family of theories, each one defined by a different function, f, of the Ricci scalar, R. The simplest case is just the function being equal to the scalar; this is general relativity. As a consequence of introducing an arbitrary function, there may be freedom to explain the accelerated expansion and structure formation of the Universe without adding unknown forms of dark energy or dark matter. Some functional forms may be inspired by corrections arising from a quantum theory of gravity. f(R) gravity was first proposed in 1970 by Hans Adolph Buchdahl. It has become an active field of research following work by Starobinsky on cosmic inflation. A wide range of phenomena can be produced from this theory by adopting different functions; however, many functional forms can now be ruled out on observational grounds, or because of pathological theoretical problems.

<span class="mw-page-title-main">Gauge theory</span> Physical theory with fields invariant under the action of local "gauge" Lie groups

In physics, a gauge theory is a type of field theory in which the Lagrangian, and hence the dynamics of the system itself, do not change under local transformations according to certain smooth families of operations. Formally, the Lagrangian is invariant.

The table of chords, created by the Greek astronomer, geometer, and geographer Ptolemy in Egypt during the 2nd century AD, is a trigonometric table in Book I, chapter 11 of Ptolemy's Almagest, a treatise on mathematical astronomy. It is essentially equivalent to a table of values of the sine function. It was the earliest trigonometric table extensive enough for many practical purposes, including those of astronomy. Since the 8th and 9th centuries, the sine and other trigonometric functions have been used in Islamic mathematics and astronomy, reforming the production of sine tables. Khwarizmi and Habash al-Hasib later produced a set of trigonometric tables.

In theoretical physics, more specifically in quantum field theory and supersymmetry, supersymmetric Yang–Mills, also known as super Yang–Mills and abbreviated to SYM, is a supersymmetric generalization of Yang–Mills theory, which is a gauge theory that plays an important part in the mathematical formulation of forces in particle physics.

References

  1. "Yang-Mills & The Mass Gap". Clay Mathematics Institute. Retrieved 2024-04-09.
  2. 1 2 O’Raifeartaigh, Lochlainn; Straumann, Norbert (2000-01-01). "Gauge theory: Historical origins and some modern developments". Reviews of Modern Physics. 72 (1): 1–23. doi:10.1103/RevModPhys.72.1. ISSN   0034-6861.
  3. 1 2 3 4 5 Baggott, J. E. (2013). The quantum story: a history in 40 moments (Impression: 3 ed.). Oxford: Oxford Univ. Press. ISBN   978-0-19-956684-6.
  4. Gray, Jeremy; Wilson, Robin (2012-12-06). Mathematical Conversations: Selections from the Mathematical Intelligencer. Springer Science & Business Media. p. 63. ISBN   9781461301950 via Google Books.
  5. Yang, C.N.; Mills, R. (1954). "Conservation of isotopic spin and isotopic gauge invariance". Physical Review . 96 (1): 191–195. Bibcode:1954PhRv...96..191Y. doi: 10.1103/PhysRev.96.191 .
  6. 1 2 3 Straumann, N. (2000). "On Pauli's invention of non-abelian Kaluza-Klein Theory in 1953". arXiv: gr-qc/0012054 .
  7. 1 2 Atiyah, M. (2017). "Ronald Shaw 1929–2016 by Michael Atiyah (1954)". Trinity College Annual Record (memorial). 2017: 137–146.
  8. Shaw, Ronald (September 1956). The problem of particle types and other contributions to the theory of elementary particles (Ph.D. thesis). University of Cambridge. ch. 3, pp. 34–46.
  9. Fraser, Gordon (2008). Cosmic Anger: Abdus Salam – the first Muslim Nobel scientist. Oxford, UK: Oxford University Press. p. 117. ISBN   978-0199208463.
  10. Caprini, I.; Colangelo, G.; Leutwyler, H. (2006). "Mass and width of the lowest resonance in QCD". Physical Review Letters . 96 (13): 132001. arXiv: hep-ph/0512364 . Bibcode:2006PhRvL..96m2001C. doi:10.1103/PhysRevLett.96.132001. PMID   16711979. S2CID   42504317.
  11. Yndurain, F.J.; Garcia-Martin, R.; Pelaez, J.R. (2007). "Experimental status of the ππ isoscalar S wave at low energy: f0(600) pole and scattering length". Physical Review D . 76 (7): 074034. arXiv: hep-ph/0701025 . Bibcode:2007PhRvD..76g4034G. doi:10.1103/PhysRevD.76.074034. S2CID   119434312.
  12. 't Hooft, G.; Veltman, M. (1972). "Regularization and renormalization of gauge fields". Nuclear Physics B. 44 (1): 189–213. Bibcode:1972NuPhB..44..189T. doi:10.1016/0550-3213(72)90279-9. hdl: 1874/4845 .

Further reading

Books
Articles