Final Answers
© 2000-2022 Gérard P. Michon, Ph.D.

Analytical Mechanics
Rational Mechanics & Field Theory

Fermat's principle (least time) for light (c.1655) predates Newton.
Maupertuis principle of least action (1744).
Virtual Work: A substitute for Newton's laws that cancels constraint forces.
Phase Space: Phase describes completely the state of a classical system.
Either velocities or momenta are added to configuration to specify a phase.
The Lagrangian is a function of positions and velocities.
The Hamiltonian depends on positions and momenta.
Poisson brackets: An abstract synthetic view of analytical mechanics.
Liouville's theorem: The Hamiltonian phase volume doesn't change.
Noether's theorem: Conservation laws express the symmetries of physics.
Relativistic point-mass: Lagrangian, Hamiltonian and free momentum.
Charge in a magnetic field: The canonical momentum isn't the linear one.
Field theory: Lagrangian function of a continuum of values and velocities.
Fokker's action principle. Lagrangien formulation of general relativity.

Introduction

Analytical Mechanics is a mathematical reformulation of Newtonian mechanics. Its physical content is the same, but its mathematical structure has interesting features. It can also describe the relativistic dynamics of particles and fields.

Originally, the mathematical formalism may have been invented merely to unify the patterns which appear in the solutions of basic problems in Newtonian mechanics. However, it allows a deeper understanding of the structure of classical mechanics and also of physical laws in general. The most fundamental part of this structure has served as the basis for the current formalism of quantum mechanics developed by P.A.M. Dirac and others: the quantum commutators were directly inspired by Poisson brackets...

Analytical Mechanics

(2008-09-04) Fermat's Principle of Least Time (c. 1655) A unifying principle for the geometrical propagation of light.

Around 1655, Pierre de Fermat (1601-1665) justified all laws of the newly developed science of geometrical optics by postulating that light always travels between two points in the least possible amount of time.

This does say that light travels in a straight line through an homogeneous medium. Less trivially, the law of reflection (whereby the angle of reflection is equal to the angle of incidence) can be justified by saying that a ray of light which bounces in this way traces a shorter distance between two points than any neighboring ray which would bounce in some other way. Finally, Snell's law of refraction can also be derived from Fermat's principle if we only assume that the celerity of light in a medium is inversely proportional to its index of refraction (n).

n₁ sin q₁ = n₂ sin q₂

The law of refraction was first stated by Ibn Sahl in 984. It was rediscovered by Thomas Harriot in July 1601 and independently by Snell (1621) and Descartes (1637) who was the first to publish it. The Dutchman Christiaan Huygens (1678) was instrumental in attributing the Law to his countryman, Snell...

In a medium whose index of refraction is n, the celerity of light is thus c / n. That relation remained controversial among scientists until 1850, when Fizeau and Foucault carried out direct comparisons of the celerities of light in air and water (along the lines Arago had suggested in 1838). Fermat was vindicated.

Before that time, some physicists had wrongly argued that light could be accelerated when passing from a rarer to a denser medium!

Snell's Law Song (2002 lyrics by Marian McKenzie & Walter Fox Smith) in MP3 format:
Solo by Walter F. Smith | Duo by Russ & Eli (courtesy of PhysicsSongs.org).

Why does light bend when it enters glass? (13:36) by Don Lincoln (Fermilab, 2019-05-01).

(2008-09-04) Maupertuis Principle of Least Action (1744) Introducing a quantity which is minimized in Newtonian motion.

Adding up the masses of all bodies multiplied by their respective
speeds and the distances they travel yields the quantity called action,
which is always the least possible in any natural motion.
Pierre-Louis Moreau de Maupertuis. "Sur les lois du mouvement " (1746).

Come back later, we're
still working on this one...

(2005-07-11) D'Alembert's Principle: Virtual Work A synthetic statement equivalent to Newton's laws.

D'Alembert's principle is also known as the principle of virtual work.

(2008-09-03) Phase Space The entire state of a classical system is described by its phase.

Newton's law (F = ma) involves the second-order derivatives of the positions of the particles which compose a system. The entire future evolution is thus determined if the positions and their first-order derivatives are known.

Equivalently, the present situation of a classical system is fully described by the generalized positions (q_i) and generalized momenta (p_i) of all its "particles".

Those can be, respectively, the cartesian coordinates of the positions of the particles and the corresponding cartesian coordinates of their momenta, but they need not be... For example, if particles are constrained to move on the surface of a sphere, their positions would be determined by their latitudes and longitudes.

Newton's laws are such that the above description effectively encodes deterministically not only the future but also the the past of the present system (see Laplace's Demon). Thus, the system evolves along definite trajectories where the aforementioned points are called phases. The term evokes a stage of development which may or may not be part of a repeating cycle.

If the phase of a system is described by N positions (q) and N momenta (p) then its evolution is governed by 2N first-order differential equations. Loosely speaking, N equations would be needed to specify that the p's are essentially the first order derivatives of the q's and N other first-order differential equations would express Newton's laws by stating that forces are the first-order derivatives of momenta.

The positional information alone is traditionally called configuration. The complete description of a phase thus entails specifying both the configuration and the first derivatives of all configuration constituents.

(2008-09-03) Velocities and Momenta Two equivalent ways to specify first-order derivatives in phase space.

For a classical point-mass, the (vectorial) momentum p is proportional to its (vectorial) velocity v. The coefficient of proportionality (m) is the mass of the particle. In more general contexts (e.g., Special Relativity) the relations between momenta and velocities can be more complicated but they remain one-to-one.

The one-to-one correspondence between momenta and velocities implies that either type of variables can be used to describe a point in phase space.

In general, those relations can be specified by introducing two special (scalar) functions of the phase; the Lagrangian (L) and the Hamiltonian (H) which are Legendre transforms of each other with respect to the two sets of variables which form respectively the vectors v and p (as positions q are held constant) :

H(p) = max_v { p.v - L(v) } and L(v) = max_p { p.v - H(p) }

Those theoretical definitions make the following practical relations hold :

¶L	= p_i	¶H	= v_i

¶ v_i		¶ p_i

H + L = p . v = å_i p_i v_i

(2008-09-04) Lagrangian Mechanics (1788) Practical definitions of the Lagrangian.

In classical mechanics, a Lagrangian consists of a kinetic term (T) and a potential term (U).

L = T - U

The symbol T is related to the french term travail (work) which was introduced in 1829 by the French physicist Gaspard-Gustave Coriolis, who proved that the change in kinetic energy is always equal to the "work done" by all applied forces (including friction). On the other hand, U is only defined in the case of conservative forces. This ideal Lagrangian description is appropriate for particle physics but it's no good at certain observation scales (a concept advocated by C.E. Guye) where friction becomes relevant, including ordinary human scale or long-term celestial mechanics (involving "secular terms").

T is normally a function of some quadratic expression of the velocities, involving a generalized tensor of inertia J :

T = ½ f ( v* J v )

As usual, we use a star (*) to denote the transpose (or the transpose conjugate in the complex realm) of any vectorial quantity. Thus, the above just states that the argument of f is some homogeneous quadratic polynomial of the velocity components. J is a symmetric matrix (in the complex realm, J would be Hermitian, in order to make the argument of f real).

Come back later, we're
still working on this one...

Wikipedia : Lagrangian

(2008-09-04) Hamiltonian Mechanics (1833) In a steady force field, the Hamiltonian is conserved.

Clearly, the differential of the Hamiltonian H = p.v - L is:

dH =	å_i	[	p_i dv_i + v_i dp_i -	¶ L	dq_i -	¶ L	dv_i	]	-	¶ L	dt

				¶ q_i		¶ v_i				¶t

The definitions of the momenta make the first and fourth terms in the square bracket cancel each other. Furthermore, since L is assumed to satisfy a principle of least action, the third term can be modified by using the relevant Euler-Lagrange equation, namely:

¶L	=	d	(	¶L	)	=	dp_i

¶ q_i		dt		¶ v_i			dt

Using this and the relation v_i = dq_i / dt the equation becomes:

dH =	å_i	[	dq_i	dp_i -	dp_i	dq_i	]	-	¶ L	dt

			dt		dt				¶t

The partial derivatives of H appear here as coefficients of the differentials:

Fundamental Equations of Hamiltonian Mechanics :

¶H	=	d q_i	¶H	= -	d p_i

¶ p_i		d t	¶ q_i		d t

Our previous expression also gives the total derivative of H [because each square bracket vanishes since, clearly, q_i' p_i' = p_i' q_i' ].

d H	= -	¶ L

dt		¶t

In a steady force field, the right-hand-side vanishes. So, the Hamiltonian H (the total mechanical energy) remains constant throughout the motion.

That resembles a special case of Noether's theorem: If the force laws of a system are time-invariant, then its total energy (H = E) is conserved. So, Hamiltonian energy is the conserved quantity linked to time-symmetry.

Hamiltonian mechanics | Symplectic geometry

A Fight to Fix Symplectic Geometry by Kevin Hartnett (Quanta Magazine, 2017-02-09).

Symplectic geometry (1:03) by Michael Atiyah (Web of Stories, 1997).

(2009-07-06) Poisson Brackets (1809) An abstract synthetic view of classical mechanics.

La vie n'est bonne qu'à deux choses,
à faire des mathématiques et à les professer.
Siméon Denis Poisson (1781-1840) X1798

Consider how a steady function A (¶A/¶t = 0) of the canonical Hamiltonian variables p_i and q_i evolves with time because of the actual motion itself:

dA	=	å_i	¶A	dq_i	+	¶A	dp_i

dt			¶q_i	dt		¶p_i	dt
	=	å_i	¶A	¶H	-	¶A	¶H

			¶q_i	¶p_i		¶p_i	¶q_i
	=	{ A , H }

The compact notation introduced in the last line is what is known as a Poisson bracket (French: Crochet de Poisson ). The general definition is:

Poisson Brackets

{ A , B }	=	å_i	¶A	¶B	-	¶A	¶B

			¶q_i	¶p_i		¶p_i	¶q_i

The quantity A is a constant of the motion if and only if { A , H } = 0

On September 20, 1925, P.A.M. Dirac identified Poisson brackets as the classical counterparts of quantum commutators (whose importance he had just discovered himself). That was a Sunday and the library was closed. So, he had to wait until the next morning to verify his hunch, with the help of what Jacobi had written on the subject in 1841.

Poisson brackets share the following properties with quantum commutators:

{ A , B } = - { B , A } (anticommutativity)
{ A , u B + v C } = u { A , B } + v { A , C } (linearity)
{ A , BC } = { A , B } C + B { A , C } (product rule)
0 = { A , {B,C} } + { B , {C,A} } + { C , {A,B} }

The last relation (known as Jacobi's identity) is characteristic of a Lie algebra. With just anticommutativity. linearity and the product rule, the following equations fully specify the relationship of the Poisson brackets with the canonical variables:

{ q_i , p_j } = d_ij (this is to say: 1 if i=j and 0 otherwise)
{ q_i , q_j } = { p_i , p_j } = 0

With the above relations, the original differential definition of the brackets can be retrieved for polynomial functions of the canonical variables (and other smooth enough functions, by continuity). HINT: First retrieve the following relations, by induction on the degree of the polynomial A.

{ q_i , A } = ¶A/¶p_i
{ p_i , A } = - ¶A/¶q_i

(2008-09-04) Liouville's Theorem Density in phase is conserved in Hamiltonian space.

The quantum counterpart of Liouville's theorem is the unitarity of evolution with time (the norm of a ket doesn't change as it evolves). Liouville's theorem is construed as the statement that information is conserved.

Joseph Liouville (1809-1882; X1825). Liouville's measure.
Josiah Willard Gibbs, Jr. (1839-1903). "Density in phase".
Ludwig Boltzmann (1844-1906).

Come back later, we're
still working on this one...

Why Quantum Information is Never Destroyed (13:42) by Matt O'Dowd (PBS Space Time, 2018-05-23).
Preservation of Phase Volume & Liouville's Theorem for non-Hamiltonian systems by Mark Tuckerman
Application to Electron Orbits in the Earth's Magnetic Field by W. F. G. Swann (1933)
Liouville's theorem and the velocity field of galaxies by Arthur Rex Rivolo (1985)
Wikipedia : Liouville's Theorem

(2003-05-26) Noether's Theorem (1915)

The German mathematician Emmy Noether (1882-1935) established this deep result (Noether's Theorem) in 1915:

For every continuous symmetry of the laws of physics,
there's a conservation law, and vice versa.

The theorem is true of any physical theory based on a Lagrangian formalism, including discrete classical systems of finitely many particles and the classical fields discussed below. It also applies to the quantum counterparts of those...

A simple proof is given elsewhere on this site in the basic case of a classical system with finitely many degrees of freedom... Formally, a continuous symmetry of such a system is expressed by stating that its Lagrangian L(q,v,t) is unchanged (to the first order in e ) if each component q_i is replaced by:

q_i + e h_i

It's understood that, for any constant e, the [new] component v_i remains equal to the time-derivative of the [new] component q_i ... If such a symmetry exists, then Noether's theorem states that the following quantity is a constant of motion :

Conserved Noether charge

å _i	h_i	¶L

		¶ v_i

The conservation of [Hamiltonian] energy for a system with a steady Lagrangian (i.e., a Lagrangian which does not explicitely depend on time) is often construed as a special case of Noether's theorem, although it's established differently.

Electric charge is the conserved quantity obtained for a Lagrangian which is invariant under multiplication of all its arguments by a (complex) phase.

In the case of a scalar field with complex values, the following real 4-vector field turns out to be the current density associated with the field (the time-component of that is the charge density ).

ⁱ/₂ [ y ¶_my* - ¶_my y* ]

Thus, loosely speaking, charge has the same type of algebraic expression as angular momentum and ends up being quantized for the same reasons.

E. Noether's 1918 paper, translated by M.A. Tavel (1971) | Emmy Noether (1882-1935) | Photo in 1931
MathPages | Wikipedia | Noether's Discovery | Noether's Theorem in a Nutshell by John Baez

Lagrangian & Field Theory (1:40:31) by Leonard Susskind (2008-05-05).
Noether's Theorem and The Symmetries of Reality (13:01) by Matt O'Dowd (2018-05-16).
The most significant genius: Emmy Noether (10:23) by Don Lincoln (Fermilab, 2018-07-13).

(2008-09-04) Relativistic Point-Mass Momenta, Lagrangians and Hamiltonians in Special Relativity.

One good way to see what the concepts of analytical mechanics entail is to apply them in the simple context of a relativistic point-mass.

Our starting point will be the expression of momentun known from the 4-vector formalism of Special Relativity. For a single point in free space, the abstract vectors v and p introduced in the previous section are just the ordinary three-dimensional velocity and relativistic momentum of that point (of rest-mass m) :

p =

m v



Ö	1 - v²/c²

This relation does indeed summarize the three relations:

¶L	= p_x	¶L	= p_y	¶L	= p_x

¶ v_x		¶ v_y		¶ v_z

Provided we define L as follows (up to an irrelevant additive constant) :

Lagrangian of a free point-mass :


L = - m c²	Ö	1 - v² / c²

A justification for this formula (without an exra constant) is that it makes the action a Lorentz-invariant scalar. Indeed, the observed action is the product of the observed Lagrangian L by the observer's time. So, L ought to be proportional to the derivative of the particle's proper time with respect to the observer's time.

Computing the Hamiltonian as H = p.v - L, we obtain a familiar expression :

H = E =

m c²



Ö	1 - v²/c²

That would be simply E = m c² if we were to define "m" as the relativistic mass instead of the invariant rest mass. Indeed, the Hamiltonian corresponds to the usual concept of total mechanical energy (which is conserved).

The Hamiltonian transforms like a mechanical energy (the time-component of an energy-momentum 4-vector) whereas the Lagrangian transforms like a quantity of heat. Those two things are very different physical beasts...

We may also invert our first relation and express v as a function of p :

v =

p / m



Ö	1 + p²/(mc)²

An explicit expression of H in terms of p can be desirable:

Hamiltonian of a free point-mass :


H = m c²	Ö	1 + p² / (mc)²

The next section shows how the canonical momenta may differ substantially from the dynamical momenta whose time-derivatives are the applied forces.

(2008-09-08) Relativistic Charge in an Electromagnetic Field Lagrangian for the Lorentz force and associated canonical momentum.

In the electromagnetic field specified by a quadripotential (f/c, A) a point-mass with electric charge q is governed by the following Lagrangian. We'll prove that this expression is valid by showing that it gives the correct formula for the Lorentz force exerted on the particle by the field.

Lagrangian of a charged particle :


L = q ( A.v - f ) - m c²	Ö	1 - v² / c²

This yields : p =

qA +

m v



Ö	1 - v²/c²

H = p.v - L =

qf +

m c²



Ö	1 - v²/c²

Note that ( H/c , p ) is a 4-vector. Let's see how this encodes the familiar expression of the Lorentz force F exerted by the electromagnetic field on the particle. We first examine the coordinates along the x-axis only.

For a stationary Lagrangian, the relevant Euler-Lagrange equation is:

¶L	=	d	(	¶L	)	=	dp_x	= q	dA_x	+ F_x

¶x		dt		¶ v_x			dt		dt

The last equality illustrates the distinction between the canonical momentum p (whose components are partial derivatives of the Lagrangian L with respect to the velocities) and the dynamical momentum whose (total) derivative with respect to time is the applied force (namely, the Lorentz force whose expression we are aiming to retrieve). Therefore:

F_x =	¶L	- q	dA_x

	¶x		dt

Let's expand each of those two terms... The above expression of L yields:

¶L

(

¶f

¶A_x

v_x +

¶A_y

v_y +

¶A_z

v_z

)

¶x

Generically dA = ¶_tA dt + ¶_xA dx + ¶_yA dy + ¶_zA dz. Therefore:

dA_x

(

¶A_x

v_x +

¶A_x

v_y +

¶A_x

v_z

)

¶t

¶x

¶y

¶z

Subtracting those two expansions, we see that F_x is the x-coordinate of:

q [ (- grad f - ¶A/¶t ) + v ´ rot A ]

As F and this vector have identical projections along any arbitrary x-axis, they must be equal and we have indeed retrieved the vectorial expression of the force F = q [ E + v´B ] with the correct expressions of the electromagnetic fields E and B in terms of the potentials f and A.

We may also express H as an explicit function of the canonical momentum:

Hamiltonian of a charged particle :


H = qf + m c²	Ö	1 + (p-qA)² / (mc)²

(2008-09-06) Classical Field Theory & Lagrangian Density Let the q-coordinates be the values of a field at different points in space.

For the above discrete mechanical systems, the value of the Lagrangian L was a function of time (t) the N position coordinates (q) and their derivatives (v).

For a continuous field y, the values of the field at every point of 4D-space play the role of the q coordinates and its partial spacetime derivatives play the role of the velocities v. The Lagrangian itself would be equivalent to the integral over 3D-space of the following Lagrangian density L whose integral over space and time will thus play the role of an action which must be stationary :

L = L ( y , y₀ , y₁ , y₂ , y₃ )
where y_m = ¶_my

Note the subtle point in notation: y_m is just the name of a particular argument of the Lagrangian function. It makes perfect sense to consider the partial derivative of L with respect to y_m. It makes no more sense to consider a derivative "with respect to" ¶_my than it would to speak of the derivative of f (x) "with respect to" 2 to denote the value of d f /dx when x = 2.

A proper Lagrangian density must be a relativistic scalar. So, if y is assumed to be a scalar itself, then one example of a proper L would be of the form:

L = f ( y , ¶_my ¶^my )

[ Einstein's summation convention is used here. ]

The equivalent of the Euler-Lagrange equation is:

¶L	= ¶_m	(	¶L	)

¶y			¶y_m

The role of the conjugate momenta is played by a 4-vector density field p :

p_m =	¶L

	¶y_m

The component p₀ / c can be denoted p_y and called field momentum.

(2019-04-01) Fokker's Action Principle. Fokker Lagrangian (1924) Fokker-Wheeler-Feynman interactions in Einstein's General Relativity.

Come back later, we're
still working on this one...

Final Answers
© 2000-2022 Gérard P. Michon, Ph.D.

Analytical Mechanics
Rational Mechanics & Field Theory

Related articles on this site:

Related Links (Outside this Site)

Introduction

Analytical Mechanics

Final Answers © 2000-2022 Gérard P. Michon, Ph.D.

Analytical MechanicsRational Mechanics & Field Theory

Related articles on this site:

Related Links (Outside this Site)

Introduction

Analytical Mechanics

Final Answers
© 2000-2022 Gérard P. Michon, Ph.D.

Analytical Mechanics
Rational Mechanics & Field Theory