The Principle of Least Action: The Single Rule Behind Physics

11–16 minutes

Introduction

There is a strange and beautiful idea at the heart of physics:

Nature follows the path for which the action is stationary.

This idea is called the principle of least action, or more accurately, the principle of stationary action.

At first, this sounds almost mystical. How can nature “choose” a path? How can a falling object, a ray of light, or a planet know which path is optimal?

But the principle is not saying nature thinks. It is saying that the real path taken by a physical system has a special mathematical property: if we slightly change the path, the action barely changes to first order.

In compact form, the principle is written as:

$$\delta S=0$$

where:

$$S$$ is the action
$$\delta S$$ means a tiny variation in the action
$$\delta S=0$$ means the action is stationary

This single equation connects classical mechanics, optics, electromagnetism, quantum mechanics, and even general relativity.


1. The Problem of Fastest Descent

Imagine a bead sliding without friction from point A to point B under gravity.

The question is:

What shape of ramp gets the bead from A to B in the shortest time?

At first, common sense says:

Take the shortest path: a straight line.

But the straight line is not the fastest path.

Why?

Because if the ramp dips steeply at the beginning, the bead gains speed quickly. Even if the total path is longer, the bead travels faster for most of the journey.

So the real problem is a trade-off:

  • A shorter path reduces distance.
  • A steeper initial drop increases speed.

The fastest path balances these two effects.

This problem is called the brachistochrone problem.

The word comes from Greek:

  • brachistos = shortest
  • chronos = time

So brachistochrone means:

The curve of shortest time.


2. Galileo’s Guess

Galileo believed that the fastest path was an arc of a circle.

A circular arc is indeed faster than many polygonal paths and faster than a straight ramp in many cases.

But it is not the true fastest path.

The actual answer is a different curve called a cycloid.

A cycloid is the path traced by a point on the rim of a rolling wheel.

If a wheel rolls along the ground, a point on its edge traces this shape.

The parametric equations of a cycloid are:

$$x=r(\theta-\sin\theta)$$

$$y=r(1-\cos\theta)$$

where: rr is the radius of the rolling circle and θ\theta is the angle through which the circle has rotated

The astonishing result is:

The fastest descent curve is a cycloid, not a straight line and not a circle.


3. Fermat’s Principle of Least Time

Before the brachistochrone was solved, another optimization principle appeared in optics.

Light traveling through a single medium, such as air, moves in a straight line.

Why?

Because the straight line is the shortest distance between two points.

But light behaves differently when it passes from one medium into another, such as from air into water. It bends. This bending is called refraction.

The law governing refraction is Snell’s Law:

$$\frac{\sin\theta_1}{\sin\theta_2}=\frac{v_1}{v_2}$$

where:

  • θ1\theta_1 is the angle of incidence
  • θ2\theta_2 is the angle of refraction
  • v1v_1 is the speed of light in the first medium
  • v2v_2 is the speed of light in the second medium

Pierre de Fermat proposed that light does not necessarily minimize distance. Instead, light minimizes time.

This is Fermat’s Principle of Least Time: Light travels along the path that takes the least time.

If light moves faster in air and slower in water, the fastest path is not the shortest path. The ray bends so that it spends the right amount of distance in each medium.

This explains Snell’s Law.


4. Bernoulli’s Brilliant Insight

Johann Bernoulli realized that the brachistochrone problem could be treated like an optics problem.

A bead sliding down a ramp speeds up as it falls.

By conservation of energy, the loss of gravitational potential energy becomes kinetic energy.

Suppose the bead has fallen a vertical distance $$y$$.

The gravitational potential energy lost is:

$$mgy$$

The kinetic energy gained is:

$$\frac{1}{2}mv^2$$

So:

$$\frac{1}{2}mv^2=mgy$$

Cancel $$m$$:

$$\frac{1}{2}v^2=gy$$

Multiply by 2:

$$v^2=2gy$$

Therefore:

$$v=\sqrt{2gy}$$

So the bead’s speed increases like:

$$v\propto\sqrt{y}$$

Bernoulli imagined the bead moving through many thin horizontal layers, each with a different speed. This made the mechanics problem look like a refraction problem.

For light, Snell’s Law says:

$$\frac{\sin\theta}{v}=\text{constant}$$

For the falling bead, since:

$$v\propto\sqrt{y}$$

we get:

$$\frac{\sin\theta}{\sqrt{y}}=\text{constant}$$

This equation leads to the cycloid.

So Bernoulli solved a mechanics problem by transforming it into an optics problem.

That was the first major hint that different parts of physics might be governed by the same deeper principle.


5. The Tautochrone Property

The cycloid has another surprising property.

If beads are released from different points along the same cycloid, they reach the bottom in the same amount of time.

This is called the tautochrone property.

The word comes from Greek:

  • tauto = same
  • chronos = time

So tautochrone means:

The curve of equal time.

This means the cycloid is both:

  • the brachistochrone curve: fastest descent
  • the tautochrone curve: same descent time from different starting points

This made the cycloid one of the most beautiful curves in classical physics.


6. Maupertuis and the Idea of Action

In the 1700s, Pierre Louis de Maupertuis proposed a broader idea.

Maybe light and particles are not governed by separate rules.

Maybe there is one quantity nature tries to optimize.

He called this quantity action.

In Maupertuis’ original form, action was roughly:

$$\text{Action}=mvd$$

where:

  • mm is mass
  • vv is velocity
  • dd is distance traveled

For a path made of many small segments, the total action is:

$$S=\sum m v \Delta s$$

In the continuous case, this becomes:

$$S=\int m v ds$$

Maupertuis claimed that nature follows the path of least action.

This was a bold idea, but it was not yet mathematically precise.


7. Euler Makes the Principle Mathematical

Leonhard Euler improved Maupertuis’ idea.

Instead of treating the path as a sum of separate pieces, Euler wrote the action as an integral:

$$S=\int m v , ds$$

Since velocity is:

$$v=\frac{ds}{dt}$$

we can write:

$$ds=vdt$$

Substitute into the action:

$$S=\int m v(vdt)$$

So:

$$S=\int m v^2 dt$$

Kinetic energy is:

$$T=\frac{1}{2}mv^2$$

Therefore:

$$mv^2=2T$$

So Maupertuis’ action becomes:

$$S=\int 2T,dt$$

Euler realized that this form works under certain conditions, especially when total energy is conserved.

Total energy is:

$$E=T+V$$

where:

  • TT is kinetic energy
  • VV is potential energy

So:

$$T=E-V$$

This connection between action, kinetic energy, and potential energy prepared the way for Lagrange and Hamilton.


8. Hamilton’s Principle

The modern form of the principle of least action is usually called Hamilton’s Principle.

It says that the real path taken by a system makes the action stationary:

$$\delta S=0$$

where the action is:

$$S=\int_{t_1}^{t_2} L,dt$$

Here $$L$$ is the Lagrangian.

For many classical systems, the Lagrangian is:

$$L=T-V$$

So the action becomes:

$$S=\int_{t_1}^{t_2}(T-V),dt$$

This is one of the most important formulas in all of physics.

In words:

The action is the time integral of kinetic energy minus potential energy.


9. Least Action or Stationary Action?

The phrase “least action” can be misleading.

In calculus, when we set a derivative equal to zero:

$$\frac{df}{dx}=0$$

we might find:

  • a minimum
  • a maximum
  • a saddle point

Similarly, when we write:

$$\delta S=0$$

we do not always mean the action is the absolute minimum.

We mean it is stationary.

That is why the more accurate name is:

The Principle of Stationary Action

The action does not change to first order under small variations of the path.


10. The Core Idea of Calculus of Variations

In ordinary calculus, you minimize a function.

Suppose:

$$f(x)=x^2$$

To find its minimum, compute:

$$\frac{df}{dx}=2x$$

Set it equal to zero:

$$2x=0$$

So:

$$x=0$$

That is the minimum.

But in physics, we are not minimizing a function of one variable.

We are minimizing a functional.

A functional takes an entire function as input.

For example:

$$S[y]=\int_{t_1}^{t_2} L(y,\dot y,t),dt$$

Here the input is not a number.

The input is an entire path:

$$y(t)$$

So the question becomes:

Which function $$y(t)$$ makes the action stationary?

This is the central question of the calculus of variations.


11. Varying the Path

Suppose the true path is:

$$y(t)$$

Now create a nearby trial path:

$$q(t)=y(t)+\epsilon\eta(t)$$

where:

  • η\eta is a small variation function
  • ϵ\epsilon is a tiny number
  • the endpoints remain fixed

The endpoint condition is:

$$\eta(t_1)=0$$

$$\eta(t_2)=0$$

The action of the trial path is:

$$S[q]=S[y+\epsilon\eta]$$

The true path makes the first-order change in action vanish:

$$\frac{d}{d\epsilon}S[y+\epsilon\eta]\bigg|_{\epsilon=0}=0$$

This is the mathematical meaning of:

$$\delta S=0$$


12. The Euler–Lagrange Equation

From the condition:

$$\delta S=0$$

we obtain the Euler–Lagrange equation:

$$\frac{d}{dt}\left(\frac{\partial L}{\partial \dot q}\right)-\frac{\partial L}{\partial q}=0$$

This equation is the engine of Lagrangian mechanics.

To solve a mechanics problem:

  1. Choose a coordinate $$q$$.
  2. Write the kinetic energy $$T$$.
  3. Write the potential energy $$V$$.
  4. Form the Lagrangian $$L=T-V$$.
  5. Substitute into the Euler–Lagrange equation.

This often gives the equation of motion more easily than Newton’s force method.


13. Example: A Ball Moving Under Gravity

Suppose a ball moves vertically under gravity.

Let its height be:

$$y(t)$$

The kinetic energy is:

$$T=\frac{1}{2}m\dot y^2$$

The potential energy is:

$$V=mgy$$

So the Lagrangian is:

$$L=T-V$$

Therefore:

$$L=\frac{1}{2}m\dot y^2-mgy$$

Now apply the Euler–Lagrange equation:

$$\frac{d}{dt}\left(\frac{\partial L}{\partial \dot y}\right)-\frac{\partial L}{\partial y}=0$$

First compute:

$$\frac{\partial L}{\partial \dot y}=m\dot y$$

Then:

$$\frac{d}{dt}\left(m\dot y\right)=m\ddot y$$

Next compute:

$$\frac{\partial L}{\partial y}=-mg$$

Substitute:

$$m\ddot y-(-mg)=0$$

So:

$$m\ddot y+mg=0$$

Divide by $$m$$:

$$\ddot y=-g$$

This is exactly the usual equation for vertical motion under gravity.

In Newtonian form:

$$F=ma$$

The force is:

$$F=-mg$$

So:

$$ma=-mg$$

Therefore:

$$a=-g$$

The principle of stationary action gives the same result as Newton’s Second Law.


14. Why This Is Powerful

Newton’s method uses forces.

Lagrange’s method uses energies.

For simple problems, Newton’s method is often easier.

But for complicated systems, the Lagrangian method is much more powerful.

Why?

Because energy is a scalar.

Forces are vectors.

With Newton’s method, you must carefully resolve every force into components.

With Lagrange’s method, you write:

$$L=T-V$$

and let the Euler–Lagrange equation do the work.

This is especially useful for:

  • pendulums
  • double pendulums
  • rotating systems
  • constrained motion
  • planetary motion
  • fields
  • relativity
  • quantum mechanics

15. Example: Simple Pendulum

Consider a pendulum of length $$l$$ and mass $$m$$.

Let the angle from the vertical be:

$$\theta(t)$$

The speed of the bob is:

$$v=l\dot\theta$$

So the kinetic energy is:

$$T=\frac{1}{2}mv^2$$

Substitute:

$$T=\frac{1}{2}ml^2\dot\theta^2$$

The potential energy is:

$$V=mgl(1-\cos\theta)$$

So the Lagrangian is:

$$L=\frac{1}{2}ml^2\dot\theta^2-mgl(1-\cos\theta)$$

Now use the Euler–Lagrange equation:

$$\frac{d}{dt}\left(\frac{\partial L}{\partial \dot\theta}\right)-\frac{\partial L}{\partial \theta}=0$$

Compute:

$$\frac{\partial L}{\partial \dot\theta}=ml^2\dot\theta$$

So:

$$\frac{d}{dt}\left(ml^2\dot\theta\right)=ml^2\ddot\theta$$

Next:

$$\frac{\partial L}{\partial \theta}=-mgl\sin\theta$$

Therefore:

$$ml^2\ddot\theta-(-mgl\sin\theta)=0$$

So:

$$ml^2\ddot\theta+mgl\sin\theta=0$$

Divide by $$ml^2$$:

$$\ddot\theta+\frac{g}{l}\sin\theta=0$$

This is the equation of motion for a pendulum.

For small angles:

$$\sin\theta\approx\theta$$

So:

$$\ddot\theta+\frac{g}{l}\theta=0$$

This is simple harmonic motion.


16. Why the Lagrangian Method Feels Like a Machine

The beauty of the Lagrangian approach is that it gives a repeatable recipe.

You do not need to identify every force directly.

You only need energy.

The recipe is:

$$L=T-V$$

Then:

$$\frac{d}{dt}\left(\frac{\partial L}{\partial \dot q}\right)-\frac{\partial L}{\partial q}=0$$

This produces the equation of motion.

For multiple coordinates:

$$q_1,q_2,\dots,q_n$$

you write one Euler–Lagrange equation for each coordinate:

$$\frac{d}{dt}\left(\frac{\partial L}{\partial \dot q_i}\right)-\frac{\partial L}{\partial q_i}=0$$

for:

$$i=1,2,\dots,n$$

This is why the method is so useful for systems like the double pendulum, where Newton’s force approach becomes extremely messy.


17. From Particles to Fields

The principle of stationary action does not stop with particles.

It extends to fields.

A field assigns a value to every point in space and time.

Examples include:

  • electromagnetic field
  • gravitational field
  • quantum field
  • fluid velocity field
  • temperature field

For fields, the action is usually written as:

$$S=\int \mathcal{L},d^4x$$

where:

  • \mathcal{L} is the Lagrangian density
  • d4xd^4x means integration over spacetime

The field equations come from:

$$\delta S=0$$

This same idea gives equations such as:

  • Maxwell’s equations in electromagnetism
  • Einstein’s field equations in general relativity
  • Schrödinger’s equation in quantum mechanics
  • Dirac’s equation for relativistic electrons
  • equations of quantum field theory

This is why action is one of the deepest ideas in physics.


18. Quantum Mechanics and Action

In classical mechanics, the real path is the stationary-action path.

But quantum mechanics changes the picture.

In Feynman’s path integral formulation, a particle does not take just one path.

Instead, it explores all possible paths.

Each path contributes a complex phase:

$$e^{iS/\hbar}$$

where:

  • SS is the action of that path
  • \hbar is the reduced Planck constant
  • II is the imaginary unit

The probability amplitude is obtained by summing over all paths:

$$\text{Amplitude}=\sum_{\text{all paths}} e^{iS/\hbar}$$

Paths far from stationary action tend to cancel each other out because their phases oscillate rapidly.

Paths near stationary action reinforce each other.

That is why the classical path emerges from quantum mechanics.

In this view, the principle of stationary action is not just a classical rule.

It is a shadow of quantum interference.


19. Why Action Has Units of Planck’s Constant

Action has units of:

$$\text{energy}\times\text{time}$$

or equivalently:

$$\text{momentum}\times\text{distance}$$

Planck’s constant also has units of action.

That is:

$$[h]=[\text{action}]$$

This is not a coincidence.

Quantum mechanics becomes important when the action of a system is comparable to Planck’s constant:

$$S\sim\hbar$$

Classical mechanics appears when the action is much larger than Planck’s constant:

$$S\gg\hbar$$

So action is the bridge between classical and quantum physics.


20. The Big Idea

The principle of stationary action says:

$$\delta S=0$$

where:

$$S=\int L,dt$$

and for many mechanical systems:

$$L=T-V$$

This means:

The actual motion of a system is the one for which the action is stationary under small changes in the path.

From this one principle, we can derive:

$$F=ma$$

We can also derive the equations for pendulums, planetary motion, electromagnetic fields, relativity, and quantum systems.

The principle of least action is powerful because it replaces local force-by-force reasoning with a global rule about entire paths.

Instead of asking:

What force acts at this instant?

we ask:

Which complete path makes the action stationary?

That shift in thinking changed physics forever.


21. Summary

The story begins with the problem of fastest descent.

A bead sliding under gravity follows the cycloid, not the straight line.

Then Fermat showed that light follows the path of least time.

Bernoulli connected mechanics and optics through the brachistochrone problem.

Maupertuis proposed the idea of action.

Euler gave the idea mathematical structure.

Lagrange developed the calculus of variations.

Hamilton gave the modern form:

$$S=\int_{t_1}^{t_2}L,dt$$

with:

$$L=T-V$$

and:

$$\delta S=0$$

This leads to the Euler–Lagrange equation:

$$\frac{d}{dt}\left(\frac{\partial L}{\partial \dot q}\right)-\frac{\partial L}{\partial q}=0$$

The result is one of the most elegant principles in science:

Nature’s laws can be written as the condition that action is stationary.


References

  1. Feynman, Richard P., Robert B. Leighton, and Matthew Sands. The Feynman Lectures on Physics, Volume I: Chapter 26, Optics: The Principle of Least Time. California Institute of Technology.
  2. Goldstein, Herbert, Charles Poole, and John Safko. Classical Mechanics. 3rd ed. Addison-Wesley, 2001.
  3. Lanczos, Cornelius. The Variational Principles of Mechanics. 4th ed. Dover Publications, 1986.
  4. Landau, L. D., and E. M. Lifshitz. Mechanics. 3rd ed. Butterworth-Heinemann, 1976.
  5. Arnold, Vladimir I. Mathematical Methods of Classical Mechanics. 2nd ed. Springer, 1989.
  6. Taylor, John R. Classical Mechanics. University Science Books, 2005.
  7. Gelfand, I. M., and S. V. Fomin. Calculus of Variations. Dover Publications, 2000.
  8. Hamilton, William Rowan. “On a General Method in Dynamics.” Philosophical Transactions of the Royal Society, 1834.
  9. Maupertuis, Pierre Louis Moreau de. “Accord de différentes lois de la nature qui avaient jusqu’ici paru incompatibles.” Mémoires de l’Académie Royale des Sciences de Paris, 1744.
  10. Bernoulli, Johann. “Acta Eruditorum.” 1696–1697, original publications on the brachistochrone problem.

Leave a Reply

Discover more from nerd-ish

Subscribe now to keep reading and get access to the full archive.

Continue reading