Newton's second law of motion in the three-dimensional space can be written as the vector equation
It turns out that there is a direct relationship between (2.40) and the Euler-Lagrange equation. This is difficult to see right now because the notation in (2.40) is very different from the one we have been using. So, let us modify our earlier notation to better match (2.40). First, let us write instead of for the independent variable. Second, let us write instead of for the dependent variable. Then also becomes and we have . In the new notation, the Euler-Lagrange equation becomes
Some remarks on the above change of notation are in order (as it is also a preview of things to come).The change from to is conceptually significant, because it implies that the curves are parameterized by time and thus describe some dynamic behavior (e.g., trajectories of a moving object). In problems such as Dido's problem or catenary problem, where there is no motion with respect to time, this notation would not be justified. Other variational problems, such as the brachistochrone problem, do indeed deal with paths of a moving particle. (We did not, however, explicitly use time when formulating the brachistochrone problem, and it would not make sense to just relabel as in our earlier formulation of that problem. Instead, we would need to reparameterize the -trajectories with respect to time, which yields a different Lagrangian; we will do this in Section 3.2.) In mechanics, as well as in control theory, time is the default independent variable. Accordingly, when we come to the optimal control part of the book, we will make this change of notation permanent. For now, we adopt it just temporarily while we discuss applications of calculus of variations in mechanics. As for the (time-)dependent coordinate variables, it is natural to denote them by and for planar curves and by , and for spatial curves. In general, the choice of a label for the coordinate vector is just a matter of preference and convention; the mechanics literature typically favors or , while in control theory it is customary to use .
Let us now compare (2.41) with (2.40). Is there a choice of the Lagrangian that would make these two equations the same? The answer is yes, and the reader will have no difficulty in seeing that the following Lagrangian does the job:
which is called the action integral. In general, these extremals are not necessarily minima. However, they are indeed minima--hence the term ``least action" is accurate--if the time horizon is sufficiently short; this will follow from the second-order sufficient condition for optimality, to be derived in Section 2.6.2.
For example, if the potential is 0, then the trajectories are the extremals of , which are straight lines. We saw in Example 2.2 that straight lines arise as extremals when the Lagrangian is the arclength; the kinetic energy gives the same extremals. In the presence of gravity, the paths along which the action integral is minimized can be viewed as ``straight lines" (shortest paths, or geodesics) in a curved space whose metric is determined by gravitational forces. This view of mechanics forms the basis for Einstein's theory of general relativity.
Observe the difference between the law that the derivative of the momentum equals the force and the principle that the action integral is minimized. The former condition holds pointwise in time, while the latter is a statement about the entire trajectory. However, their relation is not surprising, because if the action integral is minimized then every small piece of the trajectory must also deliver minimal action. In the limit as the length of the piece approaches 0, we recover the differential statement. We already discussed this point in the general context of the Euler-Lagrange equation (see page ).
Now, what is the physical meaning of the Hamiltonian? Substituting for in (2.28)-(2.29) and using (2.42), we have
which is the total energy (kinetic plus potential). We see that the Hamiltonian not only enables a convenient rewriting of the Euler-Lagrange equation in the form of the canonical equations (2.30), but also has a very clear mechanical interpretation.
Recall that in Section 2.3.4 we studied two special cases for which we found conserved quantities, i.e., functions that remain constant along extremals. We now revisit these two conservation laws--as well as another related case--in the context of the principle of least action, which permits us to see their physical meaning.
CONSERVATION OF ENERGY. In a conservative system, the potential is fixed and does not change with time. Since the kinetic energy does not explicitly depend on time either, we have . In other words, the Lagrangian is invariant under time shifting. In our old notation, this corresponds to the ``no " case from Section 2.3.4, and we know that in this case the Hamiltonian is conserved. We also just saw that the Hamiltonian is the total energy of the system. Therefore, this is nothing but the well-known principle of conservation of energy.
CONSERVATION OF MOMENTUM. Suppose that no force is acting on the system (i.e., the system is closed). Since the force is given by , this implies that must be constant. The kinetic energy depends on but not on . Thus the Lagrangian does not explicitly depend on , which means that it is invariant under parallel translations. This situation corresponds to the ``no " case from Section 2.3.4, where we saw that the momentum, in the present notation, is conserved. A more general statement is that for each coordinate that does not appear in , the corresponding component of the momentum is conserved.
CONSERVATION OF ANGULAR MOMENTUM. Consider a planar motion in a central field; in polar coordinates , this is defined by the property that , i.e., the potential depends only on the radius and not on the angle. This means that no torque is acting on the system, making the Lagrangian invariant under rotations. Now we can use the fact (noted at the end of Section 2.3.3) that the Euler-Lagrange equation looks the same in all coordinate systems. In particular, in polar coordinates we have
These are familiar expressions for the angular momentum.
We remark that all of the above examples are special instances of a general result known as Noether's theorem, which says that invariance of the action integral under some transformation (e.g., time shift, translation, rotation) implies the existence of a conserved quantity.