This video was produced by The Kaizen Effect.
The derivation begins by expressing the problem (which is to find the minimum value of a functional ) in the language of single-variable calculus—meaning, we’ll want to express the functional as a function of the single variable (which I’ll describe later) so that we can use the techniques of single-variable calculus to find the minimum value of which occurs when . Later on, we’ll deal with the more general case in which we solve for the stationary points of . Let the set of coordinates be generalized coordinates which are dependent variables of the independent variable . Let the quantity be a parametric quantity whose magnitude is equal to the length of the curve where can be any arbitrary curve. (This length specifies the magnitude of our parametric quantity—which isn’t limited to being just physical length but can also be an action, a period of time, and so on.)
Let the two coordinates and denote the initial and final coordinate values associated with a system, respectively. In many physics problems, these coordinate values are typically taken to denote one time coordinate (in which case we’d replace the independent variable with ) and the rest of the coordinates are typically taken to denote whichever spatial coordinates are the most convenient for a given problem; but in geometrical problems the generalized coordinates are, of course, taken to be all spatial coordinates. The choice of what kinds of generalized coordinates to use really just depends on the problem you’re trying to solve.
We’ll let be any parametric quantity associated with a system going from to , even those which are not minimized. Now, the whole purpose of this section will be to find the minimum value of —those points in which the parametric quantity does not change with respect to the variables it depends on. But to do this, we must first write an expression which determines the length of any arbitary curve. How does one calculate the magnitude ? To do this, let’s divide the curve into infinitely many, infinitesimally small line segments of length . By taking the infinite sum (which is to say, by taking the integral) of all these small lengths of , we can find that magnitude of is given by
Equation (1) is nice and all, but we should re-express it in terms of something which can be calculated in terms of the independent variable . As a first steps towards doing this, we can rewriting the length using the Pythagorean Theorem to obtain . Let’s substitute this equation into Equation (1) to get
I have written the question marks in the limits of integration to denote that I’m leaving them out for the moment. Using algebraic manipulations, we can express the integral with respect to the independent variable to obtain
where the integrand is some functional of , and and is denoted by . (A functional is something which is a function of a function.) To find the minimum of would involve a procedure which you are already familiar with: the minimum occurs at the point where will not change (up to the first order) with a small change in ; or, written in another way, where . Finding the minimum value of isn’t quite so simple. The minimum value of corresponds to a point where does not change, up to the first order, with small changes in , and . To find this minimum, we must use a technique known as calculus of variations: this is, basically, a procedure in which we use clever techniques to express as a function of a single independent variable so that we can use the techniques of single-variable calculus in order to find its minimum value.
The first step necessary to accomplish this goal will be to assume that there is a curve which is that particular curve whose arc length is minimized. As previously mentioned, we shall let represent any curve between and so long that it is everywhere smooth and continuous. We shall, however, require the two constraints that and . We shall now define a new function which we will let be any smooth curve such that and . Let’s also define a parameter which we'll call which we shall let be defined by the equation
The product is the error between the “correct path” (the one whose arc length is minimized) and the arbitrarily chosen path . By simply letting be a particular function (pick any you like; I have chosen the one illustrated in Figure #), so long as it satisfies the aforementioned constraints, then we can vary with the single parameter and write . The previous sentence, for the purpose of comprehensibility, requires a little explanation. For the two fixed initial conditions and , the function does not vary with the two functions and . The reason why does not vary with is because will not change regardless of what is— depends upon only the initial conditions and being different. Basically, it would be very easy to see visually, on a graph, that by choosing two different initial conditions, the shortest path () connecting those two points will also have to be different.
Figure 1 (click to expand)
Lastly, since we let be a particular function, it follows that it also only depends on the initial conditions. (As you move the two points and apart or towards each other, you could imagine having to elongate or contract.) It follows that is, therefore, not a function of . I have shown in Figure 1 how (due to the way in which we defined it by Equation (1)) varies with in such a way that by adding to the "correct function" , we always manage to land on . Now, represents "any" arbitrary curve; indeed, we could change to whatever we wanted and would still satisfy Equation (1). In other words, we could just add a different function (where changed a little but did not) to and land on again as in Figure 1. What all of this means is that the only thing which depends on in Equation (1) is ; therefore, we can write
By taking the derivative with the respect to on both sides, we get
At this point, we are now able to express the functional as the function . The minimum value of occurs at a point where . In order to investigate the mathematical relationships which satisfy this condition (the condition that is minimized), let’s differentiate both sides of Equation (3), set it equal to zero, and then proceed to use algebra to find mathematical relationships which satisfy this condition. Starting with the first step, we have
(To clarify any potential confusion, I took the partial derivative on both sides; since the function on the left-hand side is a single-variable function, it follows that .) Since is a functional, in order to evaluate the partial derivative , we must use the chain rule to get
Let’s evaluate the partial derivatives and to get
and
Let’s substitute these results into Equation (8) to get
There is great value in employing integration by parts on the second integral in Equation (9) since it’ll allow us to rewrite the integrand of the form, ; this form has the equations of motion right in front of our face as we shall see. From the standpoint of physics, the motivation of this is apparent as the equations of motion will allow us to determine the motion of a system. Recall that the equation for integrating by parts is given by
If we let and , then our second integral can be simplified to
Let’s substitute this result into Equation (9) to get
Since can be any arbitrary function it is, in general, not equal to zero. Therefore, the other term in the product must be zero and we have
Equation (11) is known as the Euler-Lagrange equation and it is the mathematical consequence of minimizing a functional . It is a differential equation which can be solved for the dependent variable(s) such that the functional is minimized. The next few sections will be concerned with different problems in which the question starts off as: find the minimum value of some quantity . These problems start off with a little math to express the quantity as a functional. All of the problems boil down to solving for the coordinates which minimize ; this will be accomplished by solving Equation (11). Although simple to say, we shall see that this can, sometimes, involve a lot of algebra and tinkering—the math will sometimes get a little hairy.
This article is licensed under a CC BY-NC-SA 4.0 license.
References
1. The Kaizen Effect. "Lagrangian Mechanics - Lesson 1: Deriving the Euler-Lagrange Equation & Introduction". Online video clip. YouTube. YouTube, 04 May 2016. Web. 18 May 2017.
Notes
1. When we think about the curve which minimizes the quantity , it is important not to lose track of the generality of our choice of coordinates and . In some problems, we'll just choose and to be spatial coordinates in which case is a measure of distance; but in other problems, we'll choose to be a time coordinate in which case is not a measure of distance. I wanted to mention this early on because a common confusion and ambiguity is whether or not this derivation we'll be doing in this section applies only to functionals which measure length. Be reassured that this is not the case; can measure many other things besides length as we'll see in subsequent sections where we solve some problems using the analysis we developed in this section.
2. The minimum value of some arbitrary single variable function, say , occurs when . This condition implies that for a very small change in time , the change in the function is . You might be wondering: “if changed by a very small amount, then why didn’t change by a very small amount as well?” In reality, did in fact change a little: but this change is captured in only 2nd order (and higher) derivatives and, according to Feynman, “the deviation of the function from its minimum value is only second order [or higher].” The full expression describing the differential change in is, in general, a function of the nth order derivative. In this example, the change in as a function of the first order derivative is zero. The terminology and phrasing used to describe the previous sentence is as follows: we say that “the function does not change up to the first order.”