Abstract
Often times, situations will come up in which a quantity is to be optimized for the maximum benefit. In this paper, we will use techniques from ordinary differential equations and other parts of calculus to solve problems relating to the calculus of variations, a field of study that is closely related to optimization. We will first derive the fundamental equation used in the calculus of variations: the Euler-Lagrange Equation. We will also demonstrate how the Euler-Lagrange Equation can be used to determine the shapes of geodesics on classical three-dimensional figures, such as cylinders, cones, and spheres, which is important for determining the shortest path between 2 points on the surface of a curve. We will also demonstrate how the calculus of variations can be used in finance and physics by showing how the Euler-Lagrange Equation can be used with certain profit trends to determine an optimal production function and by showing how the Euler-Lagrange Equation can be used to determine the path of objects subject to common physical conditions. In doing this, we show how the Calculus of Variations, specifically the Euler-Lagrange Equation, can be used to find optimal curves for situations involving an integral with a first-order differential equation as the integrand.
Introduction
Optimization questions have always been of interest in the field of mathematics. These types of problems typically deal with the idea that we wish to get the most out of a finite amount of input. For example, analysis and expansion upon knowledge of inequalities, such as the Cauchy-Schwarz inequality and subsequent results that came about relating to it (that are still being researched to this day), shows the long-held interest of mathematicians to optimize certain quantities, like an equation.
The study of the Calculus of Variations centers around the same idea. Problems relating to the Calculus of Variations usually involve a fixed parameter and a quantity that must be optimized. For example, the question of the shortest path between two points in a plane involves the use of the Calculus of Variations. The Calculus of Variations has been an important area of analysis with regard to both pure mathematics and other subjects, such as physics, leading to the proof of many important principles. For example, according to Simmons1 the use of Calculus of Variations was central to Einstein’s study of relativity, as well as Schrödinger’s derivation of the wave equation, important for the study of quantum mechanics.
Further reading of Simmons1, suggests the Calculus of Variations became a topic of interest in analysis when Euler and Lagrange derived what is now known as the Euler-Lagrange Equation. Before then, problems relating to the Calculus of Variations were considered by the ancient Greeks. After the creation of Calculus by Newton and Leibniz, these problems were investigated more deeply, and some of these problems were solved through other techniques. However, after the derivation of the Euler-Lagrange Equation, the Calculus of Variations was established as its own subject, and since then many mathematicians have studied and continue to make breakthroughs in this area of study.
In this paper, we shall prove the fundamental equation for the Calculus of Variations: The Euler-Lagrange Equation. Furthermore, we shall look at many situations covered from both Simmons1 and Depurkar2, relating to pure math or otherwise, where principles in the Calculus of Variations will be of particular use. Additionally, we will examine situations that are natural extensions of those mentioned in Simmons1 and Depurkar2, and use the Calculus of Variations to gain insights.
Euler-Lagrange Equation
Deriving the Euler-Lagrange Equation
Here, we shall establish some definitions and assumptions that will be used throughout the paper. First, we shall assume that all optimizing functions, , have continuous partial derivatives with respect to all variables (in our case, those variables are , , and , where is the independent variable and is the function we are trying to find that optimizes ). Next, define admissible functions, , as ones that satisfy all initial conditions and have continuous second derivatives.
We wish to find an admissible function such that the integral:
is minimized, where and are arbitrary bounds. First, we let (abbreviated here as ) be the function that actually minimizes the value of the integral. Then, we declare to be any function of the form:
where is the variation function of from , and is an arbitrary parameter. Note that at and , . This equation also implies that:
We know that at , the integral must have a minimum value, meaning that if we declare the integral as a function of and take the derivative with respect to , the value of the derivative with respect to at is , so we consider the following integral:
Differentiating under the integral sign with respect to , we have that:
which, after the use of the Multivariable Chain Rule, becomes:
Plugging in and setting , we get:
Note that and at , which is how we justify the substitution made. We then split the integral into two and use integration by parts on the integral , turning our equation into:
Since at and , we have that:
Our integral must be equal to , regardless of what the function actually is, therefore we can conclude that all functions that could possibly minimize the integral must satisfy the equation:
which is the well-known Euler-Lagrange Equation, one of the fundamental equations in the Calculus of Variations. In the demonstrations that follow, we will show how the Euler-Lagrange equations can be used to find optimal curves given an integral with a first order differential equation as the integrand. We will show how this equation can be used to show that some curves optimize quantities, such as length, surface area, profit, and time (which we will address in this paper), as well as find the said optimizing curves between points in both two-dimensions and three-dimensions.
Shortest Curve Between Two Points
The Euler-Lagrange Equation can be applied to many problems in the realm of mathematics. For example, Simmons1 proposes that we find the curve that minimizes the distance between two points in the plane, and . Here, we consider the formula for arc length for a function , given by:
Let . Applying the Euler-Lagrange Equation to , we have that:
Integrating both sides with respect to , we have that:
for some arbitrary constant . We then solve for , which is given by:
The right-hand side of our equation is simply a constant, so let , which means that for some arbitrary constant is, indeed, a straight line, as desired.
Minimized Surface Area Created By a Curve
Simmons1 proposes the following: we consider two points, and , and look for the curve connecting the two points such that when the curve is rotated about the x-axis, the surface area of the resultant shape is minimized. We will assume here that the points are chosen in a way such that a minimizing curve exists. To find this curve, which we will call , we shall consider the formula for surface area in this case, given by the integral:
Applying the Euler-Lagrange Equation to the function , we get that must satisfy the equation:
We multiply both sides by , which gets us:
Since , we get that:
Integrating both sides and plugging in , we get:
Let . Solving for , we get that:
Integrating both sides, we have that:
where is an arbitrary constant. This can be manipulated to get that:
Minimized Area Under a Curve
Simmons1 proposes the following: we consider two points, and , and the first-quadrant curve connecting them, . Suppose that the area under the curve is held constant. Our goal is to find the curve that minimizes the arc length between the two points. To find the curve, we want to find that minimizes:
given that:
where is a constant. Here, we will employ the method of Lagrange Multipliers by declaring a function:
Applying the Euler-Lagrange equation, we see that all possible minimizing functions must satisfy the equation:
Adding to both sides and integrating, we get
Squaring both sides and solving for , we get that:
Integrating both sides, we get
which is a semi-circular arc of radius .
Geodesics on Two Dimensional Surfaces
Shortest Path Along a Cylinder
Consider a cylinder of the form in three-dimensional space. Suppose that we choose two points on this figure, and we desire to model the path of the shortest length along the cylinder between these two points, which we will call at and at . This is known as a geodesic, or a curve representing the shortest path between two points on a surface. To find this, we consider the equation for 3-dimensional arc length:
Now, we let and , which transforms our integral into:
Simplifying the integral using Pythagorean Identities, we get that our integral becomes:
Now, we declare a new function :
and apply the Euler-Lagrange Equation:
Note that is a constant, meaning that the right-hand side of our equation is also a constant, which we will set equal to . By integrating both sides, we get that:
where is a constant. Plugging in our initial conditions and letting and be our angles at our positions at time and , respectively, we get that:
Solving this system of equations, we get that:
Therefore, our optimal curve is:
Obviously, this tactic of putting and in terms of polar coordinates doesn’t always work for other objects, and if we modelled this curve between two points on some other three-dimensional shape, it is unlikely that the curve would be the shortest path between said points. However, the general idea of trying to reduce the number of variables using alternative coordinate systems, like polar coordinates, will be useful as we consider geodesics on other objects, such as cones and cylinders.
Geodesics on a Cone
Now, consider a cone of the form with . We desire to find the optimal path between two points along the cone, calling them at and at . Again, we will consider the arc-length formula in three dimensions:
We let and . Our integral becomes:
We can now declare a function :
and apply the Euler-Lagrange equation:
We multiply both sides by to get:
Since , we have that our equation becomes:
Integrating both sides, we get:
where is a constant. Plugging in , we get:
Solving for , we get:
Solving the differential equation, we get that, when (where represents a constant), our optimal curve (given that it passes through our initial points):
As seen with the previous two situations, the general strategy is to consider an alternative coordinate system (in both cases, polar coordinates) in order to reduce the number of variables in our integrand so that we can get the integrand into a form that allows for the Euler-Lagrange equation to be used. In both cases, converting to polar form allowed us to manipulate our integral into a form that allows us to use the Euler-Lagrange equation. However, polar form does not always yield a usable form, and we must consider other coordinate systems.
Geodesics on a Sphere
Now, consider a unit sphere, , and two points on the sphere, at and at . The arc-length between the two points is once again given by:
We then consider the substitutions from Cartesian coordinates to Spherical coordinates:
Our integral becomes:
Now, let
We now apply the Euler-Lagrange Equation to :
Multiplying both sides by gives:
Since , our equation becomes:
Integrating both sides (where represents an arbitrary constant) gives:
We plug into our equation:
Solving for , we get:
We assume that due to domain and range constraints. Integrating both sides ( represents an arbitrary constant):
Therefore, our optimal curve would be the one that passes through the initial points and satisfies the equation:
Applications
Previously, we have derived a way to find the optimal curve/function and applied that method to various aspects of pure mathematics. In this section, we will continue to use the previously derived equations and apply them to other topics. These topics include both finance and physics.
Finance
Depurkar2 proposes a situation where a firm must produce units of a good in hours, and they would like to minimize the cost of production. The firm produces such that represents the number of units produced at time , where is represented in hours. This means that the change in the number of units produced at time is equal to . The firm has two sources of cost:
- One hour of storage costs dollars per unit.
- Production at a rate of rises linearly with the production rate. That is, for every 1 unit increase in the production rate, the cost of production per unit increases by dollars.
Both of these quantities can be expressed in terms of :
The cost from storage from time to can be represented as:
The cost from production at a rate of from time to can be represented as:
This means that the total cost can be represented as:
To minimize the cost, we find the function that minimizes the value of the integral given the initial conditions that and . To do this, we apply the Euler-Lagrange Equation to the function :
where represents an arbitrary constant. Solving this differential equation, we get that:
where represents an arbitrary constant. We plug in our initial conditions of and , and we get that and , getting us the equation:
Although not every production function will be as described, there are many production functions that will ultimately result in an integral with a first-order differential equation as the integrand. In these situations, the Euler-Lagrange equation can still be used, and the resulting differential equation that results can be solved to yield the optimal production function.
Physics
Consider two points, and with . A bead with mass starts at rest and travels along a frictionless curve that connects the two points. We would like to find the optimal curve, , such that the time the bead takes to travel from to is minimized. Simmons1 notes this as the well-known Brachistochrone Problem proposed by Johann Bernoulli in 1696. This situation is notable due to the common physical conditions that are of interest, such as gravity and conservation of mechanical energy. Additionally, time optimization is of great importance, and we show here how the Calculus of Variations can be used to better optimize time scenarios.
To solve this, we use the basic principle that time equals distance divided by speed. Then, by Conservation of Mechanical Energy, we have that , where represents the potential energy at time and represents the kinetic energy at time . Since and (where represents the distance traveled by the bead), we have that:
Since , where is the velocity of the bead, the velocity of the bead is equal to . The distance that must be traveled at that instant can be found using the formula for arc length. Therefore, the total time it takes for the bead to travel along the path is equal to:
From there, let
and apply the Euler-Lagrange Equation:
We multiply both sides by , which gets us:
Since , we get that:
We integrate both sides to get:
where is a constant. Plugging in , we have:
Solving for , we get that:
where represents an arbitrary constant.
Conclusion
The use of the Calculus of Variations to solve optimization problems, both within pure mathematics and otherwise, has become very prominent since the derivation of the Euler-Lagrange equation, as shown above. Our derivation of the equations for geodesics, in addition to the derivation of many optimal curves that are applicable to other fields of study, such as physics and finance, demonstrate the importance of the results obtained from the study of the Calculus of Variations, as well as further results that will come up in the future.
Acknowledgments
I am extremely grateful to Mr. Julius Baldauf and the Lumiere program for supporting my research.