1: Introduction to Lagrange Multipliers
( \newcommand{\kernel}{\mathrm{null}\,}\)
To avoid repetition, it is to be understood throughout that f and g1, g2,…, gm are continuously differentiable on an open set D in Rn.
Suppose that m<n and
g1(X)=g2(X)=⋯=gm(X)=0
on a nonempty subset D1 of D. If X0∈D1 and there is a neighborhood N of X0 such that
f(X)≤f(X0)
for every X in N∩D1, then X0 is a local maximum point of f subject to the constraints Equation ???. However, we will usually say “subject to” rather than “subject to the constraint(s).”
If Equation ??? is replaced by
f(X)≥f(X0),
then “maximum” is replaced by “minimum.” A local maximum or minimum of f subject to Equation ??? is also called a local extreme point of f subject to Equation ???. More briefly, we also speak of constrained local maximum, minimum, or extreme points. If Equation ??? or Equation ??? holds for all X in D1, we omit “local.”
Recall that X0=(x10,x20,…,xn0) is a critical point of a differentiable function L=L(x1,x2,…,xn) if
Lxi(x10,x20,…,xn0)=0,1≤i≤n.
Therefore, every local extreme point of L is a critical point of L; however, a critical point of L is not necessarily a local extreme point of L.
Suppose that the system Equation ??? of simultaneous equations can be solved for x1, …, xm in terms of the xm+1, …, xn; thus,
xj=hj(xm+1,…,xn),1≤j≤m.
Then a constrained extreme value of f is an unconstrained extreme value of
f(h1(xm+1,…,xn),…,hm(xm+1,…,xn),xm+1,…,xn).
However, it may be difficult or impossible to find explicit formulas for h1, h2, …, hm, and, even if it is possible, the composite function Equation ??? is almost always complicated. Fortunately, there is a better way to to find constrained extrema, which also requires the solvability assumption, but does not require an explicit formula as indicated in Equation ???. It is based on the following theorem. Since the proof is complicated, we consider two special cases first.
Suppose that n>m. If X0 is a local extreme point of f subject to
g1(X)=g2(X)=⋯=gm(X)=0
and
|∂g1(X0)∂xr1∂g1(X0)∂xr2⋯∂g1(X0)∂xrm∂g2(X0)∂xr1∂g2(X0)∂xr2⋯∂gm(X0)∂xrm⋮⋮⋱⋮∂gm(X0)∂xr1∂gm(X0)∂xr2⋯∂gm(X0)∂xrm|≠0
for at least one choice of r1<r2<⋯<rm in {1,2,…,n}, then there are constants λ1, λ2, …, λm such that X0 is a critical point of
f−λ1g1−λ2g2−⋯−λmgm;
that is,
∂f(X0)∂xi−λ1∂g1(X0)∂xi−λ2∂g2(X0)∂xi−⋯−λm∂gm(X0)∂xi=0,
1≤i≤n.
The following implementation of this theorem is the method of Lagrange multipliers.
- Find the critical points of f−λ1g1−λ2g2−⋯−λmgm,treating λ1, λ2, …λm as unspecified constants.
- Find λ1, λ2, …, λm so that the critical points obtained in (a) satisfy the constraints.
- Determine which of the critical points are constrained extreme points of f. This can usually be done by physical or intuitive arguments.
- If a and b1, b2, …, bm are nonzero constants and c is an arbitrary constant, then the local extreme points of f subject to g1=g2=⋯=gm=0 are the same as the local extreme points of af−c subject to b1g1=b2g2=⋯=bmgm=0. Therefore, we can replace f−λ1g1−λ2g2−⋯−λmgm by af−λ1b1g1−λ2b2g2−⋯−λmbmgm−c to simplify computations. (Usually, the “−c” indicates dropping additive constants.) We will denote the final form by L (for Lagrangian).
[theorem:1]