2: Extrema Subject to One Constraint
( \newcommand{\kernel}{\mathrm{null}\,}\)
Here is Theorem [theorem:1] with m=1.
Suppose that n>1. If X0 is a local extreme point of f subject to g(X)=0 and gxr(X0)≠0 for some r∈{1,2,…,n}, then there is a constant λ such that
fxi(X0)−λgxi(X0)=0, 1≤i≤n; thus, X0 is a critical point of f−λg.
For notational convenience, let r=1 and denote
U=(x2,x3,…xn) and U0=(x20,x30,…xn0).
Since gx1(X0)≠0, the Implicit Function Theorem (Corollary 6.4.2, p. 423) implies that there is a unique continuously differentiable function h=h(U), defined on a neighborhood N⊂Rn−1 of U0, such that (h(U),U)∈D for all U∈N, h(U0)=x10, and
g(h(U),U)=0,U∈N.
Now define
λ=fx1(X0)gx1(X0),
which is permissible, since gx1(X0)≠0. This implies Equation ??? with i=1. If i>1, differentiating Equation ??? with respect to xi yields
∂g(h(U),U)∂xi+∂g(h(U),U)∂x1∂h(U)∂xi=0,U∈N.
Also,
∂f(h(U),U))∂xi=∂f(h(U),U)∂xi+∂f(h(U),U)∂x1∂h(U)∂xi,U∈N.
Since (h(U0),U0)=X0, Equation ??? implies that
∂g(X0)∂xi+∂g(X0)∂x1∂h(U0)∂xi=0.
If X0 is a local extreme point of f subject to g(X)=0, then U0 is an unconstrained local extreme point of f(h(U),U); therefore, Equation ??? implies that
∂f(X0)∂xi+∂f(X0)∂x1∂h(U0)∂xi=0.
Since a linear homogeneous system
[abcd][uv]=[00]
has a nontrivial solution if and only if
|abcd|=0, (Theorem 6.1.15), Equations ??? and ??? imply that
|∂f(X0)∂xi∂f(X0)∂x1∂g(X0)∂xi∂g(X0)∂x1|=0,\; so\;\;|∂f(X0)∂xi∂g(X0)∂xi∂f(X0)∂x1∂g(X0)∂x1|=0,
since the determinants of a matrix and its transpose are equal. Therefore, the system
[∂f(X0)∂xi∂g(X0)∂xi∂f(X0)∂x1∂g(X0)∂x1][uv]=[00]
has a nontrivial solution (Theorem 6.1.15). Since gx1(X0)≠0, u must be nonzero in a nontrivial solution. Hence, we may assume that u=1, so
[∂f(X0)∂xi∂g(X0)∂xi∂f(X0)∂x1∂g(X0)∂x1][1v]=[00].
In particular,
∂f(X0)∂x1+v∂g(X0)∂x1=0,\; so\;\;−v=fx1(X0)gx1(X0).
Now Equation ??? implies that −v=λ, and Equation ??? becomes
[∂f(X0)∂xi∂g(X0)∂xi∂f(X0)∂x1∂g(X0)∂x1][1−λ]=[00].
Computing the topmost entry of the vector on the left yields Equation ???.
[example:1]
Find the point (x0,y0) on the line
ax+by=d
closest to a given point (x1,y1).
Solution
We must minimize √(x−x1)2+(y−y1)2 subject to the constraint. This is equivalent to minimizing (x−x1)2+(y−y1)2 subject to the constraint, which is simpler. For, this we could let
L=(x−x1)2+(y−y1)2−λ(ax+by−d);
however,
L=(x−x1)2+(y−y1)22−λ(ax+by)
is better. Since
Lx=x−x1−λa\quad and \quad Ly=y−y1−λb,
(x0,y0)=(x1+λa,y1+λb), where we must choose λ so that ax0+by0=d. Therefore,
ax0+by0=ax1+by1+λ(a2+b2)=d,
so
λ=d−ax1−by1a2+b2,
x0=x1+(d−ax1−by1)aa2+b2, and y0=y1+(d−ax1−by1)ba2+b2.
The distance from (x1,y1) to the line is
√(x0−x1)2+(y0−y1)2=|d−ax1−by1|√a2+b2.
[example:2]
Find the extreme values of f(x,y)=2x+y subject to
x2+y2=4.
Solution
Let
L=2x+y−λ2(x2+y2);
then
Lx=2−λx and Ly=1−λy,
so (x0,y0)=(2/λ,1/λ). Since x20+y20=4, λ=±√5/2. Hence, the constrained maximum is 2√5, attained at (4/√5,2/√5), and the constrained minimum is −2√5, attained at (−4/√5,−2/√5).
[example:3]
Find the point in the plane
3x+4y+z=1 closest to (−1,1,1).
Solution
We must minimize
f(x,y,z)=(x+1)2+(y−1)2+(z−1)2
subject to Equation ???. Let
L=(x+1)2+(y−1)2+(z−1)22−λ(3x+4y+z);
then
Lx=x+1−3λ,Ly=y−1−4λ, and Lz=z−1−λ,
so
x0=−1+3λ,y0=1+4λ,z0=1+λ.
From Equation ???,
3(−1+3λ)+4(1+4λ)+(1+λ)−1=1+26λ=0, so λ=−1/26 and
(x0,y0,z0)=(−2926,2226,2526).
The distance from (x0,y0,z0) to (−1,1,1) is
√(x0+1)2+(y0−1)2+(z0−1)2=1√26.
[example:4]
Assume that n≥2 and xi≥0, 1≤i≤n.
- Find the extreme values of n∑i=1xi subject to n∑i=1x2i=1.
- Find the minimum value of n∑i=1x2i subject to n∑i=1xi=1.
(a) Let
L=n∑i=1xi−λ2n∑i=1x2i; then
Lxi=1−λxi,\; so\;\;xi0=1λ,1≤i≤n.
Hence, n∑i=1x2i0=n/λ2, so λ=±√n and
(x10,x20,…,xn0)=±(1√n,1√n,…,1√n).
Therefore, the constrained maximum is √n and the constrained minimum is −√n.
(b) Let
L=12n∑i=1x2i−λn∑i=1xi; then
Lxi=xi−λ,\; so\;\;xi0=λ,1≤i≤n.
Hence, n∑i=1xi0=nλ=1, so xi0=λ=1/n and the constrained minimum is
n∑i=1x2i0=1n There is no constrained maximum. (Why?)
[example:5]
Show that
x1/py1/q≤xp+yq,x,y≥0,
if
1p+1q=1,p>0, and q>0.
Solution
We first find the maximum of
f(x,y)=x1/py1/q
subject to
xp+yq=σ,x≥0,y≥0,
where σ is a fixed but arbitrary positive number. Since f is continuous, it must assume a maximum at some point (x0,y0) on the line segment Equation ???, and (x0,y0) cannot be an endpoint of the segment, since f(pσ,0)=f(0,qσ)=0. Therefore, (x0,y0) is in the open first quadrant.
Let
L=x1/py1/q−λ(xp+yq).
Then
Lx=1pxf(x,y)−λp and Ly=1qyf(x,y)−λq=0,
so x0=y0=f(x0,y0)/λ. Now Equations ??? and ??? imply that x0=y0=σ. Therefore,
f(x,y)≤f(σ,σ)=σ1/pσ1/q=σ=xp+yq.
This can be generalized (Exercise [exer:53]). It can also be used to generalize Schwarz’s inequality (Exercise [exer:54]).
[theorem:2]