In Chapter 5, integration was treated as antidifferentiation. Now we adopt another, measure-theoretical approach.
Lebesgue's original theory was based on Lebesgue measure (Chapter 7, §8). The more general modern treatment develops the integral for functions f : S \rightarrow E in an arbitrary measure space. Henceforth, (S, \mathcal{M}, m) is fixed, and the range space E is E^{1}, E^{*}, C, E^{n}, or another complete normed space. Recall that in such a space, \sum_{i}\left|a_{i}\right|<\infty implies that \sum a_{i} converges and is permutable (Chapter 7, §2).
We start with elementary maps, including simple maps as a special case.
Definition
Let f : S \rightarrow E be elementary on A \in \mathcal{M}; so f=a_{i} on A_{i} for some \mathcal{M}-partition
A=\bigcup_{i} A_{i} \text { (disjoint).}
(Note that there may be many such partitions.)
We say that f is integrable (with respect to m), or m-integrable, on A iff
\sum\left|a_{i}\right| m A_{i}<\infty.
(The notation "|a_{i}| m A_{i}" always makes sense by our conventions (2*) in Chapter 4, §4.) If m is Lebesgue measure, then we say that f is Lebesgue integrable, or L-integrable.
We then define \int_{A} f, the m-integral of f on A, by
\int_{A} f=\int_{A} f d m=\sum_{i} a_{i} m A_{i}.
(The notation "dm" is used to specify the measure m.)
The "classical" notation for \int_{A} f d m is \int_{A} f(x) d m(x).
Note 1. The assumption
\sum\left|a_{i}\right| m A_{i}<\infty
implies
(\forall i) \quad\left|a_{i}\right| m A_{i}<\infty;
so a_{i}=0 if m A_{i}=\infty, and m A_{i}=0 if |a_{i}|=\infty. Thus by our conventions, all "bad" terms a_{i} m A_{i} vanish. Hence the sum in (1) makes sense and is finite.
Note 2. This sum is also independent of the particular choice of \{A_{i}\}. For if \{B_{k}\} is another \mathcal{M}-partition of A, with f=b_{k} on B_{k}, say, then f=a_{i}=b_{k} on A_{i} \cap B_{k} whenever A_{i} \cap B_{k} \neq \emptyset. Also,
(\forall i) \quad A_{i}=\bigcup_{k}\left(A_{i} \cap B_{k}\right) \text { (disjoint);}
so
(\forall i) \quad a_{i} m A_{i}=\sum_{k} a_{i} m(A_{i} \cap B_{k}),
and hence (see Theorem 2 of Chapter 7, §2, and Problem 11 there)
\sum_{i} a_{i} m A_{i}=\sum_{i} \sum_{k} a_{i} m\left(A_{i} \cap B_{k}\right)=\sum_{k} \sum_{i} b_{k} m\left(A_{i} \cap B_{k}\right)=\sum_{k} b_{k} m B_{k}.
(Explain!)
This makes our definition (1) unambiguous and allows us to choose any \mathcal{M}-partition \{A_{i}\}, with f constant on each A_{i}, when forming integrals (1).
Corollary \PageIndex{1}
Let f : S \rightarrow E be elementary and integrable on A \in \mathcal{M}. Then the following statements are true.
(i) |f|<\infty a.e. on A.
(ii) f and |f| are elementary and integrable on any \mathcal{M}-set B \subseteq A, and
\left|\int_{B} f\right| \leq \int_{B}|f| \leq \int_{A}|f|.
(iii) The set B=A(f \neq 0) is \sigma-finite (Definition 4 in Chapter 7, §5), and
\int_{A} f=\int_{B} f.
(iv) If f=a (constant) on A,
\int_{A} f=a \cdot m A.
(v) \int_{A}|f|=0 iff f=0 a.e. on A.
(vi) If m Q=0, then
\int_{A} f=\int_{A-Q} f
(so we may neglect sets of measure 0 in integrals).
(vii) For any k in the scalar field of E, k f is elementary and integrable, and
\int_{A} k f=k \int_{A} f.
Note that if f is scalar valued, k may be a vector. If E=E^{*}, we assume k \in E^{1}.
- Proof
-
(i) By Note 1, |f|=|a_{i}|=\infty only on those A_{i} with m A_{i}=0. Let Q be the union of all such A_{i}. Then m Q=0 and |f|<\infty on A-Q, proving (i).
(ii) If \{A_{i}\} is an \mathcal{M}-partition of A,\{B \cap A_{i}\} is one for B. (Verify!) We have f=a_{i} and |f|=|a_{i}| on B \cap A_{i} \subseteq A_{i}.
Also,
\sum\left|a_{i}\right| m\left(B \cap A_{i}\right) \leq \sum\left|a_{i}\right| m A_{i}<\infty.
(Why?) Thus f and |f| are elementary and integrable on B, and (ii) easily follows by formula (1).
(iii) By Note 1, f=0 on A_{i} if m A_{i}=\infty. Thus f \neq 0 on A_{i} only if m A_{i}<\infty. Let \{A_{i_{k}}\} be the subsequence of those A_{i} on which f \neq 0; so
(\forall k) \quad m A_{i_{k}}<\infty.
Also,
B=A(f \neq 0)=\bigcup_{k} A_{i_{k}} \in \mathcal{M} \text{ (}\sigma \text {-finite!).}
By (ii), f is elementary and integrable on B. Also,
\int_{B} f=\sum_{k} a_{i_{k}} m A_{i_{k}},
while
\int_{A} f=\sum_{i} a_{i} m A_{i}.
These sums differ only by terms with a_{i}=0. Thus (iii) follows.
The proof of (iv)-(vii) is left to the reader.\quad \square
Note 3. If f : S \rightarrow E^{*} is elementary and sign-constant on A, we also allow that
\int_{A} f=\sum_{i} a_{i} m A_{i}=\pm \infty.
Thus here \int_{A} f exists even if f is not integrable. Apart from claims of integrability and \sigma-finiteness, Corollary 1(ii)-(vii) hold for such f, with the same proofs.
Example
Let m be Lebesgue measure in E^{1}. Define f=1 on R (rationals) and f=0 on E^{1}-R ; see Chapter 4, §1, Example (c). Let A=[0,1].
By Corollary 1 in Chapter 7, §8, A \cap R \in \mathcal{M}^{*} and m(A \cap R)=0. Also, A-R \in \mathcal{M}^{*}.
Thus \{A \cap R, A-R\} is an \mathcal{M}^{*}-partition of A, with f=1 on A \cap R and f=0 on A-R.
Hence f is elementary and integrable on A, and
\int_{A} f=1 \cdot m(A \cap R)+0 \cdot m(A-R)=0.
Thus f is L-integrable (even though it is nowhere continuous).
Theorem \PageIndex{1} (additivity)
(i) If f : S \rightarrow E is elementary and integrable or elementary and nonnegative on A \in \mathcal{M}, then
\int_{A} f=\sum_{k} \int_{B_{k}} f
for any \mathcal{M}-partition \left\{B_{k}\right\} of A.
(ii) If f is elementary and integrable on each set B_{k} of a finite \mathcal{M}-partition
A=\bigcup_{k} B_{k},
it is elementary and integrable on all of A, and (2) holds again.
- Proof
-
(i) If f is elementary and integrable or elementary and nonnegative on A=\bigcup_{k} B_{k}, it is surely so on each B_{k} by Corollary 2 of §1 and Corollary 1(ii) above.
Thus for each k, we can fix an \mathcal{M}-partition B_{k}=\bigcup_{i} A_{k i}, with f constant (f=a_{k i}) on A_{k i}, i=1,2, \ldots. Then
A=\bigcup_{k} B_{k}=\bigcup_{k} \bigcup_{i} A_{k i}
is an \mathcal{M}-partition of A into the disjoint sets A_{k i} \in \mathcal{M}.
Now, by definition,
\int_{B_{k}} f=\sum_{i} a_{k i} m A_{k i}
and
\int_{A} f=\sum_{k, i} a_{k i} m A_{k i}=\sum_{k}\left(\sum_{i} a_{k i} m A_{k i}\right)=\sum_{k} \int_{B_{k}} f
by rules for double series. This proves formula (2).
(ii) If f is elementary and integrable on B_{k}(k=1, \ldots, n), then with the same notation, we have
\sum_{i}\left|a_{k i}\right| m A_{k i}<\infty
(by integrability); hence
\sum_{k=1}^{n} \sum_{i}\left|a_{k i}\right| m A_{k i}<\infty.
This means, however, that f is elementary and integrable on A, and so clause (ii) follows.\quad \square
Caution. Clause (ii) fails if the partition \{B_{k}\} is infinite.
Theorem \PageIndex{2}
(i) If f, g : S \rightarrow E^{*} are elementary and nonnegative on A, then
\int_{A}(f+g)=\int_{A} f+\int_{A} g.
(ii) If f, g : S \rightarrow E are elementary and integrable on A, so is f \pm g, and
\int_{A}(f \pm g)=\int_{A} f \pm \int_{A} g.
- Proof
-
Arguing as in the proof of Theorem 1 of §1, we can make f and g constant on sets of one and the same \mathcal{M}-partition of A, say, f=a_{i} and g=b_{i} on A_{i} \in \mathcal{M}; so
f \pm g=a_{i} \pm b_{i} \text { on } A_{i}, \quad i=1,2, \ldots.
In case (i), f, g \geq 0; so integrability is irrelevant by Note 3, and formula (1) yields
\int_{A}(f+g)=\sum_{i}\left(a_{i}+b_{i}\right) m A_{i}=\sum_{i} a_{i} m A_{i}+\sum b_{i} m A_{i}=\int_{A} f+\int_{A} g.
In (ii), we similarly obtain
\sum_{i}\left|a_{i} \pm b_{i}\right| m A_{i} \leq \sum\left|a_{i}\right| m A_{i}+\sum_{i}\left|b_{i}\right| m A_{i}<\infty.
(Why?) Thus f \pm g is elementary and integrable on A. As before, we also get
\int_{A}(f \pm g)=\int_{A} f \pm \int_{A} g,
simply by rules for addition of convergent series. (Verify!)\quad \square
Note 4. As we know, the characteristic function C_{B} of a set B \subseteq S is defined
C_{B}(x)=\left\{\begin{array}{ll}{1,} & {x \in B,} \\ {0,} & {x \in S-B.}\end{array}\right.
If g : S \rightarrow E is elementary on A, so that
g=a_{i} \text { on } A_{i}, 1,2, \ldots,
for some \mathcal{M}-partition
A=\bigcup A_{i},
then
g=\sum_{i} a_{i} C_{A_{i}} \text { on } A.
(This sum always exists for disjoint sets A_{i}. Why?) We shall often use this notation.

If m is Lebesgue measure in E^{1}, the integral
\int_{A} g=\sum_{i} a_{i} m A_{i}
has a simple geometric interpretation; see Figure 33. Let A=[a, b] \subset E^{1}; let g be bounded and nonnegative on E^{1}. Each product a_{i} m A_{i} is the area of a rectangle with base A_{i} and altitude a_{i}. (We assume the A_{i} to be intervals here.) The total area,
\int_{A} g=\sum_{i} a_{i} m A_{i},
can be treated as an approximation to the area under some curve y=f(x), where f is approximated by g (Theorem 3 in §1). Integration historically arose from such approximations.
Integration of elementary extended-real functions. Note 3 can be extended to sign-changing functions as follows.
Definition
If
f=\sum_{i} a_{i} C_{A_{i}} \quad\left(a_{i} \in E^{*}\right)
on
A=\bigcup_{i} A_{i} \quad\left(A_{i} \in \mathcal{M}\right),
we set
\int_{A} f=\int_{A} f^{+}-\int_{A} f^{-},
with
f^{+}=f \vee 0 \geq 0 \text { and } f^{-}=(-f) \vee 0 \geq 0;
see §2.
By Theorem 2 in §2, f^{+} and f^{-} are elementary and nonnegative on A; so
\int_{A} f^{+} \text { and } \int_{A} f^{-}
are defined by Note 3, and so is
\int_{A} f=\int_{A} f^{+}-\int_{A} f^{-}
by our conventions (2*) in Chapter 4, §4.
We shall have use for formula (3), even if
\int_{A} f^{+}=\int_{A} f^{-}=\infty;
then we say that \int_{A} f is unorthodox and equate it to +\infty, by convention; cf. Chapter 4, §4. (Other integrals are called orthodox.) Thus for elementary and (extended) real functions, \int_{A} f is always defined. (We further develop this idea in §5.)
Note 5. With f as above, we clearly have
f^{+}=a_{i}^{+} \text { and } f^{-}=a_{i}^{-} \text { on } A_{i},
where
a_{i}^{+}=\max \left(a_{i}, 0\right) \text { and } a_{i}^{-}=\max \left(-a_{i}, 0\right).
Thus
\int_{A} f^{+}=\sum a_{i}^{+} \cdot m A_{i} \text { and } \int_{A} f^{-}=\sum a_{i}^{-} \cdot m A_{i},
so that
\int_{A} f=\int_{A} f^{+}-\int_{A} f^{-}=\sum_{i} a_{i}^{+} \cdot m A_{i}-\sum_{i} a_{i}^{-} \cdot m A_{i}.
If \int_{A} f^{+}<\infty or \int_{A} f^{-}<\infty, we can subtract the two series termwise (Problem 14 of Chapter 4, §13) to obtain
\int_{A} f=\sum_{i}\left(a_{i}^{+}-a_{i}^{-}\right) m A_{i}=\sum_{i} a_{i} m A_{i}
for a_{i}^{+}-a_{i}^{-}=a_{i}. Thus formulas (3) and (4) agree with our previous definitions.