A Document With An Image

The Deﬁnite Integral

2002 Donald Kreider and Dwight Lahr

Thus far, we have discussed the Tangent Line Problem. Its solution led to the deﬁnition of the derivative

and to the rich array of applications that we have been studying. Now, we are ready to state the other

fundamental problem with which calculus deals: The Area Problem.

The Area Problem: Find the area of the region in the xy-plane lying above the interval [a, b] on the

x-axis and under the graph of the nonnegative continuous function y = f(x).

a b

Area under the curve

and above the interval

[a,b] on the x-axis.

)(xfy =

The problem seems approachable enough. That is, we are all familiar with areas and know how to

calculate them for some basic geometric ﬁgures. In fact, before we go very far let’s list three assumptions

about areas that we can all can agree to.

Assumptions about Areas:

1. Area is a nonnegative number.

2. The area of a rectangle is its length times its width.

3. Area is additive. That is, if a region is completely divided into a ﬁnite number of non-overlapping

subregions, then the area of the region is the sum of the areas of the subregions.

We probably do not need to say much about these assumptions. We have all computed the area of a square

and a rectangle, and have subdivided regions into smaller regions whose areas we then added willy-nilly as

the situation required to ﬁnd the original area.

Returning to the Area Problem, because rectangles are convenient, our approach will be to use them to

approximate the area of the region in question. We will start by considering an example and introducing

some terminology.

Upper and Lower Sums; the Method of Exhaustion

Example 1: Suppose we want to use rectangles to approximate the area under the graph of y = x + 1

on the interval [0, 1]. Here are two possible ways to do it, as illustrated in the sketches.

In both sketches, we use ﬁve rectangles, but in the left picture, the area of the rectangle on each subinterval

exceeds the area under the graph, while in the right picture the area of each rectangle is less than that of

the corresponding subregion. The triangles at the top of each rectangle represent the amount by which we

go over or fall short of the area of the region under the graph. We will call the sum of the areas of the

rectangles in the left picture an Upper Riemann Sum, and the sum of the areas of the rectangles in the right

picture a Lower Riemann Sum. Let’s calculate these quantities.

Each rectangle is of width 0.2. In the Upper Sum, the height of each rectangle is f evaluated at the

right endpoint of the subinterval; in the Lower Sum the heights are f evaluated at the left endpoint of the

subinterval). Upper Sum = .2f(.2)+.2f(.4) + .2f(.6)+.2f(.8) + .2f(1) =

. Lower Sum = .2f(0) + .2f(.2)+

.2f(.4) + .2f (.6) + .2f(.8) =

. Now, in the example we are considering, the region is the sum of a rectangle

and a triangle. So, we know that the exact area is 1 +

. And of course,

is between

and

Note that we can get a better approximation to the area under the graph by using rectangles of smaller

width. For example, if we double the number of recatngles to 10 so that the width of each rectangle is 0.1,

then the Upper Sum =

and Lower Sum =

The process of increasing the number of rectangles to improve the approximation to the area whose

value we seek is reminiscent of the Greek Method of Exhaustion. The inventors of calculus asked: Instead of

stopping with a ﬁnite number of rectangles, why not take the limit of the sum of the areas of the rectangles as

their widths approach 0? This should yield the exact value of the area, if (as always, an important proviso)

the limit exists. Why not, indeed!

Let n stand for the number of rectangles, U for the Upper Riemann Sum, and L for the Lower Rie-

mann Sum. Here are some values for the same example we have been discussing. You can compare these

approximations with the exact value of 1.5:

n U L

100 1.505000000 1.495000000

150 1.503333333 1.496666667

200 1.502500000 1.497500000

300 1.501666667 1.498333333

500 1.501000000 1.499000000

General Procedure for ﬁnding the Area Under a Curve and Above an Interval: The above

example suggests the following procedure for calculating the area under a curve.

1. Let y = f(x) be given and deﬁned on an interval [a, b]. Subdivide the interval [a, b] into n subin-

tervals. Label the endpoints of the subintervals a = x

≤ x

··· ≤ x

= b. Deﬁne

P = {x

, x

, . . . , x

} to be a partition of [a, b].

2. Let ∆x

= x

− x

i−1

be the width of the i

subinterval, 1 ≤ i ≤ n.

3. Form the Upper Riemann Sum U(P, f): the height of each rectangle is the maximum value M

of the

function on that i

subinterval.

U(P, f) = M

∆x

+ M

∆x

+ M

∆x

+ ··· + M

∆x

4. Form the Lower Riemann Sum L(P, f): the height of each rectangle is the minimum value m

of the

function on that i

subinterval.

L(P, f) = m

∆x

+ m

∆x

+ m

∆x

+ ··· + m

∆x

5. Take the limit as n → ∞ and the maximum ∆x

→ 0.

We have that L(P, f) ≤ Area ≤ U (P, f). So, if the limit of the Upper Riemann Sums and the limit of

the Lower Riemann Sums approach a common value, this number is deﬁned to be the area under the

curve and above the interval [a, b].

Sigma Notation

From our discussion of the example above, we seem to have deﬁned a working procedure to ﬁnd the area

of a region lying above an interval of the x-axis and under the graph of a function. But before going further,

the process can be facilitated by introducing some useful notation.

Sigma Notation: If m and n are integers with m ≤ n, and if f is a function deﬁned on the integers

from m to n, then the symbol

i=m

f(i), called sigma notation, is deﬁned to be f(m) + f(m + 1) + f(m +

2) + ··· + f(n).

So, sigma notation is just a way of writing the sum in a compact form. (The word sigma comes from the

Greek letter Σ.)

Example 2: Here are three examples:

i=1

i = 1 + 2 + 3 + 4 ··· + n

i=1

= 1

+ 2

+ 3

+ 4

··· + n

i=1

1 = 1 + 1 + 1 + 1 ··· + 1

| {z }

ntimes

Note that we can evaluate the sums in the above example by simply adding the numbers.

Example 3: If we add 1 to itself n times, the sum is n. So,

i=1

1 = n. For instance,

i=1

1 =

1 + 1 + 1 + 1 + 1 = 5.

Also,

i=1

i =

n(n + 1)

For instance,

i=1

i = 1 + 2 + 3 + 4 + 5 = 15 = 5 · 6/2. This sum is often referred to as the Gauss sum

because when he was a young school boy, the mathematical genius Gauss was able to solve the problem of

adding the ﬁrst 100 numbers for his teacher in lightning speed. Here is probably the way he did it:

1 2 3 4 5 ··· 98 99 100

100 99 98 97 96 ··· 3 2 1

101 101 101 101 101 ··· 101 101 101

That is, if you write the numbers ﬁrst from 1 to 100, then in reverse order from 100 to 1, and add them,

you get 100 times 101. But this is twice the answer you want, so you must divide by 2. Hence, the answer

that you want is (100 · 101)/2 = 5050. Pretty clever! Notice that there is nothing special about 100. We

could prove the result we have stated for n in an analogous way, but we won’t bother with the details here.

We will simply state the result for the sum of the squares of the ﬁrst n numbers:

i=1

n(n + 1)(2n + 1)

Example 4:

i=1

(3i

+ 2i + 1) = 3

i=1

+ 2

i=1

i +

i=1

3 ·10 · 11 ·21

2 ·10 · 11

+ 10

= 1275

The Area Problem Revisited

So far as an example we have considered a region whose top boundary is a line. And based on that

example, we have outlined some fairly general procedures. Let’s approximate the area of another region

that we recognize just to be sure that we are going in the right direction. We will also make use of the

terminology we have introduced. Note that in sigma notation the Upper and Lower Riemann sums can be

stated compactly as

U(P, f) =

i=1

∆x

L(P, f) =

i=1

∆x

where M

and m

are, respectively, the maximum and minimum values of f on the i

subinterval [x

i−1

, x

1 ≤ i ≤ n.

Example 5: Let f(x) =

√

1 −x

on the interval [-1,1]. Using 5 subintervals of equal width, ﬁnd U(P, f)

and L(P, f). To solve this problem, ﬁrst note that the region is bounded by a semicircle and the x-axis.

In this example, we are given that the widths of the rectangles are all the same, namely, ∆x = 2/5 = 0.4.

To form the Upper Riemann Sum, we use the maximum height of the rectangle on each subinterval. So,

we get that U (P, f) = ∆x(f(−.6) + f(−.2) + f(0) + f(.2) + f(.6)) ≈ .4(4.559591794) ≈ 1.823836718.

The Lower Riemann Sum uses the minimum height of the rectangle on each subinterval. Thus, L(P, f ) =

∆x(f(−.6) + f (−.2) + f(.6)) ≈ .4(2.579795897) ≈ 1.031918359. The actual value of the area is (π · 1

)/2

which is approximately 1.570796327. Once again, we would expect that as we let ∆x → 0, we would get a

better and better approximation of the area under the graph.

),( fPU

),( fPL

There are two other Riemann Sums that are convenient to use because their formulas do not depend

on the characteristics of the function. Given a partition P of [a, b], P = {a = x

, x

, . . . , x

= b}, and

∆x

= x

− x

i−1

the width of the i

subinterval, 1 ≤ i ≤ n; let f be deﬁned on [a, b]. Then the Right

Riemann Sum is

i=1

f(x

)∆x

and the Left Riemann Sum is

n−1

i=0

f(x

)∆x

The left Riemann Sum uses the left endpoint of each subinterval to determine the height of the rectangle

on that subinterval, while the Right Riemann Sum uses the right endpoint. Note that if f is decreasing on

[a, b], then U(P, f) is a Left Riemann Sum, and L(P, f) is a Right Riemann Sum. Similar comments apply

to increasing functions.

Right Riemann

Sum

Left Riemann

Sum

Example 6: Approximate the area of the region under the graph of the function f(x) =

√

1 −x

on the

interval [0,1] using a Left and a Right Riemann Sum ﬁrst with 5 rectangles of equal width, then with 100.

Note that because the function that describes the quarter-circle is decreasing on [0,1], the question asks us to

ﬁnd the Upper and Lower Riemann Sums for 5, and then 100 subintervals of equal width. A big advantage to

the left and right Riemann sums is that their formulas are easily programmed into a programmable calculator

or a computer. In this example, in the case of 5 rectangles, x

= 0 + i/5, 0 ≤ i ≤ 5, and we want to ﬁnd the

Left Riemann Sum

i=0

1 −x

i=0

1 −

and then the Right Riemann Sum

i=1

1 −x

i=1

1 −

These evaluate to Left Riemann Sum ≈ .8592622072 and Right Riemann Sum ≈ .6592622072. The actual

value is π/4 ≈ .7853981635.

With 100 subintervals, we can get even closer to π/4 by evaluating the sums

100

i=0

1 −x

100

i=0

1 −

100

≈ .7901042579

100

i=1

1 −x

100

i=1

1 −

100

≈ .7801042577

Right Riemann Sum Left Riemann Sum

n = 100

Applet: Riemann Sums Try it!

The Deﬁnite Integral

Believe it or not, we almost have the deﬁnition of the deﬁnite integral in hand. We will state it formally

so that we can refer to it conveniently as needed.

Deﬁnition: Let P be a partition of the interval [a, b], P = {x

, x

, . . . , x

} with a = x

≤ x

≤

··· ≤ x

= b. Let ∆x

= x

−x

i−1

be the width of the i

subinterval, 1 ≤ i ≤ n. Let f be a function deﬁned

on [a, b]. Next form the Upper Riemann Sum U(P, f) where the height of the rectangle on each subinterval

is the maximum value of f on that subinterval; and form the Lower Riemann Sum L(P, f), where the height

of the rectangle on each subinterval is the minimum height of f on that subinterval. Then we say that f

is Riemann integrable on [a, b] if there exists a unique number Φ such that L(P, f) ≤ Φ ≤ U(P, f) for all

partitions of [a, b]. We write the number Φ as

Φ =

f(x)dx

and call it the deﬁnite integral of f over [a, b].

The integral symbol is a stylized Greek sigma Σ from the summation notation we introduced above. The

x is a so-called dummy variable in that it merely tells us the variable with respect to which we are integrating;

hence, we could equally well write

f(t)dt or

f(r)dr.

The deﬁnition looks a bit awkward to verify. However, there are two important theorems that come to

our aid from advanced analysis, and which we rely on in practice.

Theorem 1: If f is Riemann integrable on [a, b], then

f(x)dx = lim

n→∞

||P ||→0

i=1

f(c

)∆x

where c

is any point in the subinterval [x

i−1

, x

], and ||P || is the maximum length of the ∆x

So, the Upper Riemann Sum, the Lower Riemann Sum, the Left Riemann Sum, and the Right Riemann

Sum are all special cases of the sum in the above limit where we choose the points c

in very particular ways.

(That is, where f is a maximum, or a minimum, or the left endpoint, or the right endpoint, respectively.)

In the examples, we usually take the subintervals to be of equal length, so as n → ∞, the length of each

subinterval automatically goes to 0.

The above theorem is much more than a theoretical result. We will see that we use it extensively in

applications as a guide in setting up a mathematical model connected with the problem. But more about

that later. The theorem below allows us to work eﬀectively with the integral because most of the functions

in which we will be interested are continuous or piecewise continuous.

Theorem 2: If f is continuous on [a, b], then f is Riemann integrable on [a, b].

This theorem tells us that for continuous functions, we can use the limit of any convenient Riemann sums

to evaluate the integral.

Example 7: Use an Upper Riemann Sum and a Lower Riemann Sum, ﬁrst with 8, then with 100

subintervals of equal length to approximate the area under the graph of y = f (x) = x

on the interval [0, 1].

First with 8 subintervals:

U(P, f) =

i=1

≈ .3984375

L(P, f) =

i=0

≈ .2734375

Then with 100 subintervals:

U(P, f) =

100

i=1

10000

= .33835

L(P, f) =

100

i=0

10000

= .32835

Exercises: Problems Check what you have learned!

Videos: Tutorial Solutions See problems worked out!