23  Properties

Highlights of this Chapter: we prove many foundational theorems about the derivative that one sees in an early calculus course. We see how to take the derivative of scalar multiples, sums, products, quotients and compositions. We also compute - directly from the definition - the derivative of exponential functions. This leads to an important discovery: there is a unique simplest, or natural exponential, whose derivative is itself. This is the origin of e in Analysis.

From the definition, we move on to confirm the basic properties of the derivative well known and loved in introductory calculus courses. Most of these are straightforward, the only exception whose proof requires more thought than usually let on in Calculus I is the chain rule.

However before jumping in we prove one small oft-useful result often not mentioned in a calculus class, relating differentiability to continuity.

The converse is not true as we saw previously: the absolute value is continuous, but not differentiable.

Proposition 23.1 Let f be differentiable at aR. Then f is continuous at a.

Proof. Since f is differentiable at a, we know the limit of the difference quotient is finite limxaf(x)f(a)xa=f(a) We also know that limxa(xa)=0$ So, using the limit theorems we may multiply these together and get what we want. Precisely, let xna be any sequence with xna for all n. Then we have

0=(0)(f(a))=(limxna)(limf(xn)f(a)xna)=lim((xna)f(xn)f(a)xna)=lim(f(xn)f(a))

Thus lim(f(xn)a)=0 so by the limit theorems we see limf(xn)=a. Since xn was arbitrary with xna this holds for any such sequence, we see that f is continuous at a using the sequence definition.

There is a little gap not explicitly spelled out at the end of the proof above, that we should fill in now (to assure ourselves this style of reasoning always works). We just proved that for sequences xna the property we want holds, but continuity requires this fact for all arbitrary sequences. How do we bridge this gap? Let yna be an arbitrary sequence: then we split into the subsequences xna and the subsequence of all terms =a. If either of these is finite, we can just truncate the original sequence at a point past which all terms are of one or the other: each of these has limf(xn)=f(a) so we are done. In the case that both are infinite, we just use that we have separated our sequence into a union of two subsequences, each with the same limit! Thus the overall limit exists.

23.1 Differentiation and Field Operations

Here we prove the ‘derivative laws’ of Calculus I:

Theorem 23.1 Let f be a function and cR. Then if f is differentiable at a point aR so is cf, and (cf)(a)=c(f(a))

Proof. Let’s use the difference quotient with a+hn to change things up: Let hn0 be arbitrary, and we wish to compute the limit limcf(a+hn)cf(a)hn By the limit laws we can pull out the constant c, and the remainder converges to f(a), as f is assumed to be differentiable at a.

=climf(a+hn)f(a)hn=cf(a)

Because this is true for all sequences hn0 with hn0, the limit exists, and equals cf(a).

Exercise 23.1 Let f,g be functions which are both differentiable at a point aR. Then f+g is also differentiable at a, and (f+g)(a)=f(a)+g(a)

Theorem 23.2 (The Product Rule) Let f,g be functions which are both differentiable at a point aR. Then fg is differentiable at a and

(fg)(a)=f(a)g(a)+f(a)g(a)

Proof. Let f,g be differentiable at aR, and choose an arbitrary sequence ana. Then we wish to compute

limf(an)g(an)f(a)g(a)ana

To the numerator we add 0=f(an)g(a)f(an)g(a) and regroup with algebra:

=limf(an)g(an)f(an)g(a)+f(an)g(a)f(a)g(a)ana =limf(an)g(an)f(an)g(a)ana+f(an)g(a)f(a)g(a)ana

Using the limit laws, we can take each of these limits individually so long as they exist (which we will show they do). But even more, note that the first term has a common factor of f(an) in the numerator that can be factored out, and the second a common factor of g(a). Thus, by the limit laws, we see

=(limf(an))(limg(an)g(a)ana)+g(a)(f(an)f(a)ana)

Because f is differentiable at a, its continuous at a, and so we know limf(an)=f(a). The other two limits above converge to the derivatives f(a) and g(a) respectively. Thus, alltogether we find the resulting limit to be

f(a)g(a)+f(a)g(a)

As this was the result for an arbitrary sequence ana with ana, it must be the same for all sequences, meaning the limit exists, and

(fg)(a)=f(a)g(a)+f(a)g(a)

Exercise 23.2 (The Reciprocal Rule) Let f be a function and aR be a point such that f(a)0 and f is differentiable at a. Then 1/f is also differentiable at a and (1f)(a)=f(a)f(a)2

Theorem 23.3 (The Quotient Rule) Let f,g be a functions which are differentiable at a point aR and assume g(a)0. Then the function f/g is also differentiable at a and (fg)(a)=f(a)g(a)f(a)g(a)g(a)2

Exercise 23.3 Use the Reciprocal Rule and Product Rule to prove the quotient rule.

23.2 The Chain Rule

Theorem 23.4 (The Chain Rule) If g(x) is differentiable at aR and f(x) is differentiable at g(a) then the composition fg is differentiable at a, with (fg)(a)=f(g(a))g(a)

Proof (Wish this Worked!). We are taking the derivative at a, so let xna wtih xna be arbitrary. Then the limit defining [f(g(a))] is

limf(g(xn))f(g(a))xna

We multiply the numerator and denominator of this fraction by $g(xn)g(a) and regroup:

f(g(xn))f(g(a))xna=f(g(xn))f(g(a))xnag(xn)g(a)g(xn)g(a)=limf(g(xn))f(g(a))g(xn)g(a)g(xn)g(a)xna

Because g is continuous at a, we know g(xn)a, and because f is differentiable at g(a) we recognize the first term here as the limit defining f at g(a)! Since the second term is the limit defining the derivative of g, both of these exist by our assumptions, and so by the limit theorems we can compute

=(limf(g(xn))f(g(a))g(xn)g(a))(limg(xn)g(a)xna) =f(g(a))g(a)

Unfortunately, this proof fails at one crucial step! Wile we do know that xna0 (in the definition of limxa, we only choose sequences xna with xna) we do not know that the other denominator g(xn)g(a) is nonzero.

If this problem could only happen finitely many times it would be no trouble - we could just truncate the beginning of our sequence and rest assured we had not affected the value of the limit. But functions - even differentiable functions - can be pretty wild. The function x2sin(1/x) (from ) ends up equaling zero infinitely often in any neighborhood of zero! So such things are a real concern.

Happily the fix - while tedious - is straightforward. It’s given below.

Exercise 23.4 We define the auxiliary function d(y) as follows:

d(y)={f(y)f(g(a))yg(a)yg(a)f(g(a))y=g(a)

This function equals our problematic difference quotient most of the time, but equals the quantity we want it to be when the denominator is zero.

Prove that d is continuous at g(c) and we may use d in place of the difference quotient in our computation: that for all xa, the following equality holds:

f(g(x))f(g(a))xa=d(g(x))g(x)g(a)xa

Given this, the original proof is rescued:

Proof. We are taking the derivative at a, so let xna with xna be arbitrary. Then the limit defining [f(g(a))] is (by the exercise)

limf(g(xn))f(g(a))xna=limd(g(xn))g(xn)g(a)xna

Because d is continuous at g(a) and g(xn)g(a) we know d(g(xn))d(g(a))=f(g(a)). And, as g is differentiable at a we know the limit of the difference quotient exists. Thus, by the limit laws we can separate them and

=(limd(g(xn)))(g(xn)g(a)xna)=f(g(a))g(a)

23.3 Exponentials and Logs

In this section we look at how to find derivatives of functions which are defined not explicitly, but by functional equations. We will take the exponential as our example case; on the final project you will analyze the trigonometric functions this way.

Proposition 23.2 Let E(x) be an exponential function. Then E is differentiable on the entire real line, and E(x)=E(0)E(x)

First we show that this formula holds so long as E is actually differentiable at zero. Thus, differentiability at a single point is enough to ensure differentiability everywhere and fully determine the formula!

Proof. Let xR, and hn0. Then we compute E(x) by the following limit: E(x)=limE(x+hn)E(x)hn

Using the property of exponentials and the limit laws, we can factor an E(x) out of the entire numerator:

=limE(x)E(hn)E(x)hn=E(x)limE(hn)1hn

But, E(0)=1 so the limit here is actually the *derivative of E at zero$!

E(x)=E(x)E(0)

Next, we tackle the slightly more subtle problem of showing that E is in fact differentiable at zero. This is tricky because all we have assumed is that E is continuous and satisfies the law of exponents: how are we going to pull differentiability out of this? The trick is two parts (1) show the right and left hand limits defining the derivative exist, and (2) show they’re equal.

In fact, E(0) is a known number, its the natural log of a (Cite where we’ll prove this later)

Proof. STEP 1: Show that the left and right hand limits defining the derivative exist: E is convex () so the difference quotient is monotone increasing (), and so the limit limx0 exists (as a sup) and limx0+ exists (as an inf), .

STEP2: Now that we know each of these limits exist, let’s show they are equal using the definition:

To compute the lower limit, we can choose any sequence approaching 0 from below: let hn be a positive sequence with hn0, then hn will do:

limh0E(h)1h=limE(hn)1hn And by we see E(hn)=1/E(hn). Thus limE(hn)1hn=lim1E(hn)1hn=lim1E(hn)hn1Ehn=limE(hn)1hn1E(hn)

But, since E is continuous (by definition) and E(0)=1 () the limit theorems imply lim1E(hn)=1limE(hn)=1E(limhn)=1E(0)=1 Thus, lim(E(hn)1hn1E(hn))=(limE(hn)1hn)(lim1E(hn))=limE(hn)1hn But this last limit evaluates exactly to the limit from above since hn>0 and hn0. Stringing all of this together, we finally see limh0E(h)1h=limh0+E(h)1h Thus, by we see that since both one sided limits exist and are equal the entire limit exists: E is differentiable at 0.

This theorem tells us that the exponential functions have a remarkable property: they are their own derivatives, up to a constant multiple! While the functional equation alone did not provide us any means of distinguishing between different exponential functions, differentiation selects a single best, or simplest exponential out of the lot: the one where that constant multiple is just 1!

Definition 23.1 We write exp(x) for the exponential function which has exp(0)=1.

Note that by the chain rule we know such a thing exists so long as any exponential exists. If E(x) is any exponential then E(x/E(0)) has derivative 1 at x=0!

Recalling our work with irrational exponents, we saw that if E is an exponential with E(1)=a, then we may write E(x)=ax for any xR (defined as a limit of rational exponents). So, our special exponential exp comes with a special number as its base.

Definition 23.2 We denote by the letter e the base of the exponential exp(x): that is, e=exp(1), and exp(x)=ex Note that by definition we have (ex)=ex

This is the origin of the number e from the perspective of analysis! At this point in the story we do not know it’s value, but we now have a hint on how to get it: we just need to construct a means of computing the exponential function, and then plug in 1.

23.3.1 Logarithms

For every exponential, the inverse function is a logarithm (). So, E be any exponential, and L a logarithm. Then L(E(x))=x, and differentiating with the chain rule yields

[L(E(x))]=L(E(x))E(x)=L(E(x))E(x)E(0)

The other side of the equality was x, whose derivative is 1: thus

1=L(E(x))E(x)E(0) L(E(x))=1E(0)E(x)

Thus, L() is a function that takes the positive number E(x) to E(0)/E(x): it divides E(0) by its input!

Proposition 23.3 If L(x) is a logarithm function, then for some positive kR L(x)=1kx (Indeed k=E(0) where E is the inverse of L)

This tells us that like the exponential function, there is a natural logarithm - the one where the arbitrary constant appearing during differentiation is equal to 1.

Definition 23.3 The natural logarithm log(x) is the logarithm function for which log(x)=1x

Corollary 23.1 (log and exp are Inverses)  

Proof. Since exp is an exponential, we know its derivative is some logarithm L. But, differentiating L yields L(x)=1exp(0)1x

Since exp(0)=1 by definition this says that L(x)=1/x, which is the defining property of the natural logarithm. Thus L=log.

23.4 The Power Rule

Perhaps the most memorable fact from Calculus I is the power rule, that (xn)=nxn1. In this short section, we prove the power level at various levels of generality, starting with natural number exponents and proceeding to arbitrary real exponents.

Proposition 23.4 (The Power Rule: Natural Number Exponents) If n is a natural number, xn is differentiable at all real numbers and (xn)=nxn1

Proof. This is directly proved via induction on n, starting from the base case x=1, which holds as if f(x)=x and aR,

limxaf(x)axa=xaxa=1

Now, assume (xn)=nxn1 and consider xn+1. Using the product rule, we compute the derivative of xn+1=xxn

(xxn)=(x)xn+x(xn)=1xn+x(nxn1)=xn+nxn=(n+1)xn+1

Exercise 23.5 (The Power Rule: Integer Exponents) Let nZ and consider the function xn (which is defined as 1/x|n| when n<0). Then xn is differentiable at all x0 and (xn)=nxn1

Using this, we can extend what we know to rational exponents:

Proposition 23.5 (The Power Rule: Rational Exponents) Let r=p/q be any rational number and f(x)=xr. Then f is differentiable for all x>0 and f(x)=rxr1

Proof. Let r=p/q where without loss of generality p,q0 and q>1 (as if q=1 we are in the integer exponent case). Then let f(x)=xp/q, and note that f(x)q=xp. Then we can differentiate both sides of this inequality:

[f(x)q]=qf(x)q1f(x) [xp]=pxp1

Equating these gives qf(x)q1f(x)=pxp1, and solving for f:

f(x)=pxp1qf(x)q1

Using that f(x)=xp/q we can simplify the right hand side further:

f(x)=pxp1q(xp/q)q1=pxp1qxpq1a=pqx(p1)pq1q

This exponent simplifies as expected, yielding f(x)=pqxpq1

Now that we know the power rule for all rational exponents, it is time to consider arbitrary real exponents, recalling that we define xa as a limit of rational exponents.

Theorem 23.5 ( The General Power Rule) If aR and f(x)=xa. Then f is differentiable for all x>0, and (xa)=axa1

Exercise 23.6 Recall the definition of xa for irrational a is limxan for an a sequence of rational numbers converging to a. Use this definition to attempt a proof of the general power rule, by computing

(xa)=(limxan)

In your proof, you will end up getting stuck at a point where you need to interchange two limits: point out where this happens, and then show that if you are justified in interchanging the limits, that the generalized power rule holds.

Alternatively, we can give a complete proof of the power rule using exponentials and logarithms.

Proof. Let exp be the natural exponential, and log be the natural log. Then exp(log(x))=x, and so exp(log(xa))=xa. Using the property of logarithms and powers () this simplifies

xn=exp(log(xa))=exp(alog(x))

By the chain rule,

[exp(alog(x))]=exp(alog(x))[alog(x)]=exp(alog(x))alog(x)=exp(alog(x))a1x

But, recalling that exp(alog(x))=exp(log(xa))=xa this simplifies to

=xaa1x=axa1