Inverse Function Theorems

[math] \newcommand{\ex}[1]{\item } \newcommand{\sx}{\item} \newcommand{\x}{\sx} \newcommand{\sxlab}[1]{} \newcommand{\xlab}{\sxlab} \newcommand{\prov}[1] {\quad #1} \newcommand{\provx}[1] {\quad \mbox{#1}} \newcommand{\intext}[1]{\quad \mbox{#1} \quad} \newcommand{\R}{\mathrm{\bf R}} \newcommand{\Q}{\mathrm{\bf Q}} \newcommand{\Z}{\mathrm{\bf Z}} \newcommand{\C}{\mathrm{\bf C}} \newcommand{\dt}{\textbf} \newcommand{\goesto}{\rightarrow} \newcommand{\ddxof}[1]{\frac{d #1}{d x}} \newcommand{\ddx}{\frac{d}{dx}} \newcommand{\ddt}{\frac{d}{dt}} \newcommand{\dydx}{\ddxof y} \newcommand{\nxder}[3]{\frac{d^{#1}{#2}}{d{#3}^{#1}}} \newcommand{\deriv}[2]{\frac{d^{#1}{#2}}{dx^{#1}}} \newcommand{\dist}{\mathrm{distance}} \newcommand{\arccot}{\mathrm{arccot\:}} \newcommand{\arccsc}{\mathrm{arccsc\:}} \newcommand{\arcsec}{\mathrm{arcsec\:}} \newcommand{\arctanh}{\mathrm{arctanh\:}} \newcommand{\arcsinh}{\mathrm{arcsinh\:}} \newcommand{\arccosh}{\mathrm{arccosh\:}} \newcommand{\sech}{\mathrm{sech\:}} \newcommand{\csch}{\mathrm{csch\:}} \newcommand{\conj}[1]{\overline{#1}} \newcommand{\mathds}{\mathbb} [/math]

In this section we shall prove some basic theorems about continuous functions and the inverses of monotonic functions, which we have already used in studying the exponential function. These theorems are all geometrically obvious. We shall show that they also follow logically from the definitions of continuity and monotonicity using the least upper bound property of the real numbers. This shows, as much as anything, that these definitions say what we want them to say. To put it facetiously, if these theorems could not be proved, we would change the definitions until they could be.

INTERMEDIATE VALUE THEOREM

If [math]f[/math] is continuous on the closed interval [math][a, b][/math] and [math]w[/math] is any real number such that [math]f (a) \lt w \lt f (b)[/math], then there exists at least one real number [math]c[/math] such that [math]a \lt c \lt b[/math] and [math]f(c) = w[/math]. A similar theorem is obtained if the inequalities [math]f(a) \lt w \lt f(b)[/math] are replaced by [math]f(a) \gt w \gt f (b)[/math]. The proof, in all essentials, is the same. To see that (3.1) is geometrically obvious, look at Figure. The horizontal line [math]y = w[/math] must certainly cut the curve [math]y = f (x)[/math] at least once.


Show Proof

Consider the subset [math]L[/math] of the interval [math][a, b][/math] that consists of all numbers [math]x[/math] in [math][a, b][/math] such that [math]f(x) \lt w[/math]. The set [math]L[/math] is not empty because in particular it contains [math]a[/math]. Since every number in [math]L[/math] is less than [math]b[/math], the number [math]b[/math] is an upper bound for [math]L[/math]. It follows by the Least Upper Bound Property of the real numbers (page 7) that [math]L[/math] has a least upper bound, which we denote by [math]c[/math]. Moreover, [math]c[/math] lies in [math][a, b][/math]. There are three possibilities:

  • [math]f (c) \lt w,[/math]
  • [math]f (c) \gt w,[/math]
  • [math]f (c)= w.[/math]

We shall show that (i) and (ii) are, in fact, not possible. Suppose that (i) holds. Since [math]f(b) \gt w[/math], it follows that [math]c \lt b[/math]. Set [math]\epsilon = w - f(c)[/math], which is positive. Since [math]f[/math] is continuous at [math]c[/math], there exists a positive number [math]\delta[/math] such that whenever [math]|x - c| \lt \delta[/math] and [math]x[/math] is in [math][a, b][/math], then [math]|f(x) - f(c)| \lt \epsilon[/math]. Hence, there exist numbers [math]x[/math] in [math][a, b][/math] larger than [math]c[/math] for which [math]|f(x) - f(c)| \lt \epsilon[/math]. For any such [math]x[/math],

[[math]] f(x) - f(c) \leq |f(x) - f(c)| \lt \epsilon = w - f(c), [[/math]]
and so [math]f(x) \lt w[/math], which implies that [math]x[/math] belongs to [math]L[/math]. Thus there are numbers in [math]L[/math] which are larger than [math]c[/math], which contradicts the fact that [math]c[/math] is an upper bound. Next suppose that (ii) holds. Since [math]f(a) \lt w[/math], it follows that [math]a \lt c[/math]. This time set [math]\epsilon = f(c) - w[/math], which is positive. The continuity of [math]f[/math] at [math]c[/math] implies the existence of a positive number [math]\delta[/math] such that if [math]| x-c | \lt \delta[/math] and [math]x[/math] is in [math][a, b][/math], then [math]|f(x) - f(c)| \lt \epsilon[/math]. Pick [math]x_0[/math] in [math][a, b][/math] such that [math]|x_{0} - c | \lt \delta[/math] and [math]x_{0} \lt c[/math]. We contend that [math]x_0[/math] is also an upper bound for [math]L[/math]. For if [math]x[/math] is any number such that [math]x_{0} \leq x \leq c[/math], then [math]|f(x) - f(c)| \lt \epsilon[/math]. Consequently,

[[math]] - f(x) + f(c) \leq |f(x) - f(c)| \lt \epsilon = f(c) - w, [[/math]]
and so [math]-f(x) \lt -w[/math], or, equivalently, [math]f(x) \gt w[/math], which implies that [math]x[/math] is not in [math]L[/math]. This proves the contention that [math]x_0[/math] is an upper bound for [math]L[/math], which contradicts the fact that [math]c[/math] is the least upper bound. The only remaining possibility is (iii). Hence [math]f(c) = w[/math], and the proof is complete.

This theorem, (3.1), was used in Section 2, where it was asserted that, for any real number [math]x[/math], there exists a number [math]y[/math] such that [math]x = \ln y[/math]. We have previously shown that the natural logarithm takes on arbitrarily large positive and negative values. Hence we can “surround” a given number [math]x[/math] with values of [math]\ln[/math]. That is, there exist numbers [math]a[/math] and [math]b[/math] for which [math]\ln a \lt x \lt \ln b[/math]. The existence of a number [math]y[/math] sueh that [math]x = \ln y[/math] now follows immediately from (3.1). An interval was defined on page 4 to be any subset [math]I[/math] of the set of all real numbers with the property that, if [math]a[/math] and [math]c[/math] belong to [math]I[/math] and [math]a \leq b \leq c[/math], then [math]b[/math] also belongs to [math]I[/math]. The following proposition is therefore fully equivalent to Theorem (3.1). [More precisely, it is equivalent to the conjunction of (3.1) and its companion theorem with the inequality [math]f(a) \gt w \gt f(b)[/math].]

Theorem

If the domain of a continuous real-valuedfunction is an interval, then so is its range.

The reader should verify that (3.1) and (3.2) are equivalent. We have already proved that every strictly monotonic function has an inverse [see (2.4), page 250]. However, more is needed than simply existence:

Theorem

If [math]f[/math] is a strictly increasing continuousfunction whose domain is an interval, then the same is true of the inuerse function [math]f^{-1}[/math]. A companion theorem is obtained if “increasing” is replaced by “decreasing. ”


Show Proof

There are three things to be proved: (i) [math]f^{-1}[/math] is strictly increasing, (ii) the domain of [math]f^{-1}[/math] is an interval, and (iii) [math]f^{-1}[/math] is a continuous function. The first is completely straightforward, and we leave it as a problem. The second follows at once from (3.2) and the observation that the domain of [math]f^{-1}[/math] is equal to the range of [math]f[/math]: In proving (iii), we shall assume that the interval which is the domain of [math]f[/math] is neither empty nor consists of a single point. This is reasonable, because in these two cases the assumption that fis continuous is not particularly meaningful. Let [math]b[/math] be a number in the domain of [math]f^{-1}[/math], let [math]a = f^{-1}(b)[/math], and let [math]\epsilon[/math] be an arbitrary positive number. If [math]a[/math] is an endpoint of the interval which is the domain of [math]f[/math], the following argument must be modified slightly. We shall assume that [math]a[/math] is not an endpoint and also that [math]\epsilon[/math] is sufficiently small that both [math]a + \epsilon[/math] and [math]a - \epsilon[/math] are in the domain of [math]f[/math] (see Figure). Set [math]\delta[/math] equal to the smaller of the two numbers [math]f(a + \epsilon) - b[/math] and [math]b - f(a - \epsilon)[/math]. Then, if [math]y[/math] is any number in the domain of [math]f^{-1}[/math] such that [math]|y - b| \lt \delta[/math], we know that

[[math]] f(a - \epsilon) \lt y \lt f(a + \epsilon). [[/math]]

Since [math]f^{-1}[/math] is strictly increasing, we have

[[math]] a - \epsilon = f^{-1}(f(a - \epsilon)) \lt f^{-1}(y) \lt f^{-1}(f(a + \epsilon)) = a + \epsilon, [[/math]]
which implies that [math]-\epsilon \lt f^{-1}(y) -a \lt \epsilon[/math]. Since [math]a = f^{-1}(b)[/math], the latter inequalities are equivalent to

[[math]] | f^{-1}(y) - f^{-1}(b)| \lt \epsilon, [[/math]]
which proves that [math]f^{-1}[/math] is continuous. This completes the proof.

Our final theorem concerns the differentiability of an inverse function. It was used in Section 2, where we asserted that the exponential function [math]\exp[/math] is differentiable.

Theorem

Let [math]f[/math] be a strictly monotonic differentiable function whose domain is an interval. If [math]b = f[/math] (a) and if [math]f'(a) \neq 0[/math], then [math]f^{-1}[/math] is differentiable at [math]b[/math]. Moreover,

[[math]] (f^{-1})'(b)= \frac{1}{f'(a)}. [[/math]]


Show Proof

According to the definition of the derivative, we must show that

[[math]] \lim_{y \rightarrow b} \frac{ f^{-1}(y) - f^{-1}(b)}{y - b} =\frac{1}{f'(a)}. [[/math]]
Let [math]x = f^{-1}(y)[/math]. Then [math]y = f(x)[/math] and, of course, [math]a = f^{-1}(b)[/math]. Hence

[[math]] \frac{f^{-1}(y) - f^{-1}(b)}{y - b} = \frac{x - a}{f(x) - f(a)} = \frac{1}{\frac{f(x) - f(a)}{x-a}}. [[/math]]
Since the reciprocal of a limit is the limit of the reciprocal, we know that

[[math]] \frac{1}{f'(a)} = \frac{1}{\lim_{x \rightarrow a} \frac{f(x)-f(a)}{x-a}} = \lim_{x \rightarrow a} \frac{1}{\frac{ f(x)-f(a)} {x-a}}. [[/math]]
It is a simple matter to finish the proof provided we know that [math]x[/math] approaches [math]a[/math] as [math]y[/math] approaches [math]b[/math]. But this is just the assertion that [math]f^{-1}[/math] is continuous at [math]b[/math], which we know to be true as a result of Theorem (3.3). Hence

[[math]] \begin{eqnarray*} \frac{1}{f'(a)} &=& \lim_{y \rightarrow b} \frac{1}{ \frac{f(x) - f(a)}{ x-a}}\\ &=& \lim_{y \rightarrow b} \frac {f^{-1}(y) - f^{-1}(b)}{y - b} = (f^{-1})'(b), \end{eqnarray*} [[/math]]
and the proof is complete.

We have also used Theorem before to establish the differentiability of the function [math]g[/math] defined by [math]g(x) = x^{1/n}[/math], where [math]n[/math] is a positive integer and [math]x[/math] is any positive real number (see page 72). The inverse function [math]f[/math], defined by [math]f(x) = x^n[/math], for every positive real number [math]x[/math], is strictly increasing and has a positive derivative at every point in the interval [math](0, \infty)[/math]. Theorem tells us at once that [math]g[/math] is a differentiable function.

General references

Doyle, Peter G. (2008). "Crowell and Slesnick's Calculus with Analytic Geometry" (PDF). Retrieved Oct 29, 2024.