The Derivative

Jiří Lebl · Basic Analysis I, Ch. 5 · CC BY-SA 4.0

The derivative f'(c) is the limit of the difference quotient (f(x) - f(c))/(x - c) as x → c. It measures the instantaneous rate of change. The mean value theorem says that somewhere between a and b, the derivative equals the average rate of change.

Definition of the derivative

f'(c) = lim (f(x) - f(c)) / (x - c) as x → c, if the limit exists. Equivalently, f'(c) = lim (f(c+h) - f(c)) / h as h → 0. Differentiability at c implies continuity at c, but not vice versa: |x| is continuous at 0 but not differentiable there.

Scheme

; Derivative as limit of difference quotient
(define (approx-derivative f c h)
  (/ (- (f (+ c h)) (f c)) h))

(define (f x) (* x x))  ; f(x) = x^2, f'(x) = 2x

(display "Approximating f'(3) for f(x) = x^2:") (newline)
(for-each (lambda (h)
  (display "  h=") (display h)
  (display ": ") (display (approx-derivative f 3 h))
  (newline))
  (list 1.0 0.1 0.01 0.001 0.0001 0.00001))

(display "Exact: f'(3) = 6") (newline)

; |x| is not differentiable at 0
(display "Derivative of |x| at 0:") (newline)
(display "  from right: ")
(display (approx-derivative abs 0 0.001)) (newline)
(display "  from left:  ")
(display (approx-derivative abs 0 -0.001)) (newline)
(display "Left and right limits differ: not differentiable")

Chain rule

If g is differentiable at c and f is differentiable at g(c), then (f ∘ g)'(c) = f'(g(c)) * g'(c). The derivative of a composition is the product of derivatives along the chain.

Scheme

; Chain rule: (f o g)'(c) = f'(g(c)) * g'(c)
; f(x) = x^2, g(x) = sin(x)
; (f o g)(x) = sin^2(x)
; (f o g)'(x) = 2*sin(x)*cos(x) = sin(2x)

(define (approx-deriv f c)
  (let ((h 0.00001))
    (/ (- (f (+ c h)) (f c)) h)))

(define (f x) (* x x))
(define (g x) (sin x))
(define (fog x) (f (g x)))

(define c 1.0)
(display "Chain rule at c = 1:") (newline)
(display "  numerical (fog)'(1) = ")
(display (approx-deriv fog c)) (newline)
(display "  f'(g(1)) * g'(1) = ")
(display (* (approx-deriv f (g c)) (approx-deriv g c))) (newline)
(display "  exact sin(2) = ")
(display (sin 2.0))

Mean value theorem

If f is continuous on [a, b] and differentiable on (a, b), there exists c in (a, b) with f'(c) = (f(b) - f(a)) / (b - a). The tangent at c is parallel to the secant from a to b. Rolle's theorem is the special case where f(a) = f(b), giving f'(c) = 0.

Scheme

; MVT: find c where f'(c) = (f(b)-f(a))/(b-a)
; f(x) = x^3, [a,b] = [1, 3]

(define (f x) (* x x x))
(define a 1.0)
(define b 3.0)

(define avg-slope (/ (- (f b) (f a)) (- b a)))
(display "Average slope on [1,3]: ")
(display avg-slope) (newline)  ; (27-1)/2 = 13

; f'(x) = 3x^2 = 13 => x = sqrt(13/3)
(define c-exact (sqrt (/ 13.0 3.0)))
(display "MVT point c = sqrt(13/3) = ")
(display c-exact) (newline)
(display "f'(c) = 3c^2 = ")
(display (* 3 c-exact c-exact)) (newline)
(display "Matches average slope? ")
(display (< (abs (- (* 3 c-exact c-exact) avg-slope)) 0.0001))

L'Hopital's rule and Taylor's theorem

L'Hopital's rule: if f(c) = g(c) = 0 and the limit of f'/g' exists, then lim f(x)/g(x) = lim f'(x)/g'(x). Taylor's theorem: f(x) = f(c) + f'(c)(x-c) + f''(c)(x-c)^2/2! + ... + R_n, with an explicit remainder term. The Taylor polynomial is the best polynomial approximation near c.

Scheme

; Taylor polynomial for e^x around c=0
; e^x = 1 + x + x^2/2! + x^3/3! + ...

(define (factorial n)
  (if (<= n 1) 1 (* n (factorial (- n 1)))))

(define (taylor-exp x n)
  (let loop ((k 0) (sum 0.0))
    (if (> k n) sum
      (loop (+ k 1) (+ sum (/ (expt x k) (factorial k)))))))

(define x 1.0)
(display "Taylor approximations of e^1:") (newline)
(for-each (lambda (n)
  (display "  degree ") (display n)
  (display ": ") (display (taylor-exp x n))
  (display "  (error: ") (display (abs (- (exp x) (taylor-exp x n))))
  (display ")") (newline))
  (list 1 2 3 5 10 15))

(display "Exact e = ") (display (exp 1.0))

Notation reference

Symbol	Scheme	Meaning
f'(c)	(approx-deriv f c)	Derivative at c
(f ∘ g)'	f'(g(c)) * g'(c)	Chain rule
MVT	f'(c) = slope	Mean value theorem
T_n(x)	(taylor-exp x n)	Taylor polynomial of degree n

Neighbors

Ch. 4: Continuous Functions — differentiability implies continuity
Ch. 6: The Riemann Integral — FTC connects derivatives and integrals
Ch. 3: Series — Taylor series are power series
📐 Calculus Ch.5 — derivative rules: the computational counterpart to this rigorous treatment
🤖 ML Ch.3 — gradient descent: the chain rule drives all of neural network training
🍞 Capucci 2021 — categorical differentiation: the chain rule as composition in a double category

Translation notes

Numerical differentiation via difference quotients suffers from cancellation error: too-small h makes (f(c+h) - f(c)) lose significant digits. The sweet spot is around h = 10^-8 for double precision. The exact derivative is a limit, not a computation.

← Continuous Functions by june.kim The Riemann Integral →