
16.7 Subdifferentials and Directional Derivatives 587
(b) If S ⊂ ri(A) is compact, show that f
n
converges to f uniformly on S.
HINT: Show that g = sup
k
f
k
is bounded above on each ball B
r
(a) and use the
Lipschitz condition from Theorem 16.6.2.
P. A function f defined on a convex subset A of R
n
is called a quasi-convex function if
the sublevel set {a ∈ A : f(a) ≤ α} is a convex set for each α ∈ R.
(a) Show that f is quasi-convex if and only if
f(λa + (1 −λ)b) ≤ max{f (a), f(b)} for all a, b ∈ A.
(b) By part (a), a convex function is always quasi-convex. Give an example of a func-
tion f on R that is quasi-convex but not convex.
(c) Suppose that f is differentiable on R. Show that f is quasi-convex if and only if
f(y) ≥ f(x) + f
0
(x)(y − x) for all x, y ∈ R.
16.7. Subdifferentials and Directional Derivatives
We now turn to the notions of derivatives and subdifferentials. For a convex
function of one variable, we had two different notions that were useful, the left and
right derivatives and the subdifferential set of supporting hyperplanes to epi(f).
Both have natural generalizations to higher dimensions.
16.7.1. DEFINITION. Suppose that A is a convex subset of R
n
, a ∈ A, and f
is a convex function on A. A subgradient of f at a is a vector s ∈ R
n
such that
f(x) ≥ f(a) + hx − a, si for all x ∈ A.
The set of all subgradients of f at a is the subdifferential and is denoted by ∂f(a).
These terms are motivated by the corresponding terms for differentiable func-
tions f : R
n
→ R, so we recall them. The gradient of f at a, denoted ∇f (a),
is the n-tuple of partial derivatives:
¡
∂
∂x
1
f(a), . . . ,
∂
∂x
n
f(a)
¢
. The differential of
f at a is the hyperplane in R
n+1
given by those points (x, t) ∈ R
n
× R so that
t = f (a) + hx − a, ∇f(a)i.
If f is a convex function on A and a ∈ A, then a vector s determines a hyper-
plane of R
n
× R by
H = {(x, t) ∈ R
n
× R : t = f(a) + hx − a, si}
= {(x, t) ∈ R
n
× R :
(x, t), (−s, 1)
®
= f (a) − ha, si}
The condition that s be a subgradient is precisely that the graph of f is contained
in the half-space H
+
= {(x, t) ∈ R
n
× R : t ≥ f(a) + hx − a, si}. Clearly,
this is equivalent to saying that epi(f ) is contained in H
+
. Since H contains the
point (a, f(a)), we conclude that the subgradients of f at a correspond to the sup-
porting hyperplanes of epi(f) at (a, f(a)). It is important that the vector (−s, 1)
determining H has a 1 in the (n+1)st coordinate. This ensures that the hyperplane
is nonvertical, meaning that it is not of the form H
0
×R for some hyperplane H
0
of
R
n
. In the case n = 1, this rules out vertical tangents.