L.
Vandenberghe EE236C (Spring 2013-14)
8. Conjugate functions
• closed functions
• conjugate function
8-1
Closed set
a set C is closed if it contains its boundary:
xk ∈ C, xk → x̄ =⇒ x̄ ∈ C
operations that preserve closedness
• the intersection of (finitely or infinitely many) closed sets is closed
• the union of a finite number of closed sets is closed
• inverse under linear mapping: {x | Ax ∈ C} is closed if C is closed
Conjugate functions 8-2
Image under linear mapping
the image of a closed set under a linear mapping is not necessarily closed
example (C is closed, AC = {Ax | x ∈ C} is open):
R2+
C = {(x1, x2) ∈ | x1x2 ≥ 1}, A= 1 0 , AC = R++
sufficient condition: AC is closed if
• C is closed and convex
• and C does not have a recession direction in the nullspace of A. i.e.,
Ay = 0, x̂ ∈ C, x̂ + αy ∈ C ∀α ≥ 0 =⇒ y=0
in particular, this holds for any A if C is bounded
Conjugate functions 8-3
Closed function
definition: a function is closed if its epigraph is a closed set
examples
• f (x) = − log(1 − x2) with dom f = {x | |x| < 1}
• f (x) = x log x with dom f = R+ and f (0) = 0
• indicator function of a closed set C: f (x) = 0 if x ∈ C = dom f
not closed
• f (x) = x log x with dom f = R++, or with dom f = R+ and f (0) = 1
• indicator function of a set C if C is not closed
Conjugate functions 8-4
Properties
sublevel sets: f is closed if and only if all its sublevel sets are closed
minimum: if f is closed with bounded sublevel sets then it has a minimizer
common operations on convex functions that preserve closedness
• sum: f + g is closed if f and g are closed (and dom f ∩ dom g 6= ∅)
• composition with affine mapping: f (Ax + b) is closed if f is closed
• supremum: supα fα(x) is closed if each function fα is closed
Conjugate functions 8-5
Outline
• closed functions
• conjugate function
Conjugate function
f (x)
xy
the conjugate of a function f is
f ∗(y) = sup (y T x − f (x))
x∈dom f
x
(0, −f ∗(y))
f ∗ is closed and convex even if f is not
Fenchel’s inequality
f (x) + f ∗(y) ≥ xT y ∀x, y
(extends inequality xT x/2 + y T y/2 ≥ xT y to non-quadratic convex f )
Conjugate functions 8-6
Quadratic function
1 T
f (x) = x Ax + bT x + c
2
strictly convex case (A ≻ 0)
1
f (y) = (y − b)T A−1(y − b) − c
∗
2
general convex case (A 0)
1
f ∗(y) = (y − b)T A†(y − b) − c, dom f ∗ = range(A) + b
2
Conjugate functions 8-7
Negative entropy and negative logarithm
negative entropy
n
X n
X
f (x) = xi log xi f ∗(y) = eyi−1
i=1 i=1
negative logarithm
n
X n
X
f (x) = − log xi f ∗(y) = − log(−yi) − n
i=1 i=1
matrix logarithm
f (X) = − log det X (dom f = Sn++) f ∗(Y ) = − log det(−Y ) − n
Conjugate functions 8-8
Indicator function and norm
indicator of convex set C: conjugate is support function of C
0 x∈C
f (x) = f ∗(y) = sup y T x
+∞ x ∈
6 C x∈C
norm: conjugate is indicator of unit dual norm ball
0 kyk∗ ≤ 1
f (x) = kxk f ∗(y) =
+∞ kyk∗ > 1
(see next page)
Conjugate functions 8-9
proof: recall the definition of dual norm:
kyk∗ = sup xT y
kxk≤1
∗ T
to evaluate f (y) = supx y x − kxk we distinguish two cases
• if kyk∗ ≤ 1, then (by definition of dual norm)
y T x ≤ kxk ∀x
and equality holds if x = 0; therefore supx (y T x − kxk) = 0
• if kyk∗ > 1, there exists an x with kxk ≤ 1, xT y > 1; then
f ∗(y) ≥ y T (tx) − ktxk = t(y T x − kxk)
and r.h.s. goes to infinity if t → ∞
Conjugate functions 8-10
The second conjugate
f ∗∗(x) = sup (xT y − f ∗(y))
y∈dom f ∗
• f ∗∗ is closed and convex
• from Fenchel’s inequality (xT y − f ∗(y) ≤ f (x) for all y and x):
f ∗∗(x) ≤ f (x) ∀x
equivalently, epi f ⊆ epi f ∗∗ (for any f )
• if f is closed and convex, then
f ∗∗(x) = f (x) ∀x
equivalently, epi f = epi f ∗∗ (if f is closed convex); proof on next page
Conjugate functions 8-11
proof (f ∗∗ = f if f is closed and convex): by contradiction
suppose (x, f ∗∗(x)) 6∈ epi f ; then there is a strict separating hyperplane:
T
a z−x
≤c<0 ∀(z, s) ∈ epi f
b s − f ∗∗(x)
for some a, b, c with b ≤ 0 (b > 0 gives a contradiction as s → ∞)
• if b < 0, define y = a/(−b) and maximize l.h.s. over (z, s) ∈ epi f :
f ∗(y) − y T x + f ∗∗(x) ≤ c/(−b) < 0
this contradicts Fenchel’s inequality
• if b = 0, choose ŷ ∈ dom f ∗ and add small multiple of (ŷ, −1) to (a, b):
T
a + ǫŷ z−x ∗ T ∗∗
≤ c + ǫ f (ŷ) − x ŷ + f (x) < 0
−ǫ s − f ∗∗(x)
now apply the argument for b < 0
Conjugate functions 8-12
Conjugates and subgradients
if f is closed and convex, then
y ∈ ∂f (x) ⇐⇒ x ∈ ∂f ∗(y) ⇐⇒ xT y = f (x) + f ∗(y)
proof: if y ∈ ∂f (x), then f ∗(y) = supu(y T u − f (u)) = y T x − f (x)
f ∗(v) = sup (v T u − f (u))
u
≥ v T x − f (x)
= xT (v − y) − f (x) + y T x
= f ∗(y) + xT (v − y)
for all v; therefore, x is a subgradient of f ∗ at y (x ∈ ∂f ∗(y))
reverse implication x ∈ ∂f ∗(y) =⇒ y ∈ ∂f (x) follows from f ∗∗ = f
Conjugate functions 8-13
Some calculus rules
separable sum
f (x1, x2) = g(x1) + h(x2) f ∗(y1, y2) = g ∗(y1) + h∗(y2)
scalar multiplication: (for α > 0)
f (x) = αg(x) f ∗(y) = αg ∗(y/α)
addition to affine function
f (x) = g(x) + aT x + b f ∗(y) = g ∗(y − a) − b
infimal convolution
f (x) = inf (g(u) + h(v)) f ∗(y) = g ∗(y) + h∗(y)
u+v=x
Conjugate functions 8-14
References
• J.-B. Hiriart-Urruty, C. Lemaréchal, Convex Analysis and Minimization
Algoritms (1993), chapter X
• D.P. Bertsekas, A. Nedić, A.E. Ozdaglar, Convex Analysis and
Optimization (2003), chapter 7.
• R. T. Rockafellar, Convex Analysis (1970)
Conjugate functions 8-15