0% found this document useful (0 votes)

29 views5 pages

Convex Functions and Sets in Optimization

This lecture continues the study of convex functions and sets, focusing on their definitions and properties, including the proof of convexity for halfspaces and ellipsoids. It introduces key theorems that establish the relationship between local and global minima in convex functions, and discusses the first-order Taylor approximation as a global underestimator. The lecture emphasizes the importance of convexity in optimization and its applications across various fields.

Uploaded by

harrynguyen788

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views5 pages

Convex Functions and Sets in Optimization

Uploaded by

harrynguyen788

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ISyE 3013: Optimization for Machine Learning

Lecture #03 2025-09-09

Instructor: Swati Padmanabhan, Teaching Assistant: Zhenyi Zhang

Last class, we studied second-order optimality conditions, both necessary and sufficient, for twice
continuously1 differentiable functions. We saw that relaxing the sufficiency condition to positive
semidefiniteness of the Hessian at all points implies a global minimum. This special class of functions
falls under the umbrella of convexity. We then defined convex sets and used this definition to provide
a “secant inequality”-based definition of convex functions. In this lecture, we continue our study
of this very important class of functions and derive some of its equivalent characterizations.

3.1 Convex Sets, Convex Functions

Definition 3.D1. A set K ⊆ Rd is convex if for every pair of points x, y ∈ K, we have [x, y] ⊆ K.

Example 3.E1. Two important convex sets for us are: (1) halfspaces, and (2) ellipsoids.

y
Proof. (1) Convexity of a halfspace.
A halfspace is defined as a set H := u ∈ Rd : u⊤ a ≤ b .

(1, 1)
Here, a is the normal vector that points away from the (0, 1)
halfspace. In order to prove that this set is convex, we
must prove that given any two points in H, the line segment x + y ≤ 1
joining them lies wholly inside H. Recall, what it means for
“a point to be contained in a set” is that “the point satisfies x
(0, 0) (1, 0)
the equation (or inequality) describing the set”.
Let us put this insight to work here. We start with the following assumption:

x, y ∈ H ⇐⇒ a⊤ x ≤ b, a⊤ y ≤ b. (3.1.1)

Now consider a z ∈ [x, y], i.e., z = λ · x + (1 − λ) · y for some λ ∈ [0, 1]. Then we have,

a⊤ z = a⊤ (λ · x + (1 − λ) · y) = λ · a⊤ x + (1 − λ) · a⊤ y ≤ λ · b + (1 − λ) · b = b,

where the third step uses Equation (3.1.1). Since this holds for all λ ∈ [0, 1], we’ve shown that any
point in the line segment joining x and y lies entirely in H, thereby proving the convexity of H.

(2) Convexity of an ellipsoid.

We now prove convexity of the ellipsoid. The ellipsoid is defined as the set
n o
E := x ∈ Rd : (x − xc )⊤ P(x − xc ) ≤ 1 , P ≻ 0, xc ∈ Rd . (3.1.2)

For our purpose here, the following equivalent definition is useful:

n √ o
E := x ∈ Rd : ∥ P(x − xc )∥2 ≤ 1 , P ≻ 0, xc ∈ Rd . (3.1.3)

These notes have not been subjected to the usual scrutiny reserved for formal peer-reviewed publications. Thank
you for reporting any typos!
1
Please see the lecture note for the precise conditions, we are simplifying some nuance in this summary.

1
The eigenvalues of the positive definite matrix determine the length of each axis of the ellipsoid.
Why is E from Equation (3.1.3) a convex set? Let u, v ∈ E. This is equivalent to:
√ √
∥ P(u − xc )∥2 ≤ 1, ∥ P(v − xc )∥ ≤ 1. (3.1.4)
Now consider a z ∈ [u, v], i.e., z = λ · u + (1 − λ) · v for some λ ∈ [0, 1]. Then we have,
√ √
∥ P(z − xc )∥2 = ∥ P(λ · u + (1 − λ) · v − xc )∥2
√ √
≤ λ · ∥ P(u − xc )∥2 + (1 − λ) · ∥ P(v − xc )∥2
≤ λ · 1 + (1 − λ) · 1 = 1,
where we used triangle inequality in the second step and Equation (3.1.4) in the third.

As of now, we are only proving convexity of these sets. Through the course, we’ll see these sets
play central roles in wide-ranging applications. As an example, the problem class of linear programs,
which we saw in the first lecture, is convex2 partly because its feasible set is the intersection of
halfspaces (convexity is preserved under intersection). Similarly, ellipsoids are enormously useful
in data analysis (e.g., in identifying outliers), statistics (e.g., experiment design), in robotics (e.g.,
collision avoidance), and in approximating convex bodies (e.g., in the ellipsoid method, featured in
the newspaper article below). A wonderful resource on ellipsoids in optimization is [Tod16].

Definition 3.D2. A function f : K → R defined on a convex set K ⊆ Rd is convex (or convex over
K) if for every pair of points x, y ∈ K, we have
f (λ · x + (1 − λ) · y) ≤ λ · f (x) + (1 − λ) · f (y), for all λ ∈ [0, 1].

Example 3.E2. Here are some convex functions.

2 2
f (x, y)

f (x, y)

1 1
−1 1 −1 1
−2−1 1 2 −2−1 1 2 0 0 0 0
x 1 −1 y x 1 −1 y
f (x) = |x| f (x) = ex f (x, y) = x2 + y 2 f (x, y) = |x| + |y|

2
Note, until now, we defined convex sets and convex functions — convex programs will be introduced shortly.

2
We’ll see later, there is a close link between convex functions and their epigraphs.

Problem 3.P1. Prove the convexity of: (1) f (x) = |x|, (2) f (x) = x2 .

With Definition 3.D2 in hand, we are now ready to revisit a big question we posed in the first
lecture: is there a class of functions for which a local minimum is in fact a global minimum? We
answer this question affirmatively in Theorem 3.T1 below.

3.1.1 Local Minima are Global Minima

Theorem 3.T1. When f is convex, any local minimizer x⋆ is a global minimizer of f . If, in
addition, f is differentiable, then any stationary point x⋆ is a global minimizer of f .

Proof. Let x⋆ be a local minimizer of f , and suppose that there exists z ̸= x⋆ such that z is the
global minimizer of f . Consider the line segment joining the two points, and let x ∈ (x⋆ , z). Then

f (x) = f (λz + (1 − λ)x⋆ ) ≤ λf (z) + (1 − λ)f (x⋆ ) = λ(f (z) − f (x⋆ )) + f (x⋆ ) < f (x⋆ ),

where the first step holds for some λ ∈ (0, 1) (since we chose x ∈ (x⋆ , z)), the second step is by
Definition 3.D2, and the final step is because f (z) < f (x⋆ ) and positivity of λ. To see the second
part of the claim, we revisit the definition of convexity, as assumed:

λf (z) + (1 − λ)f (x⋆ ) ≥ f (λz + (1 − λ)x⋆ ),

where λ ∈ (0, 1). Rearranging terms and dividing throughout by λ (valid since λ > 0) yields:

f (λz + (1 − λ)x⋆ ) − f (x⋆ )

f (z) − f (x⋆ ) ≥ ,
λ
The inequality above is preserved if we take λ → 0:

f (λz + (1 − λ)x⋆ ) − f (x⋆ )

f (z) − f (x⋆ ) ≥ lim
λ→0 λ
d
≥ f (x⋆ + λ(z − x⋆ ))|λ=0
dλ
≥ ∇f (x⋆ )⊤ (z − x⋆ ),

where the second inequality is by the definition of derivative of f restricted to a line and the third is
by the connection between directional derivative and gradient. This concludes both the claims.

Problem 3.P2. Is a function that satisfies either of the following properties necessarily convex?

1. “every local minimum is a global minimum”.

2. “every stationary point is a global minimum”.

3.1.2 First-Order Taylor Approximator as Global Underestimator

In general, convex functions aren’t differentiable. But when they are, they have an alternate,
equivalent, first-order characterization, the proof of which follows that of Theorem 3.T1.

3
Theorem 3.T2. Let f : K → R be a continuously differentiable function on convex set K ⊆ Rd .
Then f is convex on K if and only if f (y) ≥ f (x) + ∇f (x)⊤ (y − x) for any x, y ∈ K.

Proof. One direction follows the same idea as Theorem 3.T1. For the other direction, suppose that
we have the stated inequality for all points in K. We apply this to two sets of points, x, y, and
z = λx + (1 − λ)y:
f (x) ≥ f (z) + ∇f (z)⊤ (x − z)
(3.1.5)
f (y) ≥ f (z) + ∇f (z)⊤ (y − z)
Multiplying the first by λ and second by (1 − λ) and adding gives:

λf (x) + (1 − λ)f (y) ≥ f (z) + ∇f (z)⊤ (λx − λz + (1 − λ)y − (1 − λ)z),

which simplifies to the claim.

In plain English, Theorem 3.T2 states the following: at any point, the linear estimator of
a convex function lies completely under the function, everywhere. Thus, the local information
about a convex function (its value and derivative at a point) tell us something about the function
everywhere. This is one of the most important properties of convex functions, since it is what enables
the development of fast (polynomial-time) algorithms for optimizing convex functions: finding a
minimum of an arbitrary function can require exhaustive search; but the above key fact enables a
binary search. We will see this powerful principle in action in all the algorithms in this class.

Example 3.E3.
f (x)
In the adjacent plot (f (x) = .6 · x2 − 1.2 · x + 1.6), (z, f (z))
we show the tangent f (u) = f (x) + f ′ (x) · (u − x) at
three different points, illustrating Theorem 3.T2.
(x, f (x))
(y, f (y))

x
x y z

Problem 3.P3. Prove that a function is convex if and only if its epigraph is.

Remark 3.R1. Theorem 3.T2 provides an alternate proof of the second part of Theorem 3.T1:
since f (y) ≥ f (x) + ∇f (x)⊤ (y − x) for all x, y ∈ dom(f ), if for some x = x⋆ ∈ dom(f ) we have
∇f (x) = 0, then f (y) ≥ f (x⋆ ) for all y ∈ dom(f ).

Problem 3.P4. In the previous lecture, we showed ex ≥ 1 + x using an application of Taylor’s

Theorem. Prove the same inequality using Theorem 3.T2.

Readings
The material in these notes is based on the following excellent sources: [BV04, Chapter 2, Chapter
3] and [NW06, Chapter 2.1].

4
References
[BV04] Stephen P Boyd and Lieven Vandenberghe. Convex optimization. Cambridge university
press, 2004 (cit. on p. 4).
[NW06] Jorge Nocedal and Stephen J Wright. Numerical optimization. Springer, 2006 (cit. on
p. 4).
[Tod16] Michael J Todd. Minimum-volume ellipsoids: Theory and algorithms. SIAM, 2016 (cit. on
p. 2).

Understanding Convex Functions in Optimization
No ratings yet
Understanding Convex Functions in Optimization
11 pages
Understanding Convex Functions and Subgradients
No ratings yet
Understanding Convex Functions and Subgradients
5 pages
Fast Algorithms via Convex Optimization
No ratings yet
Fast Algorithms via Convex Optimization
114 pages
Convex Optimization Concepts Explained
No ratings yet
Convex Optimization Concepts Explained
14 pages
Convex Functions and Inequalities Explained
No ratings yet
Convex Functions and Inequalities Explained
11 pages
Convex Functions in Optimization Theory
No ratings yet
Convex Functions in Optimization Theory
6 pages
Differentiability and Optimality Conditions
No ratings yet
Differentiability and Optimality Conditions
10 pages
Convex Optimization Concepts and Methods
No ratings yet
Convex Optimization Concepts and Methods
116 pages
Understanding Convex Functions
No ratings yet
Understanding Convex Functions
38 pages
Convex Optimization in Machine Learning
No ratings yet
Convex Optimization in Machine Learning
110 pages
Overview of Convex Optimization
No ratings yet
Overview of Convex Optimization
12 pages
Convex Function Optimization Examples
No ratings yet
Convex Function Optimization Examples
4 pages
Convex Optimization Exercises and Solutions
No ratings yet
Convex Optimization Exercises and Solutions
12 pages
Convex Functions and Optimization Methods
No ratings yet
Convex Functions and Optimization Methods
143 pages
Convex Optimization Exercises and Solutions
No ratings yet
Convex Optimization Exercises and Solutions
12 pages
Understanding Convex Functions and Properties
No ratings yet
Understanding Convex Functions and Properties
43 pages
Equivalent Proofs of Convex Functions
No ratings yet
Equivalent Proofs of Convex Functions
4 pages
Understanding Convex Functions in Rn
No ratings yet
Understanding Convex Functions in Rn
20 pages
Understanding Convex Functions
No ratings yet
Understanding Convex Functions
31 pages
Jan Van Tiel - Convex Analysis - An Introductory Text-Wiley (1984) PDF
No ratings yet
Jan Van Tiel - Convex Analysis - An Introductory Text-Wiley (1984) PDF
135 pages
Understanding Convex Functions
No ratings yet
Understanding Convex Functions
14 pages
Understanding Convex Functions
No ratings yet
Understanding Convex Functions
9 pages
Local Minima in Optimization Methods
No ratings yet
Local Minima in Optimization Methods
3 pages
Convex Sets and Functions Overview
No ratings yet
Convex Sets and Functions Overview
44 pages
Convex Function Practice Problems
No ratings yet
Convex Function Practice Problems
4 pages
Strictly vs Strongly Convex Functions
No ratings yet
Strictly vs Strongly Convex Functions
14 pages
Understanding Convex Functions in Optimization
No ratings yet
Understanding Convex Functions in Optimization
20 pages
Linear Programming Exercise Solutions
No ratings yet
Linear Programming Exercise Solutions
37 pages
Understanding Convex Functions
No ratings yet
Understanding Convex Functions
7 pages
Introduction to Convex Optimization
No ratings yet
Introduction to Convex Optimization
48 pages
Convex Functions in Optimization
No ratings yet
Convex Functions in Optimization
43 pages
Convex Optimization Fundamentals Guide
No ratings yet
Convex Optimization Fundamentals Guide
58 pages
Fundamentals of Convex Analysis
No ratings yet
Fundamentals of Convex Analysis
12 pages
Convex and Concave Functions Explained
No ratings yet
Convex and Concave Functions Explained
5 pages
Convexity Proofs and Properties Explained
No ratings yet
Convexity Proofs and Properties Explained
6 pages
Convex Optimization Concepts Explained
No ratings yet
Convex Optimization Concepts Explained
6 pages
Convex Optimization Concepts and Definitions
No ratings yet
Convex Optimization Concepts and Definitions
6 pages
Gradient Descent Analysis in Optimization
No ratings yet
Gradient Descent Analysis in Optimization
6 pages
KKT Conditions in Constrained Optimization
No ratings yet
KKT Conditions in Constrained Optimization
30 pages
Global and Local Minima in Optimization
No ratings yet
Global and Local Minima in Optimization
5 pages
Convex Set and Function Exercises
No ratings yet
Convex Set and Function Exercises
4 pages
Additional Exercises in Convex Optimization
No ratings yet
Additional Exercises in Convex Optimization
310 pages
Preserving Convexity in Sets
No ratings yet
Preserving Convexity in Sets
27 pages
Convex Functions in Math Olympiad
No ratings yet
Convex Functions in Math Olympiad
7 pages
Stephen Boyd Optimization 1 100 5
No ratings yet
Stephen Boyd Optimization 1 100 5
20 pages
Optimization and Convexity Concepts
No ratings yet
Optimization and Convexity Concepts
17 pages
Convexity and Quasiconvexity Analysis
No ratings yet
Convexity and Quasiconvexity Analysis
14 pages
Nonlinear Optimization: Convex Sets & Functions
No ratings yet
Nonlinear Optimization: Convex Sets & Functions
4 pages
03 Convex Functions Notes Cvxopt f21
No ratings yet
03 Convex Functions Notes Cvxopt f21
20 pages
Unconstrained Nonlinear Optimization Guide
No ratings yet
Unconstrained Nonlinear Optimization Guide
23 pages
Affine and Convex Function Overview
No ratings yet
Affine and Convex Function Overview
36 pages
Concave and Convex Functions Explained
No ratings yet
Concave and Convex Functions Explained
12 pages
Machine Learning Optimization Basics
No ratings yet
Machine Learning Optimization Basics
42 pages
Affine vs. Convex Sets Explained
No ratings yet
Affine vs. Convex Sets Explained
42 pages
Bregman Divergence and Mirror Descent
No ratings yet
Bregman Divergence and Mirror Descent
9 pages
EE364a Homework 2 Solutions
No ratings yet
EE364a Homework 2 Solutions
9 pages
Positive Semidefinite Cone Analysis
No ratings yet
Positive Semidefinite Cone Analysis
9 pages
Projected Gradient Descent Explained
No ratings yet
Projected Gradient Descent Explained
7 pages
Smooth Optimizations in Microeconomics
No ratings yet
Smooth Optimizations in Microeconomics
18 pages
Computer Science & Its Application in Defence
No ratings yet
Computer Science & Its Application in Defence
200 pages
Student List for B.E. Courses II Year
No ratings yet
Student List for B.E. Courses II Year
1 page
MN 06144
No ratings yet
MN 06144
26 pages
Long Multiplication: 4-Digit by 2-Digit
No ratings yet
Long Multiplication: 4-Digit by 2-Digit
3 pages
Understanding Constants and Variables
No ratings yet
Understanding Constants and Variables
2 pages
Brochure Flow Preventive Maintenance Service Micro Motion en 64632
No ratings yet
Brochure Flow Preventive Maintenance Service Micro Motion en 64632
2 pages
NOAA Satellite Image Reception Guide
No ratings yet
NOAA Satellite Image Reception Guide
34 pages
Grade 4 First Quarter Learning Gaps
No ratings yet
Grade 4 First Quarter Learning Gaps
3 pages
rg6 Gripper Datasheet PDF
No ratings yet
rg6 Gripper Datasheet PDF
6 pages
Core Metrics in Software Project Management
100% (1)
Core Metrics in Software Project Management
48 pages
Fire Alarm System Completion Record
No ratings yet
Fire Alarm System Completion Record
4 pages
Energy Dissipation in MoS2 Transistors
No ratings yet
Energy Dissipation in MoS2 Transistors
29 pages
Introduction to Spatial Computing
No ratings yet
Introduction to Spatial Computing
22 pages
ESD310 Portable SSD: 10Gbps Speed, 2TB Storage
No ratings yet
ESD310 Portable SSD: 10Gbps Speed, 2TB Storage
2 pages
Ensham Mine Layout and Planning Guide
100% (6)
Ensham Mine Layout and Planning Guide
228 pages
Snowflake & DBT Course Overview
No ratings yet
Snowflake & DBT Course Overview
4 pages
Telesom Employee Awards Ceremony
No ratings yet
Telesom Employee Awards Ceremony
35 pages
Two-Phase Clocking in VLSI Systems
No ratings yet
Two-Phase Clocking in VLSI Systems
16 pages
PMP ITTO Mock Exam: Stakeholder Management
No ratings yet
PMP ITTO Mock Exam: Stakeholder Management
3 pages
Senior Software Engineer in Ireland
No ratings yet
Senior Software Engineer in Ireland
2 pages
CUDA Quick Start Guide
No ratings yet
CUDA Quick Start Guide
19 pages
Capacitor Lab: Exploring Capacitance and Energy
No ratings yet
Capacitor Lab: Exploring Capacitance and Energy
2 pages
Software Engineering: A Perspective For 2003: Linda Shafer Director
No ratings yet
Software Engineering: A Perspective For 2003: Linda Shafer Director
53 pages
Neural Data-to-Text Generation Survey
No ratings yet
Neural Data-to-Text Generation Survey
46 pages
T REC G.Sup51 201205 S!!PDF E
No ratings yet
T REC G.Sup51 201205 S!!PDF E
36 pages
State Space Search in AI Algorithms
No ratings yet
State Space Search in AI Algorithms
12 pages
Grammar and Vocabulary Starter Worksheets
No ratings yet
Grammar and Vocabulary Starter Worksheets
81 pages
Farmer's E-Market System Analysis
No ratings yet
Farmer's E-Market System Analysis
34 pages
Android Power Shutdown Log
No ratings yet
Android Power Shutdown Log
4 pages
Affordable Home Security Setup Guide
No ratings yet
Affordable Home Security Setup Guide
76 pages

Convex Functions and Sets in Optimization

Uploaded by

Convex Functions and Sets in Optimization

Uploaded by

ISyE 3013: Optimization for Machine Learning

Lecture #03 2025-09-09

3.1 Convex Sets, Convex Functions

(2) Convexity of an ellipsoid.

For our purpose here, the following equivalent definition is useful:

Example 3.E2. Here are some convex functions.

3.1.1 Local Minima are Global Minima

λf (z) + (1 − λ)f (x⋆ ) ≥ f (λz + (1 − λ)x⋆ ),

f (λz + (1 − λ)x⋆ ) − f (x⋆ )

f (λz + (1 − λ)x⋆ ) − f (x⋆ )

1. “every local minimum is a global minimum”.

2. “every stationary point is a global minimum”.

3.1.2 First-Order Taylor Approximator as Global Underestimator

λf (x) + (1 − λ)f (y) ≥ f (z) + ∇f (z)⊤ (λx − λz + (1 − λ)y − (1 − λ)z),

which simplifies to the claim.

Problem 3.P4. In the previous lecture, we showed ex ≥ 1 + x using an application of Taylor’s

You might also like