0% found this document useful (0 votes)

253 views12 pages

Pontryagin's Maximum Principle Overview

The document summarizes Pontryagin's maximum principle for determining optimal control trajectories. It describes how the principle uses costates or co-states to represent the gradient of the optimal cost-to-go function. It shows that the principle can be derived from the Hamilton-Jacobi-Bellman equation and provides conditions for optimal trajectories and controls in both continuous and discrete time. It also provides examples of how the principle can be applied to specific dynamics and cost functions.

Uploaded by

manish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

253 views12 pages

Pontryagin's Maximum Principle Overview

Uploaded by

manish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Pontryagin’s maximum principle

Emo Todorov

Applied Mathematics and Computer Science & Engineering

University of Washington

Winter 2012

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 1/9

Pontryagin’s maximum principle
For deterministic dynamics ẋ = f (x, u) we can compute extremal open-loop
trajectories (i.e. local minima) by solving a boundary-value ODE problem
∂
with given x (0) and λ (T ) = ∂x qT (x), where λ (t) is the gradient of the
optimal cost-to-go function (called costate).

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 2/9

Definition (deterministic Hamiltonian)

H (x, u, λ) , ` (x, u) + f (x, u)T λ

Theorem (continuous-time maximum principle)

If x (t) , u (t), 0 t T is the optimal state-control trajectory starting at x (0), then
∂
there exists a costate trajectory λ (t) with λ (T ) = ∂x qT (x) satisfying

ẋ = H λ (x, u, λ) = f (x, u)
λ̇ = Hx (x, u, λ) = `x (x, u) + fx (x, u)T λ
u e, λ)
= arg min H (x, u
e
u

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 2/9

Derivation from the HJB equation (continuous time)
For deterministic dynamics ẋ = f (x, u) the optimal cost-to-go in the
finite-horizon setting satisfies the HJB equation
n o
vt (x, t) = min ` (x, u) + f (x, u)T vx (x, t) = min H (x, u, vx (x, t))
u u

If the optimal control law is π (x, t), we can set u = π and drop the ’min’:

0 = vt (x, t) + ` (x, π (x, t)) + f (x, π (x, t))T vx (x, t)

Now differentiate w.r.t. x and suppress the dependences for clarity:

0 = vtx + `x + π T T T T
x `u + fx + π x fu vx + vxx f

Using the identity v̇x = vtx + vxx f and regrouping yields

0 = v̇x + `x + fT T T T
x vx + π x `u + fu vx = v̇x + Hx + π x Hu

Since u is optimal we have Hu = 0, thus λ̇ = Hx (x, π, λ) where λ = vx .

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 3/9
Derivation via Largrange multipliers (discrete time)
Optimize total cost subject to dynamics constraints xk+1 = f (xk , uk ).
Define the Lagrangian L (x , u , λ ) as
N 1
L = qT (xN ) + ∑k=0 ` (xk , uk ) + (f (xk , uk ) xk+1 )T λk+1
N 1
N λN + x0 λ0 + ∑k=0 H (xk , uk , λk+1 )
xT T
= qT (xN ) xT
k λk

Setting Lx = Lλ = 0 and explicitly minimizing w.r.t. u yields

Theorem (discrete-time maximum principle)

If xk , uk , 0 k N is the optimal state-control trajectory starting at x0 , then there
∂
exists a costate trajectory λk with λN = ∂x qT (xN ) satisfying

xk+1 = H λ (xk , uk , λk+1 ) = f (xk , uk )

λk = Hx (xk , uk , λk+1 ) = `x (xk , uk ) + fx (xk , uk )T λk+1
uk = arg min H (xk , ue , λ k +1 )
e
u

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 4/9

Gradient of the total cost

The maximum principle provides an efficient way to evaluate the gradient of

the total cost w.r.t. u, and thereby optimize the controls numerically.

Theorem (gradient)
For given control trajectory uk , let xk , λk be such that

xk+1 = f (xk , uk )
λk = `x (xk , uk ) + fx (xk , uk )T λk+1
∂
with x0 given and λN = ∂x qT (xN ). Let J (x , u ) be the total cost. Then

∂
J (x , u ) = Hu (xk , uk , λk+1 ) = `u (xk , uk ) + fu (xk , uk )T λk+1
∂uk

Note that xk can be found in a forward pass (since it does not depend on λ),
and then λk can be found in a backward pass.

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 5/9

Proof by induction

The cost accumulated from time k until the end can be written recursively as

Jk (xk N , uk N 1 ) = ` (xk , uk ) + Jk+1 (xk+1 N , uk+1 N 1 )

Noting that uk affects future costs only through xk+1 = f (xk , uk ), we have

∂ ∂
J = `u (xk , uk ) + fu (xk , uk )T J
∂uk k ∂xk+1 k+1

∂
We need to show that λk = J . For k = N this holds because JN = qT .
∂xk k
For k < N we have
∂ ∂
J = `x (xk , uk ) + fx (xk , uk )T J
∂xk k ∂xk+1 k+1

which is identical to λk = `x (xk , uk ) + fx (xk , uk )T λk+1 .

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 6/9

Enforcing terminal states

The final state x (T ) is usually different from the minimum of the final
cost qT , because it reflects a trade-off between final and running cost.
We can enforce x (T ) = x as a boundary condition and remove the
boundary condition on λ (T ).
Once the solution is found, we can construct a function qT such that
∂
λ (T ) = ∂x qT (x (T )). However if λ (T ) 6= 0 then x (T ) is not the
minimum of this qT .
We can also define the problem as infinite horizon average cost, in which
case it is usually suboptimal to have an asymptotic state different from
the minimum of the state cost function. The maximum principle does not
apply to infinite horizon problems, so one has to use the HJB equations.

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 7/9

When the dynamics and cost are in the restricted form

ẋ= a (x) + Bu
1
` (x, u) = q (x) + uT Ru
2
the Hamiltonian can be minimized analytically, which yields the ODE
1 T
ẋ = a (x) BR B λ
λ̇ = qx ( x ) + a x ( x ) T λ
∂
with boundary conditions x (0) and λ (T ) = ∂x qT (x). If B, R depend on x,
the second equation has additional terms involving the derivatives of B, R.

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 8/9

When the dynamics and cost are in the restricted form

We have Hu = R (x) u + B (x)T λ and Huu = R (x) 0. Thus the maximum

principle here is both a necessary and a sufficient condition for a local
minimum.

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 8/9

Pendulum example
Passive dynamics:

x2
a (x) =
k sin (x1 )
0 1
ax (x) =
k cos (x1 ) 0

Optimal control:
1
u= r λ2

ODE (with q = 0):

ẋ1 = x2
1
ẋ2 = k sin (x1 ) r λ2
λ̇1 = k cos (x1 ) λ2
λ̇2 = λ1

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 9/9

Pendulum example
Passive dynamics: Cost-to-go and trajectories:

x2
a (x) =
k sin (x1 )
0 1
ax (x) =
k cos (x1 ) 0

Optimal control:
1
u= r λ2
Control law (from HJB):
ODE (with q = 0):

ẋ1 = x2
1
ẋ2 = k sin (x1 ) r λ2
λ̇1 = k cos (x1 ) λ2
λ̇2 = λ1

Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 9/9

Optimal Control and LQR Overview
No ratings yet
Optimal Control and LQR Overview
15 pages
Deterministic Optimal Control Overview
No ratings yet
Deterministic Optimal Control Overview
42 pages
HJB Equation and Optimal Control Theory
No ratings yet
HJB Equation and Optimal Control Theory
30 pages
Optimal Control and Dynamic Games Overview
No ratings yet
Optimal Control and Dynamic Games Overview
12 pages
Optimal Control Theory Overview
No ratings yet
Optimal Control Theory Overview
19 pages
Pontryagin's Minimum Principle Explained
No ratings yet
Pontryagin's Minimum Principle Explained
9 pages
Hamilton-Jacobi-Bellman Optimization Methods
No ratings yet
Hamilton-Jacobi-Bellman Optimization Methods
9 pages
HJB Equation: Optimal Control Insights
No ratings yet
HJB Equation: Optimal Control Insights
12 pages
Introduction to Nonlinear Control Systems
No ratings yet
Introduction to Nonlinear Control Systems
4 pages
Maximum Principle in Continuous Time
No ratings yet
Maximum Principle in Continuous Time
86 pages
Variational Techniques in Optimal Control
No ratings yet
Variational Techniques in Optimal Control
10 pages
Discrete-Time Optimal Control Overview
No ratings yet
Discrete-Time Optimal Control Overview
26 pages
DPOCexam2008midterm Solution
No ratings yet
DPOCexam2008midterm Solution
12 pages
HJI Equation Derivation for Control Systems
No ratings yet
HJI Equation Derivation for Control Systems
6 pages
Time Optimal Control Design Guide
No ratings yet
Time Optimal Control Design Guide
64 pages
Optimal Control System Analysis
No ratings yet
Optimal Control System Analysis
29 pages
Understanding the HJB Equation
No ratings yet
Understanding the HJB Equation
7 pages
Pontryagin's Maximum Principle Overview
No ratings yet
Pontryagin's Maximum Principle Overview
21 pages
Pontryagin's Minimum Principle Explained
No ratings yet
Pontryagin's Minimum Principle Explained
31 pages
Control Theory in Robotics Overview
No ratings yet
Control Theory in Robotics Overview
54 pages
A Guide to Optimal Control Theory
No ratings yet
A Guide to Optimal Control Theory
10 pages
Hamilton-Jacobi-Bellman Control Theory
No ratings yet
Hamilton-Jacobi-Bellman Control Theory
8 pages
Dynamic Programming in Optimal Control
No ratings yet
Dynamic Programming in Optimal Control
8 pages
Constrained Optimal Control Analysis
No ratings yet
Constrained Optimal Control Analysis
46 pages
Survey on State-Constrained Control Theory
No ratings yet
Survey on State-Constrained Control Theory
27 pages
Optimal Control Techniques Overview
100% (1)
Optimal Control Techniques Overview
7 pages
A Child's Guide To Dynamic Programming
No ratings yet
A Child's Guide To Dynamic Programming
20 pages
Optimal Control Existence Theorems
No ratings yet
Optimal Control Existence Theorems
6 pages
Optimal Control Problem Solutions
No ratings yet
Optimal Control Problem Solutions
4 pages
Pontryagin's Maximum Principle Overview
No ratings yet
Pontryagin's Maximum Principle Overview
18 pages
Optimal Control Theory Overview
No ratings yet
Optimal Control Theory Overview
142 pages
2017optimalcontrol Solution April
No ratings yet
2017optimalcontrol Solution April
4 pages
Linear Quadratic Regulator Explained
No ratings yet
Linear Quadratic Regulator Explained
10 pages
Pontryagin's Principle in Optimal Control
No ratings yet
Pontryagin's Principle in Optimal Control
14 pages
Hamilton-Jacobi-Bellman Equation in Control
No ratings yet
Hamilton-Jacobi-Bellman Equation in Control
7 pages
Optimal Control Theory Overview
No ratings yet
Optimal Control Theory Overview
11 pages
Optimal Control of Oscillator Systems
No ratings yet
Optimal Control of Oscillator Systems
6 pages
Stochastic Optimal Control Overview
No ratings yet
Stochastic Optimal Control Overview
45 pages
Turnpike Properties in Optimal Control
No ratings yet
Turnpike Properties in Optimal Control
31 pages
Introduction to Optimal Control Theory
No ratings yet
Introduction to Optimal Control Theory
38 pages
Optimal Control Solutions and Analysis
No ratings yet
Optimal Control Solutions and Analysis
18 pages
Pontryagin Maximum Principle Overview
No ratings yet
Pontryagin Maximum Principle Overview
22 pages
Chapter One: 1.1 Optimal Control Problem
No ratings yet
Chapter One: 1.1 Optimal Control Problem
25 pages
Cost Function in Optimal Control Theory
100% (1)
Cost Function in Optimal Control Theory
15 pages
Understanding the Hamilton-Jacobi-Bellman Equation
No ratings yet
Understanding the Hamilton-Jacobi-Bellman Equation
5 pages
Introduction to Optimal Control Theory
No ratings yet
Introduction to Optimal Control Theory
135 pages
Mixed Control Problems and Optimality Conditions
No ratings yet
Mixed Control Problems and Optimality Conditions
15 pages
Discounted Infinite-Horizon Control Methods
No ratings yet
Discounted Infinite-Horizon Control Methods
85 pages
Optimal Control Theory Overview
No ratings yet
Optimal Control Theory Overview
12 pages
Pontryagin's Maximum Principle Overview
No ratings yet
Pontryagin's Maximum Principle Overview
36 pages
Optimal Control: Calculus of Variations
No ratings yet
Optimal Control: Calculus of Variations
32 pages
Naidu Cap 2
No ratings yet
Naidu Cap 2
5 pages
Variational Calculus and Optimal Control
No ratings yet
Variational Calculus and Optimal Control
29 pages
Solving HJB with Machine Learning Methods
No ratings yet
Solving HJB with Machine Learning Methods
20 pages
Optimal Control and Pontryagin's Principle
No ratings yet
Optimal Control and Pontryagin's Principle
6 pages
Pontryagin's Principle in Control Problems
No ratings yet
Pontryagin's Principle in Control Problems
11 pages
Equilibrium and Moments in Engineering Mechanics
No ratings yet
Equilibrium and Moments in Engineering Mechanics
28 pages
Audi Brand Experience and Vision
No ratings yet
Audi Brand Experience and Vision
42 pages
Overview of Feminism and Its Types
No ratings yet
Overview of Feminism and Its Types
11 pages
Fujitsu SB450 Block Diagram Guide
No ratings yet
Fujitsu SB450 Block Diagram Guide
93 pages
SIMOTICS Low-Voltage Motor Catalog
No ratings yet
SIMOTICS Low-Voltage Motor Catalog
80 pages
College Placement Portal Overview
No ratings yet
College Placement Portal Overview
52 pages
33.25-29 OTR Tires: Honour & Titan
No ratings yet
33.25-29 OTR Tires: Honour & Titan
2 pages
Hypertext LP
No ratings yet
Hypertext LP
5 pages
Types of Research Explained
No ratings yet
Types of Research Explained
20 pages
Digital Marketing at Jaipur Marriott
No ratings yet
Digital Marketing at Jaipur Marriott
41 pages
Solar PV Technology: Global Insights
No ratings yet
Solar PV Technology: Global Insights
6 pages
Nursing Management Planning Essentials
No ratings yet
Nursing Management Planning Essentials
50 pages
Draft Scaffolding Code of Practice
No ratings yet
Draft Scaffolding Code of Practice
61 pages
Soal Surat Lamaran Kerja Kelas 12
No ratings yet
Soal Surat Lamaran Kerja Kelas 12
3 pages
Types of Stem Cells Explained
No ratings yet
Types of Stem Cells Explained
14 pages
1.1 Linear Equations - Fall 22022
No ratings yet
1.1 Linear Equations - Fall 22022
12 pages
Sodium Diisobutyl Dithiophosphate MSDS
No ratings yet
Sodium Diisobutyl Dithiophosphate MSDS
5 pages
Impact of Gate Pass on Reading Skills
No ratings yet
Impact of Gate Pass on Reading Skills
9 pages
Overview of Microbiology Concepts
No ratings yet
Overview of Microbiology Concepts
5 pages
I PUC Commerce Test Portions - Dec 2025
No ratings yet
I PUC Commerce Test Portions - Dec 2025
2 pages
Rapport de Stage: Développement Bluetooth Java
No ratings yet
Rapport de Stage: Développement Bluetooth Java
19 pages
Sterile Water for Injection Overview
No ratings yet
Sterile Water for Injection Overview
64 pages
UV Stabilizer Life Cycle Report
No ratings yet
UV Stabilizer Life Cycle Report
5 pages
Freezer Wiring Diagram Overview
No ratings yet
Freezer Wiring Diagram Overview
15 pages
CIBIL Score Prediction Using ML Techniques
No ratings yet
CIBIL Score Prediction Using ML Techniques
7 pages
Understanding Measures of Dispersion
No ratings yet
Understanding Measures of Dispersion
42 pages
Mariana Trench: Depth and Discoveries
No ratings yet
Mariana Trench: Depth and Discoveries
4 pages
Class 10 CBSE IT Half Yearly Exam 2025
No ratings yet
Class 10 CBSE IT Half Yearly Exam 2025
3 pages
Standard Costs and Variance Analysis
No ratings yet
Standard Costs and Variance Analysis
6 pages
Understanding Functions in Python
No ratings yet
Understanding Functions in Python
17 pages