0% found this document useful (0 votes)

44 views

Lecture 2 Deterministic

The document describes a deterministic dynamic programming problem called the stagecoach problem. It involves finding the optimal route through multiple stages (stagecoach runs) to minimize the total cost of life insurance. Dynamic programming is used to solve this by starting with the final stage and working backwards, determining the optimal decision at each stage based on the previous stages. Deterministic dynamic programming problems have states and decisions at each stage that deterministically lead to the next state.

Uploaded by

Armee Justitia

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Lecture 2 Deterministic

Uploaded by

Armee Justitia

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Deterministic Dynamic

Programming

1 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM
The stagecoach problem is a problem specially constructed to
illustrate the features and to introduce the terminology of dynamic
programming.

A mythical fortune seeker in Missouri who decided to go west to join

the gold rush in California during the mid-19th century. The journey
would require traveling by stagecoach through unsettled country
where there was serious danger of attack by marauders.

The possible routes are shown in the figure, where each state is
represented by a circled letter and the direction of travel is always
from left to right in the diagram.

2 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM
Four stages were required to travel from his point of embarkation in
state A (Missouri) to his destination in state J (California).

3 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM
Life insurance policies were offered to stagecoach passengers. The
cost of the policy for taking any given stagecoach run was based on a
careful evaluation of the safety of that run. Thus, the safest route
should be the one with the cheapest total life insurance policy.

We shall now focus on the question of which route minimizes the total
cost of the policy.

4 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM

How should we find the route with cheapest cost?

5 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM

How should we find the route with cheapest cost?

How about selecting the cheapest road at each stage?

5 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM

How should we find the route with cheapest cost?

How about selecting the cheapest road at each stage?

One possible approach to solving this problem is to use trial and

error. However, the number of possible routes is large (18)

5 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM

Dynamic programming starts with a small portion of the original

problem and finds the optimal solution for this smaller problem

6 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM

Dynamic programming starts with a small portion of the original

problem and finds the optimal solution for this smaller problem

We start with the smaller problem where the fortune seeker has
nearly completed his journey and has only one more stage
(stagecoach run) to go.

6 / 17 Ya-Tang Chuang Dynamic Programming

T HE S TAGECOACH P ROBLEM

Dynamic programming starts with a small portion of the original

problem and finds the optimal solution for this smaller problem

We start with the smaller problem where the fortune seeker has
nearly completed his journey and has only one more stage
(stagecoach run) to go.

At each subsequent iteration, the problem is enlarged by

increasing by 1 the number of stages left to go to complete the
journey

6 / 17 Ya-Tang Chuang Dynamic Programming

C HARACTERISTICS OF DYNAMIC P ROGRAMMING
P ROBLEMS
These features characterize dynamic programming problems
1. The problem can be divided into stages, with a decision required
at each stage.

2. Each stage has a number of states associated with the

beginning of that stage.

3. The effect of the decision at each stage is to transform the

current state to a state associated with the beginning of the next
stage (possibly according to a probability distribution).

4. The solution procedure is designed to find an optimal policy for

the overall problem

7 / 17 Ya-Tang Chuang Dynamic Programming

C HARACTERISTICS OF DYNAMIC P ROGRAMMING
P ROBLEMS
These features characterize dynamic programming problems
5. The optimal immediate decision depends on only the current
state and not on how you got there. This is the principle of
optimality for dynamic programming

6. The solution procedure begins by finding the optimal policy for

the last stage (backward induction)

7. A recursive relationship that identifies the optimal policy for

stage n, given the optimal policy for stage n + 1, is available.

fn∗ (sn ) = min{c(sn , xn ) + fn+1

∗
(sn+1 )}
xn

8 / 17 Ya-Tang Chuang Dynamic Programming

C HARACTERISTICS OF DYNAMIC P ROGRAMMING
P ROBLEMS
Notation which will be used are summarized below:
N: number of stages

n: label for current stage (n = 1, 2, . . . , N )

sn : current state for stage n

xn : decision variable for stage n

xn∗ : optimal value of xn (given sn )

fn (sn , xn ): objective function if system starts in state sn at stage

n, immediate decision is xn , and optimal decisions are made
thereafter

9 / 17 Ya-Tang Chuang Dynamic Programming

C HARACTERISTICS OF DYNAMIC P ROGRAMMING
P ROBLEMS
Notation which will be used are summarized below:
The recursive relationship will always be of the form

fn∗ (sn ) = min{fn (sn , xn )}

fn∗ (sn ) = max{fn (sn , xn )},

where fn∗ (sn ) = fn (sn , xn∗ )

10 / 17 Ya-Tang Chuang Dynamic Programming

D ETERMINISTIC DYNAMIC PROGRAMMING

The state at the next stage is completely determined by the state

and decision at the current stage

states sn might be representable by a discrete state variable (as

for the stagecoach problem) or by a continuous state variable

decision variables (x1 , x2 , . . . , xN ) also can be either discrete or

continuous.

11 / 17 Ya-Tang Chuang Dynamic Programming

D ISTRIBUTING M EDICAL T EAMS TO C OUNTRIES
The World Health Organization (WHO) is devoted to improving health
care in the underdeveloped countries of the world. It now has five
medical teams available to allocate among three such countries to
improve their medical care

The WHO needs to determine how many teams (if any) to allocate to
each of these countries to maximize the total effectiveness of the five
teams. The the number allocated to each country must be an integer.

The measure of performance being used is additional person-years of

life.

12 / 17 Ya-Tang Chuang Dynamic Programming

D ISTRIBUTING M EDICAL T EAMS TO C OUNTRIES
The measure of performance being used is additional person-years of
life.

What are the decision variables xn and state variables sn ?

13 / 17 Ya-Tang Chuang Dynamic Programming

T HE D ISTRIBUTION OF E FFORT P ROBLEM
The preceding example illustrates a particularly common type of
dynamic programming problem called the distribution of effort
problem.

Stage n: activity n (n = 1, 2, . . . , N).

xn : amount of resource allocated to activity n

State sn : amount of resource still available for allocation to

remaining activities

When the system starts at stage n in state sn , the choice of xn results

in the next state at stage n + 1 being sn+1 = sn − xn

14 / 17 Ya-Tang Chuang Dynamic Programming

D ISTRIBUTING S CIENTISTS TO R ESEARCH T EAMS
A government space project is conducting research on a certain
engineering problem that must be solved before people can fly safely
to Mars.

Three research teams are currently trying three different approaches

for solving this problem. The estimate has been made that, under
present circumstances, the probability that the respective teams —
call them 1, 2, and 3 — will not succeed is 0.40, 0.60, and 0.80,
respectively. Thus, the current probability that all three teams will fail
is 0.40 × 0.60 × 0.80 = 0.192. Because the objective is to minimize
the probability of failure, two more top scientists have been assigned
to the project.

Following table gives the estimated probability that the respective

teams will fail when 0, 1, or 2 additional scientists are added to that
team.

15 / 17 Ya-Tang Chuang Dynamic Programming

D ISTRIBUTING S CIENTISTS TO R ESEARCH T EAMS

The problem is to determine how to allocate the two additional

scientists to minimize the probability that all three teams will fail.

16 / 17 Ya-Tang Chuang Dynamic Programming

D ISTRIBUTING S CIENTISTS TO R ESEARCH T EAMS

The problem is to determine how to allocate the two additional

scientists to minimize the probability that all three teams will fail.

17 / 17 Ya-Tang Chuang Dynamic Programming

Dynamic Programming
100% (1)
Dynamic Programming
15 pages
Stochastic Dynamic Programming
No ratings yet
Stochastic Dynamic Programming
18 pages
Optimization Theory with Applications
From Everand
Optimization Theory with Applications
Donald A. Pierre
4/5 (4)
Dynamic Programming 2
No ratings yet
Dynamic Programming 2
39 pages
Dynamic Programming
No ratings yet
Dynamic Programming
8 pages
Dynamic Programming - Mirage Group
No ratings yet
Dynamic Programming - Mirage Group
37 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
28 pages
Monte Carlo Optimization Procedure For Chance Constrained Programming - Simulation Study Results
No ratings yet
Monte Carlo Optimization Procedure For Chance Constrained Programming - Simulation Study Results
8 pages
Dynamic Programming (DP)
No ratings yet
Dynamic Programming (DP)
32 pages
Dpp
No ratings yet
Dpp
20 pages
dpp (1)
No ratings yet
dpp (1)
14 pages
Lecture 8 Dynamic Programming
No ratings yet
Lecture 8 Dynamic Programming
32 pages
Monte-Carlo Planning in Large Pomdps
No ratings yet
Monte-Carlo Planning in Large Pomdps
9 pages
Chapter 2
No ratings yet
Chapter 2
50 pages
OR
No ratings yet
OR
34 pages
Dynamic Programming
No ratings yet
Dynamic Programming
10 pages
On Interior-Point Methods, Related Dynamical Systems Results and Cores of Targets For Linear Programming
No ratings yet
On Interior-Point Methods, Related Dynamical Systems Results and Cores of Targets For Linear Programming
11 pages
Nonlinear Programming Based Sliding Mode Control of An Inverted Pendulum
No ratings yet
Nonlinear Programming Based Sliding Mode Control of An Inverted Pendulum
5 pages
Paper 15-Improving The Solution of Traveling Salesman Problem Using Genetic, Memetic Algorithm and Edge Assembly Crossover
No ratings yet
Paper 15-Improving The Solution of Traveling Salesman Problem Using Genetic, Memetic Algorithm and Edge Assembly Crossover
4 pages
Operations Research: Chapter 3 (I)
No ratings yet
Operations Research: Chapter 3 (I)
39 pages
A New Approach in Dynamic Traveling Salesman Problem: A Hybrid of Ant Colony Optimization and Descending Gradient
No ratings yet
A New Approach in Dynamic Traveling Salesman Problem: A Hybrid of Ant Colony Optimization and Descending Gradient
9 pages
Topic10 DTMC LimitingDistribution
No ratings yet
Topic10 DTMC LimitingDistribution
3 pages
Adam S. Charles Nicholas P. Bertrand John Lee Christopher J. Rozell
No ratings yet
Adam S. Charles Nicholas P. Bertrand John Lee Christopher J. Rozell
5 pages
Mathematics Promotional Exam Cheat Sheet
No ratings yet
Mathematics Promotional Exam Cheat Sheet
8 pages
A New Efficient Algorithm For Solving Systems of Multivariate Polynomial Equations
No ratings yet
A New Efficient Algorithm For Solving Systems of Multivariate Polynomial Equations
12 pages
Engineering Design Optimization Husk!: The Conjugate Gradient Algorithm
No ratings yet
Engineering Design Optimization Husk!: The Conjugate Gradient Algorithm
7 pages
Evolutionary Assignment
No ratings yet
Evolutionary Assignment
5 pages
Stochastic Online Opitmization Using Kalman Recursion - Paper
No ratings yet
Stochastic Online Opitmization Using Kalman Recursion - Paper
55 pages
Dynamic Programming 7707
No ratings yet
Dynamic Programming 7707
51 pages
SAHADEB - Categorical - Data - LECTURES - Till Session 6
No ratings yet
SAHADEB - Categorical - Data - LECTURES - Till Session 6
165 pages
Chapter 4 Constrained Optimization: FX XR GX HX U M V PN
No ratings yet
Chapter 4 Constrained Optimization: FX XR GX HX U M V PN
5 pages
Dynamic Programming Treatment of The Travelling Salesman Problem
No ratings yet
Dynamic Programming Treatment of The Travelling Salesman Problem
4 pages
Chap 4 Linear Regression With One Regressor
No ratings yet
Chap 4 Linear Regression With One Regressor
46 pages
Chance Constrained Quadratic Bi-Level Programming Problem: Surapati Pramanik, Durga Banerjee
No ratings yet
Chance Constrained Quadratic Bi-Level Programming Problem: Surapati Pramanik, Durga Banerjee
8 pages
K-Faraz Et Al Sci-Rep (2021) REVISED SupplInfo
No ratings yet
K-Faraz Et Al Sci-Rep (2021) REVISED SupplInfo
17 pages
zhou2013
No ratings yet
zhou2013
11 pages
A Hill Climbing Differential Evolutionary Algorithm For Solving Multiple Travelling Salesman Problem
No ratings yet
A Hill Climbing Differential Evolutionary Algorithm For Solving Multiple Travelling Salesman Problem
4 pages
P1.7 Genetic Algorithms in Geophysical Fluid Dynamics
No ratings yet
P1.7 Genetic Algorithms in Geophysical Fluid Dynamics
7 pages
Pronósticos. KNN
No ratings yet
Pronósticos. KNN
17 pages
An Evolutionary Algorithm For Minimizing Multimodal Functions
No ratings yet
An Evolutionary Algorithm For Minimizing Multimodal Functions
7 pages
Bdqifjwkla
No ratings yet
Bdqifjwkla
2 pages
Kelompok1 Filename 0theperformanceevaluationofvarioustechniquesfortransp-Annotated
No ratings yet
Kelompok1 Filename 0theperformanceevaluationofvarioustechniquesfortransp-Annotated
12 pages
L4_CMMI 1_2020
No ratings yet
L4_CMMI 1_2020
8 pages
Lectures 19 and 20
No ratings yet
Lectures 19 and 20
15 pages
Jahanshahloo2006 (Topsis)
No ratings yet
Jahanshahloo2006 (Topsis)
10 pages
16895-Article Text-20389-1-2-20210518
No ratings yet
16895-Article Text-20389-1-2-20210518
8 pages
Lec29 ImportanceSampling
No ratings yet
Lec29 ImportanceSampling
84 pages
My Project
No ratings yet
My Project
67 pages
DS
No ratings yet
DS
129 pages
CH 2 Linear Programming
No ratings yet
CH 2 Linear Programming
8 pages
Numerical Methods For Advancing Interfaces
No ratings yet
Numerical Methods For Advancing Interfaces
32 pages
On The Use Non-Stationary Penalty Functions T o Solve Nonlinear Constrained Optimization Problems With GA's
No ratings yet
On The Use Non-Stationary Penalty Functions T o Solve Nonlinear Constrained Optimization Problems With GA's
6 pages
Big Data JPM
No ratings yet
Big Data JPM
31 pages
Digital Signal Processing Lab
No ratings yet
Digital Signal Processing Lab
28 pages
5 - Lecture 5 - S-Plane To Z-Plane Mapping & Transfer Function - (2nd Term 2021-2022)
No ratings yet
5 - Lecture 5 - S-Plane To Z-Plane Mapping & Transfer Function - (2nd Term 2021-2022)
12 pages
Modelling of Failure Processes
No ratings yet
Modelling of Failure Processes
34 pages
Csitnepal: UNIT:-1 Principles of Analyzing Algorithms and Problems
No ratings yet
Csitnepal: UNIT:-1 Principles of Analyzing Algorithms and Problems
83 pages
Process Optimisation: Dynamic Programming
No ratings yet
Process Optimisation: Dynamic Programming
35 pages
A Parametric Augmented Lagrangian Algorithm For Real-Time Economic NMPC
No ratings yet
A Parametric Augmented Lagrangian Algorithm For Real-Time Economic NMPC
6 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
IIM7064 Dynamic Programming
No ratings yet
IIM7064 Dynamic Programming
18 pages
IS 657 Information Systems Governance and Risk Management Intro IT Governance
No ratings yet
IS 657 Information Systems Governance and Risk Management Intro IT Governance
59 pages
Some Results of Non-Coprime Graph of The Dihedral Group FOR: A Prime Power
No ratings yet
Some Results of Non-Coprime Graph of The Dihedral Group FOR: A Prime Power
4 pages
(13144081 - Cybernetics and Information Technologies) An Indonesian Hoax News Detection System Using Reader Feedback and Naïve Bayes Algorithm
No ratings yet
(13144081 - Cybernetics and Information Technologies) An Indonesian Hoax News Detection System Using Reader Feedback and Naïve Bayes Algorithm
13 pages
Customer Satisfaction Analysis Based On SERVQUAL Method To Determine Service Level of Academic Information Systems On Higher Education
No ratings yet
Customer Satisfaction Analysis Based On SERVQUAL Method To Determine Service Level of Academic Information Systems On Higher Education
5 pages
Ch. 3 Linear Programming-1
No ratings yet
Ch. 3 Linear Programming-1
22 pages
1046 - IEEE Trans 2023 - Lewis Cited Me
No ratings yet
1046 - IEEE Trans 2023 - Lewis Cited Me
11 pages
ICS QUIZ 2024 On 7th Nov 2024, 07.30PM
No ratings yet
ICS QUIZ 2024 On 7th Nov 2024, 07.30PM
11 pages
IOT_EX-2
No ratings yet
IOT_EX-2
5 pages
Lecture 15 Affine Cipher
No ratings yet
Lecture 15 Affine Cipher
18 pages
ESS Question Paper
No ratings yet
ESS Question Paper
4 pages
MOGALE_1 _ ICT1511-19-S1 _ Online Assessment
No ratings yet
MOGALE_1 _ ICT1511-19-S1 _ Online Assessment
17 pages
312 Lab 4
No ratings yet
312 Lab 4
2 pages
EC 2029 Digital Image Processing May June 2013 Question Paper
No ratings yet
EC 2029 Digital Image Processing May June 2013 Question Paper
2 pages
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
No ratings yet
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
7 pages
DSP 3
No ratings yet
DSP 3
5 pages
Download Complete Equations of Phase Locked Loops Dynamics on the Circle Torus and Cylinder World Scientific Series on Nonlinear Science Series a World Scientific Series on Nonlinear Science Series a Jacek Kudrewicz PDF for All Chapters
100% (1)
Download Complete Equations of Phase Locked Loops Dynamics on the Circle Torus and Cylinder World Scientific Series on Nonlinear Science Series a World Scientific Series on Nonlinear Science Series a Jacek Kudrewicz PDF for All Chapters
67 pages
12.probability Distributions Binomial
No ratings yet
12.probability Distributions Binomial
3 pages
Uri Kartoun PHD Final 2007
No ratings yet
Uri Kartoun PHD Final 2007
418 pages
Numerical Differentiation and NR Method PDF
No ratings yet
Numerical Differentiation and NR Method PDF
5 pages
Regular Expressions
No ratings yet
Regular Expressions
15 pages
Homework Assignment #2
No ratings yet
Homework Assignment #2
2 pages
Namma Kalvi 12th Computer Science Question Bank em 220030
No ratings yet
Namma Kalvi 12th Computer Science Question Bank em 220030
73 pages
Unit 4 Notes FAI
No ratings yet
Unit 4 Notes FAI
18 pages
Unit2 PDF
No ratings yet
Unit2 PDF
15 pages
Control of Integrating Processes Using Dynamic Matrix Control
No ratings yet
Control of Integrating Processes Using Dynamic Matrix Control
6 pages
Chapter 5 Clustering
No ratings yet
Chapter 5 Clustering
40 pages
T. Y. B. Sc. (Mathematics) (Sem. - V) Examination March - 2023 MTH - 505: Mathematics Graph Theory
No ratings yet
T. Y. B. Sc. (Mathematics) (Sem. - V) Examination March - 2023 MTH - 505: Mathematics Graph Theory
2 pages
Information Retrieval: Unit 4: Web Search - Part 1
No ratings yet
Information Retrieval: Unit 4: Web Search - Part 1
63 pages
Neural Networks
100% (1)
Neural Networks
26 pages
Econometrics For MPM, LNotes 2
No ratings yet
Econometrics For MPM, LNotes 2
45 pages
2022 1 Linear Programming
No ratings yet
2022 1 Linear Programming
8 pages
05 Divide and Conquer I
No ratings yet
05 Divide and Conquer I
82 pages
Ai Data
No ratings yet
Ai Data
8 pages
Canny Edge Detection Step by Step in Python - Computer Vision - by Sofiane Sahir - Towards Data Science
No ratings yet
Canny Edge Detection Step by Step in Python - Computer Vision - by Sofiane Sahir - Towards Data Science
24 pages