Mila University Centre M1: I2A Uncertain Decision: Work 2 (Reinforcement Learning) 1. Definition

The document provides an overview of OpenAI's Gym, an open-source project for reinforcement learning experiments, specifically focusing on the Frozen Lake environment. It details the installation process, the state and action spaces, and how to interact with the environment using Python code. Additionally, it poses questions regarding the simulation of an episode and the implementation of an optimal policy algorithm.

Uploaded by

lahlou khalid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as ODT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views2 pages

Mila University Centre M1: I2A Uncertain Decision: Work 2 (Reinforcement Learning) 1. Definition

Uploaded by

lahlou khalid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as ODT, PDF, TXT or read online on Scribd

You are on page 1/ 2

Mila University Centre

M1: I2A
Uncertain decision
Work 2 (reinforcement learning)
1. Definition
Gym is an open source project created by OpenAI used for reinforcement learning experiments.

2. Install OpenAI Gym

- pip install gym
- pip install gym [toy-text]

3. The Frozen Lake Environment

Frozen lake involves crossing a frozen lake from Start(S) to Goal(G) without falling into any Holes(H)
by walking over the Frozen(F) lake (see Figure ).

0 1 2 3
S F F F
4 5 6 7
F H F H
8 9 10 11
F F F H
12 13 14 15
H F F G
Frozen Lake environment
import gym
env = gym.make("FrozenLake-v1",render_mode="human") ## to create the Frozen Lake
environment
env.reset() ## to put the environment on its initial state.
env.render() # to print the environment into the console.
3.1 State space
This environment consists of 16 fields (4 by 4 grid). The states are denoted from 0 to 15 (See figure
above) . There are four types of fields: start field (S), frozen fields (F), holes (H), and the goal field
(G).That is, the game is completed if we step on a hole field or if we reach the goal field.
env.observation_space
Mila University Centre
M1: I2A
Uncertain decision

3.2 Action space

env.action_space.
we have 4 possible actions: : left(0), down (1), right (2), up(3)
To take a random action, we use :
random_action = env.action_space.sample()
env.step(random_action)
This function has the following parameter:
(1, 0.0, False, False, {'prob': 0.3333333333333333})
1: The current state, 2: reward, 3: Boolean parameter taking true when the agent achieves the goal or
falls into a hole.
The last parameter concerns the probability that the agent move in the intended direction. In fact, The
agent may not always move in the intended direction, due to the slippery nature of the frozen lake.

3.3 Probability transition

Env.P[s][a]
The different Probability of reaching the adjacent state of s, including s, using action a.
[(0.3333333333333333, 0, 0.0, False), (0.3333333333333333, 4, 0.0, False), (0.3333333333333333, 1,
0.0, False)]
3.4 leave the environment
env.close:

4. Questions
1. Simulate an episode
2. Implement an Algorithm that allows determining the optimal policy to achieve the Goal.

61 Report
No ratings yet
61 Report
12 pages
Reinforcement Learning Environment Setup
No ratings yet
Reinforcement Learning Environment Setup
7 pages
Monte Carlo RL for Frozen Lake
No ratings yet
Monte Carlo RL for Frozen Lake
18 pages
FrozenLake - Using - Dynamic - Programming5.ipynb - Colab
No ratings yet
FrozenLake - Using - Dynamic - Programming5.ipynb - Colab
6 pages
L1 Basic Concepts
No ratings yet
L1 Basic Concepts
27 pages
Intro to Reinforcement Learning Concepts
No ratings yet
Intro to Reinforcement Learning Concepts
524 pages
Introduction to AI Techniques
No ratings yet
Introduction to AI Techniques
52 pages
Reinforcement Learning with Gymnasium
No ratings yet
Reinforcement Learning with Gymnasium
77 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Unit 5 ML
No ratings yet
Unit 5 ML
49 pages
Ai Problem Formulation Tool
No ratings yet
Ai Problem Formulation Tool
12 pages
Module 3
No ratings yet
Module 3
57 pages
Markov Decision Process in Reinforcement Learning
No ratings yet
Markov Decision Process in Reinforcement Learning
8 pages
OpenAI Gym Guide for RL Students
No ratings yet
OpenAI Gym Guide for RL Students
43 pages
AI Report Copy 1
No ratings yet
AI Report Copy 1
3 pages
AI Python 1565131797
No ratings yet
AI Python 1565131797
221 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
232 pages
Python Code For Artificial Intelligence: Foundations of Computational Agents
No ratings yet
Python Code For Artificial Intelligence: Foundations of Computational Agents
221 pages
Artificial Intelligence With Python
No ratings yet
Artificial Intelligence With Python
229 pages
AI Code Guide
No ratings yet
AI Code Guide
215 pages
AI (Complete)
No ratings yet
AI (Complete)
28 pages
Aipython
No ratings yet
Aipython
227 pages
AI Python
No ratings yet
AI Python
221 pages
III - AI-DS - AD3311 - AI - Lab Manual
No ratings yet
III - AI-DS - AD3311 - AI - Lab Manual
34 pages
mdp1 6pp
No ratings yet
mdp1 6pp
13 pages
Chapter 02
No ratings yet
Chapter 02
53 pages
Aiexp 2643
No ratings yet
Aiexp 2643
9 pages
Python Code For Artificial Intelligence: Foundations of Computational Agents
No ratings yet
Python Code For Artificial Intelligence: Foundations of Computational Agents
221 pages
Reinforcement Learning: Foundations Exam
No ratings yet
Reinforcement Learning: Foundations Exam
42 pages
AI Python
No ratings yet
AI Python
221 pages
RL Catalogue
No ratings yet
RL Catalogue
3 pages
Ai 18-06-2024 Solution
No ratings yet
Ai 18-06-2024 Solution
15 pages
Python Code For Artificial Intelligence: Foundations of Computational Agents
No ratings yet
Python Code For Artificial Intelligence: Foundations of Computational Agents
221 pages
Aipython
No ratings yet
Aipython
221 pages
Python Code For Artificial Intelligence: Foundations of Computational Agents
No ratings yet
Python Code For Artificial Intelligence: Foundations of Computational Agents
221 pages
Mathematical Programming and Magic - The Gathering®
No ratings yet
Mathematical Programming and Magic - The Gathering®
52 pages
Deep Reinforcement Learning: Lecture Notes
No ratings yet
Deep Reinforcement Learning: Lecture Notes
60 pages
1,1 Intelligent System: AI - UNIT - 1
No ratings yet
1,1 Intelligent System: AI - UNIT - 1
42 pages
IT567 24 25 Lab Exercise 1
No ratings yet
IT567 24 25 Lab Exercise 1
2 pages
Ai Lab File 2
No ratings yet
Ai Lab File 2
45 pages
Ass1 Merged Merged
No ratings yet
Ass1 Merged Merged
15 pages
CartPole Reinforcement Learning Guide
No ratings yet
CartPole Reinforcement Learning Guide
8 pages
Week1 Slide ECE4010
No ratings yet
Week1 Slide ECE4010
301 pages
ML Unit 5 Possible Questions and Answers
No ratings yet
ML Unit 5 Possible Questions and Answers
47 pages
Slides Trial and Error
No ratings yet
Slides Trial and Error
55 pages
CS221 AI Course Overview & Policies
No ratings yet
CS221 AI Course Overview & Policies
122 pages
Reinforcement Learning - Personal Study Notes
No ratings yet
Reinforcement Learning - Personal Study Notes
12 pages
3 - Chapter 1 Basic Concepts
No ratings yet
3 - Chapter 1 Basic Concepts
13 pages
cs188 Fa22 Note01
No ratings yet
cs188 Fa22 Note01
8 pages
CSE3001: Artificial Intelligence and Machine Learning
No ratings yet
CSE3001: Artificial Intelligence and Machine Learning
3 pages
Ai Unit 2
No ratings yet
Ai Unit 2
4 pages
Decision Making Under Uncertainty
No ratings yet
Decision Making Under Uncertainty
63 pages
Exploring Reinforcement Learning Algorithms: Information and Communication Technologies Department
No ratings yet
Exploring Reinforcement Learning Algorithms: Information and Communication Technologies Department
60 pages
Mila University Centre M1: IA Uncertain Decision: Work 3 (Bayesian Network) Exercise 1
No ratings yet
Mila University Centre M1: IA Uncertain Decision: Work 3 (Bayesian Network) Exercise 1
1 page
MDPs Solving
No ratings yet
MDPs Solving
19 pages
Cours 3
No ratings yet
Cours 3
18 pages
MDPs
No ratings yet
MDPs
19 pages
Cours 7 A
No ratings yet
Cours 7 A
25 pages
Cours9a RNN
No ratings yet
Cours9a RNN
29 pages
Cours 7 B
No ratings yet
Cours 7 B
31 pages
Cours 8 A
No ratings yet
Cours 8 A
34 pages
Cours9c-Attention Mechanism
No ratings yet
Cours9c-Attention Mechanism
36 pages
Cours9b NLP
No ratings yet
Cours9b NLP
35 pages
Assignment of Projects 2023-2024
No ratings yet
Assignment of Projects 2023-2024
1 page
Projects 2023-2024
No ratings yet
Projects 2023-2024
7 pages
Relative Extrema and Derivatives
No ratings yet
Relative Extrema and Derivatives
17 pages
Java Sample Bee Solved
No ratings yet
Java Sample Bee Solved
39 pages
Introduction
No ratings yet
Introduction
33 pages
Design and Analysis of Dynamic Huffman Codes: Jeffrey Scott Vitter
No ratings yet
Design and Analysis of Dynamic Huffman Codes: Jeffrey Scott Vitter
21 pages
Algebra for High School Students
100% (1)
Algebra for High School Students
21 pages
Hill Climbing Algorithm in AI - Javatpoint
No ratings yet
Hill Climbing Algorithm in AI - Javatpoint
2 pages
Ds Important Questions
No ratings yet
Ds Important Questions
3 pages
KNN Algorithm Process and Implementation
No ratings yet
KNN Algorithm Process and Implementation
3 pages
Sns College of Technology: 23itt201 - Data Structures
No ratings yet
Sns College of Technology: 23itt201 - Data Structures
35 pages
ICP Mid Model Questions Unit-1
No ratings yet
ICP Mid Model Questions Unit-1
7 pages
18CS54 ATC Question Bank Module 4 and 5
No ratings yet
18CS54 ATC Question Bank Module 4 and 5
2 pages
UEF - Supply Chain Management - Unit 10 EN
No ratings yet
UEF - Supply Chain Management - Unit 10 EN
21 pages
Local Search and Adversarial Search
No ratings yet
Local Search and Adversarial Search
83 pages
Class 17 DFA Minimization Using Equivalence Theorem
No ratings yet
Class 17 DFA Minimization Using Equivalence Theorem
10 pages
Swap Two Numbers
No ratings yet
Swap Two Numbers
1 page
23ucc554 Ass 2
No ratings yet
23ucc554 Ass 2
6 pages
Sorting Algorithms Compared
100% (1)
Sorting Algorithms Compared
10 pages
Java ID3 Decision Tree Algorithm
92% (13)
Java ID3 Decision Tree Algorithm
10 pages
8.2 Unit Revision
No ratings yet
8.2 Unit Revision
5 pages
Int423 ML-II
No ratings yet
Int423 ML-II
2 pages
Dead Lock Handling Approaches: Operating Systems
No ratings yet
Dead Lock Handling Approaches: Operating Systems
29 pages
Year 8 Python Programming Assessment
No ratings yet
Year 8 Python Programming Assessment
18 pages
Discrete Math Exam for UIU Students
No ratings yet
Discrete Math Exam for UIU Students
3 pages
Lecture2A Annotations
No ratings yet
Lecture2A Annotations
22 pages
Fundamentals of Algorithm Design
No ratings yet
Fundamentals of Algorithm Design
54 pages
Math Quiz on Number Theory
No ratings yet
Math Quiz on Number Theory
3 pages
Key Concepts in Functions and Relations
No ratings yet
Key Concepts in Functions and Relations
12 pages
Module VI - Mining Social Network Graph
No ratings yet
Module VI - Mining Social Network Graph
88 pages
Claude Shannon - A Mathematical Theory of Communications 1948
100% (1)
Claude Shannon - A Mathematical Theory of Communications 1948
87 pages
Introduction To Neural Networks 67103 - 2019 Exam B
No ratings yet
Introduction To Neural Networks 67103 - 2019 Exam B
2 pages

Mila University Centre M1: I2A Uncertain Decision: Work 2 (Reinforcement Learning) 1. Definition

Uploaded by

Mila University Centre M1: I2A Uncertain Decision: Work 2 (Reinforcement Learning) 1. Definition

Uploaded by

Mila University Centre

2. Install OpenAI Gym

3. The Frozen Lake Environment

3.2 Action space

3.3 Probability transition

You might also like