Reinforcement Learning

Uploaded by

siva.jntua

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views9 pages

Reinforcement Learning

Uploaded by

siva.jntua

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Reinforcement Learning

• A supervised learning agent needs to be

told the correct move for each position it
encounters, but such feedback is
• absence of feedback from a teacher?
• without some feedback about what is
good and what is bad, the agent will have
no grounds for deciding which move to
make
• The agent needs to know that something
good has happened when it (accidentally)
checkmates the opponent, and that
something bad has happened when it is
checkmated—or vice versa.
• This kind of feedback is called a reward,
or reinforcement.
• In games like chess, the reinforcement is
received only at the end of the game. In other
environments, the rewards come more
frequently.
• In ping-pong, each point scored can be
considered a reward
• Our framework for agents regards the reward
as part of the input percept, but the agent
must be “hardwired” to recognize that part as
a reward rather than as just another sensory
input.
• Markov decision processes (MDPs).
• An optimal policy is a policy that maximizes
the expected total reward.
• The task of reinforcement learning is to use
observed rewards to learn an optimal (or
nearly optimal) policy for the environment.
• Imagine playing a new game whose rules
you don’t know; after a hundred or so moves,
your opponent announces, “You lose.”
• In many complex domains, reinforcement
learning is the only feasible way to train a
program to perform at high levels.
• For example, in game playing, it is very
hard for a human to provide accurate and
consistent evaluations of large numbers of
positions, which would be needed to train
an evaluation function directly from
examples.
• In many complex domains, reinforcement
learning is the only feasible way to train a
program to perform at high levels.
• For example, in game playing, it is very
hard for a human to provide accurate and
consistent evaluations of large numbers of
positions, which would be needed to train
an evaluation function directly from
examples
• Instead, the program can be told when it has
won or lost, and it can use this information to
learn an evaluation function that gives
reasonably accurate estimates of the
probability of winning from any given position
• It is extremely difficult to program an agent
to fly a helicopter; yet given appropriate
negative rewards for crashing, wobbling, or
deviating from a set course, an agent can
learn to fly by itself.

Reinforcement Learning B.Tech. IV Year I Sem. Unit - I
No ratings yet
Reinforcement Learning B.Tech. IV Year I Sem. Unit - I
27 pages
ML Unit 5
No ratings yet
ML Unit 5
13 pages
Assignment 15 Modern AI
No ratings yet
Assignment 15 Modern AI
3 pages
Reinforcemnet Learning
No ratings yet
Reinforcemnet Learning
8 pages
Lect 2
No ratings yet
Lect 2
26 pages
MLT Unit-5 Notes
No ratings yet
MLT Unit-5 Notes
17 pages
Reinforcement Learning Guide
No ratings yet
Reinforcement Learning Guide
64 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
8 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
64 pages
Unit 4
No ratings yet
Unit 4
56 pages
Module 1
No ratings yet
Module 1
72 pages
Unit 3
No ratings yet
Unit 3
29 pages
Module 1
No ratings yet
Module 1
85 pages
3.RL Unit 3
No ratings yet
3.RL Unit 3
31 pages
Lecture 9 - Reinforced Learning
No ratings yet
Lecture 9 - Reinforced Learning
18 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
30 pages
Unit 5 ML
No ratings yet
Unit 5 ML
15 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
19 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Reinforcement Learning: Yijue Hou
No ratings yet
Reinforcement Learning: Yijue Hou
8 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
73 pages
RL Unit 1
100% (1)
RL Unit 1
26 pages
Ai PPT New
No ratings yet
Ai PPT New
14 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages

Reinforcement Learning

Uploaded by

Reinforcement Learning

Uploaded by

Reinforcement Learning

• A supervised learning agent needs to be

You might also like