0% found this document useful (0 votes)

1 views

PPT Lecture 1 & 2

MLT U1

Uploaded by

suraj.rajput2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views

PPT Lecture 1 & 2

MLT U1

Uploaded by

suraj.rajput2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

WELL-POSED LEARNING PROBLEMS

Definition: A computer program is said to learn from experience E with respect to some class of tasks T and
performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.

A checkers learning problem:

Task T: playing checkers

0 Performance measure P: percent of games won against opponents

Training experience E: playing practice games against itself

DESIGNING A LEARNING SYSTEM

Choosing the Training Experience

1. The first design choice we face is to choose the type of training experience from which our system
will learn.
One key attribute is whether the training experience provides direct or indirect feedback regarding
the choices made by the performance system.
In indirect : the learner faces an additional problem of credit assignment, or determining the degree
to which each move in the sequence deserves credit or blame for the final outcome.
2. A second important attribute of the training experience is the degree to which the learner controls the
sequence of training examples. the learner might rely on the teacher to select informative board
states and to provide the correct move for each. Alternatively, the learner might itself propose board
states that it finds particularly confusing and ask the teacher for the correct move. Or the learner
may have complete control over both the board states and (indirect) training classifications, as it does
when it learns by playing against itself with no teacher

3. A third important attribute of the training experience is how well it represents the distribution of
examples over which the final system performance P must be measured. In general, learning is most
reliable when the training examples follow a distribution similar to that of future test examples.

In our checkers learning scenario, the performance metric P is the percent of games the system wins
in the world tournament. If its training experience E consists only of games played against itself,
there is an obvious danger that this training experience might not be fully representative of the
distribution of situations over which it will later be tested.

A checkers learning problem:

0 Task T: playing checkers
0 Performance measure P: percent of games won in the world tournament

0 Training experience E: games played against itself

In order to complete the design of the learning system, we must now choose

1. the exact type of knowledge to be, learned

2. a representation for this target knowledge
3. a learning mechanism
Choosing the Target Function

Let us call this target function V and again use the notation V : B  R to denote that V maps
any legal board state from the set B to some real value (we use R to denote the set
of real numbers).

We intend for this target function V to assign higher scores to

better board states.
If the system can successfully learn such a target function V, then it can easily use it to select the best
move from any current board position.

This can be accomplished by generating the successor board state produced by

every legal move, then using V to choose the best successor state and therefore
the best legal move.

What exactly should be the value of the target function V for any given board state? Of course any
evaluation function that assigns higher scores to better board states will do.

Let us therefore define the target value V(b) for an arbitrary board state b in B, as follows:

1. if b is a final board state that is won, then V(b) = 100

2. if b is a final board state that is lost, then V(b) = -100
3. if b is a final board state that is drawn, then V(b) = 0
4. if b is a not a final state in the game, then V(b) = V(b’), where b' is the best final board state that can be
achieved starting from b and playing optimally until the end of the game (assuming the opponent plays
optimally, as well).

In the current discussion we will use the symbol V cap to refer to the function that is actually learned by
our program, to distinguish it from the ideal target function V.
Now that we have specified the ideal target function V, we must choose a representation that the learning
program will use to describe the function V cap that it will learn. As with earlier design choices, we
again have many options. We could, for example, allow the program to represent using a large table with
a distinct entry specifying the value for each distinct board state. Or we could allow it to represent using a
collection of rules that match against features of the board state, or a quadratic polynomial function of
predefined board features, or an artificial neural network. In general, this choice of representation
involves a crucial tradeoff. On one hand, we wish to pick a very expressive representation to allow
representing as close an approximation as possible to the ideal target function V.

On the other hand, the more expressive the representation, the more training data the program will require
in order to choose among the alternative hypotheses it can represent. To keep the discussion brief, let us
choose a simple representation:
for any given board state, the function V cap will be calculated as a linear combination
of the following board features:
0 xl: the number of black pieces on the board

x2: the number of red pieces on the board

0 x3: the number of black kings on the board
0 x4: the number of red kings on the board

x5: the number of black pieces threatened by red (i.e., which can be captured on red's next turn)
X6: thenumber of red pieces threatened by black
Thus, our learning program will represent c(b) as a linear function of the form where wo through W6 are
numerical coefficients, or weights, to be chosen by the learning algorithm. Learned values for the weights
wl through W6 will determine the relative importance of the various board features in determining the
value of the board, whereas the weight wo will provide an additive constant to the board value.

Unit 1
No ratings yet
Unit 1
14 pages
Machine Learning Notes-1 (ML Design)
No ratings yet
Machine Learning Notes-1 (ML Design)
7 pages
Unit 1 ML
No ratings yet
Unit 1 ML
60 pages
ML Unit-1
No ratings yet
ML Unit-1
61 pages
Module 1
No ratings yet
Module 1
27 pages
Module 1
No ratings yet
Module 1
28 pages
Machine Learning (Unit-1)
No ratings yet
Machine Learning (Unit-1)
24 pages
ml notes
No ratings yet
ml notes
47 pages
ML Module Notes
No ratings yet
ML Module Notes
139 pages
ML - Unit 1 - Part I
No ratings yet
ML - Unit 1 - Part I
24 pages
ML1
No ratings yet
ML1
28 pages
Ai&ml Unit 4
No ratings yet
Ai&ml Unit 4
21 pages
Designing A Learning System
No ratings yet
Designing A Learning System
23 pages
Ijirt154128 Paper
No ratings yet
Ijirt154128 Paper
5 pages
Unit 1 1
No ratings yet
Unit 1 1
64 pages
Module 1 Notes PDF
No ratings yet
Module 1 Notes PDF
26 pages
Unti 1 ML
No ratings yet
Unti 1 ML
26 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Effective Applications of Learning: Speech Recognition
No ratings yet
Effective Applications of Learning: Speech Recognition
52 pages
ML Unit-I
No ratings yet
ML Unit-I
121 pages
Module 2 PDF
No ratings yet
Module 2 PDF
26 pages
Module 1 Concept Learning Notes
No ratings yet
Module 1 Concept Learning Notes
26 pages
Ecs 403 ML Module I
No ratings yet
Ecs 403 ML Module I
33 pages
PPTX
No ratings yet
PPTX
12 pages
ADALINE:Machine Learning Application
No ratings yet
ADALINE:Machine Learning Application
16 pages
Unit-1 Notes
No ratings yet
Unit-1 Notes
26 pages
ML-1
No ratings yet
ML-1
86 pages
Eid 403 ML Module I Lecture Notes
No ratings yet
Eid 403 ML Module I Lecture Notes
26 pages
ML Design Learning
No ratings yet
ML Design Learning
7 pages
Designing A Learning System: DR - Chandrika.J Professor CSE Course Faculty
No ratings yet
Designing A Learning System: DR - Chandrika.J Professor CSE Course Faculty
22 pages
Unit 1 1
No ratings yet
Unit 1 1
26 pages
ML-UNIT-1 - Introduction PART-1
No ratings yet
ML-UNIT-1 - Introduction PART-1
60 pages
ML Unit-I Chapter-I Introduction
No ratings yet
ML Unit-I Chapter-I Introduction
36 pages
Introduction To ML,: Module-I
No ratings yet
Introduction To ML,: Module-I
48 pages
Learning
No ratings yet
Learning
35 pages
Unit 1 ML
No ratings yet
Unit 1 ML
14 pages
Machine Learning
No ratings yet
Machine Learning
111 pages
Mitchell Machine Learning
No ratings yet
Mitchell Machine Learning
37 pages
ML Unit 1
No ratings yet
ML Unit 1
156 pages
CSE860 - 16 - Learning System Design
No ratings yet
CSE860 - 16 - Learning System Design
15 pages
Module 1
No ratings yet
Module 1
27 pages
Video Tutorial: Machine Learning 17CS73
100% (2)
Video Tutorial: Machine Learning 17CS73
27 pages
ML UNIT 1-2-57
No ratings yet
ML UNIT 1-2-57
56 pages
ML Chapter-1
No ratings yet
ML Chapter-1
39 pages
ML Module1 Chapter1
No ratings yet
ML Module1 Chapter1
38 pages
ML Unit I Notes
No ratings yet
ML Unit I Notes
27 pages
Introduction
No ratings yet
Introduction
6 pages
MACHINE LEARNING TECHNIQUES - PPSX
No ratings yet
MACHINE LEARNING TECHNIQUES - PPSX
26 pages
What Is Learning?: CS 391L: Machine Learning
No ratings yet
What Is Learning?: CS 391L: Machine Learning
6 pages
Unit 4
No ratings yet
Unit 4
45 pages
Learningintro Notes
No ratings yet
Learningintro Notes
12 pages
ML UNIT-1 NOTES
No ratings yet
ML UNIT-1 NOTES
15 pages
DS Unit Iv
No ratings yet
DS Unit Iv
89 pages
ML Answers For Prep
No ratings yet
ML Answers For Prep
53 pages
Machine Learning (UNIT-1 - PART ONE)
No ratings yet
Machine Learning (UNIT-1 - PART ONE)
24 pages
UNIT 1 Machine Learning MTech
No ratings yet
UNIT 1 Machine Learning MTech
167 pages
ML MODULE - 1-1
No ratings yet
ML MODULE - 1-1
25 pages
1 Introduction
No ratings yet
1 Introduction
11 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
11 Math English 2020 21
No ratings yet
11 Math English 2020 21
248 pages
Design and Analysis of Algorithms: CSE 5311 Lecture 22 All-Pairs Shortest Paths
No ratings yet
Design and Analysis of Algorithms: CSE 5311 Lecture 22 All-Pairs Shortest Paths
40 pages
Ma8353 Transforms and Partial Differential Equations II Year III Semester
No ratings yet
Ma8353 Transforms and Partial Differential Equations II Year III Semester
128 pages
How To Determine Ea - S17
No ratings yet
How To Determine Ea - S17
1 page
2 - Permutations and Combinatinos
No ratings yet
2 - Permutations and Combinatinos
8 pages
Category Theory II
No ratings yet
Category Theory II
161 pages
Classnote Ma2031
100% (1)
Classnote Ma2031
185 pages
SAT Math Cheat Sheet
No ratings yet
SAT Math Cheat Sheet
1 page
2.1+2.2 - Determinant by Cofactor Expansion - 6
No ratings yet
2.1+2.2 - Determinant by Cofactor Expansion - 6
19 pages
Horizontal Curves PDF
100% (1)
Horizontal Curves PDF
6 pages
Control of Inverted Pendulum Cart System by Use of Pid Controller
No ratings yet
Control of Inverted Pendulum Cart System by Use of Pid Controller
6 pages
Updated Matrices and Calculus Qb
No ratings yet
Updated Matrices and Calculus Qb
20 pages
1 9780898719451 FM
100% (1)
1 9780898719451 FM
22 pages
Lab 9
No ratings yet
Lab 9
19 pages
Quotient Rule Derivative
No ratings yet
Quotient Rule Derivative
4 pages
Femap 11.1 New Features
No ratings yet
Femap 11.1 New Features
88 pages
Relation & Function PDF
No ratings yet
Relation & Function PDF
9 pages
Complex Variables
No ratings yet
Complex Variables
5 pages
Application of Fractional Derivatives in Characterization of ECG HH45
No ratings yet
Application of Fractional Derivatives in Characterization of ECG HH45
16 pages
Orthogonal Representations and Connectivity of Graphs
No ratings yet
Orthogonal Representations and Connectivity of Graphs
11 pages
Class 7 Algebra-Expressions and Equations: Answer The Questions
No ratings yet
Class 7 Algebra-Expressions and Equations: Answer The Questions
3 pages
Assignment 5 (Sol.) : Reinforcement Learning
100% (1)
Assignment 5 (Sol.) : Reinforcement Learning
4 pages
Taylor Series Method For Solving Partial Differential Equations
No ratings yet
Taylor Series Method For Solving Partial Differential Equations
10 pages
Queen'S College CAPE 1.1 TEST 2016-2017
No ratings yet
Queen'S College CAPE 1.1 TEST 2016-2017
2 pages
Ch2 Introduce To Firing Theory
No ratings yet
Ch2 Introduce To Firing Theory
15 pages
Advanced Quantum Mechanics
No ratings yet
Advanced Quantum Mechanics
23 pages
Collision Between Identical Particles
No ratings yet
Collision Between Identical Particles
14 pages
Advanced Maths1 Syndicate - Marking Scheme 2024
No ratings yet
Advanced Maths1 Syndicate - Marking Scheme 2024
20 pages
Jun 05 A-Level Mark Schemes
No ratings yet
Jun 05 A-Level Mark Schemes
208 pages
MATH 311: Spring 2017 Assignment #5
No ratings yet
MATH 311: Spring 2017 Assignment #5
1 page

PPT Lecture 1 & 2

Uploaded by

PPT Lecture 1 & 2

Uploaded by

WELL-POSED LEARNING PROBLEMS

A checkers learning problem:

Task T: playing checkers

Training experience E: playing practice games against itself

DESIGNING A LEARNING SYSTEM

A checkers learning problem:

0 Training experience E: games played against itself

1. the exact type of knowledge to be, learned

We intend for this target function V to assign higher scores to

This can be accomplished by generating the successor board state produced by

1. if b is a final board state that is won, then V(b) = 100

x2: the number of red pieces on the board

You might also like