0% found this document useful (0 votes)

28 views51 pages

Lecture05 AdversarialSearch

Uploaded by

Thanh Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views51 pages

Lecture05 AdversarialSearch

Uploaded by

Thanh Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Artificial Intelligence

ADVERSARIAL SEARCH

Nguyễn Ngọc Thảo – Nguyễn Hải Minh

{nnthao, nhminh}@fit.hcmus.edu.vn
Outline
• The concept of games in AI
• Optimal decisions in games
• α-β Pruning
• Imperfect, real-time decisions
• Stochastic games

2
The concept of games in AI

3
Search in multiagent environments
• Each agent needs to consider the actions of other agents
and how they affect its own welfare.
• The unpredictability of other agents introduce contingencies
into the agent’s problem-solving process

4
Game theory
• Game theory views any multiagent environment as a game.
• The impact of each agent on the others is “significant,” regardless of
whether the agents are cooperative or competitive.
• Types of games

Deterministic Chance
Perfect Chess, Checkers, Go, Backgammon,
information Othello Monopoly
Imperfect Bridge, poker, scrabble
information nuclear war

5
Types of Games

6
Adversarial search
• Adversarial search (known as games) covers competitive
environments in which the agents’ goals are in conflict.
• Zero-sum games of perfect information
• Deterministic, fully observable environments, turn-taking, two-player
• The utility values at the end are always equal and opposite.

7
Games vs. Search problems
• Complexity: games are too hard to be solved
• Chess: b  35, d  100 (50 moves/player) → graph of 1040 nodes,
search tree of 35100 or 10154 nodes
• Go: b  1000 (!)
• Time limits: make some decision even when calculating the
optimal decision is infeasible
• Efficiency: penalize inefficiency severely
• Several interesting ideas on how to make the best possible use of
time are spawn in game-playing research.

8
Primary assumptions
• Two players only, called MAX and MIN.
• MAX moves first, and then they take turns moving until the game ends
• Winner gets reward, loser gets penalty.
• Both players have complete knowledge of the game’s state
• E.g., chess, checkers and Go, etc. Counter examples: poker
• No element of chance
• No dice thrown, no cards drawn, etc.
• Zero-sum games
• The total payoff to all players is the same for every game instance.
• Rational players
• Each player always tries to maximize his/her utility
9
Games as search
• 𝑆0 – Initial state: How the game is set up at the start
• E.g., board configuration of chess
• 𝑃𝐿𝐴𝑌𝐸𝑅(𝑠): Which player has the move in a state, MAX/MIN?
• 𝐴𝐶𝑇𝐼𝑂𝑁𝑆(𝑠) – Successor function: A list of (move, state) pairs
specifying legal moves.
• 𝑅𝐸𝑆𝑈𝐿𝑇(𝑠, 𝑎) – Transition model: Result of move 𝑎 on state 𝑠
• 𝑇𝐸𝑅𝑀𝐼𝑁𝐴𝐿 − 𝑇𝐸𝑆𝑇(𝑠): Is the game finished?
• States where the game has ended are called terminal states
• 𝑈𝑇𝐼𝐿𝐼𝑇𝑌 (𝑠, 𝑝) – Utility function: A numerical value of a terminal
state 𝑠 for a player 𝑝
• E.g., chess: win (+1), lose (-1) and draw (0), backgammon: [0, 192]
10
The game tree of Tic-Tac-Toe

MAX uses search tree

to determine next move.

from the point of view of MAX 11

Examples of game: Checkers

• Complexity
• ~ 1018 nodes, which may require 100k years with 106 positions/sec

• Chinook (1989-2007)
• The first computer program that won the world champion title in a
competition against humans
• 1990: won 2 games in competition with world champion Tinsley (final
score: 2-4, 33 draws). 1994: 6 draws

• Chinook’s search
• Ran on regular PCs, played perfectly by using alpha-beta search
combining with a database of 39 trillion endgame positions
12
Examples of game: Chess
• Complexity
• b  35, d  100, 10154 nodes (!!)
• Completely impractical to search this
• Deep Blue (May 11, 1997)
• Kasparov lost a 6-game match against IBM’s Deep Blue (1 win Kasp
– 2 wins DB) and 3 ties.
• In the future, focus will be to allow computers to LEARN to
play chess rather than being TOLD how it should play

13
Deep Blue
• Ran on a parallel computer with 30 IBM RS/6000
processors doing alpha–beta search
• Searched up to 30 billion positions/move, average depth 14
(be able to reach to 40 plies)
• Evaluation function: 8000 features
• highly specific patterns of pieces (~4000 positions)
• 700,000 grandmaster games in database
• Working at 200 million positions/sec, even Deep Blue
would require 10100 years to evaluate all possible games.
• (The universe is only 1010 years old.)

• Now: algorithmic improvements have allowed programs running on standard PCs

to win World Computer Chess Championships.
• Pruning heuristics reduce the effective branching factor to less than 3

14
GO 1 million trillion trillion trillion
trillion more configurations
than chess!
• Complexity
• Board of 19x19, b  361, average depth  200
• 10174 possible board configuration.
• Control of territory is unpredictable until the endgame
• AlphaGo (2016) by Google
• Beat 9-dan professional Lee Sedol (4-1)
• Machine learning + Monte Carlo search guided by a “value network”
and a “policy network” (implemented using deep neural network
technology)
• Learn from human + Learn by itself (self-play games)

15
An overview of AlphaGo

16
Optimal
decisions
in games

• Minimax algorithm
• Optimal decisions in multiplayer games
17
Optimal decision in games
• Normal search problem
• The optimal solution is a sequence of action leading to a goal state.
• Games
• The optimal strategy is a search path that guarantee win for a player
• This can be determined from the minimax value of each node.

For MAX

Assume that both players play optimally from there to the end of the game

18
An example of two-ply game tree

MAX best move

MIN best move

Utility values for MAX

19
Minimax algorithm
• Make a minimax decision from the current state, using a
recursive computation of minimax values at each successor
• The recursion proceeds all the way down to the leaves, and then
back up the minimax values through the tree as it unwinds.

20
Minimax algorithm
function MINIMAX-DECISION(state) returns an action
return arg maxa ∈ ACTIONS(s) MIN-VALUE(RESULT(state, a))
function MAX-VALUE(state) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v ← -∞
for each a in ACTIONS(state) do
v ← MAX(v, MIN-VALUE(RESULT(s, a)))
return v
function MIN-VALUE(state) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v←∞
for each a in ACTIONS(state) do
v ← MIN(v, MAX-VALUE(RESULT(s, a)))
return v 21
Properties of Minimax algorithm
• A complete depth-first exploration of the game tree
• Completeness
• Yes (if tree is finite)
• Optimality Note:
• Yes (against an optimal opponent) m: the maximum depth of the tree
• Time complexity b: the legal moves at each point

• 𝑂(𝑏𝑚 )
• Space complexity
• 𝑂(𝑏𝑚) (depth-first exploration)

For chess, 𝑏 ≈ 35, 𝑚 ≈ 100 for "reasonable" games

→ exact solution completely infeasible
22
Quiz 01: Minimax algorithm
• Calculate the utility value for the remaining nodes
• Which node should MAX and MIN choose?

23
Optimality in multiplayer games
• A single value is replaced with a vector of values.
→ the UTILITY function returns a vector of utilities
• For terminal states, this vector gives the utility of the state
from each player’s viewpoint.

24
Optimality in multiplayer games
• Multiplayer games usually involve alliances, which are made
and broken as the game proceeds.

A and B are weak while C is strong. C becomes weak.

A forms an alliance with B. A or B could violate the agreement

• If the game is not zero-sum, then collaboration can also

occur with just two players.

25
Alpha-beta
pruning
26
Problem with minimax search
• The number of game states is exponential in the tree’s depth
→ Do not examine every node
• Alpha-beta pruning: Prune away branches that cannot
possibly influence the final decision
• Bounded lookahead
• Limit depth for each search
• This is what chess players do: look ahead for a few moves and see
what looks best

27
Alpha-beta pruning: An example

28
Another way to look at this is as a simplification of the formula for MINIMAX.
Let the two unevaluated successors of node 𝐶 have values 𝑥 and 𝑦.
Then the value of the root node is given by

29
Alpha-beta pruning
• If a move 𝑛 is determined to be
worse than move 𝑚 that has
already been examined and
discarded, then examining move
𝑛 once again is pointless.

𝜶 = the value of the best (i.e., highest-value) choice we have found so far
at any choice point along the path for MAX.
β = the value of the best (i.e., lowest-value) choice we have found so far
at any choice point along the path for MIN.
30
Alpha-beta search algorithm

function ALPHA-BETA-SEARCH(state) returns an action

v ← MAX-VALUE(state,-∞,+∞)
return the action in ACTIONS(state) with value v

function MAX-VALUE(state,α,β) returns a utility value

if TERMINAL-TEST(state) then return UTILITY(state)
v ← -∞
for each a in ACTIONS(state) do
v ← MAX(v, MIN-VALUE(RESULT(s,a),α,β))
if v ≥ β then return v
α ← MAX(α, v)
return v

31
Alpha-beta search algorithm

function MIN-VALUE(state,α,β) returns a utility value

if TERMINAL-TEST(state) then return UTILITY(state)
v ← +∞
for each a in ACTIONS(state) do
v ← MIN(v, MAX-VALUE(RESULT(s,a) ,α,β))
if v ≤ α then return v
β ← MIN(β, v)
return v

32
Properties of alpha-beta pruning
• Pruning does not affect the result
• Its worst case is as good as the minimax algorithm
• Good move ordering improves effectiveness of pruning
• With "perfect ordering“: time complexity 𝑂(𝑏 𝑚/2 ) → x2 search depth
• The effective branching factor becomes 𝑏 instead of 𝑏.
• E.g., for chess, about 6 instead of 35.

• Killer move heuristic

• First, IDS search with 1 ply deep and record the best path.
• Then search 1 ply deeper with the recorded path to inform move
ordering
• Transposition table avoids re-evaluation a state
33
Quiz 02: Alpha-beta pruning
• Calculate the utility value for the remaining nodes.
• Which nodes should be pruned?

34
Imperfect
real-time
decisions

• Evaluation functions
• Cutting off search
• Forward pruning
• Search versus Lookup
35
Heuristic minimax
• Both minimax and alpha-beta pruning search all the way to
terminal states.
• This depth is usually impractical because moves must be made in a
reasonable amount of time (~ minutes).
• Cut off the search earlier with some depth limit
• Use an evaluation function
• An estimation for the desirability of position (win, lose, tie?)

36
Evaluation functions
• These evaluation function should order the terminal states in
the same way as the true utility function does
• States that are wins must evaluate better than draws, which in turn
must be better than losses.
• The computation must not take too long!
• For nonterminal states, their orders should be strongly
correlated with the actual chances of winning.

37
Evaluation functions
• For chess, typically linear weighted sum of features
𝑬𝒗𝒂𝒍(𝒔) = 𝒘𝟏 𝒇𝟏 (𝒔) + 𝒘𝟐 𝒇𝟐 (𝒔) + … + 𝒘𝒏 𝒇𝒏 (𝒔)
• where 𝑓𝑖 could be the numbers of each kind of piece on the board,
and 𝑤𝑖 could be the values of the pieces
• E.g., 𝐸𝑣𝑎𝑙(𝑠) = 9𝑞 + 5𝑟 + 3𝑏 + 3𝑛 + 𝑝
• Implicit strong assumption: the contribution of each feature is
independent of the values of the other features.
• E.g., assign the value 3 to a bishop ignores the fact that bishops are
more powerful in the endgame → Nonlinear combination

38
Cutting off search
• Minimax Cutoff is identical to Minimax Value except
1. 𝑇𝑒𝑟𝑚𝑖𝑛𝑎𝑙? is replaced by 𝐶𝑢𝑡𝑜𝑓𝑓?
2. 𝑈𝑡𝑖𝑙𝑖𝑡𝑦 is replaced by 𝐸𝑣𝑎𝑙

if CUTOFF-TEST(state, depth) then return EVAL(state)

• Does it work in practice?

• 𝑏 𝑚 = 106 , 𝑏 = 35 → 𝑚 = 4
• 4-ply lookahead is a hopeless chess player!
• 4-ply ≈ human novice, 8-ply ≈ typical PC, human master, 12-ply ≈
Deep Blue, Kasparov

39
A more sophisticated cutoff test
• Quiescent positions are those unlikely to exhibit wild swings
in value in the near future.
• E.g., in chess, positions in which favorable captures can be made
are not quiescent for an evaluation function counting material only
• Quiescence search: expand nonquiescent positions until
quiescent positions are reached.

40
Quiescent positions: An example

Two chess positions that differ only in the position of the rook at lower right.
In (a), Black has an advantage of a knight and two pawns, which should be
enough to win the game. In (b), White will capture the queen, giving it an
advantage that should be strong enough to win.
41
A more sophisticated cutoff test
• Horizon effect: The program is facing an evitable serious
loss and temporarily avoid it by delaying tactics.

With Black to move, the black bishop is

surely doomed. But Black can forestall
that event by checking the white king
with its pawns, forcing the king to
capture the pawns.

42
A more sophisticated cutoff test
• Singular extension: a move that is “clearly better” than all
other moves in a given position.
• The algorithm allows for further consideration on a legal singular
extension → deeper search tree, yet only a few singular extensions.
• Beam search
• Forward pruning, consider only a “beam” of the 𝑛 best moves only
• Most humans consider only a few moves from each position
• PROBCUT, or probabilistic cut, algorithm (Buro, 1995)
• Search vs. Lookup
• Use table lookup rather than search for the opening and ending

43
Stochastic
games
44
Stochastic behaviors
• Uncertain outcomes controlled by chance, not an adversary!
• Why wouldn’t we know what the result of an action will be?
• Explicit randomness: rolling dice
• Unpredictable opponents: the ghosts respond randomly
• Actions can fail: when a robot is moving, wheels might slip

45
Expectimax search
• Values reflect the average-case (expectimax) outcomes, not
worst-case (minimax) outcomes

• Expectimax search: compute the

average score under optimal play
• Max nodes as in minimax search
• Chance nodes are like min nodes, but the outcome is uncertain
• Calculate expected utilities, i.e., take weighted average of children

• For minimax, terminal function scale doesn't matter

• Monotonic transformations: better states to have higher evaluations
• For expectimax, we need magnitudes to be meaningful
46
Expectimax search: Pseudo code

47
Expectimax pruning

Is it possible to perform pruning in expectimax search?

48
Expectimax pruning
• Pruning can only be possible with knowledge of a fix range.

How to prune this tree?

• Each child has an equal
probability of being chosen
• The values can only be in
the range 0-9 (inclusive).

49
Depth-limited expectimax

50
THE END

2025-Lecture03-AdversarialSearch
No ratings yet
2025-Lecture03-AdversarialSearch
51 pages
AI UNIT 3
No ratings yet
AI UNIT 3
33 pages
08 Adversarial Search
No ratings yet
08 Adversarial Search
36 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
Adversarial Search
No ratings yet
Adversarial Search
109 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
Ch5 Game
No ratings yet
Ch5 Game
79 pages
Game Playing
No ratings yet
Game Playing
60 pages
Lecture 4
No ratings yet
Lecture 4
29 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Adversarial Search PPT
No ratings yet
Adversarial Search PPT
49 pages
Cummins Engine Parts
100% (4)
Cummins Engine Parts
427 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
ITSC6121 Lecture 4 -- Game Trees I
No ratings yet
ITSC6121 Lecture 4 -- Game Trees I
38 pages
Part4.Game playing
No ratings yet
Part4.Game playing
35 pages
ai lecture-4
No ratings yet
ai lecture-4
37 pages
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
4 pages
AAI Lecture 7 Sp 25
No ratings yet
AAI Lecture 7 Sp 25
51 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Week 13 (1)
No ratings yet
Week 13 (1)
45 pages
6 Game
No ratings yet
6 Game
42 pages
Games
No ratings yet
Games
41 pages
Adversarial Search
No ratings yet
Adversarial Search
78 pages
Lecture13 - Adversial Search Algorithms
No ratings yet
Lecture13 - Adversial Search Algorithms
23 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
2.6-Adversarial Search Algorithms
No ratings yet
2.6-Adversarial Search Algorithms
21 pages
CS 188: Artificial Intelligence: Adversarial Search
No ratings yet
CS 188: Artificial Intelligence: Adversarial Search
44 pages
Lecture 5 - Adversal Search
No ratings yet
Lecture 5 - Adversal Search
88 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
Lecture 6 - minmax alpha beta
No ratings yet
Lecture 6 - minmax alpha beta
41 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Lecture11_AdversarialSearch
No ratings yet
Lecture11_AdversarialSearch
74 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Class - Star Touched - GM Binder
100% (2)
Class - Star Touched - GM Binder
96 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Game Playing
No ratings yet
Game Playing
32 pages
6-A Star Search Adversarial Search-09!01!2025
No ratings yet
6-A Star Search Adversarial Search-09!01!2025
42 pages
HeroQuest No Game Master by HeroQuestFrance & Drathe May 2011
100% (2)
HeroQuest No Game Master by HeroQuestFrance & Drathe May 2011
9 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Adversarial Search and Game Playing
No ratings yet
Adversarial Search and Game Playing
77 pages
ITSC6121 Lecture 4 -- Game Trees I
No ratings yet
ITSC6121 Lecture 4 -- Game Trees I
34 pages
Chap 4 Games
No ratings yet
Chap 4 Games
31 pages
Carpigiani Part List
50% (2)
Carpigiani Part List
91 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
No ratings yet
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
49 pages
AAI - Intro Lec 9 10
No ratings yet
AAI - Intro Lec 9 10
22 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Indoor and Outdoor Ppt
No ratings yet
Indoor and Outdoor Ppt
23 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
41 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
6 min max
No ratings yet
6 min max
11 pages
All Pump Spec
No ratings yet
All Pump Spec
13 pages
Isf Eurasian Records - Streetlifting Classic
No ratings yet
Isf Eurasian Records - Streetlifting Classic
15 pages
Lotto Rice01 (AutoRecovered)
No ratings yet
Lotto Rice01 (AutoRecovered)
44 pages
uji valid dan relib Ferry
No ratings yet
uji valid dan relib Ferry
7 pages
1-5 Levels Part 9 Death in The Jungle
No ratings yet
1-5 Levels Part 9 Death in The Jungle
20 pages
UDAAN Brochure.pdf
No ratings yet
UDAAN Brochure.pdf
4 pages
Force For AS U1
No ratings yet
Force For AS U1
11 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Physical Sciences p1 Revision Document Grade 12
No ratings yet
Physical Sciences p1 Revision Document Grade 12
32 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
5 - Data Sampling Domestik
No ratings yet
5 - Data Sampling Domestik
2 pages
Mondo Track
No ratings yet
Mondo Track
4 pages
Liste de Mangas
No ratings yet
Liste de Mangas
8 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
5 pages
Aos Warscroll Yltharis Guardians en
No ratings yet
Aos Warscroll Yltharis Guardians en
1 page
Dflse Es Engl PDF
No ratings yet
Dflse Es Engl PDF
16 pages
Rieti 2013 Timetable at 01.02.2012 English
No ratings yet
Rieti 2013 Timetable at 01.02.2012 English
4 pages
Ather Brochure 450x
No ratings yet
Ather Brochure 450x
5 pages
Recon Car List Malaysia
No ratings yet
Recon Car List Malaysia
26 pages
Sports in Ottoman Empire
No ratings yet
Sports in Ottoman Empire
2 pages
ĐỀ CƯƠNG ÔN TẬP CUỐI HK2 TA3 I LEARN SMART START - Ms.BíchPhượng
No ratings yet
ĐỀ CƯƠNG ÔN TẬP CUỐI HK2 TA3 I LEARN SMART START - Ms.BíchPhượng
8 pages
RRB NTPC Tier 2 Exam Paper Held On 19-01-2017 Shift 1 WWW - Rrbportal.com 0
No ratings yet
RRB NTPC Tier 2 Exam Paper Held On 19-01-2017 Shift 1 WWW - Rrbportal.com 0
25 pages
MD2 PS2
No ratings yet
MD2 PS2
4 pages
Filter-Insert-Replacement HDU1525 CJC
100% (1)
Filter-Insert-Replacement HDU1525 CJC
1 page
ICAT Diesel Emission
No ratings yet
ICAT Diesel Emission
9 pages
Warm Up and Daily Exercises
0% (1)
Warm Up and Daily Exercises
5 pages
Sports and Games: Essay Outline
No ratings yet
Sports and Games: Essay Outline
6 pages
The Doomed: Apocalyptic Horror Hunting: A Wargame
From Everand
The Doomed: Apocalyptic Horror Hunting: A Wargame
Chris McDowall
No ratings yet
Dracula's America: Shadows of the West: A Wargame
From Everand
Dracula's America: Shadows of the West: A Wargame
Jonathan Haythornthwaite
3.5/5 (3)
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
From Everand
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
Baby Professor
No ratings yet

Lecture05 AdversarialSearch

Uploaded by

Lecture05 AdversarialSearch

Uploaded by

Artificial Intelligence

Nguyễn Ngọc Thảo – Nguyễn Hải Minh

MAX uses search tree

from the point of view of MAX 11

• Now: algorithmic improvements have allowed programs running on standard PCs

MAX best move

MIN best move

Utility values for MAX

For chess, 𝑏 ≈ 35, 𝑚 ≈ 100 for "reasonable" games

A and B are weak while C is strong. C becomes weak.

• If the game is not zero-sum, then collaboration can also

function ALPHA-BETA-SEARCH(state) returns an action

function MAX-VALUE(state,α,β) returns a utility value

function MIN-VALUE(state,α,β) returns a utility value

• Killer move heuristic

if CUTOFF-TEST(state, depth) then return EVAL(state)

• Does it work in practice?

With Black to move, the black bishop is

• Expectimax search: compute the

• For minimax, terminal function scale doesn't matter

Is it possible to perform pruning in expectimax search?

How to prune this tree?

You might also like