Note 2
Note 2
Game Theory
PG
Email: [email protected]
(3) A set of payoff functions for each player that give a payoff
Strategy:
Pure strategy:
s=s 1 , s 2 , s 3 , .. . , s n
Strategy profile: A strategy profile is a vector of strategies, one for
Example
Let you have an examination in three hours and you must decide
how long to study for the exam and how long to just relax knowing
S = {1,2,….3,12}
i
Where, si ∈ S i
determines how many 15 units you will spend
{ }
S1 S 2 S 3 S 4 . .. .. ..... .. .... S7 . .. S 12
Si = 1 2 3 4 . .. .. ..... .. ... 7 . .. 12
↓ ↓ ↓ ↓ . .. .. ..... .. ..... ↓ . .. ↓
studying for the exam. 15 30 45 60 . .. .. ..... .. ...... 105 . .. 180
For example, if you chose si =7 then you will spend 1 hour and 45
Cartesian Product:
A={a,b,c,d}
B= {e,f,g,h}
(b,e),….,(d,h)}
by the agents.
⟨ N , {S i }ni=1 , {v i (.)}ni=1 ⟩
The normal form game will be a triple of sets:
N= Set of players
n
{ v i (.) }i=1= Set of all players’ payoff functions over the strategy
s−i = (s1, s2, c, si−1, si+1, c, sn). Separating a strategy profile s into
the strategy of player i and the strategies of the other players, we write
−i
I refer to the “− i ” players as player i’s opponents, but note that this
year in prison. If one and only one of them finks, she will be freed and
used as a witness against the other, who will spend four years in
prison. If they both fink, each will spend three years in prison. This
situation may be modelled as a strategic game:
Players The two suspects.
Actions Each player’s set of actions is {Quiet(Mum), Fink}.
Players: N= {1,2}
M=Mum/Quiet, F= Finks
=-5.
Here, the payoff values have been decided considering the concept
that if freed (F,M), then no jail, but jail for one year if (M, M),
for three years, so the difference between the no jail value and the
jail for three years should be three (-1+(-3)=-4). In the end, the
7
difference between no Jain and jail for four years should be four (-
1+(-4)=-5).
The suspect 1’s ordering of the action profiles, from best to worst,
For suspect 2
Player 2
M F
[
Player 1 M (−2 ,−2 ) (−5 ,−1)
F (−1 ,−5 ) (−4 ,−4 ) ]
Solution concept:
Iterated elimination of strictly dominated strategies.
Solution by iterated elimination of strictly dominated strategies.
¿
In an n- player game, strategy si ,for player i is strictly dominated if
there exists some strategy si// such that,
Π i ( s1 , s1 , ,s1 , ... s''i ,...s n, )>, Π i ( s1 , s1 , ,s1 , ... s'i ,...s n , ),
Nash equilibrium
¿ ¿
Π i ( si , s−1 )≥Π i (s 'i , s−i )
¿ s ' ∈ Si
For all i and for all N players.
Nash equilibrium in games with continuous strategies:
Continuous variables : For example si ∈ S i could be a price, which in
theory could take any positive value.
¿ ¿ ¿
Π i ( si , s−1 )≥Π i (s 'i , s−i )
8
s ' ∈ Si
For all i and for all N players. In other words each player
s Π ( s ) max Π i ( si , s−1 ) '
choses to maximize
i i s i
player 1
Work hard Goof off
player 2 Work hard 2,2 0,3
Goof off 3,0 1,1
Application of PD game: Duopoly
In a simple model of a duopoly, two firms produce the same good, for
which each firm charges either a low price or a high price. Each firm
wants to achieve the highest possible profit. If both firms, choose
High then each earns a profit of $1000. If one firm chooses High and
the other chooses Low then the firm choosing High obtains no
customers and makes a loss of $200, whereas the firm choosing Low
earns a profit of $1200 (its unit profit is low, but its volume is high).
If both firms, choose Low then each earns a profit of $600. Each firm
cares only about its profit, so we can represent its preferences by the
profit it obtains, yielding the game in Figure.
Firm 2
q=0 (1-q)=1
Low price High price
Firm 1. p=0 Low 1000,100 -200,1200 price
(1-p)=1 High 0 price
1200,-200 600,600
PSNE = (HP, HP)=(600,600)
EU_1(Low)=1000*q +-200*(1-q)=1000q-200+200q=1200q-200
EU_1(High)=1200*q +600*(1-q)= 1200q-600q+600= 600q+600
EU_1(Low)= EU_1(High)
1200q-200=600q+600
10
600q=800
q = 1.33 (q \in [0,1]); q>1
Apple
Samsung
In a simple model of a duopoly, two firms produce the same good for
which each firm charges either a low price or a high price. Each firm
wants to achieve the highest possible profit.
Battle of the Sexes (BOS):
Player 1
Opera Football
Player 2 Opera 2,1 0,0
Football 0,0 1,2
For the battle of the sexes; however, there is no strategy that is
always best: Playing F is the best if your opponent plays F and
playing O is the best if your opponent plays O.
NE: two
Bach or Stravinsky (cooperation):
Conflict and cooperation both are present in both the PD and BOS.
The next game is purely conflictual.
Matching of Pennies:
Two people choose, simultaneously, whether to show the Head or the
Tail of a coin. If they show the same side, person 2 pays person 1 a
dollar; if they show different sides, person 1 pays person 2 a dollar.
Each person cares only about the amount of money she receives, and
(naturally!) prefers to receive more than less. A strategic game that
11
Player 1
Head Tail
Player 2 Head 1,-1 -1,1
Tale -1,1 1,-1
In this game the players’ interests are diametrically opposite. This is
called “Strictly Competitive”. Player 1 wants to take the same action
as the other player, whereas player 2 wants to take the opposite
action.
NE: no
Economic example:
Game between new firm and established firm.
The established producer prefers the newcomer’s product to look
different from its own, so that its customer’s will not be tempted to
buy the newcomer’s product, whereas the newcomer prefers that the
products look alike.
Stag Hunt:
Each of a group of hunters has two options: she may remain attentive
to the pursuit of a stag, or catch a hare. If all hunters pursue the stag,
they catch it and share it equally; if any hunter devotes her energy to
catching a hare, the stag escapes, and the hare belongs to the defecting
hunter alone. Each hunter prefers a share of the stag to a hare. The
strategic game that corresponds to this specification is:
12
function,
C ( q )=q
2
i i i for i∈ {1 , 2}
q =100− p
where, q=q 1 +q 2
Competitive equilibrium:
MC=2 qi , equating this to the price gives us each first supply
function.
2 qi = p
p p
or , q i= i. e . q 1=q2 =
2 2
profit
Π i =pq−c =(100−q )q i−q2i =(100−50)25−252 = Rs.625.
Π 1 =51×24−24 2=625 .
This means firm 1 instead look for the best choice it can
make. Its best choice depends on the quantity that firm 2 will
produce- what will be that?
The actions are choices of quantity, and payoffs are the
profits.
N= {1, 2}
Strategy sets: S1=[ 0 , α ] for , i∈ {1 , 2} and firms choose quantities si ∈ S i .
We use the term belief for a player’s assessment about the strategies
of the others in the game.
For example, consider the prisoners’ dilemma game, in which each
player chooses between strategies M and F. Player 1 may say to
himself, “I think the other guy is likely to play strategy F” or “I think
the other guy will surely play strategy M.” These are two alternative
beliefs that player 1 could have. Unfortunately, a statement such as “I
think the other guy is likely to play strategy F” is somewhat
ambiguous. Because we aim to model decision making
mathematically, we need a precise way of representing a player’s
beliefs. The key is to use probabilities. Here is an illustration.
Continuing with the prisoners’ dilemma example, suppose we let p
represent the likelihood that player 1 thinks the other player will
select strategy M in the game. Formally, p is a probability—a number
between zero and one—where p = 1 means that player 1 is certain that
player 2 will select strategy M, and p = 0 means that player 1 is sure
player 2 will not choose M. The probability that player 1 believes the
other player will select F is 1 − p. Thus, if p = 1>2, then player 1
believes that the other player is equally likely to select M and F. The
15
Player 2
M F
[
Player 1 M (−2 ,−2 ) (−5 ,−1)
F (−1 ,−5 ) (−4 ,−4 ) ]
Note that if p ∈ (0, 1), then player 1 thinks it is possible that player 2
will play strategy M and also thinks it is possible that player 2 will
play F. Understand that player 1 might not believe that player 2
actually randomizes (by, say, flipping a coin). It’s just that player 1
may be uncertain about what strategy player 2 will choose, and so he
associates probabilities with player 2’s strategies. Furthermore, player
l’s belief may not be accurate; he could be certain that player 2 will
select M when in fact player 2 selects F.
Mathematically, a belief of player i is a probability distribution over
the strategies of the other players. Let us denote such a probability
distribution θ and writeθ ∈ Δ S , where Δ S is the set of
−i −i −i −i
∑s ∈ s θ j ( s j )=1
j j
Example:
16
17
Figure-6.1a
Strategy D is dominated by strategy U. Neither of player 2’s strategy
is dominated.
Figure-6.1b
Strategy D is dominated by strategy M. U is not dominated by M.
Figure-6.1.c
For player 1, no pure strategy is dominated by another pure strategy.
Obviously, neither U nor M dominates the other, and D
does not dominate these strategies. In addition, neither
U nor M dominates D.
For instance, although U is better than D when player 2
selects L, D performs better than U when player 2
selects R. However, a mixed strategy dominates D.
Consider player 1’s mixed strategy of selecting U with
probability 1/2, M with probability 1/2, and D with
probability zero. We represent this mixed strategy as
(1/2, 1/2, 0). If player 2 selects L, then this mixed
18
Player 1
[
M (−2 ,−2 ) (−5 ,−1 )
F (−1 ,−5 ) (−4 ,−4 ) ]
Here M is dominated by F. So, we would predict that neither would
select M. Bothe players would be better off if they each selected M.
Major tension in strategic settings:
The clash between individual and group interests.
The Concept of Efficiency:
When Chris belief that Alex is going to the opera, his unique best
response is to go to the opera. Similarly, if he believes that Alex will
go to the Football game, then he should go to the game.
For every unique belief there is a best response.
Mapping beliefs into a choice of action and this choice of action must
be a best response to the beliefs.
This is called player i’s best response function.
Example:
For some beliefs a player will have more than one best response
strategy.
P2
L C R
[ ]
U 3,3 5,1 6,2
M 4 ,1 8,4 3,6
P1 D 4 ,0 9,6 6,8
s−i . If he has a unique best response to s−i the BRi ( s−i ) will contain only
F.O.C.
90−2 q1 −q 2=0
90−q2
Or ,q 1=
2
If firm 1 believes that firm 2 will choose the quantity q2 then it should
choose q 1 according to the best response function.
{
90−q2
BR 1 ( q 2 )= if , 0≤q2 ≤90
2
0 if , q 2 >90
22
Example:
There may be more than one best response to a given belief. In a finite
game every belief has at least one best response.
Thus his best response is strategy D, the strategy that yields the
greatest expected payoff given his belief. Strategy D is his only best
response
23
Example:
For player 2:
R is the dominated strategy for player 2.
And
24
For player:1
For player 1 neither U nor M is dominated. D is dominated by neither
U nor M.
Now calculate “Mixed Strategy”:
D is dominated by the mixed strategy with probability 1/3 for U, 2/3
for M and 0 for D.
(1/3,2/3,0).
Set of UD1 = {U,M}.
Now derive the set of best responses.
Expected payoff of playing U with beliefs {p,(1-p)}.
i.e. 6.p+0.(1-p)=6p
Expected payoff of playing M is
2p+(1-p)5
Expected payoff of playing D is
p.3+(1-p).3
Conclusion:
Strategies are best responses if and only if they are not strictly
dominated.
25
Weak dominance
Example:
Information set:
29
Case 1:
Every player i has a collection of information sets hi ∈ H i . When
h2 ={x i }, this means that the information set at x i is a singleton( it
31
includes only the node x i . Hence player 2 has information that says “I
am at x i ’.
Case: 2
When player 2 has two information sets h2 ={x 1 }and , h2 ={x 2 }.
Case :3
A game in which player 2 does not know whether he is at x1 or x2 , the
it must be the case that his information is
“I know that I am at either x1 or x2 , but I do not know at which of the
two I am ”.
i.e. h2 ={x 1 , x 2 }
Player 1 choose from the action set A 1 = {O, F}, and player 2 chooses
from A2 ={o, f} without observing the choice of player 1.
32
33
34
35
36
37