0% found this document useful (0 votes)

46 views6 pages

Game1ba Nash Eqm Pure-1

Uploaded by

Emiliano Salazar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views6 pages

Game1ba Nash Eqm Pure-1

Uploaded by

Emiliano Salazar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Game Theory

Nash Equilibrium in Pure Strategies

Tetsuya Hoshino January 17, 2023

In this note, we study the notion of Nash equilibrium in a normal-form game (Nash, 1950).
We discuss the justifications based on a self-enforcing agreement.

Example 1. Consider the following prisoners’ dilemma:

C D
C −1, −1 −3, 0
D 0, −3 −2, −2

Table 1: the prisoners’ dilemma

Suppose that players 1 and 2 have agreed on playing a strategy profile a = (a1 , a2 ). We
examine whether this agreement is “stable” or not, by asking whether there exists at least one
player who wants to deviate from that strategy profile.
Suppose that players 1 and 2 have agreed to play strategy profile (D, D). Then, they are
willing to play it. This is because if (player 1 thinks) player 2 will play D, player 1 is willing to
play D, and similarly, if (player 2 thinks) player 1 will play D, player 2 is willing to play D. In
this sense, the agreement is self-enforcing; strategy profile (D, D) is said to be a (pure-strategy)
Nash equilibrium.
Suppose that players 1 and 2 have agreed to play strategy profile (D, C). Then, they are
not willing to play it. This is because if (player 2 thinks) player 1 will play D, he deviates to
playing D rather than C. In this sense, the agreement is not self-enforcing. □

1 Pure-Strategy Nash Equilibrium

The notion of Nash equilibrium captures the “self-enforcement” of the agreement of play. More
specifically, if players reach an agreement to play strategy profile a∗ before actually playing
game G, then no player will deviate from the agreement a∗ when actually playing it. That is,
the agreement a∗ is self-enforcing. We note that the Nash equilibrium does not attempt to
examine how players can reach such a stable agreement.

Definition 1. In a normal-form game G, a Nash equilibrium (in pure strategies) is a

strategy profile a∗ ∈ A such that for each i ∈ I,

ui (a∗i , a∗−i ) ≥ ui (ai , a∗−i ), ∀ai ∈ Ai . (1)

1
We provide an equivalent definition of Nash equilibrium.

Definition 2. In a normal-form game G, player i’s best-response correspondence (in

pure strategies) is a correspondence BRi : A−i → 2Ai such that BRi (a−i ) is the set of player
i’s optimal strategies when players −i play strategies a−i . That is,

BRi (a−i ) = argmax ui (ai , a−i ).

ai ∈Ai

Definition 3. In a normal-form game G, a Nash equilibrium (in pure strategies) is a

strategy profile a∗ ∈ A such that for each i ∈ I,

a∗i ∈ BRi (a∗−i ).

The equivalence between Definitions 1 and 3 is straightforward. To see this, we note that
inequality (1) says that strategy a∗i is optimal for player i when players −i play strategies a∗−i .
Formally, a∗i ∈ argmaxai ui (ai , a∗i ) ≡ BRi (a∗−i ).
Although somewhat repetitive, we oﬀer a memorable paraphrase of Nash equilibrium.

A strategy profile a∗ is a (pure-strategy) Nash equilibrium if every player i plays a best

response a∗i to players −i’s strategies a∗−i (that is, if no player has any unilateral profitable
deviation).

Exercise 1. Find a (pure-strategy) Nash equilibrium in the Cournot game described in the
previous note.

Exercise 2. Find a (pure-strategy) Nash equilibrium in the Bertrand game described in the
previous note.

Equilibrium Multiplicity We consider another simple example.

Example 2. Consider the following coordination game:

L R
U 1, 1 −2, 0
D 0, −2 0, 0

Table 2: a coordination game

Player 1 chooses either U or D, while player 2 chooses either L or R. This game has two
pure-strategy Nash equilibria, (U, L) and (D, R). Both players are strictly better-off at the
former than at the latter.
Suppose that players are playing the inefficient equilibrium (D, R). Can they move to the
efficient equilibrium (U, L)? In general, the answer is “no.” By the nature of Nash equilibrium,

2
if (player 1 thinks) player 2 will play R, player 1 does not deviate from playing D. Similarly,
if (player 2 thinks) player 1 will play D, player 2 does not deviate from playing L. That is,
neither player alone can escape from the (ineﬃcient) equilibrium (D, R). □

It is worth summarizing the implication obtained from this example.

A normal-form game G may have multiple (pure-strategy) Nash equilibria, and it is possible
that an equilibrium is strictly better than another equilibrium for all players. It is, in
general, diﬃcult that when players are stuck in a “bad” equilibrium, no player escapes
from the bad one to another one alone.

Exercise 3. Consider the Braess’s paradox in the previous note. Show that:
1. There are multiple (pure-strategy) Nash equilibria.
2. Every (pure-strategy) Nash equilibrium has five players choosing route SAG and five
players choosing route SBG.

Equilibrium Non-Existence We consider another simple example.

Example 3. Consider the following matching pennies game:

H T
H +1, −1 −1, +1
T −1, +1 +1, −1

Table 3: a matching pennies

Each player has a penny and secretly turns the penny to Heads (H) or Tails (T ). If the pennies
match then player 1 gets the both pennies and player 2 loses his penny; otherwise, player 2
gets the both pennies and player 1 loses her penny. This game has no pure-strategy Nash
equilibrium. □

2 How to Reach Nash Equilibrium

Rational Reasoning If players think rationally enough before playing a game, they may arrive
at a Nash equilibrium. Consider the prisoners’ dilemma of Example 1. In this game, it is always
in a player’s interest to play D, regardless of the opponent’s strategy; hence, rational players
can reach the Nash equilibrium (D, D).
However, rational reasoning alone cannot always lead to Nash equilibrium. Consider the
coordination game of Example 2. In this game, there are two (pure-strategy) Nash equilibria,
(U, L) and (D, R). If player 1 conjectures player 2 will play L, then player 1 will rationally
choose U . If, in addition, player 2 conjectures player 1 will play D, then player 2 will rationally
choose R. In the end, they will end up playing a non-equilibrium (U, R).

3
Box (Leniency as Prisoner’s Dilemma): The prisoner’s dilemma is a fictional situation,
but we have such a situation in reality. A leniency system is a system that aims to
detect illegal activities by artificially creating a prisoners’ dilemma-like situation. It has
been introduced in Japan and has been successful. In many developed countries, collusive
behavior that restricts competition and increase prices (e.g., cartels) are illegal under the
Antitrust Law. In Japan, the Fair Trade Commission (FTC) is in charge of detecting them.
Firms engaging collusion may be subject to surcharges.
The leniency system in Japan is a system that reduces or exempts surcharges if a firm
confesses to having engaged in collusion prior to the FTC’s investigation. Specifically, the
first firm to report is exempted from the full surcharge, the second from 50%, and the third
from 30%.
There are two firms, 1 and 2, that are engaged in collusion. There is a probability of
10% that the collusion is discovered even if both firms remain silent. Assume that the
surcharge of 20 is imposed when the collusion is discovered. This situation is formalized as
follows:

Silent Confess
Silent −2, −2 −20, 0
Confess 0, −20 −5, −5

If both firms keep silent, the collusion is discovered with probability 0.1, and thus each firm
incurs the expected surcharge 0.1 × 20 = 2. If firm i alone confesses (but firm −i is silent),
it will be fully exempt from the surcharge. If firm i alone is silent (but firm −i confesses),
it will have to pay the full surcharge 20. If both firms confess, we assume that there is a
50-50 chance that each is judged as the first to report and as the second; in this case, firm
i’s expected penalty is 0.5 × (0% × 20) + 0.5 × (50% × 20) = 5.
This game has a unique Nash equilibrium, in which both confess. Moreover, this game
is exactly the prisoners’ dilemma in the sense that both will be better oﬀ if they keep silent.

That is, rational thinking (in each player’s brain) itself does not lead to a Nash equilibrium.
This is because rational thinking does not give a “right” conjecture about an opponent’s be-
havior. This observation suggests that Nash equilibrium requires more than rational thinking.
Indeed—as we shall see later—we can summarize as follows:

Roughly speaking, it holds that:a

rationality + right conjecture of opponents’ play =⇒ Nash equilibrium

a
More precisely, we require common knowledge of rationality and common knowledge of the conjecture,
but we do not discuss these requirements in detail.

This observation casts doubt on how players can reach right conjectures about opponents’ play.
In what follows, we discuss possible ways they arrive at the right conjectures.

4
Communication As suggested in Example 1, if players “agree” to play a Nash equilibrium
in advance, then they are willing to follow the agreement. In Example 2, if players can
communicate to decide, for example, to play strategy profile (U, L) before playing the game,
then they will play as agreed when actually playing the game. We note that this agreement
itself is non-binding. In other words, there is no external force (e.g., a law enforcement agency)
that compels players to follow the agreement or imposes any penalty on players who violate
the agreement.
It is worth summarizing this interpretation again:

A Nash equilibrium is a self-enforcing strategy profile. That is, players can implement it
by agreement (without any external device).

We note that a Nash equilibrium per se does not require any external device to enforce it,
but in reality, we use an external device to enforce a particular Nash equilibrium.

Example 4. Which side of the road do people drive? This situation is modeled as a coordi-
nation game that has two (pure-strategy) Nash equilibria: All drive on the left-hand side, and
all drive on the right-hand side. In Japan, people drive on the left-hand side, while in the US
and Mexico, people drive on the right-hand side.
Okinawa, a small island in Japan, was occupied by the US after World War II and returned
to Japan in 1972. Okinawa people drove on the right-hand side of the road because Okinawa
was occupied by the US when people started driving. This rule remained in effect for some time
after the reversion. However, Japan decided to switch from driving on the right-hand side of
the road to the left-hand side in 1978, because of the Geneva Convention on Road Traffic, which
requires uniform rules within the country. Such a change will never occur voluntarily, since
each person will drive on the right-hand side if others drive on the right-hand side (which is the
nature of Nash equilibrium). Therefore, Japan amended its Road Traffic Law and mobilized
the police etc. to force Okinawa people to drive on the left-side of the road. □

Learning What if players cannot communicate? In large communities, it is diﬃcult for people
to communicate with others. Even in such communities, we observe Nash equilibrium behavior.
In Example 4, people are forced to follow a particular Nash equilibrium by laws, but it is also
possible that a Nash equilibrium spontaneously emerges without any such enforcement.

Example 5. Which side do people stand on escalators? Similar to Example 4, this situation
is modeled as a coordination game that has two (pure-strategy) Nash equilibria: All stand on
the left, and all stand on the right. Which conventions are prevalent varies from community to
community. In Japan, two conventions coexist: In Tokyo, people stand on the left; in Osaka,
people stand on the right.
This convention is considered as a spontaneous occurrence. When escalators were intro-
duced, people might have stood on a random side, but as they “play” this escalator game

5
every morning in a station, they might have learned the convention that they should stand on
a particular side. In Tokyo, some people, by chance, might have stood on the left; then, people
following them now have incentives to stand on the left. Similarly, in Osaka, some people,
by chance, might have stood on the right; then, people following them now have incentives to
stand on the right.1 □

Nash equilibrium may emerge as a result of trial-and-error learning, but in general this
trial-and-error process may take a long time until it converges, or it may not converge. That
being said, we can still say that:

Behavior patterns entrenched in society are likely to be Nash equilibrium.

Indeed, if a behavioral pattern is not a Nash equilibrium, then some people will deviate from
the pattern to taking diﬀerent behavior, which breaks the pattern. Hence, if a behavioral
pattern is established in society, it should be a Nash equilibrium, so that no player alone will
not deviate.

References
Nash, J. F. (1950). Equilibrium points in n-person games. Proceedings of the National Academy
of Sciences, 36 (1), 48–49.

1
There have been recent campaigns to call for people to stand on both sides (for safety, etc.), but they have
not been successful at least not in Japan, as far as I know. It is generally diﬃcult to change people’s Nash
equilibrium behavior.

User Guide
100% (7)
User Guide
448 pages
Sterling Espionage Case Appeal Decision
86% (7)
Sterling Espionage Case Appeal Decision
118 pages
1 Supply Chain Management Fundamentals
95% (20)
1 Supply Chain Management Fundamentals
174 pages
Annual Reports on Chemistry 1962
100% (12)
Annual Reports on Chemistry 1962
576 pages
Uveit Foster
75% (12)
Uveit Foster
954 pages
Supply Chain Management Overview
50% (2)
Supply Chain Management Overview
67 pages
General Chemistry II Exam 1 Solutions
78% (9)
General Chemistry II Exam 1 Solutions
5 pages
Organic vs Inorganic Polymers Explained
100% (3)
Organic vs Inorganic Polymers Explained
37 pages
Pediatric Cardiology MCQs and Answers
79% (14)
Pediatric Cardiology MCQs and Answers
3 pages
MBBS 1st Year Syllabus Overview
89% (9)
MBBS 1st Year Syllabus Overview
14 pages
PVC Solvent Cement Formulation Guide
90% (10)
PVC Solvent Cement Formulation Guide
2 pages
Social Media Marketing Literature Review
100% (4)
Social Media Marketing Literature Review
27 pages
IRQs
100% (2)
IRQs
2 pages
Cassava Production and Demand in Laos
No ratings yet
Cassava Production and Demand in Laos
46 pages
Income Tax Ordinance 2001 Overview
67% (3)
Income Tax Ordinance 2001 Overview
23 pages
Radiology MCQs for Medical Students
80% (5)
Radiology MCQs for Medical Students
7 pages
Birth Asphyxia: Definition and Management
75% (4)
Birth Asphyxia: Definition and Management
12 pages
Android Facebook Integration Exercise
100% (3)
Android Facebook Integration Exercise
10 pages
PU Coatings: Chemistry and Applications
86% (7)
PU Coatings: Chemistry and Applications
5 pages
Simple Distillation Process Explained
63% (8)
Simple Distillation Process Explained
2 pages
Intermolecular Forces and Boiling Points
75% (4)
Intermolecular Forces and Boiling Points
1 page
Globalization's Impact on Legal Education in India
75% (4)
Globalization's Impact on Legal Education in India
14 pages
Hydrogenation and Hydrogenolysis Overview
75% (4)
Hydrogenation and Hydrogenolysis Overview
12 pages
Cardiac Tamponade: Causes and Management
100% (3)
Cardiac Tamponade: Causes and Management
7 pages
BTX Aromatics Production and Uses
No ratings yet
BTX Aromatics Production and Uses
6 pages
Dermatology MCQs and Answers Guide
67% (3)
Dermatology MCQs and Answers Guide
7 pages
ARDS Nursing Management and Interventions
0% (1)
ARDS Nursing Management and Interventions
2 pages
Separation of Benzene and Chlorobenzene
No ratings yet
Separation of Benzene and Chlorobenzene
5 pages
PES 2013 PS3 Option File Guide
No ratings yet
PES 2013 PS3 Option File Guide
1 page
Stoichiometric Principles in Industrial Chemistry
100% (1)
Stoichiometric Principles in Industrial Chemistry
6 pages

Game1ba Nash Eqm Pure-1

Uploaded by

Game1ba Nash Eqm Pure-1

Uploaded by

Game Theory

Nash Equilibrium in Pure Strategies

Example 1. Consider the following prisoners’ dilemma:

Table 1: the prisoners’ dilemma

1 Pure-Strategy Nash Equilibrium

Definition 1. In a normal-form game G, a Nash equilibrium (in pure strategies) is a

ui (a∗i , a∗−i ) ≥ ui (ai , a∗−i ), ∀ai ∈ Ai . (1)

Definition 2. In a normal-form game G, player i’s best-response correspondence (in

BRi (a−i ) = argmax ui (ai , a−i ).

Definition 3. In a normal-form game G, a Nash equilibrium (in pure strategies) is a

a∗i ∈ BRi (a∗−i ).

A strategy profile a∗ is a (pure-strategy) Nash equilibrium if every player i plays a best

Equilibrium Multiplicity We consider another simple example.

Example 2. Consider the following coordination game:

Table 2: a coordination game

It is worth summarizing the implication obtained from this example.

Equilibrium Non-Existence We consider another simple example.

Example 3. Consider the following matching pennies game:

Table 3: a matching pennies

2 How to Reach Nash Equilibrium

Roughly speaking, it holds that:a

rationality + right conjecture of opponents’ play =⇒ Nash equilibrium

Behavior patterns entrenched in society are likely to be Nash equilibrium.

You might also like