博弈论斯坦福game theory stanford week 5.1_

最新推荐文章于 2020-01-13 18:01:45 发布

weixin_30564901

最新推荐文章于 2020-01-13 18:01:45 发布

阅读量339

点赞数

CC 4.0 BY-SA版权

原文链接：http://www.cnblogs.com/zangzelin/p/8598970.html

本文解析了斯坦福大学博弈论课程中的练习题，包括纯策略纳什均衡的求解、重复博弈的子博弈完美纳什均衡分析、无限重复博弈中的可信威胁策略等关键概念的应用。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

title: 博弈论斯坦福game theory stanford week 5-1
tags: note
notebook: 6- 英文课程-15-game theory
---

博弈论斯坦福game theory stanford week 5-1

练习

1. Question 1

Two players play the following normal form game.

1 2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Which is the pure strategy Nash equilibrium of this stage game (if it is played only once)?

a) (Left, Left);

b) (Left, Middle);

c) (Left, Right);

d) (Middle, Left);

e) (Middle, Middle);

f) (Middle, Right);

g) (Right, Left);

h) (Right, Middle);

i) (Right, Right).

Correct 
(i) is the unique Nash equilibrium of the stage game.

(Right, Right) is a Nash equilibrium of the stage game because Right is the best response when the other player is playing Right.
It is also the unique Nash equilibrium. To see this, check that in all other cases at least one player has an incentive to deviate.

Question 2

Two players play the following normal form game.

1 2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Suppose that the game is repeated for two periods. What is the outcome from the subgame perfect Nash equilibrium of the whole game:

a) (Left, Left) is played in both periods.

b) (Right, Right) is played in both periods.

Correct 
(b) is true.

The stage game has a unique Nash equilibrium.
In the second period, (Right, Right) must be played regardless of the outcome obtained in the first period.
Then, it is optimal for both players to maximize the current payoff at the first period and play (Right, Right).

c) (Middle, Middle) is played in the first period, followed by (Left, Left)

d) (Middle, Middle) is played in the first period, followed by (Right, Right)

Question 3

Two players play the following normal form game.

1 2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Suppose that there is a probability p that the game continues next period and a probability (1−p) that it ends. What is the threshold p∗ such that when p≥p∗ (Middle, Middle) is sustainable as a subgame perfect equilibrium by grim trigger strategies, but when $gif.latex?p<p$ playing Middle in all periods is not a best response? [Here the grim strategy is: play Middle if the play in all previous periods was (Middle, Middle); play Right otherwise.]

a) 1/2;

b) 1/3;

c) 1/4;

This should not be selected

d) 2/5.

Question 4

Consider the following game:

1 2 Left Middle Right

Left 1,1 5,0 0,0

Middle 0,5 4,4 0,0

Right 0,0 0,0 3,3

Which are the pure strategy Nash equilibria of this stage game? There can be more than one.

a) (Left, Right);

Un-selected is correct

b) (Left, Left);

Correct 

(b) and (f) are pure strategy Nash equilibria of the stage game.

(Left, Left) and (Right, Right) are Nash equilibria of the stage game because Right is the best response when the other player is playing Right, and Left is the best response when the other player is playing Left.
There are no other pure strategy Nash equilibria. To see this, check that in all other cases at least one player has an incentive to deviate.

c) (Left, Middle);

Un-selected is correct

d) (Middle, Right);

Un-selected is correct

e) (Middle, Left);

Un-selected is correct

f) (Right, Right).

Correct 
(b) and (f) are pure strategy Nash equilibria of the stage game.

(Left, Left) and (Right, Right) are Nash equilibria of the stage game because Right is the best response when the other player is playing Right, and Left is the best response when the other player is playing Left.
There are no other pure strategy Nash equilibria. To see this, check that in all other cases at least one player has an incentive to deviate.

g) (Right, Middle);

Un-selected is correct

h) (Right, Left);

Un-selected is correct

i) (Middle, Middle);

Un-selected is correct

Question 5

Consider the following game:

1 2 Left Middle Right

Left 1,1 5,0 0,0

Middle 0,5 4,4 0,0

Right 0,0 0,0 3,3

Suppose that the game is repeated for two periods. Which of the following outcomes could occur in some subgame perfect equilibrium? (There might be more than one).

a) (Middle, Middle) is played in the first period, followed by (Right, Right)

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

b) (Left, Left) is played in both periods.

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

c) (Right, Right) is played in both periods.

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

Question 6

Consider the following trust game:

There is a probability p that the game continues next period and a probability (1−p) that it ends. The game is repeated indefinitely. Which statement is true? [Grim trigger in (c) and (d) is player 1 playing Not play and player 2 playing Distrust forever after a deviation from ((Play,Share), (Trust)).]

a) There exists a pure strategy Nash equilibrium in the one-shot game with player 2 playing Trust.

b) There exists a pure strategy subgame perfect equilibrium with player 2 playing Trust in any period in the finitely repeated game.

This should not be selected

c) ((Play,Share), (Trust)) is sustainable as a subgame perfect equilibrium by grim trigger in the indefinitely repeated game with a probability of continuation of p≥5/9.

Question 7

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";
In a normal status player i cooperates;
In a revenge status player i defects;
From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;
From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.
Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

True or False:

When player 1 uses the above-described "tit for tat" strategy and starts the first period in a revenge status (thus plays defect for sure), in any infinite payoff maximizing strategy, player 2 plays defect in the first period

True.

Correct 
True.

If player 1 uses "tit for tat" strategy and starts in a revenge status, the payoff in the first period is higher for player 2 from defection than cooperation.
Moreover, the action played by 2 in the first period when 1 begins in revenge status doesn't affect the remaining periods since 1 switches to normal status in the second period regardless of what player 2 does in the first period.

False.

Question 8

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";

In a normal status player i cooperates;

In a revenge status player i defects;

From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;

From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.

Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

What is the payoff for player 2 from always cooperating when player 1 uses this tit for tat strategy and begins in a normal status? How about always defecting when 1 begins in a normal status?

a) 4+4p+4p2+4p3+… ; 5+p+p2+p3+…

b) 4+4p+4p2+4p3+… ; 5+p+5p2+p3+…

Correct 
(b) is true.

If 2 always cooperates, then 1 stays `normal' and cooperates always as well, and the payoff to each player is 4 in each period.
If 2 always defects, then 1 is normal in odd periods and switches to revenge in even periods (because 2 defects). 1 cooperates in odd periods and defects in even periods, thus 2 earns 5 in odd periods and 1 in even periods.

c) 5+4p+4p2+4p3+… ; 4+4p+4p2+4p3+…

d) 5+4p+4p2+4p3+… ; 5+p+p2+p3+…

Question 9

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";

In a normal status player i cooperates;

In a revenge status player i defects;

From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;

From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.

Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

What is the threshold p∗ such that when p≥p∗ always cooperating by player 2 is a best response to player 1 playing tit for tat and starting in a normal status, but when p<p∗ always cooperating is not a best response?

a) 1/2

b) 1/3

Correct 
(b) is true.

From part (2), in order to sustain cooperation, we need 4+4p+4p2+4p3+...≥5+p+5p2+p3+... , which is 4+4p≥5+p, thus p≥1/3.
p* = 1/3.
Note that this just checks always cooperating against always defecting. However, you can easily check that if player 2 wants to defect in the first period,then s/he should also do so in the second period (our answer from part (1)). Then the third period looks just like we are starting the game over, so player 2 would want to defect again...

c) 1/4

d) 1/5

转载于:https://www.cnblogs.com/zangzelin/p/8598970.html