博弈论 斯坦福game theory stanford week 5.1_

本文解析了斯坦福大学博弈论课程中的练习题,包括纯策略纳什均衡的求解、重复博弈的子博弈完美纳什均衡分析、无限重复博弈中的可信威胁策略等关键概念的应用。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >


title: 博弈论 斯坦福game theory stanford week 5-1
tags: note
notebook: 6- 英文课程-15-game theory
---

博弈论 斯坦福game theory stanford week 5-1

练习

1. Question 1

Two players play the following normal form game.

1  2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Which is the pure strategy Nash equilibrium of this stage game (if it is played only once)?

a) (Left, Left);

b) (Left, Middle);

c) (Left, Right);

d) (Middle, Left);

e) (Middle, Middle);

f) (Middle, Right);

g) (Right, Left);

h) (Right, Middle);

i) (Right, Right).

Correct 
(i) is the unique Nash equilibrium of the stage game.

(Right, Right) is a Nash equilibrium of the stage game because Right is the best response when the other player is playing Right.
It is also the unique Nash equilibrium. To see this, check that in all other cases at least one player has an incentive to deviate.

Question 2

Two players play the following normal form game.

1  2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Suppose that the game is repeated for two periods. What is the outcome from the subgame perfect Nash equilibrium of the whole game:

a) (Left, Left) is played in both periods.

b) (Right, Right) is played in both periods.

Correct 
(b) is true.

The stage game has a unique Nash equilibrium.
In the second period, (Right, Right) must be played regardless of the outcome obtained in the first period.
Then, it is optimal for both players to maximize the current payoff at the first period and play (Right, Right).

c) (Middle, Middle) is played in the first period, followed by (Left, Left)

d) (Middle, Middle) is played in the first period, followed by (Right, Right)

Question 3

Two players play the following normal form game.

1  2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Suppose that there is a probability p that the game continues next period and a probability (1−p) that it ends. What is the threshold p∗ such that when p≥p∗ (Middle, Middle) is sustainable as a subgame perfect equilibrium by grim trigger strategies, but when gif.latex?p<p playing Middle in all periods is not a best response? [Here the grim strategy is: play Middle if the play in all previous periods was (Middle, Middle); play Right otherwise.]

a) 1/2;

b) 1/3;

c) 1/4;

This should not be selected

d) 2/5.

Question 4

Consider the following game:

1  2 Left Middle Right

Left 1,1 5,0 0,0

Middle 0,5 4,4 0,0

Right 0,0 0,0 3,3

Which are the pure strategy Nash equilibria of this stage game? There can be more than one.

a) (Left, Right);

Un-selected is correct 

b) (Left, Left);

Correct 

(b) and (f) are pure strategy Nash equilibria of the stage game.

(Left, Left) and (Right, Right) are Nash equilibria of the stage game because Right is the best response when the other player is playing Right, and Left is the best response when the other player is playing Left.
There are no other pure strategy Nash equilibria. To see this, check that in all other cases at least one player has an incentive to deviate.

c) (Left, Middle);

Un-selected is correct 

d) (Middle, Right);

Un-selected is correct 

e) (Middle, Left);

Un-selected is correct 

f) (Right, Right).

Correct 
(b) and (f) are pure strategy Nash equilibria of the stage game.

(Left, Left) and (Right, Right) are Nash equilibria of the stage game because Right is the best response when the other player is playing Right, and Left is the best response when the other player is playing Left.
There are no other pure strategy Nash equilibria. To see this, check that in all other cases at least one player has an incentive to deviate.

g) (Right, Middle);

Un-selected is correct 

h) (Right, Left);

Un-selected is correct 

i) (Middle, Middle);

Un-selected is correct 

Question 5

Consider the following game:

1  2 Left Middle Right

Left 1,1 5,0 0,0

Middle 0,5 4,4 0,0

Right 0,0 0,0 3,3

Suppose that the game is repeated for two periods. Which of the following outcomes could occur in some subgame perfect equilibrium? (There might be more than one).

a) (Middle, Middle) is played in the first period, followed by (Right, Right)

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

b) (Left, Left) is played in both periods.

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

c) (Right, Right) is played in both periods.

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

Question 6

Consider the following trust game:

There is a probability p that the game continues next period and a probability (1−p) that it ends. The game is repeated indefinitely. Which statement is true? [Grim trigger in (c) and (d) is player 1 playing Not play and player 2 playing Distrust forever after a deviation from ((Play,Share), (Trust)).]

a) There exists a pure strategy Nash equilibrium in the one-shot game with player 2 playing Trust.

b) There exists a pure strategy subgame perfect equilibrium with player 2 playing Trust in any period in the finitely repeated game.

This should not be selected 

c) ((Play,Share), (Trust)) is sustainable as a subgame perfect equilibrium by grim trigger in the indefinitely repeated game with a probability of continuation of p≥5/9.

Question 7

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";
In a normal status player i cooperates;
In a revenge status player i defects;
From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;
From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.
Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

True or False:

When player 1 uses the above-described "tit for tat" strategy and starts the first period in a revenge status (thus plays defect for sure), in any infinite payoff maximizing strategy, player 2 plays defect in the first period

True.

Correct 
True.

If player 1 uses "tit for tat" strategy and starts in a revenge status, the payoff in the first period is higher for player 2 from defection than cooperation.
Moreover, the action played by 2 in the first period when 1 begins in revenge status doesn't affect the remaining periods since 1 switches to normal status in the second period regardless of what player 2 does in the first period.

False.

Question 8

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";

In a normal status player i cooperates;

In a revenge status player i defects;

From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;

From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.

Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

What is the payoff for player 2 from always cooperating when player 1 uses this tit for tat strategy and begins in a normal status? How about always defecting when 1 begins in a normal status?

a) 4+4p+4p2+4p3+… ; 5+p+p2+p3+…

b) 4+4p+4p2+4p3+… ; 5+p+5p2+p3+…

Correct 
(b) is true.

If 2 always cooperates, then 1 stays `normal' and cooperates always as well, and the payoff to each player is 4 in each period.
If 2 always defects, then 1 is normal in odd periods and switches to revenge in even periods (because 2 defects). 1 cooperates in odd periods and defects in even periods, thus 2 earns 5 in odd periods and 1 in even periods.

c) 5+4p+4p2+4p3+… ; 4+4p+4p2+4p3+…

d) 5+4p+4p2+4p3+… ; 5+p+p2+p3+…

Question 9

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";

In a normal status player i cooperates;

In a revenge status player i defects;

From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;

From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.

Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

What is the threshold p∗ such that when p≥p∗ always cooperating by player 2 is a best response to player 1 playing tit for tat and starting in a normal status, but when p<p∗ always cooperating is not a best response?

a) 1/2

b) 1/3

Correct 
(b) is true.

From part (2), in order to sustain cooperation, we need 4+4p+4p2+4p3+...≥5+p+5p2+p3+... , which is 4+4p≥5+p, thus p≥1/3.
p* = 1/3.
Note that this just checks always cooperating against always defecting. However, you can easily check that if player 2 wants to defect in the first period,then s/he should also do so in the second period (our answer from part (1)). Then the third period looks just like we are starting the game over, so player 2 would want to defect again...

c) 1/4

d) 1/5

转载于:https://www.cnblogs.com/zangzelin/p/8598970.html

内容概要:本文档详细介绍了一个基于MATLAB实现的跨尺度注意力机制(CSA)结合Transformer编码器的多变量时间序列预测项目。项目旨在精准捕捉多尺度时间序列特征,提升多变量时间序列的预测性能,降低模型计算复杂度与训练时间,增强模型的解释性和可视化能力。通过跨尺度注意力机制,模型可以同时捕获局部细节和全局趋势,显著提升预测精度和泛化能力。文档还探讨了项目面临的挑战,如多尺度特征融合、多变量复杂依赖关系、计算资源瓶颈等问题,并提出了相应的解决方案。此外,项目模型架构包括跨尺度注意力机制模块、Transformer编码器层和输出预测层,文档最后提供了部分MATLAB代码示例。 适合人群:具备一定编程基础,尤其是熟悉MATLAB和深度学习的科研人员、工程师和研究生。 使用场景及目标:①需要处理多变量、多尺度时间序列数据的研究和应用场景,如金融市场分析、气象预测、工业设备监控、交通流量预测等;②希望深入了解跨尺度注意力机制和Transformer编码器在时间序列预测中的应用;③希望通过MATLAB实现高效的多变量时间序列预测模型,提升预测精度和模型解释性。 其他说明:此项目不仅提供了一种新的技术路径来处理复杂的时间序列数据,还推动了多领域多变量时间序列应用的创新。文档中的代码示例和详细的模型描述有助于读者快速理解和复现该项目,促进学术和技术交流。建议读者在实践中结合自己的数据集进行调试和优化,以达到最佳的预测效果。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值