题意:OpenAI Gym CarRacing-v2 赛道终止处理
问题背景:
I am using gym==0.26.0
library and am trying to understand what means that an episode is finished/done in the CarRacing-v2 environment. In the documentation is written this.
我正在使用 `gym==0.26.0` 库,并尝试理解在 CarRacing-v2 环境中“episode finished/done”意味着什么。文档中写道:
### Rewards
The reward is -0.1 every frame and +1000/N for every track tile visited,
where N is the total number of tiles visited in the track. For example,
if you have finished in 732 frames, your reward is
1000 - 0.1*732 = 926.8 points.
### Episode Termination
The episode finishes when all of the tiles are visited. The car