强化学习
JSerenity
这个作者很懒,什么都没留下…
展开
专栏收录文章
- 默认排序
- 最新发布
- 最早发布
- 最多阅读
- 最少阅读
-
Key Concepts in RL
Part 1: Key Concepts in RL for review. need to update. In a nutshell, RL is the study of agents and how they learn by trial and error. It formalizes the idea that rewarding or punishing an agent for i...原创 2019-05-19 00:11:41 · 206 阅读 · 0 评论 -
Spinning Up USER DOCUMENTATION
Spinning Up USER DOCUMENTATION (For review) Environments Spinning Up requires Python3, OpenAI Gym, and OpenMPI. MuJoCo(Optional but Preferred) Algorithms VPG, TRPO, PPO, DDPG, TD3, SAC. The On-Policy ...原创 2019-05-17 23:37:57 · 287 阅读 · 0 评论 -
Learn to play Pong with PG from scratch and pixels
Learn to play Pong with PG from scratch and pixels http://karpathy.github.io/2016/05/31/rl/ Policy Gradients(PG) is default choice for attacking RL problems. DQN changed Q-Learning. PG is preferred be...原创 2019-06-08 15:15:42 · 409 阅读 · 0 评论
分享