强化学习资源——Hands-On Reinforcement Learning、Deep Reinforcement Learning Hands-On等

20年12月26日更新,如果链接失效,请保存过资料的朋友在评论区分享一下永久链接,让技术在社区内自由地流动!

Hands-On Intelligent Agents with OpenAI Gym

一本讲怎么配强化学习环境的书,写的还行。
在这里插入图片描述

链接: https://pan.baidu.com/s/1WPPn_jYwpwKKujRt1lqCRQ 提取码: 27rm


Hands-On Reinforcement Learning with Python

在这里插入图片描述
PDF 链接:https://pan.baidu.com/s/1CzplQERRjwgi0b9pQTebFw 密码:u3v5
代码 链接:https://pan.baidu.com/s/1rrmuZTPML-ZLJZlGWhUVdA 密码:qzf8
这本书的代码是Tensorflow

Deep Reinforcement Learning Hands-On

The topic of this book is Reinforcement Learning—which is a subfield of Machine Learning—focusing on the general and challenging problem of learning optimal behavior in complex environment. The learning process is driven only by reward value and observations obtained from the environment. This model is very general and can be applied to many practical situations from playing games to optimizing complex manufacture processes. Due to flexibility and generality, the field of Reinforcement Learning is developing very quickly and attracts lots of attention both from researchers trying to improve existing or create new methods, as well as from practitioners interested in solving their problems in the most efficient way. This book was written as an attempt to fill the obvious lack of practical and structured information about Reinforcement Learning methods and approaches. On one hand, there are lots of research activity all around the world, new research papers are being published almost every day, and a large portion of Deep Learning conferences such as NIPS or ICLR is dedicated to RL methods. There are several large research groups focusing on RL methods application in Robotics, Medicine, multi-agent systems, and others. The information about the recent research is widely available, but is too specialized and abstract to be understandable without serious efforts. Even worse is the situation with the practical aspect of RL application, as it is not always obvious how to make a step from the abstract method described in the mathematical-heavy form in a research paper to a working implementation solving actual problem. This makes it hard for somebody interested in the field to get an intuitive understanding of methods and ideas behind papers and conference talks. There are some very good blog posts about various RL aspects illustrated with working examples,
评论 18
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值