
学术
rockray21
我只是知识的搬运工
展开
-
An Optimistic Perspective on Offline Reinforcement Learning
An Optimistic Perspective on Offline Reinforcement Learning摘要1 introduction2 Off-policy Reinforcement Learning如有错误,欢迎指正本文翻译为机翻,仅作初步了解学习使用,需要用到的时候再回来整理。如有侵权,请私信本人。原文链接:https://arxiv.org/pdf/1907.04543.pdf参考链接:https://tech.sina.com.cn/roll/2020-04-15/do原创 2020-12-06 18:17:06 · 527 阅读 · 0 评论 -
自博弈学习初步
如有错误,欢迎指正本文学习过程中的归纳总结如有侵权,请私信本人参考链接:https://blog.youkuaiyun.com/weixin_37837522/article/details/91907661https://www.jianshu.com/p/bcbc41125c54https://zhuanlan.zhihu.com/p/30282616对于alphazero的准备知识重点看这一篇https://blog.youkuaiyun.com/windowsyun/article/details/88701原创 2020-12-06 15:09:10 · 4905 阅读 · 0 评论 -
curriculum learning
如有错误,欢迎指正本文学习过程中的归纳总结如有侵权,请私信本人参考链接:https://www.dazhuanlan.com/2019/11/21/5dd617335da12/https://blog.youkuaiyun.com/qq_25011449/article/details/82914803关于transfer Learning和fine-tuning的区别就是,transfer Learning是一种理念(concept),而fine-tuning则是其实现的具体方法。而Curriculum原创 2020-12-06 10:01:09 · 1120 阅读 · 0 评论