- 博客(19)
- 收藏
- 关注
原创 git clone报错“××× Could not resolve host:github.com“的解决办法
git clone报错"××× Could not resolve host:github.com"的解决办法
2022-09-17 14:52:12
4010
原创 DOA算法3:Matrix Pencil
介绍Matrix Pencil在DOA计算中的应用,着重理解广义特征值,读者可以设定特殊的N、M、L值进行理解,计算一遍会有更深的印象。
2022-07-23 15:30:42
3538
2
原创 Planning and Learning with Tabular Methods: Part3
ContentsHeuristic SearchRollout AlgorithmMonte Carlo Tree SearchReferencesHeuristic SearchDecision-time planning methods, collectively known as heuristic search, are classical state-space planning methods in AI.The approximate value function is applied
2021-08-02 14:56:43
202
原创 Planning and Learning with Tabular Methods: Part 2
ContentsExpected vs. Sample UpdatesIs it better devoted to a few expected updates or to bbb times as many sample updates?Trajectory SamplingReal-time Dynamic ProgrammingPlanning at Decision TimeExpected vs. Sample UpdatesSample updates can in many cases
2021-07-24 16:14:25
244
原创 Planning and Learning with Tabular Methods: Part 1
ContentsModels and PlanningPlanning and learning methodsRandom-sample one-step tabular Q-planningDyna: Integrated Planning, Acting, and LearningThe reason to put forward Dyna-QModel learning and direct RLDyna-QMethods of reinforcement learning fall into t
2021-07-18 17:12:52
545
转载 git clone 出现fatal: unable to access ‘https://github ××× 类错误解决方法
将命令行里的http或https改为git重新执行如下图所示:
2021-06-15 20:23:33
3014
原创 Windy Gridworld: A simple implementation of Tabular RL(Sarsa and Q-learning)
ContentsDescription of the problemDescription of the problemThe windy gridworld is a simple example in the textbook. By programming to reproduce and solve this problem, I began to really understand Sarsa and Q-learning. Now I introduce this to you, and
2021-04-27 21:23:19
1597
2
原创 Q-learning、Expected Sarsa、Double Learning
Contents一级目录二级目录三级目录一级目录二级目录三级目录
2021-04-21 23:58:49
732
原创 Sarsa: One of classical algorithms of RL
Sarsa: One of classical algorithms of RLWhat is TD learning?On policy and Off-policyA brief introduction of SarsaA simple implementationWhat is TD learning?“TD learning” means “temporal-difference learning”, witch is a combination of Monte Carlo ideas(MC
2021-04-10 21:53:18
2262
13
原创 着手开篇
着手开篇写在前面专题随笔PTA练习Reinforcement Learning功能快捷键合理的创建标题,有助于目录的生成如何改变文本的样式插入链接与图片如何插入一段漂亮的代码片生成一个适合你的列表创建一个表格设定内容居中、居左、居右SmartyPants创建一个自定义列表如何创建一个注脚注释也是必不可少的KaTeX数学公式新的甘特图功能,丰富你的文章UML 图表FLowchart流程图导出与导入导出导入写在前面朋友你好!不论你是因为何种原因点开了这篇博客,都是我的荣幸!此时落笔的我是一名大三学生,通信
2021-04-10 21:37:46
169
1
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人