RL An introduction 2nd edition Reading Notes
RL An introduction 2nd edition 读书笔记Chap 2S 2.1S 2.2
Chap 2
S 2.1
Action value = expected or mean reward.
S 2.2
Action-value methods = estimating the values of actions + using the estimates to make action selection decisions.
Sample-average method ->
原创
2020-10-12 10:26:30 ·
240 阅读 ·
0 评论