PRML 读书笔记-Chapter1

最新推荐文章于 2021-07-11 14:21:58 发布

原创最新推荐文章于 2021-07-11 14:21:58 发布 · 397 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#读书笔记 #PRML

机器学习专栏收录该内容

8 篇文章

订阅专栏

强化学习是一种通过在给定环境中采取行动以最大化奖励的智能算法。它涉及在探索未知行动效果与利用已知高收益行动之间寻找平衡。线性模型用于拟合数据，最小化误差函数以确定系数。RMS（根均方）是衡量函数与训练数据点之间差异的指标。

reinforcement learning

Finding suitable actions to take in a given situation in order to maximize a reward.
A general feature of reinforcement learning is the trade-off between exploration,in which the system tries out new kinds of actions to see how effective they are,and exploitation, in which the system makes use of actions that are known to yield a high reward.
Too strong a focus on either exploration or exploitation will yield poor results.

linear models

Functions,such as the polynomial,which are linear in the unknown parameters have important properties and are called linear model.
for instance:
y(x,W) = w0 + w1*x + w2*x2 + w3*x3 + … + wm*xm

Error function

The values of the coefficients will be determined by fitting the polynomial to the training data.This can be done by minimization an error function the measures the misfit between the function y(x,W),for any given value of W, and the training set data points.
##Root - Mean -Square##
RMS,defined by