reinforcement learning
Finding suitable actions to take in a given situation in order to maximize a reward.
A general feature of reinforcement learning is the trade-off between exploration,in which the system tries out new kinds of actions to see how effective they are,and exploitation, in which the system makes use of actions that are known to yield a high reward.
Too strong a focus on either exploration or exploitation will yield poor results.
linear models
Functions,such as the polynomial,which are linear in the unknown parameters have important properties and are called linear model.
for instance:
y(x,W) = w0 + w1*x + w2*x2 + w3*x3 + … + wm*xm
Error function
The values of the coefficients will be determined by fitting the polynomial to the training data.This can be done by minimization an error function the measures the misfit between the function y(x,W),for any given value of W, and the training set data points.
##Root - Mean -Square##
RMS,defined by
强化学习是一种通过在给定环境中采取行动以最大化奖励的智能算法。它涉及在探索未知行动效果与利用已知高收益行动之间寻找平衡。线性模型用于拟合数据,最小化误差函数以确定系数。RMS(根均方)是衡量函数与训练数据点之间差异的指标。
2578

被折叠的 条评论
为什么被折叠?



