An introduction to Linear Regression

最新推荐文章于 2022-06-01 13:20:28 发布

原创

最新推荐文章于 2022-06-01 13:20:28 发布 · 567 阅读

·

1

·

CC 4.0 BY-SA版权

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

文章标签：

本文介绍了线性回归的基本概念，包括线性回归的意图、最小均方误差（LMS）算法、梯度下降法以及在偏差与方差之间的权衡。还探讨了正则化在L1（Lasso）和L2（岭回归）中的作用，以及批量梯度下降和随机梯度下降的区别。此外，文章提到了线性回归在概率和线性代数中的解释，以及如何从几何角度理解线性回归的投影。最后，文章简要讨论了逻辑回归和广义线性模型的概念。

An Introduction to Linear Regression

Intent

supervised learning:
- regression (continious)
- classfication (discrete)

$h(\theta)=\displaystyle\sum_{i=0}^{n}\theta_iX_i=\theta^{T}X$

For historical reasons, this function h is called a hypothesis.

$\theta^{'}s \quad\text{is the parameters}$

$(x^{(i)},y^{(i)})\qquad(training \quad example)$

$\{(x^{(i)},y^{(i)}); i=1,...,m\}\qquad(training \quad set)$

[1]

要让 $h(\theta)$ 接近于trainset中的 $y^{(i)}$

cost function: $J(\theta) =\frac{1}{2}\displaystyle\sum_{i=1}^{m}(h_{\theta}(x^{(i)} - y^{(i)})^2$

为了让train出的model能stable，需要在bias和variance中tradeoff。regularized regression 会使用L1(lasso)和L2(ridge)作为penalty。其中L1相当于feature selection(parameter变为0)。而L2主要是减少variance。cross validation和bootstrap也是常用的trade bias和variance的方法。

LMS

gradient descent

$\theta_j = \theta_j - \alpha\frac{\partial }{\partial \theta_j}J(\theta)$
$\alpha$ is learning rate

∂∂θj

最低0.47元/天解锁文章

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。