[Machinie Learning] 吴恩达机器学习课程笔记——Week5

原创已于 2022-12-17 18:21:22 修改 · 239 阅读

CC 4.0 BY-SA版权

文章标签：

于 2022-12-17 17:58:02 首次发布

6 篇文章

订阅专栏

这篇博客详细介绍了吴恩达的机器学习课程中关于神经网络和反向传播的内容。博主总结了成本函数和反向传播算法，包括前向传播、误差计算、梯度下降以及在实践中的注意事项，如参数初始化、梯度检查和随机初始化。此外，还提及了神经网络在自动驾驶等领域的应用，并提供了额外的学习资源链接。

Machine Learning by Andrew Ng

💡 吴恩达机器学习课程学习笔记——Week 5
🐠 本人学习笔记汇总合订本
✓ 课程网址 standford machine learning
🍭 参考资源

课程笔记
python版作业

学习提纲

Notation

we denote $h_Θ(x)_k$ as being a hypothesis that results in the k-th output
输入x， $h_Θ(x)_k$ 是输出的第k个特征

1.Cost Function
the cost function for regularized logistic regression
and the cost function for neural network are as below

the cost function for regularized logistic regression was

for neural network, the cost function is

Here, we define $\delta_j^l$ as the error for $a_j^l$

2.Back-propagation Algorithm

Same as other ml algorithms, our goal is to minimize the cost function

Therefor, we need to compute

The algorithm to compute $\frac{\partial}{\partial \theta_{i,j} ^{(l)}} J(\theta)$ is as below

The whole algorithm is as below

step1 and step2, forward propagation

step3, compute $\delta^{(L)}$

step4, compute $\delta^{(L-1)}$ and so on,

step5, compute the $\frac{\partial}{\partial \theta_{i,j} ^{(l)}} J(\theta)$

3.Back-propagation Intuition

1.Implementation Note: Unrolling Parameters
use gradient checking to assure everything goes well

2.Gradient Checking

3.Random Initialization

4.Putting It Together

6 steps to train a network

Ideally, we want $h_\theta(x^i) \approx y^i$ But remember that $J(\Theta)$ is not a convex function and thus we can end up in a local minima instead.

1.Autonomous Driving
skip.

skip.

额外阅读

吴恩达机器学习——反向传播算法