Machine Learning（李宏毅公开课笔记）-Machine Learning and Deep Learning_machine learning and deep learning in computationa-优快云博客

本文链接：https://blog.youkuaiyun.com/qq_40438523/article/details/116046689

这篇博客介绍了机器学习和深度学习的基础，包括用于回归、分类和其他任务的函数，如PM2.5和国际象棋。接着讨论了寻找函数的过程，如梯度下降法，以及线性模型和复杂模型的构建。文章还提到了激活函数如硬 sigmoid 和 ReLU，并展示了如何通过增加特征来构建新的模型。最后，讲述了模型优化过程中参数的更新方法，并给出了一个迭代更新的例子。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Machine Learning（李宏毅）

Machine Learning and Deep Learning

Machine Learning and Deep Learning

1. Functions

Regression：PM2.5
Classification：chess
Others：structured learning

2. The procedures of finding the functions

Functions with unknown parameters
Define loss from training data
Optimization
1. gradient descent
  A) randomly set an initial value w
  B) compute the $\partial l/\partial w$
  C) update w iteratively

3. Models

linear model
sophisticated model

linear curves:
all piecewise linear curves = constant + sum of set (sigmoid)
activation function:
1.hard sigmoid: which can be represented by sum of two ReLU

2.rectified linear unit(ReLU): $m a x (0, w x + b)$

3.soft sigmoid: $\cfrac{c}{1+e^{-(wx+b)}}=c*sigmoid(wx+b)$

Beyond piecewise curves

approximate continuous curve by a piecewise linear curve
to have a good approximate, we need sufficient pieces

New model: More Features
$\sum_{i}{c_i * sigmoid(\sum_{j}w_{ij}x_j+b_i)}$
$r_i = W_i X+b_i ，a_i=sigmoid(ri)$
$y = b + C A$
optimization of new model:
$\varTheta = [W B C]$
$\begin{vmatrix} \cfrac{\partial L}{\partial\varTheta_1} \\ \cfrac{\partial L}{\partial\varTheta_2} \\...\\\cfrac{\partial L}{\partial\varTheta_n} \end{vmatrix}$
$\nabla{L(\varTheta^0)}$
$\begin{vmatrix}\varTheta_1^1 \\ \varTheta_2^1\\...\\\varTheta_n^1 \end{vmatrix}=\begin{vmatrix} \varTheta_1^0 \\ \varTheta_2^0\\...\\\varTheta_n^0 \end{vmatrix} - \eta * g$