Look into "A Neural Network in 11 lines of Python"

A toy code “A Neural Network in 11 lines of Python” is famous with machine learning starters. I’m wondering how many people really look into it because the derivatives by author is a bit strange. Let’s see it in detail together, especially the gradient back-propagation part.

Of course of all, the code:

X = np.array([ [0,0,1],[0,1,1],[1,0,1],[1,1,1] ])
y = np.array([[0,1,1,0]]).T
syn0 = 2*np.random.random((3,4)) - 1
syn1 = 2*np.random.random((4,1)) - 1
for j in xrange(60000):
    l1 = 1/(1+np.exp(-(np.dot(X,syn0))))
    l2 = 1/(1+np.exp(-(np.dot(l1,syn1))))
    l2_delta = (y - l2)*(l2*(1-l2))
    l1_delta = l2_delta.dot(syn1.T) * (l1 * (1-l1))
    syn1 += l1.T.dot(l2_delta)
    syn0 += X.T.dot(l1_delta)

It’s a simple MLP with one hidden layer. syn0 and syn1 are weights of input layer and hidden layer.
X is input. l1 is the value of hidden layer. l2 is the value of output layer(one node)

First, the forward propagation:
input -> hidden: l1=sigmoid(Xsyn0)
hidden -> output: l2=sigmoid(l1syn1)
MSE loss L=12(yl2)2 (We will talk about what loss this code uses later)

Then, back propagation:
We want to know Lsyn1 and Lsyn0 .
For Lsyn1 :
Lsyn1=Ll2l2syn1 , in which,
Ll2=l2y
l2syn1=l2l1syn1l1syn1syn1=l2(1l2)l1
times this two parts, Lsyn1=(l2y)l2(1l2)l1

And for Lsyn0 :
Lsyn0=Ll2l2l1l1syn0
Ll2=l2y as derived above
l2l1=l2l1syn1l1syn1l1=l2(1l2)syn1
l1syn0=l1Xsyn0Xsyn0syn0=l1(1l1)X
times this three parts together, Lsyn0=(l2y)l2(1l2)syn1l1(1l1)X

These derivatives are all match the code so I guess actually this piece of code is using MSE as the cost function while the author seems not making this part clear.


Reference:
A Neural Network in 11 lines of Python (Part 1)

跟网型逆变器小干扰稳定性分析与控制策略优化研究(Simulink仿真实现)内容概要:本文围绕跟网型逆变器的小干扰稳定性展开分析,重点研究其在电力系统中的动态响应特性及控制策略优化问题。通过构建基于Simulink的仿真模型,对逆变器在不同工况下的小信号稳定性进行建模与分析,识别系统可能存在的振荡风险,并提出相应的控制优化方法以提升系统稳定性和动态性能。研究内容涵盖数学建模、稳定性判据分析、控制器设计与参数优化,并结合仿真验证所提策略的有效性,为新能源并网系统的稳定运行提供理论支持和技术参考。; 适合人群:具备电力电子、自动控制或电力系统相关背景,熟悉Matlab/Simulink仿真工具,从事新能源并网、微电网或电力系统稳定性研究的研究生、科研人员及工程技术人员。; 使用场景及目标:① 分析跟网型逆变器在弱电网条件下的小干扰稳定性问题;② 设计并优化逆变器外环与内环控制器以提升系统阻尼特性;③ 利用Simulink搭建仿真模型验证理论分析与控制策略的有效性;④ 支持科研论文撰写、课题研究或工程项目中的稳定性评估与改进。; 阅读建议:建议读者结合文中提供的Simulink仿真模型,深入理解状态空间建模、特征值分析及控制器设计过程,重点关注控制参数变化对系统极点分布的影响,并通过动手仿真加深对小干扰稳定性机理的认识。
评论
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值