什么是反向传播?什么是梯度下降?有哪几种梯度下降算法?梯度下降和学习率有什么关系?为什么要归一化到[0,1]或[-1,1]
https://blog.youkuaiyun.com/u012526120/article/details/49183279
http://www.sohu.com/a/131923387_473283
https://www.cnblogs.com/volcao/p/9144362.html
https://www.cnblogs.com/bnuvincent/p/9612686.html
https://blog.youkuaiyun.com/weixin_38208741/article/details/79983310
https://segmentfault.com/a/1190000012645225?utm_source=tag-newest
https://blog.youkuaiyun.com/Miss_yuki/article/details/80618813
激活函数
keras 后台为theano 通道顺序为[batchsize,波段, 行,列];后台tensorflow通道顺序为[num,行,列,波段],但是tensorflow可以指定通道的顺序:DATA_FORMAT = 'channels_first',即通道维靠前。
https://blog.youkuaiyun.com/qq_39622065/article/details/81228915
待完整...