论文阅读-A Fast Learning Algorithm for Deep Belief Nets

最新推荐文章于 2022-04-14 22:05:41 发布

原创

最新推荐文章于 2022-04-14 22:05:41 发布 · 2.5k 阅读

8 ·

CC 4.0 BY-SA版权

文章标签：

#DBN

作者: G.E.Hinton et. al.
日期: 2006
类型: article
来源: Neural Computation
评价: Deep Learning eve(深度学习前夕)
论文链接: http://www.cs.toronto.edu/~hinton/absps/ncfast.pdf
文章比较"硬核", 各种算法原理解释, 数学公式和术语. 而且作者真的是很喜欢用一些生物上的词汇来描述模型, 比如synapse strength(突触强度), mind(头脑), 导致读的时候很困惑(挠头). 需要有RBM(受限玻尔兹曼机)和wake-sleep 算法的基础. 恰好我没有, 读的很困难, 笔记只做了简单的梳理和摘抄.

1 Purpose

To design a generative model to surpass discriminative models.
To train a deep, densely connected belief network efficiently.
The explaining-away effects make inference difficult in densely connected belief nets that has many hidden layers.

challenges

It’s difficult to infer the conditional distribution of the hidden activities when given a data vector.
Variational methds(变分方法) use simple approximations to teh true conditional distribution, but the approximation may be poor, especially at the deepest hidden layer, where the prior assumes independence.
Variational learning still requires all of the parameters to be learned together and this make the learning time scale poorly (extreme time consuming?) as the number of parameters increase.

2 The previous work

Back propagation nets
support vector machines

3 The proposed method

The authors designed A hybrid model, in which its top two hidden layers form an undirected associative memory, and the remaining hidden layers form a directed acyclic graph that converts the representations in the associative memory into observable variables such as the pixels of an image.

my understanding