Paper Info:Gradient-Based Learning Applied to Document Recognition
YANN LECUN, MEMBER, IEEE, L´EONBOTTOU, YOSHUA BENGIO, AND PATRICK HAFFNER
I. Introduction
II. CNN for isolatedcharacter recognition
Features of Tradition Pattern Recognition:
1. hand-designedfeature extractor
2. trainable classifier
Problem: Images too large;topology of input (space or temporal correlations) ignored
Solution:
Using Convolutional Networks
Features: 1)local receptive fields 2)shared weight 3)spatial or temporalsubsampling(Once a feature has been detected, location less important)->LeNet-5
III. Results andcomparison with other methods
IV. Multimodule systems and graph transformer networks(GTN)
V. Multiple object recognition: HOS (The first method for character string recognition)
Isolated characters TO strings of characters
optimizing a global criterion
A now classical method for segmentation andrecognition—HOS
Good candidate locations for cuts can be found by locating minima in the vertical projection profile, or minima of the distance between the upper and lower contours of the word.
Structure of the Process
Question: What's the meaning of Interpretation graph?
Definitions in the paper:
The goal of the recognitiontransformer is to generate a graph, called the interpretation graph orrecognition graph that contains all the possible interpretations for all thepossible segmentations of the input.
The interpretation graph hasalmost the same structure as the segmentation graph, except that each arc isreplaced by a set of arcs from and to the same node.
VI. Global training for graph transformer networks
?global training? The whole process?
1.Viterbi training 2.discriminative Viterbitraining 3.Forward training 4.discriminative forward training 5.remarks
VII. Multiple object recognition: Space displacement neural network
No segmentation needed
Problems: Expensive; neighbors; notsize-normalized
Solution:Convolutional Networks- A replicatedconvolutional
network, also called an SDNN
A. Interpreting theOutput of an SDNN with a GTN
B. Experiments withSDNN
C. Global Training ofSDNN
D. Object Detection and Spotting with SDNN
VIII. Graph transformernetworks and transducers
IX. & X. Applications
(Online Handwriting recognition system and check reading system)
本文探讨了基于梯度的学习方法在文档识别领域的应用,包括卷积神经网络(CNN)用于孤立字符识别,以及多模块系统和图转换网络(GTN)在字符串识别中的作用。文中还详细介绍了优化全局标准的图转换网络训练方法,以及空间位移神经网络(SDNN)在无需分段的情况下进行多对象识别的技术。
7613

被折叠的 条评论
为什么被折叠?



