文章目录
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
一. 简介
发表:CVPR2022
机构:微软
代码:https://github.com/microsoft/unilm/tree/master/trocr
摘要:
Text recognition is a long standing-research problem for document digitalization. Existing approaches are usually built based on CNN for image understanding and RNN for char-level text generation. In addition, another language model is usually needed to improve the overall accuracy as a