表格OCR相关资源整理【ICDAR】【表格识别】【持续更新...】

最新推荐文章于 2025-06-08 13:46:00 发布

智能血压计

最新推荐文章于 2025-06-08 13:46:00 发布

阅读量6.3k

点赞数 6

CC 4.0 BY-SA版权

分类专栏： OCR 文字检测图像识别文章标签：语音识别

本文链接：https://blog.youkuaiyun.com/lz867422770/article/details/105046048

图像识别同时被 3 个专栏收录

15 篇文章

订阅专栏

OCR

11 篇文章

订阅专栏

文字检测

11 篇文章

订阅专栏

本文概述了表格检测和结构识别技术，包括基于unet的表格检测方法、完整的印刷体表格解决方案，以及多个用于训练和评估的公开数据集，如ICDAR2013、ctdar2019等。同时，介绍了ICDAR2019会议中发表的16篇相关论文，涉及表格检测、结构识别及新数据集发布。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

定义：
- 表格检测（Table Detection）任务是从一个页面中检测出表格所在的区域
- 表格结构识别（Table Structure Recognition）任务则是在检测到的表格区域的基础上，进一步将表格的内容与逻辑结构识别出来
代码：
- 运用unet实现对文档表格的自动检测，表格重建：https://github.com/chineseocr/table-ocr
- 完整印刷体表格解决方案：https://github.com/Rid7/Table-OCR
数据集：

名称	说明	内容	量级	地址
ICDAR2013	PDF	美国政府文件和欧盟文件		http://www.tamirhassan.com/html/dataset.html
icdar2017页面对象识别	页面截图
ctdar2019	分为两类数据，历史文档和现在文档			GitHub - cndplab-founder/ICDAR2019_cTDaR: The ICDAR 2019 cTDaR is to evaluate the performance of methods for table detection (TRACK A) and table recognition (TRACK B). For the first track, document images containing one or several tables are provided. For TRACK B two subtracks exist: the first subtrack (B.1) provides the table region. Thus, only the table structure recognition must be performed. The second subtrack (B.2) provides no a-priori information. This means, the table region and table structure detection has to be done.
TABLE2LATEX-450K	latex		46.6万	https://github.com/bloomberg/TABLE2LATEX
DECO	电子表格		1165	DECO: A Dataset of Annotated Spreadsheets for Layout and Table Recognition \| Database Systems Group
第三方个人数据	扫描英文表格检测		403	https://github.com/sgrpanchal31/table-detection-dataset

论文：
- ICDAR2019会议中，共有16篇与表格识别相关的论文
- 其中5篇针对表格检测任务
- 8篇针对表格结构识别任务
- 1篇在同时进行了表格检测与结构识别的任务
- 2篇则是发布了新的表格识别相关的数据集

任务	论文名称	说明	作者	代码	数据
识别	A Genetic-based Search for Adaptive Table Recognition in Spreadsheets	传统图像，应用于excel截图
识别	Deep Splitting and Merging for Table Structure Decomposition	ICDAR2013表格竞赛表格结构识别子任务的数据集State-of-the-art	adobe研究院
识别	DeepTabStr:Deep Learning based Table Structure Recognition
识别	ReS2TIM: Reconstruct SyntacticStructures from Table Images	icdar2013 f1 0.74
识别	Rethinking Semantic Segmentationfor Table Structure Recognition in Documents	不可处理跨行跨列
识别	Rethinking Table Recognitionusing Graph Neural Networks	有框线无框线表格均可处理没有提供预训练模型		GitHub - shahrukhqasim/TIES-2.0: Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Recognition using Graph Neural Networks (2019)	合成，提供数据生产工具
识别	TableStructure Extraction with Bi-directional Gated Recurrent Unit Networks
端到端检测识别	TableNet: Deep Learning Model for End-to-end Table Detection and Tabular Data Extraction from Scanned Document Images	icdar2013检测和识别F1分别为96.62%和91.51%
检测	A GAN-based Feature Generator forTable Detection	ICDAR13/17 state-of-the-art	北京大学王选计算机研究所
检测	A YOLO-based Table Detection Method
检测	Faster R-CNN BasedTable Detection Combining Corner Locating				ICDAR2017 POD数据集
检测	Table Detection in Invoice Documents by Graph Neural Networks				取自 RVL-CDIP invoice data
端到端	CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents			GitHub - DevashishPrasad/CascadeTabNet: This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"