Tesseract-OCR Background
The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. The source code will read a binary, grey or color image and output text. Image input is managed by the Leptonica Image Processing Library which can read a wide variety of image formats.
TesseractDotnet
TesseractDot
是Tesseract-OCR的.NET项目, 方便.NET开发人员使用Tesseract-OCR.但是我还没发现C++可用的类库,源码也无法编译成dll.
更多详情请访问项目主页: http://code.google.com/p/tesseractdotnet/
另外推荐一些文章:


Tesseract-OCR引擎在1995年的UNLV Accuracy测试中表现出色,尽管2006年前发展缓慢,但仍是准确度较高的开源OCR引擎之一。它能处理二值、灰度和彩色图像,依赖于Leptonica库进行图像输入。TesseractDotnet是针对.NET开发者的一个项目,简化了在.NET环境中使用Tesseract-OCR的过程。对于C++开发人员,尚缺乏可用的类库。了解更多详情,请访问相关项目主页。
1103





