依赖包都在…/pkg下
- 安装rpm,一些基本的依赖
rpm -ivh ./*
gtk2-devel yasm glibc.i686 libstdc++.so.6 libgtk-x11-2.0.so libatk-1.0.so.0 libcairo.so.2 libcups.so.2 libgdk-x11-2.0.so.0 libgdk_pixbuf-2.0.so.0 libgtk-x11-2.0.so.0 libpango-1.0.so.0 libpangocairo-1.0.so.0 libICE.so.6 libSM.so.6 libmng.so.1 libpng12.so.0 libGLU.so.1 autoconf automake libtool yum libjpeg-devel libpng-devel libtiff-devel zlib-devel
- 安装开发工具,运维同事应该已经安好
yum groupinstall "Development Tools"
- 安装leptonica1.7
解压,cd
./configure --prefix=/usr/local/
make && make install
配置环境
vim \etc\bashrc
加入
PKG_CONFIG_PATH=$PKG_CONFIG_PATH:/usr/local/lib/pkgconfig
export PKG_CONFIG_PATH
CPLUS_INCLUDE_PATH=$CPLUS_INCLUDE_PATH:/usr/local/include/
export CPLUS_INCLUDE_PATH
C_INCLUDE_PATH=$C_INCLUDE_PATH:/usr/local/leptonica/include/leptonica
export C_INCLUDE_PATH
LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib
export LD_LIBRARY_PATH
LIBRARY_PATH=$LIBRARY_PATH:/usr/local/lib
export LIBRARY_PATH
TESSDATA_PREFIX=/root/tesseract/
export TESSDATA_PREFIX
最后刷新
source /etc/bashrc
回到tesseract目录,开始安装tesseract
./autogen.sh
./configure --with-extra-includes=/usr/local/include --with-extra-libraries=/usr/local/include
有可能会报 configure: error: Leptonica 1.74 or higher is required. Try to install libleptonica-dev package.
那就要加入环境变量
vim /etc/profile
在最后插入
export LD_LIBRARY_PATH=$LD_LIBRARY_PAYT:/usr/local/lib
export LIBLEPT_HEADERSDIR=/usr/local/include
export PKG_CONFIG_PATH=/usr/local/lib/pkgconfig
最后刷新下文件
source /etc/profile
然后我们再执行
./autogen.sh
./configure --with-extra-includes=/usr/local/include --with-extra-libraries=/usr/local/include
这时就没有 leptonica问题啦,随后make下
make && sudo make install
如果用代码测试一切都正常,但到项目中却莫明奇妙地失败,执行下
cp /usr/local/lib/*.so.* /usr/lib64/
如果报
Error in findTiffCompression: function not present
Error in pixReadStreamTiff: function not present
Error in pixReadStream: tiff: no pix returned
Error in pixRead: pix not read
Unsupported image type.
相关。
那就是没有少了png和jpge依赖库,重新安装rpm里的
yum install libjpeg-devel
yum install libpng-devel
进入leptonica的安装文件夹重新编译
./configure
make
make install
输入tesseract -v 因该可以看到如下:
tesseract 4.1.1
leptonica-1.74.4
libjpeg 6b (libjpeg-turbo 1.2.90) : libpng 1.5.13 : libtiff 4.0.3 : zlib 1.2.7 : libopenjp2 2.3.1
Found AVX
Found SSE
到whl 里,安装Anaconda2,和其他whl
激活conda 中的py2, python denoise_recog.py
或者/root/anaconda2/bin/python denoise_recog.py
G可以看到
>> 结算业务申请书 <type 'unicode'>
GOOD LUCK TO U!