由于最近想到tracking的特征要用到深度学习,于是就想装个深度学习框架来耍耍,其实就是懒得想特征0.0,那么比较常用的有caffe、theano还有torch等,由于不太会linux懒癌发作不想学,于是就想在windows下来装个试试,一开始用的是caffe,还没开始装前是这个表情,debug时我是这个表情
,装完心太累了只剩这个表情
。后面会po出caffe的安装过程以及一些巨坑,免得自己以后换电脑要重新再来一次这个磨人的过程。但是今天我要讲一下theano,相比caffe的debug过程,theano粑粑还是充满人情味和良心的
。(我的安装过程大部分基于官方文档照葫芦画瓢)
一、安装dependencies
(一)VS2013+CUDA7.0
VS2013嘛,宇宙最强大的工具包,懒癌发作程序员必备,所以就不说怎么装了地球人都懂。CUDA7.0的话请到官方网站下载(传送门在此),如果超链接失效了请随手百度一下就有了。
(二)Microsoft Visual C++ Compiler for Python 2.7
传送门:http://www.microsoft.com/en-us/download/details.aspx?id=44266,然后下载安装这个东西,这个东西将会被安装在C:\Program Files (x86)\Common Files\Microsoft\Visual C++ for Python\9.0这个路径里,这个东西后面要用到。
(三)stdint.h
下载stdint.h文件,放入C:\Program Files (x86)\Common Files\Microsoft\Visual C++ for Python\9.0\VC\include这个文件夹里,由于这个文件官方网站只是给出了代码,但是要翻墙才能看到,我就小小地在这里粘贴一下:
// ISO C9x compliant stdint.h for Microsoft Visual Studio
// Based on ISO/IEC 9899:TC2 Committee draft (May 6, 2005) WG14/N1124
//
// Copyright (c) 2006-2013 Alexander Chemeris
//
// Redistribution and use in source and binary forms, with or without
// modification, are permitted provided that the following conditions are met:
//
// 1. Redistributions of source code must retain the above copyright notice,
// this list of conditions and the following disclaimer.
//
// 2. Redistributions in binary form must reproduce the above copyright
// notice, this list of conditions and the following disclaimer in the
// documentation and/or other materials provided with the distribution.
//
// 3. Neither the name of the product nor the names of its contributors may
// be used to endorse or promote products derived from this software
// without specific prior written permission.
//
// THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR IMPLIED
// WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
// MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO
// EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
// SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
// PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS;
// OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY,
// WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR
// OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF
// ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
//
///////////////////////////////////////////////////////////////////////////////
#ifndef _MSC_VER // [
#error "Use this header only with Microsoft Visual C++ compilers!"
#endif // _MSC_VER ]
#ifndef _MSC_STDINT_H_ // [
#define _MSC_STDINT_H_
#if _MSC_VER > 1000
#pragma once
#endif
#if _MSC_VER >= 1600 // [
#include <stdint.h>
#else // ] _MSC_VER >= 1600 [
#include <limits.h>
// For Visual Studio 6 in C++ mode and for many Visual Studio versions when
// compiling for ARM we should wrap <wchar.h> include with 'extern "C++" {}'
// or compiler give many errors like this:
// error C2733: second C linkage of overloaded function 'wmemchr' not allowed
#ifdef __cplusplus
extern "C" {
#endif
# include <wchar.h>
#ifdef __cplusplus
}
#endif
// Define _W64 macros to mark types changing their size, like intptr_t.
#ifndef _W64
# if !defined(__midl) && (defined(_X86_) || defined(_M_IX86)) && _MSC_VER >= 1300
# define _W64 __w64
# else
# define _W64
# endif
#endif
// 7.18.1 Integer types
// 7.18.1.1 Exact-width integer types
// Visual Studio 6 and Embedded Visual C++ 4 doesn't
// realize that, e.g. char has the same size as __int8
// so we give up on __intX for them.
#if (_MSC_VER < 1300)
typedef signed char int8_t;
typedef signed short int16_t;
typedef signed int int32_t;
typedef unsigned char uint8_t;
typedef unsigned short uint16_t;
typedef unsigned int uint32_t;
#else
typedef signed __int8 int8_t;
typedef signed __int16 int16_t;
typedef signed __int32 int32_t;
typedef unsigned __int8 uint8_t;
typedef unsigned __int16 uint16_t;
typedef unsigned __int32 uint32_t;
#endif
typedef signed __int64 int64_t;
typedef unsigned __int64 uint64_t;
// 7.18.1.2 Minimum-width integer types
typedef int8_t int_least8_t;
typedef int16_t int_least16_t;
typedef int32_t int_least32_t;
typedef int64_t int_least64_t;
typedef uint8_t uint_least8_t;
typedef uint16_t uint_least16_t;
typedef uint32_t uint_least32_t;
typedef uint64_t uint_least64_t;
// 7.18.1.3 Fastest minimum-width integer types
typedef int8_t int_fast8_t;
typedef int16_t int_fast16_t;
typedef int32_t int_fast32_t;
typedef int64_t int_fast64_t;
typedef uint8_t uint_fast8_t;
typedef uint16_t uint_fast16_t;
typedef uint32_t uint_fast32_t;
typedef uint64_t uint_fast64_t;
// 7.18.1.4 Integer types capable of holding object pointers
#ifdef _WIN64 // [
typedef signed __int64 intptr_t;
typedef unsigned __int64 uintptr_t;
#else // _WIN64 ][
typedef _W64 signed int intptr_t;
typedef _W64 unsigned int uintptr_t;
#endif // _WIN64 ]
// 7.18.1.5 Greatest-width integer types
typedef int64_t intmax_t;
typedef uint64_t uintmax_t;
// 7.18.2 Limits of specified-width integer types
#if !defined(__cplusplus) || defined(__STDC_LIMIT_MACROS) // [ See footnote 220 at page 257 and footnote 221 at page 259
// 7.18.2.1 Limits of exact-width integer types
#define INT8_MIN ((int8_t)_I8_MIN)
#define INT8_MAX _I8_MAX
#define INT16_MIN ((int16_t)_I16_MIN)
#define INT16_MAX _I16_MAX
#define INT32_MIN ((int32_t)_I32_MIN)
#define INT32_MAX _I32_MAX
#define INT64_MIN ((int64_t)_I64_MIN)
#define INT64_MAX _I64_MAX
#define UINT8_MAX _UI8_MAX
#define UINT16_MAX _UI16_MAX
#define UINT32_MAX _UI32_MAX
#define UINT64_MAX _UI64_MAX
// 7.18.2.2 Limits of minimum-width integer types
#define INT_LEAST8_MIN INT8_MIN
#define INT_LEAST8_MAX INT8_MAX
#define INT_LEAST16_MIN INT16_MIN
#define INT_LEAST16_MAX INT16_MAX
#define INT_LEAST32_MIN INT32_MIN
#define INT_LEAST32_MAX INT32_MAX
#define INT_LEAST64_MIN INT64_MIN
#define INT_LEAST64_MAX INT64_MAX
#define UINT_LEAST8_MAX UINT8_MAX
#define UINT_LEAST16_MAX UINT16_MAX
#define UINT_LEAST32_MAX UINT32_MAX
#define UINT_LEAST64_MAX UINT64_MAX
// 7.18.2.3 Limits of fastest minimum-width integer types
#define INT_FAST8_MIN INT8_MIN
#define INT_FAST8_MAX INT8_MAX
#define INT_FAST16_MIN INT16_MIN
#define INT_FAST16_MAX INT16_MAX
#define INT_FAST32_MIN INT32_MIN
#define INT_FAST32_MAX INT32_MAX
#define INT_FAST64_MIN INT64_MIN
#define INT_FAST64_MAX INT64_MAX
#define UINT_FAST8_MAX UINT8_MAX
#define UINT_FAST16_MAX UINT16_MAX
#define UINT_FAST32_MAX UINT32_MAX
#define UINT_FAST64_MAX UINT64_MAX
// 7.18.2.4 Limits of integer types capable of holding object pointers
#ifdef _WIN64 // [
# define INTPTR_MIN INT64_MIN
# define INTPTR_MAX INT64_MAX
# define UINTPTR_MAX UINT64_MAX
#else // _WIN64 ][
# define INTPTR_MIN INT32_MIN
# define INTPTR_MAX INT32_MAX
# define UINTPTR_MAX UINT32_MAX
#endif // _WIN64 ]
// 7.18.2.5 Limits of greatest-width integer types
#define INTMAX_MIN INT64_MIN
#define INTMAX_MAX INT64_MAX
#define UINTMAX_MAX UINT64_MAX
// 7.18.3 Limits of other integer types
#ifdef _WIN64 // [
# define PTRDIFF_MIN _I64_MIN
# define PTRDIFF_MAX _I64_MAX
#else // _WIN64 ][
# define PTRDIFF_MIN _I32_MIN
# define PTRDIFF_MAX _I32_MAX
#endif // _WIN64 ]
#define SIG_ATOMIC_MIN INT_MIN
#define SIG_ATOMIC_MAX INT_MAX
#ifndef SIZE_MAX // [
# ifdef _WIN64 // [
# define SIZE_MAX _UI64_MAX
# else // _WIN64 ][
# define SIZE_MAX _UI32_MAX
# endif // _WIN64 ]
#endif // SIZE_MAX ]
// WCHAR_MIN and WCHAR_MAX are also defined in <wchar.h>
#ifndef WCHAR_MIN // [
# define WCHAR_MIN 0
#endif // WCHAR_MIN ]
#ifndef WCHAR_MAX // [
# define WCHAR_MAX _UI16_MAX
#endif // WCHAR_MAX ]
#define WINT_MIN 0
#define WINT_MAX _UI16_MAX
#endif // __STDC_LIMIT_MACROS ]
// 7.18.4 Limits of other integer types
#if !defined(__cplusplus) || defined(__STDC_CONSTANT_MACROS) // [ See footnote 224 at page 260
// 7.18.4.1 Macros for minimum-width integer constants
#define INT8_C(val) val##i8
#define INT16_C(val) val##i16
#define INT32_C(val) val##i32
#define INT64_C(val) val##i64
#define UINT8_C(val) val##ui8
#define UINT16_C(val) val##ui16
#define UINT32_C(val) val##ui32
#define UINT64_C(val) val##ui64
// 7.18.4.2 Macros for greatest-width integer constants
// These #ifndef's are needed to prevent collisions with <boost/cstdint.hpp>.
// Check out Issue 9 for the details.
#ifndef INTMAX_C // [
# define INTMAX_C INT64_C
#endif // INTMAX_C ]
#ifndef UINTMAX_C // [
# define UINTMAX_C UINT64_C
#endif // UINTMAX_C ]
#endif // __STDC_CONSTANT_MACROS ]
#endif // _MSC_VER >= 1600 ]
#endif // _MSC_STDINT_H_ ]
右键保存上面代码就好了(四)GCC
理由如下:
Theano C code compiler currently requires a GCC installation. We have used the build TDM GCC which is provided for both 32- and 64-bit platforms.
然后注意一下几个要点:
1.安装路径不要带有空格官方文档是这样说的所以大家没事不要作死;
2.记得勾选openmp support option(这个我装的时候并没有得选,然后我安装选了64位的all-package,反正没什么问题)
TDM-GCC传送门:http://tdm-gcc.tdragon.net/
(四)、Scientific Python Distribution
这个distribution有很多选择,但坑爹的是官方网站上只给出了WinPython的配置方案,其它的都没给,感谢这位博主的blog:http://blog.youkuaiyun.com/richard2357/article/details/16963187,给出了使用Anaconda的配置文件,简直是黑暗中的一盏明灯Orz。
1.使用WinPython
这个大家可以移步到官网去看,但是WinPython下我是没有成功使用gpu的,具体原因也没有深究懒癌再次发作;
2.使用Anaconda
下载直接下就好了,Anaconda传送门:https://store.continuum.io/cshop/anaconda/,然后安装后(我是直接安装在C盘下)添加下列环境变量到Path里:
(1)C:\Anaconda;
(2)C:\Anaconda\Scripts;
我只加了这两个,但是我忘了这是手动加还是安装Anaconda时自动加的,大家看看自己的环境变量自己决定吧~配置完环境变量记得重新启动一下。
二、配置环境字面意思直译过来的
官方网站上指出将下面这段代码复制到一个env.bat的批处理文件里,放在一个C:\scisoft文件夹里,我的dependencies除了Anaconda装得比较早,直接装在C盘下,其它的都放在了C:\Theano_General这个文件夹里,所以这个.bat文件也相应放进这个文件夹里。这个文件的位置会不会对Theano能否运行造成影响我是不知道的,如果有人知道,求不吝赐教。
REM configuration of paths
set VSFORPYTHON="C:\Program Files (x86)\Common Files\Microsoft\Visual C++ for Python\9.0"
set SCISOFT=%~dp0
REM add tdm gcc stuff
set PATH=%SCISOFT%\TDM-GCC-64\bin;%SCISOFT%\TDM-GCC-64\x86_64-w64-mingw32\bin;%PATH%
REM add winpython stuff
CALL %SCISOFT%\WinPython-64bit-2.7.9.4\scripts\env.bat
REM configure path for msvc compilers
REM for a 32 bit installation change this line to
REM CALL %VSFORPYTHON%\vcvarsall.bat
CALL %VSFORPYTHON%\vcvarsall.bat amd64
REM return a shell
cmd.exe /k
有个问题是这里的CALL %SCISOFT%\WinPython-64bit-2.7.9.4\scripts\env.bat这句话,这个路径里面并没有所说的env.bat文件,运行时会说找不到文件,但是并没有对安装造成实质性影响。最后只需要运行我们创建的env.bat文件,就进入到所谓的“Python shell”,然后执行下面命令来为GCC创建链接库:
gendef WinPython-64bit-2.7.10.1\python-2.7.10.amd64\python27.dll
dlltool --dllname python27.dll --def python27.def --output-lib WinPython-64bit-2.7.10.1\python-2.7.10.amd64\libs\libpython27.a
(大家记得根据自己python版本号来进行相应修改)
三、Theano
(一)下载和安装Theano
官方网站推荐了使用MYSGIT来下载和安装theano,打开MYSGIT后请先进到自己想要安装theano的文件夹,然后输入下列命令:
git clone https://github.com/Theano/Theano.git --branch rel-0.7
就可以安装了。
(二)配置Theano
进入“Python shell”(e.g. C:\Theano_General\env.bat),然后进入到放有setup.py文件的文件夹里(由于git安装Theano后会产生一个"Theano"的文件夹,setup.py就在里面),然后输入如下:
Python setup.py develop
四、GPU加速
要使用GPU请确保你的电脑有GPU这不是废话嘛,然后在变量%USERPROFILE%所指向的路径下建立一个.theanorc.txt的文件(我的%USERPROFILE%是C:\Users\Daniel),里面的内容参考了官方文档和其他博主的:
[global]
openmp=False
[blas]
ldflags=
[gcc]
cxxflags = -IC:\Anaconda\MinGW\x86_64-w64-mingw32
[nvcc]
fastmath = True
flags = -LC:\Anaconda\libs
compiler-bindir=C:\Program Files (x86)\Microsoft Visual Studio 12.0\VC\bin
base_compiledir=path_to_a_directory_without_such_characters
[global]
floatX = float32
device = gpu
然后我们就完成了所有工作了啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊
五、测试
并没有完成所有工作呢,还是要测试一下的,代码如下:
from theano import function, config, shared, sandbox
import theano.tensor as T
import numpy
import time
vlen = 10 * 30 * 768 # 10 x #cores x # threads per core
iters = 1000
rng = numpy.random.RandomState(22)
x = shared(numpy.asarray(rng.rand(vlen), config.floatX))
f = function([], T.exp(x))
print f.maker.fgraph.toposort()
t0 = time.time()
for i in xrange(iters):
r = f()
t1 = time.time()
print 'Looping %d times took' % iters, t1 - t0, 'seconds'
print 'Result is', r
if numpy.any([isinstance(x.op, T.Elemwise) for x in f.maker.fgraph.toposort()]):
print 'Used the cpu'
else:
print 'Used the gpu'
运行结果:
好了,大功告成~