Theano在win8.1下配置以及测试

最新推荐文章于 2022-01-14 11:57:04 发布

原创最新推荐文章于 2022-01-14 11:57:04 发布 · 1k 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#windows #Theano

Theano 专栏收录该内容

0 篇文章

订阅专栏

本文详细介绍了如何在Windows环境下安装和配置Theano深度学习框架，包括依赖库安装、环境配置、GPU加速设置及测试流程。通过遵循教程，读者可以顺利在本地环境中搭建起Theano开发环境。

由于最近想到tracking的特征要用到深度学习，于是就想装个深度学习框架来耍耍，~~其实就是懒得想特征0.0~~，那么比较常用的有caffe、theano还有torch等，由于不太会linux~~懒癌发作不想学~~，于是就想在windows下来装个试试，一开始用的是caffe，还没开始装前是这个表情，debug时我是这个表情，装完心太累了只剩这个表情。后面会po出caffe的安装过程以及一些巨坑，免得自己以后换电脑要重新再来一次这个磨人的过程。但是今天我要讲一下theano，相比caffe的debug过程，theano粑粑还是充满人情味和良心的。（我的安装过程大部分基于官方文档照葫芦画瓢）

一、安装dependencies

（一）VS2013+CUDA7.0

VS2013嘛，宇宙最强大的工具包，懒癌发作程序员必备，所以就不说怎么装了~~地球人都懂~~。CUDA7.0的话请到官方网站下载（传送门在此），如果超链接失效了请随手百度一下就有了。

（二）Microsoft Visual C++ Compiler for Python 2.7

传送门：http://www.microsoft.com/en-us/download/details.aspx?id=44266，然后下载安装这个东西，这个东西将会被安装在C:\Program Files (x86)\Common Files\Microsoft\Visual C++ for Python\9.0这个路径里，这个东西后面要用到。

（三）stdint.h

下载stdint.h文件，放入C:\Program Files (x86)\Common Files\Microsoft\Visual C++ for Python\9.0\VC\include这个文件夹里，由于这个文件官方网站只是给出了代码，但是要翻墙才能看到，我就小小地在这里粘贴一下：

// ISO C9x  compliant stdint.h for Microsoft Visual Studio
// Based on ISO/IEC 9899:TC2 Committee draft (May 6, 2005) WG14/N1124 
// 
//  Copyright (c) 2006-2013 Alexander Chemeris
// 
// Redistribution and use in source and binary forms, with or without
// modification, are permitted provided that the following conditions are met:
// 
//   1. Redistributions of source code must retain the above copyright notice,
//      this list of conditions and the following disclaimer.
// 
//   2. Redistributions in binary form must reproduce the above copyright
//      notice, this list of conditions and the following disclaimer in the
//      documentation and/or other materials provided with the distribution.
// 
//   3. Neither the name of the product nor the names of its contributors may
//      be used to endorse or promote products derived from this software
//      without specific prior written permission.
// 
// THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR IMPLIED
// WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
// MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO
// EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
// SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
// PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS;
// OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, 
// WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR
// OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF
// ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
// 
///////////////////////////////////////////////////////////////////////////////

#ifndef _MSC_VER // [
#error "Use this header only with Microsoft Visual C++ compilers!"
#endif // _MSC_VER ]

#ifndef _MSC_STDINT_H_ // [
#define _MSC_STDINT_H_

#if _MSC_VER > 1000
#pragma once
#endif

#if _MSC_VER >= 1600 // [
#include <stdint.h>
#else // ] _MSC_VER >= 1600 [

#include <limits.h>

// For Visual Studio 6 in C++ mode and for many Visual Studio versions when
// compiling for ARM we should wrap <wchar.h> include with 'extern "C++" {}'
// or compiler give many errors like this:
//   error C2733: second C linkage of overloaded function 'wmemchr' not allowed
#ifdef __cplusplus
extern "C" {
#endif
#  include <wchar.h>
#ifdef __cplusplus
}
#endif

// Define _W64 macros to mark types changing their size, like intptr_t.
#ifndef _W64
#  if !defined(__midl) && (defined(_X86_) || defined(_M_IX86)) && _MSC_VER >= 1300
#     define _W64 __w64
#  else
#     define _W64
#  endif
#endif


// 7.18.1 Integer types

// 7.18.1.1 Exact-width integer types

// Visual Studio 6 and Embedded Visual C++ 4 doesn't
// realize that, e.g. char has the same size as __int8
// so we give up on __intX for them.
#if (_MSC_VER < 1300)
   typedef signed char       int8_t;
   typedef signed short      int16_t;
   typedef signed int        int32_t;
   typedef unsigned char     uint8_t;
   typedef unsigned short    uint16_t;
   typedef unsigned int      uint32_t;
#else
   typedef signed __int8     int8_t;
   typedef signed __int16    int16_t;
   typedef signed __int32    int32_t;
   typedef unsigned __int8   uint8_t;
   typedef unsigned __int16  uint16_t;
   typedef unsigned __int32  uint32_t;
#endif
typedef signed __int64       int64_t;
typedef unsigned __int64     uint64_t;


// 7.18.1.2 Minimum-width integer types
typedef int8_t    int_least8_t;
typedef int16_t   int_least16_t;
typedef int32_t   int_least32_t;
typedef int64_t   int_least64_t;
typedef uint8_t   uint_least8_t;
typedef uint16_t  uint_least16_t;
typedef uint32_t  uint_least32_t;
typedef uint64_t  uint_least64_t;

// 7.18.1.3 Fastest minimum-width integer types
typedef int8_t    int_fast8_t;
typedef int16_t   int_fast16_t;
typedef int32_t   int_fast32_t;
typedef int64_t   int_fast64_t;
typedef uint8_t   uint_fast8_t;
typedef uint16_t  uint_fast16_t;
typedef uint32_t  uint_fast32_t;
typedef uint64_t  uint_fast64_t;

// 7.18.1.4 Integer types capable of holding object pointers
#ifdef _WIN64 // [
   typedef signed __int64    intptr_t;
   typedef unsigned __int64  uintptr_t;
#else // _WIN64 ][
   typedef _W64 signed int   intptr_t;
   typedef _W64 unsigned int uintptr_t;
#endif // _WIN64 ]

// 7.18.1.5 Greatest-width integer types
typedef int64_t   intmax_t;
typedef uint64_t  uintmax_t;


// 7.18.2 Limits of specified-width integer types

#if !defined(__cplusplus) || defined(__STDC_LIMIT_MACROS) // [   See footnote 220 at page 257 and footnote 221 at page 259

// 7.18.2.1 Limits of exact-width integer types
#define INT8_MIN     ((int8_t)_I8_MIN)
#define INT8_MAX     _I8_MAX
#define INT16_MIN    ((int16_t)_I16_MIN)
#define INT16_MAX    _I16_MAX
#define INT32_MIN    ((int32_t)_I32_MIN)
#define INT32_MAX    _I32_MAX
#define INT64_MIN    ((int64_t)_I64_MIN)
#define INT64_MAX    _I64_MAX
#define UINT8_MAX    _UI8_MAX
#define UINT16_MAX   _UI16_MAX
#define UINT32_MAX   _UI32_MAX
#define UINT64_MAX   _UI64_MAX

// 7.18.2.2 Limits of minimum-width integer types
#define INT_LEAST8_MIN    INT8_MIN
#define INT_LEAST8_MAX    INT8_MAX
#define INT_LEAST16_MIN   INT16_MIN
#define INT_LEAST16_MAX   INT16_MAX
#define INT_LEAST32_MIN   INT32_MIN
#define INT_LEAST32_MAX   INT32_MAX
#define INT_LEAST64_MIN   INT64_MIN
#define INT_LEAST64_MAX   INT64_MAX
#define UINT_LEAST8_MAX   UINT8_MAX
#define UINT_LEAST16_MAX  UINT16_MAX
#define UINT_LEAST32_MAX  UINT32_MAX
#define UINT_LEAST64_MAX  UINT64_MAX

// 7.18.2.3 Limits of fastest minimum-width integer types
#define INT_FAST8_MIN    INT8_MIN
#define INT_FAST8_MAX    INT8_MAX
#define INT_FAST16_MIN   INT16_MIN
#define INT_FAST16_MAX   INT16_MAX
#define INT_FAST32_MIN   INT32_MIN
#define INT_FAST32_MAX   INT32_MAX
#define INT_FAST64_MIN   INT64_MIN
#define INT_FAST64_MAX   INT64_MAX
#define UINT_FAST8_MAX   UINT8_MAX
#define UINT_FAST16_MAX  UINT16_MAX
#define UINT_FAST32_MAX  UINT32_MAX
#define UINT_FAST64_MAX  UINT64_MAX

// 7.18.2.4 Limits of integer types capable of holding object pointers
#ifdef _WIN64 // [
#  define INTPTR_MIN   INT64_MIN
#  define INTPTR_MAX   INT64_MAX
#  define UINTPTR_MAX  UINT64_MAX
#else // _WIN64 ][
#  define INTPTR_MIN   INT32_MIN
#  define INTPTR_MAX   INT32_MAX
#  define UINTPTR_MAX  UINT32_MAX
#endif // _WIN64 ]

// 7.18.2.5 Limits of greatest-width integer types
#define INTMAX_MIN   INT64_MIN
#define INTMAX_MAX   INT64_MAX
#define UINTMAX_MAX  UINT64_MAX

// 7.18.3 Limits of other integer types

#ifdef _WIN64 // [
#  define PTRDIFF_MIN  _I64_MIN
#  define PTRDIFF_MAX  _I64_MAX
#else  // _WIN64 ][
#  define PTRDIFF_MIN  _I32_MIN
#  define PTRDIFF_MAX  _I32_MAX
#endif  // _WIN64 ]

#define SIG_ATOMIC_MIN  INT_MIN
#define SIG_ATOMIC_MAX  INT_MAX

#ifndef SIZE_MAX // [
#  ifdef _WIN64 // [
#     define SIZE_MAX  _UI64_MAX
#  else // _WIN64 ][
#     define SIZE_MAX  _UI32_MAX
#  endif // _WIN64 ]
#endif // SIZE_MAX ]

// WCHAR_MIN and WCHAR_MAX are also defined in <wchar.h>
#ifndef WCHAR_MIN // [
#  define WCHAR_MIN  0
#endif  // WCHAR_MIN ]
#ifndef WCHAR_MAX // [
#  define WCHAR_MAX  _UI16_MAX
#endif  // WCHAR_MAX ]

#define WINT_MIN  0
#define WINT_MAX  _UI16_MAX

#endif // __STDC_LIMIT_MACROS ]


// 7.18.4 Limits of other integer types

#if !defined(__cplusplus) || defined(__STDC_CONSTANT_MACROS) // [   See footnote 224 at page 260

// 7.18.4.1 Macros for minimum-width integer constants

#define INT8_C(val)  val##i8
#define INT16_C(val) val##i16
#define INT32_C(val) val##i32
#define INT64_C(val) val##i64

#define UINT8_C(val)  val##ui8
#define UINT16_C(val) val##ui16
#define UINT32_C(val) val##ui32
#define UINT64_C(val) val##ui64

// 7.18.4.2 Macros for greatest-width integer constants
// These #ifndef's are needed to prevent collisions with <boost/cstdint.hpp>.
// Check out Issue 9 for the details.
#ifndef INTMAX_C //   [
#  define INTMAX_C   INT64_C
#endif // INTMAX_C    ]
#ifndef UINTMAX_C //  [
#  define UINTMAX_C  UINT64_C
#endif // UINTMAX_C   ]

#endif // __STDC_CONSTANT_MACROS ]

#endif // _MSC_VER >= 1600 ]

#endif // _MSC_STDINT_H_ ]

右键保存上面代码就好了

（四）GCC

理由如下：

Theano C code compiler currently requires a GCC installation. We have used the build TDM GCC which is provided for both 32- and 64-bit platforms.

然后注意一下几个要点：

1.安装路径不要带有空格~~官方文档是这样说的所以大家没事不要作死~~；

2.记得勾选openmp support option（这个我装的时候并没有得选，然后我安装选了64位的all-package，反正没什么问题）

TDM-GCC传送门：http://tdm-gcc.tdragon.net/

（四）、Scientific Python Distribution

这个distribution有很多选择，但坑爹的是官方网站上只给出了WinPython的配置方案，其它的都没给，感谢这位博主的blog：http://blog.youkuaiyun.com/richard2357/article/details/16963187，给出了使用Anaconda的配置文件，简直是黑暗中的一盏明灯Orz。

1.使用WinPython

这个大家可以移步到官网去看，但是WinPython下我是没有成功使用gpu的，具体原因也没有深究~~懒癌再次发作~~；

2.使用Anaconda

下载直接下就好了，Anaconda传送门：https://store.continuum.io/cshop/anaconda/，然后安装后（我是直接安装在C盘下）添加下列环境变量到Path里：

（1）C:\Anaconda;

（2）C:\Anaconda\Scripts;

我只加了这两个，但是我忘了这是手动加还是安装Anaconda时自动加的，大家看看自己的环境变量自己决定吧~配置完环境变量记得重新启动一下。

二、配置环境~~字面意思直译过来的~~

官方网站上指出将下面这段代码复制到一个env.bat的批处理文件里，放在一个C:\scisoft文件夹里，我的dependencies除了Anaconda装得比较早，直接装在C盘下，其它的都放在了C:\Theano_General这个文件夹里，所以这个.bat文件也相应放进这个文件夹里。这个文件的位置会不会对Theano能否运行造成影响我是不知道的，如果有人知道，求不吝赐教。

REM configuration of paths
set VSFORPYTHON="C:\Program Files (x86)\Common Files\Microsoft\Visual C++ for Python\9.0"
set SCISOFT=%~dp0

REM add tdm gcc stuff
set PATH=%SCISOFT%\TDM-GCC-64\bin;%SCISOFT%\TDM-GCC-64\x86_64-w64-mingw32\bin;%PATH%

REM add winpython stuff
CALL %SCISOFT%\WinPython-64bit-2.7.9.4\scripts\env.bat

REM configure path for msvc compilers
REM for a 32 bit installation change this line to
REM CALL %VSFORPYTHON%\vcvarsall.bat
CALL %VSFORPYTHON%\vcvarsall.bat amd64

REM return a shell
cmd.exe /k

有个问题是这里的CALL %SCISOFT%\WinPython-64bit-2.7.9.4\scripts\env.bat这句话，这个路径里面并没有所说的env.bat文件，运行时会说找不到文件，但是并没有对安装造成实质性影响。最后只需要运行我们创建的env.bat文件，就进入到所谓的“Python shell”,然后执行下面命令来为GCC创建链接库：

gendef WinPython-64bit-2.7.10.1\python-2.7.10.amd64\python27.dll

dlltool --dllname python27.dll --def python27.def --output-lib WinPython-64bit-2.7.10.1\python-2.7.10.amd64\libs\libpython27.a

（大家记得根据自己python版本号来进行相应修改）

三、Theano

（一）下载和安装Theano

官方网站推荐了使用MYSGIT来下载和安装theano，打开MYSGIT后请先进到自己想要安装theano的文件夹，然后输入下列命令：

git clone https://github.com/Theano/Theano.git --branch rel-0.7

就可以安装了。

（二）配置Theano

进入“Python shell”(e.g. C:\Theano_General\env.bat)，然后进入到放有setup.py文件的文件夹里（由于git安装Theano后会产生一个"Theano"的文件夹，setup.py就在里面），然后输入如下：

Python setup.py develop

四、GPU加速

要使用GPU请确保你的电脑有GPU~~这不是废话嘛~~，然后在变量%USERPROFILE%所指向的路径下建立一个.theanorc.txt的文件（我的%USERPROFILE%是C:\Users\Daniel）,里面的内容参考了官方文档和其他博主的：

[global]
openmp=False

[blas]
ldflags=

[gcc]
cxxflags = -IC:\Anaconda\MinGW\x86_64-w64-mingw32

[nvcc]
fastmath = True
flags = -LC:\Anaconda\libs
compiler-bindir=C:\Program Files (x86)\Microsoft Visual Studio 12.0\VC\bin
base_compiledir=path_to_a_directory_without_such_characters

[global]
floatX = float32
device = gpu

然后我们就完成了所有工作了啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊

五、测试

并没有完成所有工作呢，还是要测试一下的，代码如下：

from theano import function, config, shared, sandbox
import theano.tensor as T
import numpy
import time

vlen = 10 * 30 * 768  # 10 x #cores x # threads per core
iters = 1000

rng = numpy.random.RandomState(22)
x = shared(numpy.asarray(rng.rand(vlen), config.floatX))
f = function([], T.exp(x))
print f.maker.fgraph.toposort()
t0 = time.time()
for i in xrange(iters):
    r = f()
t1 = time.time()
print 'Looping %d times took' % iters, t1 - t0, 'seconds'
print 'Result is', r
if numpy.any([isinstance(x.op, T.Elemwise) for x in f.maker.fgraph.toposort()]):
    print 'Used the cpu'
else:
    print 'Used the gpu'

运行结果：