scrapy安装

参考文章:http://www.cnblogs.com/txw1958/archive/2012/07/12/scrapy_installation_introduce.html

 

一、 Scrapy简介

Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

官方主页: http://www.scrapy.org/

第三方中文文档:http://scrapy-chs.readthedocs.org/zh_CN/latest/intro/overview.html

 

二、 安装Python2.7

官方主页:http://www.python.org/

下载地址:http://www.python.org/ftp/python/2.7.3/python-2.7.3.msi

 

1) 安装python

安装目录:D:\Python27

 

2) 添加环境变量

path 中添加 D:\Python27\;D:\Python27\Scripts\;

 

3) 验证环境变量

打开命令行,并且运行以下命令来修改 PATH:

c:\python27\python.exe c:\python27\tools\scripts\win_add2path.py
 
  

4) 验证Python

复制代码
T:\>python
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> exit() T:\>
复制代码

 

三、 安装Twisted

Twisted is an event-driven networking engine written in Python and licensed under the open source

 

1) 安装setuptools

Download, build, install, upgrade, and uninstall Python packages -- easily!

官方主页:http://pypi.python.org/pypi/setuptools

下载地址:http://pypi.python.org/packages/2.7/s/setuptools/setuptools-0.6c11.win32-py2.7.exe

安装过程:略

 

2) 安装Zope.Interface

官方主页:http://pypi.python.org/pypi/zope.interface/

下载地址:http://pypi.python.org/packages/2.7/z/zope.interface/zope.interface-4.0.1-py2.7-win32.egg

注意区分64位和32位的程序。命令行要以管理员身份运行,程序应该放在“D:\Python27\Scripts”这个文件夹下。

安装过程:

复制代码
T:\>d:
D:\>cd D:\Python27\Scripts
D:\Python27\Scripts>easy_install.exe zope.interface-4.0.1-py2.7-win32.egg Processing zope.interface-4.0.1-py2.7-win32.egg creating d:\python27\lib\site-packages\zope.interface-4.0.1-py2.7-win32.egg Extracting zope.interface-4.0.1-py2.7-win32.egg to d:\python27\lib\site-packages Adding zope.interface 4.0.1 to easy-install.pth file Installed d:\python27\lib\site-packages\zope.interface-4.0.1-py2.7-win32.egg Processing dependencies for zope.interface==4.0.1 Finished processing dependencies for zope.interface==4.0.1 D:\Python27\Scripts>
复制代码

 

验证安装:

D:\Python27\Scripts>python
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import zope.interface >>>

 

3) 安装Twisted

官方主页:http://twistedmatrix.com/trac/wiki/TwistedProject

下载地址:http://pypi.python.org/packages/2.7/T/Twisted/Twisted-12.1.0.win32-py2.7.msi

安装过程:略

 

四、 安装w3lib

官方主页:http://pypi.python.org/pypi/w3lib

下载地址: http://pypi.python.org/packages/source/w/w3lib/w3lib-1.2.tar.gz

解压过程:略

安装过程:

复制代码
T:\w3lib-1.2>python setup.py install
running install
running build
running build_py
creating build
creating build\lib
creating build\lib\w3lib
copying w3lib\encoding.py -> build\lib\w3lib
copying w3lib\form.py -> build\lib\w3lib
copying w3lib\html.py -> build\lib\w3lib copying w3lib\http.py -> build\lib\w3lib copying w3lib\url.py -> build\lib\w3lib copying w3lib\util.py -> build\lib\w3lib copying w3lib\__init__.py -> build\lib\w3lib running install_lib creating D:\Python27\Lib\site-packages\w3lib copying build\lib\w3lib\encoding.py -> D:\Python27\Lib\site-packages\w3lib copying build\lib\w3lib\form.py -> D:\Python27\Lib\site-packages\w3lib copying build\lib\w3lib\html.py -> D:\Python27\Lib\site-packages\w3lib copying build\lib\w3lib\http.py -> D:\Python27\Lib\site-packages\w3lib copying build\lib\w3lib\url.py -> D:\Python27\Lib\site-packages\w3lib copying build\lib\w3lib\util.py -> D:\Python27\Lib\site-packages\w3lib copying build\lib\w3lib\__init__.py -> D:\Python27\Lib\site-packages\w3lib byte-compiling D:\Python27\Lib\site-packages\w3lib\encoding.py to encoding.pyc byte-compiling D:\Python27\Lib\site-packages\w3lib\form.py to form.pyc byte-compiling D:\Python27\Lib\site-packages\w3lib\html.py to html.pyc byte-compiling D:\Python27\Lib\site-packages\w3lib\http.py to http.pyc byte-compiling D:\Python27\Lib\site-packages\w3lib\url.py to url.pyc byte-compiling D:\Python27\Lib\site-packages\w3lib\util.py to util.pyc byte-compiling D:\Python27\Lib\site-packages\w3lib\__init__.py to __init__.pyc running install_egg_info Writing D:\Python27\Lib\site-packages\w3lib-1.2-py2.7.egg-info T:\w3lib-1.2>
复制代码

 

验证安装:

T:\>python
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import w3lib >>> 

 

五、 安装libxml2

官方主页:http://users.skynet.be/sbi/libxml-python/http://pypi.python.org/pypi/pyOpenSSL

下载地址:http://users.skynet.be/sbi/libxml-python/binaries/libxml2-python-2.7.7.win32-py2.7.exe

安装过程:把程序放到“D:\Python27\Scripts”下,然后用easy_install安装。

D:\Python27\Scripts>easy_install.exe libxml2-python-2.7.7.win32-py2.7.exe

验证安装:

T:\>python
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import libxml2 >>> 

 

 

六、 安装pyOpenSSL

官方主页:http://pypi.python.org/pypi/pyOpenSSL

下载地址:http://pypi.python.org/packages/2.7/p/pyOpenSSL/pyOpenSSL-0.13.winxp32-py2.7.msi

安装过程:用pip安装(没有成功?)

pip install pyopenssl

验证安装:注意验证时import后面的OpenSSL是区分大小写的。

T:\>python
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import OpenSSL >>>

 

七、 安装Scrapy

官方主页:http://scrapy.org/

下载地址:http://pypi.python.org/packages/source/S/Scrapy/Scrapy-0.14.4.tar.gz

解压过程:略

安装过程:

复制代码
T:\Scrapy-0.14.4>python setup.py install

……
Installing easy_install-2.7-script.py script to D:\Python27\Scripts
Installing easy_install-2.7.exe script to D:\Python27\Scripts
Installing easy_install-2.7.exe.manifest script to D:\Python27\Scripts

Using d:\python27\lib\site-packages
Finished processing dependencies for Scrapy==0.14.4 T:\Scrapy-0.14.4>
复制代码

 

验证安装:

复制代码
T:\>scrapy
Scrapy 0.14.4 - no active project

Usage:
  scrapy <command> [options] [args]

Available commands:
  fetch         Fetch a URL using the Scrapy downloader
  runspider     Run a self-contained spider (without creating a project) settings Get settings values shell Interactive scraping console startproject Create new project version Print Scrapy version view Open URL in browser, as seen by Scrapy Use "scrapy <command> -h" to see more info about a command T:\>
复制代码

 

转载于:https://www.cnblogs.com/wy1290939507/p/4685705.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值