scrapyd+scrapydweb [ubuntu]
可实现从scrapy的管理部署到监控定时运行等任务,scrapydweb也可以替换为Gerapy或Spiderkeeper。
install
pip3 install scrapyd
pip3 install scrapydweb
run
changebind_addressto0.0.0.0 in /usr/local/lib/python3.6/dist-packages/scrapyd/default_scrapyd.conf
bind_address = 0.0.0.0
run scrapyd
cd [your scrapyd server folder]
scrapyd
then scrapydweb
cd [your scrapydweb server folder]
scrapydweb
visit
for scrapydweb
127.0.0.1:5000
for scrapyd
127.0.0.1:6800
scrapy-client
install [ubuntu]
pip3 install scrapyd-client
scrapy demo:QuotesBot
可从QuotesBot获取
deploy
by scrapydclient
- switch to
QuotesBotproject - append
[deploy]in scrapy.cfg
[deploy]
url = http://[your scrapyd ip]:6800/
username = [your scrapyd username]
password = [your scrapyd password]
project = quotesbot
- deploy
scrapyd-deploy
by scrapydweb
进入127.0.0.1:5000,在Deploy Project页面中上传爬虫压缩包并完成部署
run spider in scrapydweb
you can manage your scrapy project quotesbot


本文介绍如何使用Scrapyd和ScrapydWeb进行Scrapy项目的部署、管理和定时运行。涵盖安装配置、运行scrapyd及scrapydweb服务、通过scrapyd-client部署项目、在scrapydweb上管理爬虫等关键步骤。
3810

被折叠的 条评论
为什么被折叠?



