pyspider 会自动去重,抓过的连接不会重新抓取
可采用如下措施使其重新抓取:
class Handler(BaseHandler):
crawl_config = {
'itag': 'v223'
}
详见http://docs.pyspider.org/en/latest/apis/self.crawl/#itag
原文链接:https://blog.youkuaiyun.com/piyongduo3393/article/details/84403769