模拟浏览器访问:
from selenium import webdriver
from scrapy.selector import Selector
browser = webdriver.Firefox()
browser.get("https://www.planespotters.net/deliveries/1960/01")
res = Selector(text=browser.page_source)
解决requests 乱码问题:
res.encoding = res.apparent_encoding
scrapy在一个parse里解析url:
from scrapy.selector import Selector
res = fetch(url)
Selector(text=res.text)