
爬虫
tianyuan233
这个作者很懒,什么都没留下…
展开
-
UserWarning: Selenium support for PhantomJS has been deprecated, please use headless versions of Chr
In [1]: from selenium import webdriverIn [2]: driver = webdriver.PhantomJS()G:\Anaconda3\lib\sitepackages\selenium\webdriver\phantomjs\webdriver.py:49: UserWarning: Selenium support for PhantomJS h...原创 2018-03-07 22:38:34 · 4275 阅读 · 0 评论 -
正则表达式re库学习笔记
import recontent = 'Hello 123 4567 World_This is a Demo'泛匹配# result = re.match('^Hello\s\d',content)# print(result)# print(result.group())## result1 = re.match('^Hello(.*)mo$',content)# print(res原创 2018-03-14 22:08:29 · 399 阅读 · 1 评论 -
BeautifulSoup库学习笔记
import requestsfrom bs4 import BeautifulSoupimport lxml# data = requests.get('https://book.douban.com/').textdata = '''<ul><li class=""><a data-moreurl-dict='{"from":"top-nav-click-main","uid":"0"原创 2018-03-14 22:12:19 · 252 阅读 · 0 评论 -
pyquery学习笔记
from pyquery import PyQuery as pqdata = '''<ul class="qqq"><li class="1"><a data-moreurl-dict='{"from":"top-nav-click-main","uid":"0"}' href="https://www.douban.com" target="_blank">豆瓣</a></li><li c原创 2018-03-15 22:38:27 · 374 阅读 · 0 评论 -
selenlenium基本用法学习笔记
from selenium import webdriver from selenium.webdriver.common.by import By from selenium.webdriver.common.keys import Keys from selenium.webdriver.support import expected_conditions as EC from sele原创 2018-03-18 16:59:19 · 296 阅读 · 0 评论