selenium 模拟登陆

本文通过实例演示了如何使用Selenium与PhantomJS进行网页自动化操作,包括登录豆瓣网站及抓取简书文章的详细信息,如作者、发表日期、字数、浏览量、评论数和喜欢数。
#Selnium 和 PhantomJS的配合使用
#模拟浏览器操作
from selenium import webdriver
driver = webdriver.PhantomJS()
driver.get('https://www.douban.com/')
driver.implicitly_wait(5)
driver.find_element_by_id('form_email').clear()
driver.find_element_by_id('form_email').send_keys('xxxxxxxxx')
driver.find_element_by_id('form_password').clear()
driver.find_element_by_id('form_password').send_keys('xxxxxxxxxx')
driver.find_element_by_class_name('bn-submit').click()
print(driver.page_source)
with open('hello.html', 'w', encoding='utf-8') as f:
    f.write(driver.page_source)

#获取异步加载数据

from selenium import webdriver
driver = webdriver.PhantomJS()
driver.get('https://www.jianshu.com/p/aa4a1829840f')
include_title = []
driver.implicitly_wait(20)
author = driver.find_element_by_xpath('//span[@class="name"]/a').text
date = driver.find_element_by_xpath('//span[@class="publish-time"]').text
word = driver.find_element_by_xpath('//span[@class="wordage"]').text
view = driver.find_element_by_xpath('//span[@class="views-count"]').text
comment = driver.find_element_by_xpath('//span[@class="comments-count"]').text
like = driver.find_element_by_xpath('//span[@class="likes-count"]').text
included_names = driver.find_elements_by_xpath('//div[@class="include-collection"]/a/div')
for i in included_names:
    include_title.append(i.text)
print(author, date, word, view, comment, like, include_title)

 

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值