记录的身份信息 cookie
已经登陆过的cookie来进入网站

在这里插入代码片
没有登陆过的cookie来进入网站
from urllib.request import Request,urlopen
from fake_useragent import UserAgent
from urllib.parse import urlencode
from urllib.request import HTTPCookieProcessor,build_opener
#登录
login_url = “https://music.163.com/weapi/login/qrcode/client/login?csrf_token=”
headers = {
“User-Agent”:UserAgent
}
form_data = {
“user”:“12312412345”,
“passwword”:“1q3ww131”
}
f_data = urlencode(form_data)
request = Request(login_url,headers = headers,data = f_data)
handler = HTTPCookieProcessor()
opener = build_opener(handler)
response = urlopen(request)
response = opener.open(request)
#访问页面
info_url = “https://music.163.com/”
request = Request(info_url,headers = headers)
response = opener.open(request)
print(response.read().decode())
已登录的源代码
from urllib.request import Request,urlopen
from fake_useragent import UserAgent
url = "https://music.163.com/"
headers = {
"User-Agent":UserAgent
"Cookie":"_ntes_nnid=9cb6ff3314be2f27234c191923be4967,1620718658806"
}
request = Request(url,headers = headers)
response = urlopen(request)
print(response.read().decode())
#未登录的源代码
from urllib.request import Request,urlopen
from fake_useragent import UserAgent
from urllib.parse import urlencode
from urllib.request import HTTPCookieProcessor,build_opener
#登录
login_url = "https://music.163.com/weapi/login/qrcode/client/login?csrf_token="
headers = {
"User-Agent":UserAgent
}
form_data = {
"user":"12312412345",
"passwword":"1q3ww131"
}
f_data = urlencode(form_data)
request = Request(login_url,headers = headers,data = f_data)
handler = HTTPCookieProcessor()
opener = build_opener(handler)
response = urlopen(request)
response = opener.open(request)
#访问页面
info_url = "https://music.163.com/"
request = Request(info_url,headers = headers)
response = opener.open(request)
print(response.read().decode())

本文介绍了如何在Python爬虫中使用cookie,包括已登录和未登录两种情况。通过示例代码展示了登录过程,并利用HTTPCookieProcessor处理cookie,访问音乐网站获取已登录状态的页面源代码。
2万+

被折叠的 条评论
为什么被折叠?



