大家好,我是天空之城,今天给大家带来小福利,爬取今日头条新闻信息
话不多说,代码如下
import requests
headers={'user-agent':'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.98 Safari/537.36'}
url='https://www.toutiao.com/api/search/content/'
offset=0
a='''aid: 24
app_name: web_search
offset: 0
format: json
keyword: 新冠
autoload: true
count: 20
en_qc: 1
cur_tab: 1
from: search_tab
pd: synthesis
timestamp: 1601455124814
_signature: qT.UrgAgEBCHDks5xnZLoKk-lbAAPZafqzWaCfcqzTO.5gltlRobNika-oA4RC4X1n.FANe3Ud1PeuLrZvU6i5sFp50kn8a9Yemog-LBiItItT0cXhEZ4Yuac4IcxFIQ8sj'''
#需要使用自己电脑的实时Request Headers
params = dict([line.split(": ",1) for line in a.split("\n")])
res=requests.get(url,headers=headers,params=params)
articles=res.json()
data=articles['data']
for i in data:
try:
list1=[i['title'],i["media_name"],i["comment_count"]]
print(list1)
except:
pass
截图如下