Python BeautifulSoup(bs4)采集新闻

最新推荐文章于 2025-06-19 21:44:03 发布

原创最新推荐文章于 2025-06-19 21:44:03 发布 · 412 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#python #爬虫

python 同时被 2 个专栏收录

9 篇文章

订阅专栏

爬虫

2 篇文章

订阅专栏

本文介绍了一种使用Python中的BeautifulSoup库抓取新浪新闻的方法。通过requests库获取网页内容，并利用BeautifulSoup解析HTML，实现对新闻标题及发布时间的批量抓取。

python使用BeautifulSoup采集新浪新闻

from bs4 import BeautifulSoup
import requests
url="http://roll.finance.sina.com.cn/finance/zq1/ssgs/index.shtml"
res=requests.get(url)
res.encoding='gb2312'
soup=BeautifulSoup(res.text,'html.parser')
for news in soup.select('.list_009 li'):#爬取新闻列表
    title=news.select('a')[0].text#新闻标题
    time=news.select('span')[0].text#时间
    print(title,time)