python 3 爬虫图片菜鸟爬实例

最新推荐文章于 2021-02-12 11:03:43 发布

IT_zwf

最新推荐文章于 2021-02-12 11:03:43 发布

阅读量1.4k

点赞数 1

CC 4.0 BY-SA版权

文章标签： python菜鸟爬虫菜鸟爬图片爬虫网站图片

本文链接：https://blog.youkuaiyun.com/qq_33483795/article/details/80697871

这篇博客介绍了Python 3爬虫如何实现网站图片的抓取，包括如何将图片写入指定文件和优化爬虫代码。博主强调关注他以获取更多持续更新的内容。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

爬取网站

http://616pic.com/beijing/

先上码

怎么写入到指定的文件 , 怎么更简洁爬虫 , 关注我!!! 持续更新!!!

原理和前面那个一样

# _*_coding:utf-8_*_
from bs4 import BeautifulSoup
import urllib.request
import requests
header = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko)\
 Chrome/67.0.3396.62 Safari/537.36'}
url = r"http://616pic.com/beijing/"
req = urllib.request.Request(url, headers=header)
response = urllib.request.urlopen(req)
soup = BeautifulSoup(response, 'html.parser')
result = soup.findAll(attrs={'class': 'lazy'})
for i in result:
    i = str(i)
    i = i.split('nal=\"', 1)[1].split('\" src')[0]
    print(i, type(i), '\n')
    res = requests.get(i)
    j = i.split('bg', 1)[1].split('/', 4)[4].split('.jpg', 1)[0]
    print(j)
    new_pic = open('./%s.jpg' % j, 'wb')
    new_pic.write(res.content)
    new_pic.close()
print('finished')