python 爬取那种图片~~~~

爬取百度图片

最新推荐文章于 2024-12-06 15:28:43 发布

_Auggie

最新推荐文章于 2024-12-06 15:28:43 发布

阅读量584

点赞数

CC 4.0 BY-SA版权

分类专栏： Python 文章标签： python 爬虫图片

本文链接：https://blog.youkuaiyun.com/Kiddie_Lu/article/details/79829928

Python 专栏收录该内容

42 篇文章

订阅专栏

本文介绍了一个简单的Python程序，用于从百度搜索引擎抓取图片资源。通过发送HTTP请求获取网页内容，并使用正则表达式解析出图片链接，最后下载图片到本地指定文件夹。

import re,sys
import requests

def get_page():
    urls = ['http://image.baidu.com/search/flip?tn=baiduimage&ipn=r&ct=201326592&cl=2&lm=-1&st=-1&fm=result&fr=&sf=1&fmq=1515928360596_R&pv=&ic=0&nc=1&z=&se=1&showtab=0&fb=0&width=&height=&face=0&istype=2&ie=utf-8&ctd=1515928360597%5E00_1288X691&word=%E7%BE%8E%E5%A5%B3','http://image.baidu.com/search/flip?tn=baiduimage&ipn=r&ct=201326592&cl=2&lm=-1&st=-1&fm=result&fr=&sf=1&fmq=1515928395065_R&pv=&ic=0&nc=1&z=&se=1&showtab=0&fb=0&width=&height=&face=0&istype=2&ie=utf-8&ctd=1515928395065%5E00_1288X691&word=%E7%BE%8E']
    for url in urls:
        # print(url)
        get_img_link(url)

def get_img_link(url):
    r=requests.get(url)
    # print(r.encoding)
    r.encoding='utf-8'
    html_code=r.text
    reg=re.compile(r'"objURL":"(.*?)"')
    imgs=re.findall(reg,html_code)
    # print(imgs)
    for img in imgs:
        print(img)
        down_img(img)

def down_img(url):
    web_data=requests.get(url)
    filename=url.split('/')[-1]
    targetfile='C:/Users/Xuze_Lu/Desktop/picture/{}'.format(filename)
    with open(targetfile,'wb') as f:
        f.write(web_data.content)

if __name__=='__main__':
    get_page()