python爬取页面内容写入文件

csdn565973850

于 2019-12-31 16:33:32 发布

阅读量4.3k

点赞数 6

CC 4.0 BY-SA版权

分类专栏： python 文章标签： python

本文链接：https://blog.youkuaiyun.com/csdn565973850/article/details/103785443

python 专栏收录该内容

2 篇文章

订阅专栏

python爬取页面内容写入文件

# urllib.request用来发送请求获取响应
import urllib.request

import chardet
# urlopen方法 传入要请求的地址，返回一个响应对象
# RFC规定 即使是访问根目录 也要加上/
page = urllib.request.urlopen('http://www.dongao.com')
# read() 从response对象读取数据
# read()获取到的数据 是二进制数据 不是字符串
html = page.read()
#打印返回网页的编码方式
# print(chardet.detect(html))
# print(html)
# 如果想要把 二进制数据转换成字符串 可以 使用decode
data = html.decode('utf-8')
# print(data)
#以写的方式打开dongao.txt
file = open('D:/360Browser/dongao.txt','wb')
#写入
file.write(html)
#关
file.close()