Python爬虫入门<五>—DebugLog

最新推荐文章于 2025-06-21 23:36:37 发布

后台服务器开发

最新推荐文章于 2025-06-21 23:36:37 发布

阅读量2.4k

点赞数

CC 4.0 BY-SA版权

分类专栏： Python Python成长之路文章标签： python bug DebugLog 爬虫调试

本文链接：https://blog.youkuaiyun.com/Travelerwz/article/details/79780916

Python 同时被 2 个专栏收录

27 篇文章

订阅专栏

Python成长之路

18 篇文章

订阅专栏

本文介绍了如何通过Python的urllib库开启DebugLog，帮助在爬虫开发过程中调试错误并打印运行日志。通过设置HTTPHandler和HTTPSHandler的debuglevel为1，并创建自定义opener对象，最终实现对网站请求过程的详细记录。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Python爬虫入门<五>——DebugLog

标签：DebugLog

DebugeLog

1.首先，我们先先来了解一下DebugLog是什么？
我们经常说的BUG就是错误，Debug就是调试错误，LOG代表日志，所以，，，DebugLog就是调试日志，这个可以帮助我们在运行程序的过程中打印日志，接下来，我们就开始开启DebugLog：
1.使用urllib.request.HTTPHandler()和urllib.request.HTTPSHandler()将debuglevel是指为1；
2.使用urllib.request.build_opener()创建自定义opener对象，使用1中的值作为对象的参数
3.urllib.request.install_opener()创建全局默认的opener对象，我们使用urlopen()的时候，就会使用opener对象。
4.继续后面操作
代码：

import urllib.request

httphd = urllib.request.HTTPHandler(debuglevel=1)
httpshd = urllib.request.HTTPSHandler(debuglevel=1)

opener = urllib.request.build_opener(httphd,httpshd)
urllib.request.install_opener(opener)
data = urllib.request.urlopen("http://www.baidu.com")

结果：

C:\Users\Administrator\PycharmProjects\untitled2\venv\Scripts\python.exe F:/python/python代码/venv/python入门五.py
send: b'GET / HTTP/1.1\r\nAccept-Encoding: identity\r\nHost: www.baidu.com\r\nUser-Agent: Python-urllib/3.6\r\nConnection: close\r\n\r\n'
reply: 'HTTP/1.1 200 OK\r\n'
header: Date header: Content-Type header: Transfer-Encoding header: Connection header: Vary header: Set-Cookie header: Set-Cookie header: Set-Cookie header: Set-Cookie header: Set-Cookie header: Set-Cookie header: P3P header: Cache-Control header: Cxy_all header: Expires header: X-Powered-By header: Server header: X-UA-Compatible header: BDPAGETYPE header: BDQID header: BDUSERID 
Process finished with exit code 0