Python网络爬虫学习

最新推荐文章于 2023-07-20 15:25:31 发布

ffdfffxfxf

最新推荐文章于 2023-07-20 15:25:31 发布

阅读量182

点赞数

CC 4.0 BY-SA版权

本文链接：https://blog.youkuaiyun.com/ffdfffxfxf/article/details/103351892

   最近有时间学习在慕课网上跟着嵩天老师上他的Python网络爬虫与信息提取这门课，想着可以写些博客将学的爬虫知识总结起来。

1 Requests库入门

1.1 Requests库的安装

    Win平台下：前提是安装好Python，在cmd中执行“pip installl requests”。其他方法的话可以在网上搜索。

1.2 Requests库的一些主要方法及其使用

在这里插入图片描述

1.2.1 Requests库的get方法

r=requests.get(url) 其中get返回的是response对象。

response对象的属性：
在这里插入图片描述
其中，status_code为200是表示正常，404或其他为异常。
另外，encoding和apparent_encoding的区别在于：
get方法的使用
requests.request(method, url, **kwargs)

1.2.2 request方法的使用。

requests.request() 构造一个请求，支撑put、head等其他方法。
requests.request(method, url, **kwargs）
在这里插入图片描述
url：页面的url链接。
**kwargs：控制访问项。可以为：params data json headers cookies auth files timeout proxies allow_redirects stream verify cert 。