1、使用API请求数据:
初识API:浏览器中输入地址查看: https://api.github.com/search/repositories?q=language:python&sort=stars
2、安装requests:
urllib.request库 是 Python3 自带的模块(不需要下载,导入即可使用)
request = urllib.request.Request("https://www.baidu.com/")
# Request对象作为urlopen()方法的参数,发送给服务器并接收响应
response = urllib.request.urlopen(request)
html = response.read()
print(html)
返回中文乱码:
response=requests.get('www.test.com') response.encoding = response.apparent_encoding |
获取网页不完整:
headers = { 'User-Agent': 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:46.0) Gecko/20100101 Firefox/46.0', 'Content-Type': 'application/x-www-form-urlencoded', 'Connection': 'Keep-Alive', 'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8' } html = requests.get('https://www.biqukan.com/1_1094/5403177.html', headers=headers) html.encoding = html.apparent_encoding print(html.text)