使用requests请求网页时,返回的页面信息有时是乱码,如下代码
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.181 Safari/537.36'
}
def get_all(url,key):
params = {
'keyword':key,
'enc':'utf-8'
}
response = requests.get(url=url,params=params,headers=headers)
with open('jd.html','w',encoding='utf-8') as f:
f.write(response.text)
if __name__ == '__main__':
key = input('输入搜索内容:')
url = 'https://search.jd.com/Search?'
get_all(url,key)
返回的部分内容;
--------------------------------------------------------------------
注:如果你对python感兴趣,我这有个学习Python基地,里面有很多学习资料,感兴趣的+Q群:895817687
--------------------------------------------------------------------
<div class="p-name p-name-type-2">
<a target="_blank" title="æžåœ°ä¼ 说çŸè¢–Tæ¤ç”·å¤å£éŸ©ç‰ˆæ½®æµçŸè¢–男士åŠè¢–tæ¤åœ†é¢†ä¼‘闲修身大ç 五分袖潮牌é’å¹´å¦ç”Ÿtæ¤è¡£æœç”·è£
430黄色 M" href="//item.jd.com/51029271063.html" οnclick="searchlog(1,51029271063,8,1,'','flagsClk=1077936264')">
<