如下代码,url中有中文
class Spider(object):
def __init__(self):
self.begin_page = int(input("起始页:"))
self.end_page = int(input("终止页:"))
self.base_url = 'https://www.lagou.com/zhaopin/大数据/'
运行代码是会出现
UnicodeEncodeError: 'ascii' codec can't encode characters in position 13-14: ordinal not in range(128)
解决方法:
用urllib.parse.quote()
代码改成下面的这种,再去运行就成功了
class Spider(object):
def __init__(self):
self.begin_page = int(input("起始页:"))
self.end_page = int(input("终止页:"))
word = '大数据'
word = urllib.parse.quote(word)
self.base_url = 'https://www.lagou.com/zhaopin/word/'