UnicodeEncodeError: ‘gbk’ codec can’t encode character u’\xa0’ in position 3621: illegal multibyte sequence
import cookielib, urllib2,urllib,sys
from bs4 import BeautifulSoup
response = urllib2.urlopen(‘http://www.baidu.com‘)
html = response.read()
soup = BeautifulSoup(html)
a=soup.prettify()
print a
出现上述错误
原因是因为运行平台为Windows,
解决办法是使用Cygwin
或者IDLE。