Python爬虫

最新推荐文章于 2025-09-09 11:31:23 发布

weixin_30509393

最新推荐文章于 2025-09-09 11:31:23 发布

阅读量53

点赞数

CC 4.0 BY-SA版权

文章标签：爬虫 python

原文链接：http://www.cnblogs.com/Jims2016/p/5660834.html

1.需求：从网站上获取整个页面

import urllib
import re
import sys
 
def downloadPage(url,name):
    html = urllib.urlopen(url).read()
    fp = open(name+".html","w")
    fp.write(html)
    fp.close()
    return     html

html= downloadPage("http://hao123.com","hao123")
web = downloadPage("http://www.innocellence.com","Innocellence")
print html