python一个URL拼接的写法

最新推荐文章于 2024-06-17 22:04:05 发布

原创最新推荐文章于 2024-06-17 22:04:05 发布 · 1w 阅读

1 ·

CC 4.0 BY-SA版权

本文介绍了一种通过网络获取的强大URL拼接方法，包括如何使用python的urlparse和urljoin库进行URL路径的规范化和拼接。通过示例演示了在不同场景下如何正确地拼接URL，确保网页资源的准确引用。

此方法来自网络，很强大

from urlparse import urljoin
from urlparse import urlparse
from urlparse import urlunparse
from posixpath import normpath

def myjoin(base, url):
    url1 = urljoin(base, url)
    arr = urlparse(url1)
    path = normpath(arr[2])
    return urlunparse((arr.scheme, arr.netloc, path, arr.params, arr.query, arr.fragment))

经测试可正确拼接下列形式

base1 = 'http://www.bagtree.com/' url1 = '../../themes/bagtree_2011/images/pinzhi.gif'
base2 = 'http://info.ceo.hc360.com/list/qygl-ldl.shtml' url2 = '/2011/11/250020188368.shtml'
base3 = 'http://info.ceo.hc360.com/2012/07/190833206838.shtml' url3 = '190833206838-2.shtml'