Re 正则提取字符串中所有的链接-优快云博客

本文链接：https://blog.youkuaiyun.com/songhao8080/article/details/103669900

提取规则如下

根据URL的规则，设计一段正则表达式，提取出一段文本中的所有网址

Python

import <a href="https://www.168seo.cn/tag/re" title="View all posts in re" target="_blank">re</a> <a href="https://www.168seo.cn/tag/re" title="View all posts in re" target="_blank">re</a>.findall('(https?://[a-zA-Z0-9\.\?/%-_]*)',r.text)

import re

re . findall ( '(https?://[a-zA-Z0-9\.\?/%-_]*)' , r . text )

测试：

Python

In [13]: import re,<a href="https://www.168seo.cn/tag/requests" title="View all posts in requests" target="_blank">requests</a> In [14]: url = "https://www.168seo.cn" In [15]: r = <a href="https://www.168seo.cn/tag/requests" title="View all posts in requests" target="_blank">requests</a>.get(url) In [16]: r Out[16]: <Response [200]> In [17]: re.findall('(https?://[a-zA-Z0-9\.\?/%-_]*)',r.text)