10个python中使用正则表达式的场景！

原创已于 2024-11-19 17:47:02 修改 · 1.1k 阅读

8 ·

CC 4.0 BY-SA版权

文章标签：

#python #正则表达式 #开发语言

于 2024-10-08 14:46:43 首次发布

在Python编程中,正则表达式（regex）是强大的工具,能够处理各种复杂的字符串操作.以下是10个常见的正则表达式使用场景,并附上代码示例.

1. 验证电子邮件地址

验证用户输入的电子邮件地址是否符合格式要求.

import re

def validate_email(email):
    pattern = r"^[a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+\.[a-zA-Z0-9-.]+$"
    return re.match(pattern, email) is not None

# 示例
print(validate_email("test@example.com"))  # True
print(validate_email("invalid-email"))     # False

2. 匹配电话号码

提取或验证电话号码格式,如国际电话格式 +1-800-555-1234.


def validate_phone(phone):
    pattern = r"^\+?\d{1,3}[-.\s]?\(?\d{1,4}\)?[-.\s]?\d{1,4}[-.\s]?\d{1,9}$"
    return re.match(pattern, phone) is not None

# 示例
print(validate_phone("+1-800-555-1234"))  # True
print(validate_phone("555-1234"))         # True

3. 查找所有URL

从文本中提取所有的URL.

def find_urls(text):
    pattern = r"https?://[^\s]+"
    return re.findall(pattern, text)

# 示例
text = "Check out https://example.com and http://google.com."
print(find_urls(text))  # ['https://example.com', 'http://google.com']

4. 验证日期格式

验证日期是否符合 YYYY-MM-DD 的格式.

def validate_date(date):
    pattern = r"^\d{4}-\d{2}-\d{2}$"
    return re.match(pattern, date) is not None

# 示例
print(validate_date("2024-09-30"))  # True
print(validate_date("30-09-2024"))  # False

5. 从字符串中提取所有数字

提取文本中所有的数字.

def extract_numbers(text):
    return re.findall(r"\d+", text)

# 示例
text = "I have 3 apples, 4 bananas, and 5 oranges."
print(extract_numbers(text))  # ['3', '4', '5']

6. 替换敏感词汇

使用正则表达式替换文本中的敏感词汇,常用于内容审核.

def extract_numbers(text):
    return re.findall(r"\d+", text)

# 示例
text = "I have 3 apples, 4 bananas, and 5 oranges."
print(extract_numbers(text))  # ['3', '4', '5']

7. 提取文件扩展名

从文件名中提取扩展名.

def get_extension(filename):
    pattern = r"\.([a-zA-Z0-9]+)$"
    match = re.search(pattern, filename)
    return match.group(1) if match else None

# 示例
print(get_extension("document.pdf"))  # 'pdf'
print(get_extension("archive.tar.gz"))  # 'gz'

8. 验证密码强度

使用正则表达式验证密码是否满足复杂性要求,比如至少包含一个大写字母、一个小写字母、一个数字和一个特殊字符.

def validate_password(password):
    pattern = r"^(?=.*[a-z])(?=.*[A-Z])(?=.*\d)(?=.*[@$!%*?&])[A-Za-z\d@$!%*?&]{8,}$"
    return re.match(pattern, password) is not None

# 示例
print(validate_password("StrongPass1!"))  # True
print(validate_password("weakpass"))      # False

9. 拆分字符串

使用正则表达式按照多个分隔符拆分字符串.

def split_text(text):
    pattern = r"[ ,;]+"
    return re.split(pattern, text)

# 示例
text = "apple, orange; banana grape"
print(split_text(text))  # ['apple', 'orange', 'banana', 'grape']

10. 删除多余的空格

通过正则表达式删除字符串中的多余空格,只保留一个空格.

def clean_whitespace(text):
    pattern = r"\s+"
    return re.sub(pattern, " ", text).strip()

# 示例
text = "  This  is   a  messy   sentence.   "
cleaned_text = clean_whitespace(text)
print(cleaned_text)  # "This is a messy sentence."