Python 分割字符串时有多个分隔符怎么处理？

原创已于 2023-02-27 22:43:21 修改 · 4.4k 阅读

10 ·

CC 4.0 BY-SA版权

文章标签：

#正则表达式

于 2021-07-27 22:05:52 首次发布

Python 专栏收录该内容

215 篇文章

订阅专栏

本文介绍如何使用Python的字符串方法str.split()及正则表达式re模块进行字符串分割，包括处理多种分隔符的情况，并提供了详细的示例代码。

Python3.11

Conda

Python

Python 是一种高级、解释型、通用的编程语言，以其简洁易读的语法而闻名，适用于广泛的应用，包括Web开发、数据分析、人工智能和自动化脚本

用法

帮助

正则

用法

string.split( sep = None, maxsplit = -1)
string 要操作字符串
sep 分隔符，默认值为whitespace空白符
maxsplit 最大分割次数，默认值为-1，表示无限制

如果同时有多个分隔符怎么分割呢？
可以用循环多次分割来实现，例如：

>>> s = '6[5,12]3[2,6]1;35]67[8;9;11]12'
							      
>>> for j in '[],;':
	t=[i.split(j) for i in t]
	t=[i for j in t for i in j]

>>> t			      
['6', '5', '12', '3', '2', '6', '1', '35', '67', '8', '9', '11', '12']
>>>

帮助

字符串方法str.split()帮助：

Help on method_descriptor:

split(self, /, sep=None, maxsplit=-1)
Return a list of the words in the string, using sep as the delimiter string.

sep
The delimiter according which to split the string.
None (the default value) means split according to any whitespace,
and discard empty strings from the result.
maxsplit
Maximum number of splits to do.
-1 (the default value) means no limit.

正则

懂正则表达式的可以一步到位：

>>> import re
>>> s = '6[5,12]3[2,6]1;35]67[8;9;11]12'
>>> re.split('\[|\]|,|;',s)
['6', '5', '12', '3', '2', '6', '1', '35', '67', '8', '9', '11', '12']

注：竖线 | 是分隔符的分隔；也可用[]包括分隔符，[]中的不用竖线分开；两种方法都要注意转义符的使用。

>>> import re
>>> s = '6[5,12]3[2,6]1;35]67[8;9;11]12'
>>> re.split('[\[\],;]',s)
['6', '5', '12', '3', '2', '6', '1', '35', '67', '8', '9', '11', '12']

附：常见正则

1.测试正则表达式是否匹配字符串的全部或部分

regex=ur"" #正则表达式
if re.search(regex, subject):
do_something()
else:
do_anotherthing()

2.测试正则表达式是否匹配整个字符串

regex=ur"\Z" #正则表达式末尾以\Z结束
if re.match(regex, subject):
do_something()
else:
do_anotherthing()

3.创建一个匹配对象，然后通过该对象获得匹配细节(Create an object with details about how the regex matches (part of) a string)

regex=ur"" #正则表达式
match = re.search(regex, subject)
if match:
# match start: match.start()
# match end (exclusive): atch.end()
# matched text: match.group()
do_something()
else:
do_anotherthing()

4.获取正则表达式所匹配的子串(Get the part of a string matched by the regex)

regex=ur"" #正则表达式
match = re.search(regex, subject)
if match:
result = match.group()
else:
result = ""

5. 获取捕获组所匹配的子串(Get the part of a string matched by a capturing group)

regex=ur"" #正则表达式
match = re.search(regex, subject)
if match:
result = match.group(1)
else:
result = ""

6. 获取有名组所匹配的子串(Get the part of a string matched by a named group)
regex=ur"" #正则表达式
match = re.search(regex, subject)
if match:
result = match.group"groupname")
else:
result = ""

7. 将字符串中所有匹配的子串放入数组中(Get an array of all regex matches in a string)

result = re.findall(regex, subject)

8.遍历所有匹配的子串(Iterate over all matches in a string)

for match in re.finditer(r"<(.*?)\s*.*?/\1>", subject)
# match start: match.start()
# match end (exclusive): atch.end()
# matched text: match.group()

9.通过正则表达式字符串创建一个正则表达式对象(Create an object to use the same regex for many operations)

reobj = re.compile(regex)
10.用法１的正则表达式对象版本（use regex object for if/else branch whether (part of) a string can be matched）

reobj = re.compile(regex)
if reobj.search(subject):
do_something()
else:
do_anotherthing()

11.用法２的正则表达式对象版本（use regex object for if/else branch whether a string can be matched entirely）

reobj = re.compile(r"\Z")　＃正则表达式末尾以\Z 结束
if reobj.match(subject):
do_something()
else:
do_anotherthing()

12.创建一个正则表达式对象，然后通过该对象获得匹配细节（Create an object with details about how the regex object matches (part of) a string）

reobj = re.compile(regex)
match = reobj.search(subject)
if match:
# match start: match.start()
# match end (exclusive): atch.end()
# matched text: match.group()
do_something()
else:
do_anotherthing()

13.用正则表达式对象获取匹配子串（Use regex object to get the part of a string matched by the regex）文章来源地址https://www.yii666.com/blog/126608.html

reobj = re.compile(regex)
match = reobj.search(subject)
if match:
result = match.group()
else:
result = ""

14.用正则表达式对象获取捕获组所匹配的子串（Use regex object to get the part of a string matched by a capturing group）

reobj = re.compile(regex)
match = reobj.search(subject)
if match:
result = match.group(1)
else:
result = ""

15.用正则表达式对象获取有名组所匹配的子串（Use regex object to get the part of a string matched by a named group）

reobj = re.compile(regex)
match = reobj.search(subject)
if match:
result = match.group("groupname")
else:
result = ""

16.用正则表达式对象获取所有匹配子串并放入数组（Use regex object to get an array of all regex matches in a string）

reobj = re.compile(regex)
result = reobj.findall(subject)

17.通过正则表达式对象遍历所有匹配子串（Use regex object to iterate over all matches in a string）

reobj = re.compile(regex)
for match in reobj.finditer(subject):
# match start: match.start()
# match end (exclusive): match.end()
# matched text: match.group()

18.字符串替换
1).替换所有匹配的子串

#用newstring替换subject中所有与正则表达式regex匹配的子串
result = re.sub(regex, newstring, subject)

2).替换所有匹配的子串（使用正则表达式对象）文章来源站点https://www.yii666.com/

reobj = re.compile(regex)
result = reobj.sub(newstring, subject)

19.字符串拆分
1).字符串拆分

result = re.split(regex, subject)

2).字符串拆分（使用正则表示式对象）
reobj = re.compile(regex)
result = reobj.split(subject)

您可能感兴趣的与本文相关的镜像

Python3.11

Conda

Python

Python 是一种高级、解释型、通用的编程语言，以其简洁易读的语法而闻名，适用于广泛的应用，包括Web开发、数据分析、人工智能和自动化脚本