oracle 正则表达式

最新推荐文章于 2023-05-30 23:32:08 发布

转载最新推荐文章于 2023-05-30 23:32:08 发布 · 790 阅读

文章标签：

#oracle #正则表达式 #email #sql #newline #insert

Oracle 专栏收录该内容

20 篇文章

订阅专栏

本文详细介绍了SQL中正则表达式的使用方法，包括REGEXP_LIKE、REGEXP_INSTR、REGEXP_SUBSTR和REGEXP_REPLACE四个函数的功能及具体用法，并通过实例展示了如何运用这些函数解决实际问题。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

实战演习

--以A开头
--select t.*, t.rowid from test t where regexp_like(name,'^a');

--以a结尾
--select t.*, t.rowid from test t where regexp_like(name,'a$');

--以A开头并且以a结尾
--select t.*, t.rowid from test t where regexp_like(name,'^a.*a$');

--查找1第一个字符开始第2个非数字的位置
--select regexp_instr(name, '[^[:digit:]]',1,2) as position from test;

--查找1第一个字符开始第2个非数字的位置下一个字符的位置
--select regexp_instr(name, '[^[:digit:]]',1,2,1) as position from test;

--返回以数字开头的2位之后数5位值是多少
--select id,regexp_substr(name,'[[:digit:]]',2,5) from test

--把所有的飞数字字符替换为4
--update test set name=regexp_replace(name, '[^[:digit:]]', '4')

^            使表达式定位至一行的开头
$            使表达式定位至一行的末尾
*            匹配 0 次或更多次
?            匹配 0 次或 1 次
+            匹配 1 次或更多次
{m}          正好匹配 m 次
{m,}         至少匹配 m 次
{m, n}       至少匹配 m 次但不超过 n 次
[:alpha:]    字母字符
[:lower:]    小写字母字符
[:upper:]    大写字母字符
[:digit:]    数字
[:alnum:]    字母数字字符
[:space:]    空白字符（禁止打印），如回车符、换行符、竖直制表符和换页符[:punct:]    标点字符
[:cntrl:]    控制字符（禁止打印）
[:print:]    可打印字符 | 分隔替换选项，通常与分组操作符 () 一起使用
( )          将子表达式分组为一个替换单元、量词单元或后向引用单元
[char]       字符列表

Oracle 10g提供了四个regexp function: REGEXP_LIKE , REGEXP_REPLACE , REGEXP_INSTR , REGEXP_SUBSTR 。

Sql代码
REGEXP_LIKE：比较一个字符串是否与正则表达式匹配
(srcstr, pattern [, match_option])

REGEXP_INSTR：在字符串中查找正则表达式，并且返回匹配的位置
(srcstr, pattern [, position [, occurrence [, return_option [, match_option]]]])

REGEXP_SUBSTR：返回与正则表达式匹配的子字符串
(srcstr, pattern [, position [, occurrence [, match_option]]])

REGEXP_REPLACE：搜索并且替换匹配的正则表达式
(srcstr, pattern [, replacestr [, position [, occurrence [, match_option]]]])

REGEXP_LIKE：比较一个字符串是否与正则表达式匹配
(srcstr, pattern [, match_option])

REGEXP_INSTR：在字符串中查找正则表达式，并且返回匹配的位置
(srcstr, pattern [, position [, occurrence [, return_option [, match_option]]]])

REGEXP_SUBSTR：返回与正则表达式匹配的子字符串
(srcstr, pattern [, position [, occurrence [, match_option]]])

REGEXP_REPLACE：搜索并且替换匹配的正则表达式
(srcstr, pattern [, replacestr [, position [, occurrence [, match_option]]]])
其中各参数的含义为:

Sql代码
srcstr:        被查找的字符数据。
pattern:       正则表达式。
occurrence:    出现的次数。默认为1。
position:      开始位置
return_option: 默认值为0，返回该模式的起始位置；值为1则返回符合匹配条件的下一个字符的起始位置。
replacestr:    用来替换匹配模式的字符串。
match_option: 匹配方式选项。缺省为c。
               c：case sensitive
               I：case insensitive
               n：(.)匹配任何字符（包括newline)
               m：字符串存在换行的时候被作为多行处理

srcstr:        被查找的字符数据。
pattern:       正则表达式。
occurrence:    出现的次数。默认为1。
position:      开始位置
return_option: 默认值为0，返回该模式的起始位置；值为1则返回符合匹配条件的下一个字符的起始位置。
replacestr:    用来替换匹配模式的字符串。
match_option: 匹配方式选项。缺省为c。
               c：case sensitive
               I：case insensitive
               n：(.)匹配任何字符（包括newline)
               m：字符串存在换行的时候被作为多行处理
下面通过一些具体的例子来说明如何使用这四个函数。首先创建一个测试数据表，

Sql代码
SQL> create table person (
2 first_name varchar(20),
3 last_name varchar(20),
4 email varchar(100),
5 zip varchar(6));

Table created.

SQL> insert into person values ('Steven', 'Chen', 'steven@hp.com', '123456');
1 row created.

SQL> insert into person values ('James', 'Li', 'jamesli@sun.com' || chr(10) || 'lijames@oracle.com', '1b3d5f');
1 row created.

SQL> commit;
Commit complete.

SQL> select * from person;

FIRST_NAME LAST_NAME EMAIL                ZIP
---------- ---------- -------------------- ------
Steven     Chen       steven@hp.com        123456
James      Li         jamesli@sun.com      1b3d5f
                      lijames@oracle.com

SQL> create table person (
2 first_name varchar(20),
3 last_name varchar(20),
4 email varchar(100),
5 zip varchar(6));

Table created.

SQL> insert into person values ('Steven', 'Chen', 'steven@hp.com', '123456');
1 row created.

SQL> insert into person values ('James', 'Li', 'jamesli@sun.com' || chr(10) || 'lijames@oracle.com', '1b3d5f');
1 row created.

SQL> commit;
Commit complete.

SQL> select * from person;

FIRST_NAME LAST_NAME EMAIL                ZIP
---------- ---------- -------------------- ------
Steven     Chen       steven@hp.com        123456
James      Li         jamesli@sun.com      1b3d5f
                      lijames@oracle.com

1。REGEXP_LIKE

Sql代码
SQL> select zip as invalid_zip from person where regexp_like(zip, '[^[:digit:]]');
INVALID_ZIP
--------------------
1b3d5f

SQL> select first_name from person where regexp_like(first_name, '^S.*n$');
FIRST_NAME
----------
Steven

SQL> select first_name from person where regexp_like(first_name, '^s.*n$');
no rows selected

SQL> select first_name from person where regexp_like(first_name, '^s.*n$', 'c');
no rows selected

SQL> select first_name from person where regexp_like(first_name, '^s.*n$', 'i');
FIRST_NAME
----------
Steven

SQL> select email from person where regexp_like(email, '^james.*com$');
no rows selected

SQL> select email from person where regexp_like(email, '^james.*com$', 'n');
EMAIL
--------------------
jamesli@sun.com
lijames@oracle.com

SQL> select email from person where regexp_like(email, '^li.*com$');
no rows selected

SQL> select email from person where regexp_like(email, '^li.*com$', 'm');
EMAIL
--------------------
jamesli@sun.com
lijames@oracle.com

SQL> select zip as invalid_zip from person where regexp_like(zip, '[^[:digit:]]');
INVALID_ZIP
--------------------
1b3d5f

SQL> select first_name from person where regexp_like(first_name, '^S.*n$');
FIRST_NAME
----------
Steven

SQL> select first_name from person where regexp_like(first_name, '^s.*n$');
no rows selected

SQL> select first_name from person where regexp_like(first_name, '^s.*n$', 'c');
no rows selected

SQL> select first_name from person where regexp_like(first_name, '^s.*n$', 'i');
FIRST_NAME
----------
Steven

SQL> select email from person where regexp_like(email, '^james.*com$');
no rows selected

SQL> select email from person where regexp_like(email, '^james.*com$', 'n');
EMAIL
--------------------
jamesli@sun.com
lijames@oracle.com

SQL> select email from person where regexp_like(email, '^li.*com$');
no rows selected

SQL> select email from person where regexp_like(email, '^li.*com$', 'm');
EMAIL
--------------------
jamesli@sun.com
lijames@oracle.com

注意上面分别测试了不同的match_option对结果的影响。

2。REGEXP_INSTR

Sql代码
查找zip中第一个非数字字符的位置
SQL> select regexp_instr(zip, '[^[:digit:]]') as position from person;
POSITION
----------
         0
         2

从第三个字符开始，查找zip中第二个非数字字符的位置
SQL> select regexp_instr(zip, '[^[:digit:]]', 3, 2) as position from person;
POSITION
----------
         0
         6

从第三个字符开始，查找zip中第二个非数字字符的下一个字符位置
SQL> select regexp_instr(zip, '[^[:digit:]]', 3, 2, 1) as position from person;
POSITION
----------
         0
         7

查找zip中第一个非数字字符的位置
SQL> select regexp_instr(zip, '[^[:digit:]]') as position from person;
POSITION
----------
0
2

从第三个字符开始，查找zip中第二个非数字字符的位置
SQL> select regexp_instr(zip, '[^[:digit:]]', 3, 2) as position from person;
POSITION
----------
0
6

从第三个字符开始，查找zip中第二个非数字字符的下一个字符位置
SQL> select regexp_instr(zip, '[^[:digit:]]', 3, 2, 1) as position from person;
POSITION
----------
0
7

3。REGEXP_SUBSTR

Sql代码
SQL> select regexp_substr(zip, '[^[:digit:]]') as zip from person;
ZIP
------------------
b

SQL> select regexp_substr(zip, '[^[:digit:]]', 3, 2) as zip from person;
ZIP
------------
f

SQL> select regexp_substr(zip, '[^[:digit:]]') as zip from person;
ZIP
------------------
b

SQL> select regexp_substr(zip, '[^[:digit:]]', 3, 2) as zip from person;
ZIP
------------
f
4。REGEXP_REPLACE

Sql代码
把zip中所有非数字字符替换为0
SQL> update person set zip=regexp_replace(zip, '[^[:digit:]]', '0')
2 where regexp_like(zip, '[^[:digit:]]');
1 row updated.

SQL> select zip from person;
ZIP
------
123456
103050

把zip中所有非数字字符替换为0
SQL> update person set zip=regexp_replace(zip, '[^[:digit:]]', '0')
2 where regexp_like(zip, '[^[:digit:]]');
1 row updated.

SQL> select zip from person;
ZIP
------
123456
103050
后向引用（backreference）：

后向引用是一个很有用的特性。它能够把子表达式的匹配部分保存在临时缓冲区中，供以后重用。缓冲区从左至右进行编号，并利用 /digit 符号进行访问。子表达式用一组圆括号来显示。利用后向引用可以实现较复杂的替换功能。

Sql代码
SQL> select regexp_replace('Steven Chen', '(.*) (.*)', '/2, /1') as reversed_name from dual;

REVERSED_NAME
--------------------
Chen, Steven

SQL> select regexp_replace('Steven Chen', '(.*) (.*)', '/2, /1') as reversed_name from dual;

REVERSED_NAME
--------------------
Chen, Steven
在DDL中也可以正则表达式，比如Constraint, index, view

Sql代码
SQL> alter table person add constraint constraint_zip check (regexp_like(zip, '^[[:digit:]]+$'));

SQL> create index person_idx on person(regexp_substr(last_name, '^[[:upper:]]'));