数据结构与算法---串_文本串-优快云博客

博客主要介绍串匹配算法，串指字符串，算法用于查找模式串在文本串中的位置。详细阐述了蛮力算法的两种实现思路及优化方法，还介绍了KMP算法，对比了蛮力与KMP，指出KMP利用此前比较内容可跳过多个，通过next表实现。

串（Sequence）

这里的串是指字符串：由若干个字符组成的有限序列

在这里插入图片描述

串匹配算法

查找一个模式串（pattern）在文本串（text）中的位置
比如：

String text = "Hello World";
String pattern = "or";
text.indexOf(pattern);//7
text.indexOf("other");//-1

几个经典的串匹配算法
蛮力
KMP
Boyer-Moore
Karp-Rabin
Sunday

tlen = text的长度
plen = pattern的长度

蛮力（Brute Force）

顾名思义，就是一个一个比较匹配：以字符为单位，从左到右移动模式串(需要查找比较的字符串)，直到匹配成功。

在这里插入图片描述

蛮力算法有2种常见的实现思路

蛮力1

在这里插入图片描述

pi的取值范围[0, plen)
ti的取值范围[0, tlen)

如果patternChars[pi] == textChars[ti]，则
pi++;
ti++;
去比较下一个

如果patternChars[pi] != textChars[ti]，则
ti = ti - (pi - 1);//先做减法，再赋值pi=0，别反了
pi = 0;//从0开始计算

如果pi == plen
代表匹配成功

public class BruteForce01 {
	public static void main(String[] args)
	{
		System.out.println(indexOf("Hello World", "or"));
		System.out.println(indexOf("Hello World", "ww"));
	}
	
	public static int indexOf(String text, String pattern)
	{
		if(text == null || pattern == null) return -1;
		int tlen = text.length();
		int plen = pattern.length();
		if (tlen == 0 || plen == 0 || tlen < plen) return -1;
		
		char[] textChars = text.toCharArray();
		char[] patternChars = pattern.toCharArray();
		
		int pi = 0;
		int ti = 0;
		while (pi < plen && ti < tlen) {
			if (textChars[ti] == patternChars[pi]) {
				ti++;
				pi++;
			}else {
				ti = ti - (pi - 1);
				pi = 0;
			}
		}
		return pi == plen ? ti - pi : -1;
	}
}