Symbols of String Pattern Matching

最新推荐文章于 2025-01-13 15:16:10 发布

转载最新推荐文章于 2025-01-13 15:16:10 发布 · 110 阅读

CC 4.0 BY-SA版权

原文链接：http://www.cnblogs.com/kid551/p/4386119.html

本文深入解析了《算法导论》中用于讨论字符串匹配问题的严谨符号，包括文本、模式、字母表、空字符串、字符串偏移等概念，并通过实例阐述了如何基于这些符号重新表述匹配问题。

Symbols of String Pattern Matching in Introduction to Algorithms.

As it's important to be clear when discussing the problem of string matching, we can use the meticulous symbols used in Introduction to Algorithms.

Text:　　　 $T[1, ..., n]$.

Pattern:　　$P[1, ..., m]$.

Thus, as $T[i], P[j] \in \Sigma$, the array of letters, like $T[1, ..., n]$ or $P[1, ..., m]$, is called string.

Alphabet:　　$\Sigma$.

Set of all finite length of strings:　　$\Sigma^*$.

Define the empty string $\epsilon$, as it is a string, we also have $\epsilon \in \Sigma^*$.

Shifting $s$ matching:　　$T[s+1, s+2, ..., s+m] = P[1, 2, ..., m]$.

$\omega$ is the prefix of string x:　　Existing a string $y\in \Sigma^*$, such that $x=\omega y$, marked as $\omega \sqsubset x$.

$\omega$ is the suffix of string x:　　Existing a string $y\in \Sigma^*$, such that $x=y \omega$, marked as $\omega \sqsupset x$.

Define $P_k$ as the prefix $P[1, ..., k]$ of string $P[1, ..., m]$. Thus we have: $P_0 = \epsilon, P_m = P = P[1, ..., m]$.

Based on the above symbols, the matching problem can be restated as:

Finding all the possible shifting values $s$, such that $P \sqsupset T_{m+s}$.

转载于:https://www.cnblogs.com/kid551/p/4386119.html