LeetCode刷题NO.1078——Bigram 分词

最新推荐文章于 2022-07-05 10:47:04 发布

流牧

最新推荐文章于 2022-07-05 10:47:04 发布

阅读量177

点赞数

分类专栏：算法 python LeetCode 文章标签： leetcode 算法职场和发展

本文链接：https://blog.youkuaiyun.com/qq_37638909/article/details/122155126

版权

python 同时被 3 个专栏收录

23 篇文章

订阅专栏

LeetCode

13 篇文章

订阅专栏

算法

12 篇文章

订阅专栏

来源：力扣（LeetCode）
链接：https://leetcode-cn.com/problems/occurrences-after-bigram

【题目描述】
给出第一个词 first 和第二个词 second，考虑在某些文本 text 中可能以 “first second third” 形式出现的情况，其中 second 紧随 first 出现，third 紧随 second 出现。

对于每种这样的情况，将第三个词 “third” 添加到答案中，并返回答案。

【示例】
示例 1：

输入：text = "alice is a good girl she is a good student", first = "a", second = "good"
输出：["girl","student"]

示例 2：

输入：text = "we will we will rock you", first = "we", second = "will"
输出：["we","rock"]

提示：

1 <= text.length <= 1000
text 由小写英文字母和空格组成
text 中的所有单词之间都由单个空格字符分隔
1 <= first.length, second.length <= 10
first 和 second 由小写英文字母组成

【解题思路】
根据题意可知，如果text中连续出现first和second，那么我们需要把second后面的单词添加到输出列表中（前提是second后面还有单词），详细过程如下：

把text按照字符空格“ ”进行切分，即分词，并获取分词后的单词个数，分别用words和word_num表示分词结果和分词后的单词个数
定义输出列表output
遍历words，如果words[i]==first and words[i+1]==second，其中i=0, 1, 2, ..., word_num-3，则words[i+2]即为我们要找的单词，把它添加到输出列表中。即如果该单词等于first且后一个单词等于second，则把second后面的单词添加到输出队列

【提交代码】

class Solution:
    def findOcurrences(self, text: str, first: str, second: str) -> List[str]:
        words = text.split(" ")
        word_num = len(words)
        
        output = []
        for i in range(word_num-2):
            if words[i] == first and words[i+1] == second:
                output.append(words[i+2])
        
        return output