6、自然语言处理中的统计与神经模型

最新推荐文章于 2025-11-24 16:27:57 发布

redis7keeper

最新推荐文章于 2025-11-24 16:27:57 发布

阅读量46

点赞数

CC 4.0 BY-SA版权

分类专栏： NLP的过去、现在与未来文章标签：自然语言处理统计语言模型神经语言模型

本文链接：https://blog.youkuaiyun.com/redis7keeper/article/details/151096087

NLP的过去、现在与未来专栏收录该内容

44 篇文章 ¥499.90

订阅专栏¥69.90

会员秒杀 ¥9.9 重磅福利

超级会员免费看

自然语言处理中的统计与神经模型

在自然语言处理（NLP）领域，有多种技术和模型用于处理和理解文本。下面将介绍一些关键的概念和模型。

1. 统计语言模型的计算与应用

首先来看一段代码，它展示了如何计算句子的概率：

c = res.get(words[i-1] +" "+words[i], 0)
N = res.get(words[i], 0)
print("["+words[i-1] + " " + words[i] + "] : " + str(c)
+ " & " + words[i] +" : " + str(N))
prob = (c + 1)/(N + 8) # V = 8
prob_sentence *= prob
return prob_sentence
res = compute_model(sentences)
print(res)
test_sentence = ("I skipped my breakfast yesterday")
prob = test_probability(test_sentence)
print(test_sentence, ": ", prob)

这里的 compute_model 函数会创建一元语法（unigrams）和二元语法（bigrams）的计数，并将它们存储在 res 中，使用了 scikitlearn 的 CountVectorizer 。 test_probability