Lecture 6 Sequence Tagging: Hidden Markov Models

小羊和小何

已于 2023-06-04 18:05:33 修改

阅读量704

点赞数

分类专栏：自然语言处理文章标签：自然语言处理 HMM

于 2023-06-04 00:45:26 首次发布

本文链接：https://blog.youkuaiyun.com/Abner98414/article/details/131027014

版权

Exponentially many combinations: |Tags|^M, for length M 组合数量呈指数级增长：|Tags|^M，长度为M
Tag sequences of different lengths 标记不同长度的序列
Tagging is a sentence-level task but as humans we decompose it into small word-level tasks 标注是句级任务，但作为人类，我们将其分解为小型的词级任务
Solution:
- Define a model that decomposes process into individual word-level tasks steps. But this takes into account the whole sequence when learning and predicting. 定义一个模型，将过程分解为单个词级任务步骤。但在学习和预测时，考虑整个序列
- This is called sequence labelling, or structured prediction 这被称为序列标注，或结构预测

Output independence: An observed event(word) depends only on the hidden state(tag) 输出独立性：观察到的事件（词）只取决于隐藏状态（标签） -> $\prod_{i=1}^{n}P(w_i|t_i)$
Markov assumption: The current state(tag) depends only on the previous state 马尔科夫假设：当前状态（标签）只取决于前一个状态-> $\prod_{i=1}^{n}P(t_i|t_{i-1})$

Parameters are individual probabilities: 参数是单个概率
- Emission Probabilities 发射概率 (O): $P(w_i|t_i)$
- Transition Probabilities 转移概率 (A): $P(t_i|t_{i-1})$
Training uses Maximum Likelihood Estimation: Done by simp