Wee5-5Statistical parsing

最新推荐文章于 2021-09-07 22:29:08 发布

zypandora

最新推荐文章于 2021-09-07 22:29:08 发布

阅读量604

点赞数

CC 4.0 BY-SA版权

分类专栏： NLP(Michigan)

本文链接：https://blog.youkuaiyun.com/zypandora/article/details/50244771

45 篇文章

订阅专栏

PCFG

Time flies like an arrow
- Many parses
- Some more likely than others
- Need for a probabilistic ranking method

Just like CFG, a 4 tuple $(N, \Sigma, R, S)$

N: non-terminal symbols
$\Sigma$ : terminal symbols(disjoint from N)
R: rules(A→β)[p]
- $\beta \in (\Sigma \cup N)^\ast$
- p is the probability $p(\beta | A)$
S: start symbol(from N)

Rules having the same left-hand side should have probabilities summing to 1.

p (t) = \prod i = 1 n p (α i \to β i)

$p(t) = \prod_{i=1}^n p(\alpha_i \rightarrow \beta_i)$

arg max t \in T (s) p (t)

$\arg \max_{t \in T(s)} p(t)$

p (s) = \sum i = 1 n p (t i)

$p(s) = \sum_{i=1}^n p(t_i)$

这里写图片描述

Given a grammar G and sentence s, let T(s) be all parse trees that correspond to s
Task1: find the most likely parse tree t
Task2: find p(s) as the sum of all p(t)