Kaldi HCLG 深入理解

人工智能商机

于 2017-12-15 22:37:09 发布

阅读量2k

点赞数

分类专栏：语音识别

语音识别专栏收录该内容

11 篇文章

订阅专栏

本文详细介绍了HCLG图的构建过程及其组成部分，包括语言模型FST (G.fst)、词典FST (L.fst)、上下文FST (C.fst) 和HMM FST (H.fst) 的作用与工作原理。并通过具体步骤解释了如何从G到L再到C和H构建最终的HCLG图。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1. 相关部分包含的主要任务

1.1 WFST Key Concepts

determinization
minimization
composition
equivalent
epsilon-free
functional
on-demand algorithm
weight-pushing
epsilon removal

1.2 HMM Key Concepts

Markov Chain
Hidden Markov Model
Forward-backward algorithm
Viterbi algorithm
E-M for mixture of Gaussians

2. HCLG

L.fst: The Phonetic Dictionary FST

L maps monophone sequences to words.

The file L.fst is the Finite State Transducer form of the lexicon with phone symbols on the input and word symbols on the output.

L_disambig.fst:The Phonetic Dictionary with Disambiguation Symbols FST

A lexicon with disambiguation symbols

G.fst:The Language Model FST

FSA grammar (can be built from an n-gram grammar).

C.fst:The Context FST

C maps triphone sequences to monophones.

Expands the phones into context-dependent phones.

H.fst:The HMM FST

H maps multiple HMM states (a.k.a. transition-ids in Kaldi-speak) to context-dependent triphones.

Expands out the HMMs. On the right are the context-dependent phones and on the left are the pdf-ids.

HCLG.fst: final graph

总结一下：

构图过程 G -> L -> C -> H

G: 作为 acceptor (输入 symbol 与输出相同)，用于对grammar 或者 language model 进行编码

L:Lexicon, 其输出 symbol 是 words, 输入 symbol 是 phones

C:context-dependency 其输出 symbol 是 phones, 其输入 symbol 为表示context-dependency phones

如： vector<int32> ctx_window = { 12, 15, 21 };

含义：id = 15 的 phone 为中心 phone, left phone id = 12, right phone id = 21

H: 包括HMM definitions,其输出 symbol 为 context-dependency phones, 其输入 symbol 为 transitions-ids(即对 pdf-id 和其它信息编码后的 id)

asl=="add-self-loops”

rds=="remove-disambiguation-symbols”,

and H' is H without the self-loops:

HCLG = asl(min(rds(det(H' o min(det(C o min(det(L o G))))))))

转自：http://blog.youkuaiyun.com/dearwind153/article/details/70053704

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。