Latent Dirichlet allocation
In statistics, latent Dirichlet allocation (LDA) is a generative model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar. For example, if observations are words collected into documents, it posits that each document is a mixture of a small number of topics and that each word's creation is attributable to one of the document's topics. LDA is an example of a topic model and was first presented as a graphical model for topic discovery by David Blei, Andrew Ng, and Michael Jordan in 2002.[1]

本文详细介绍了Latent Dirichlet Allocation (LDA),一种用于文本数据的主题发现统计模型。LDA能够将文档集合分解为一系列潜在话题,并为每个文档分配话题分布。文中还对比了MCMC与Gibbs Sampling在推理过程中的应用,并简要概述了几种其他主题模型。
7609

被折叠的 条评论
为什么被折叠?



