0


潜在狄利克雷分配中推理的复杂性

Complexity of Inference in Latent Dirichlet Allocation
课程网址: http://videolectures.net/nips2011_sontag_allocation/  
主讲教师: David Sontag
开课单位: 纽约大学
开课时间: 2012-09-06
课程语种: 英语
中文简介:
我们考虑潜在Dirichlet分配(LDA)中概率推理的计算复杂性。首先,我们研究了找到主题的最大后验(MAP)分配问题的问题,其中文档的主题分布被整合出来。我们表明,当每个文档的有效主题数量很小时,精确推断需要多项式时间。相反,我们表明,当文档具有大量主题时,在LDA中找到主题到主题的MAP分配是NP难的。接下来,我们考虑找到文档的MAP主题分布的问题,其中集成了主题词分配。我们证明这个问题也很难NP。最后,我们简要地讨论了从后验中采样的问题,表明这在一个受限制的环境中是NP难的,但是留下一般问题。
课程简介: We consider the computational complexity of probabilistic inference in Latent Dirichlet Allocation (LDA). First, we study the problem of finding the maximum a posteriori (MAP) assignment of topics to words, where the document's topic distribution is integrated out. We show that, when the effective number of topics per document is small, exact inference takes polynomial time. In contrast, we show that, when a document has a large number of topics, finding the MAP assignment of topics to words in LDA is NP-hard. Next, we consider the problem of finding the MAP topic distribution for a document, where the topic-word assignments are integrated out. We show that this problem is also NP-hard. Finally, we briefly discuss the problem of sampling from the posterior, showing that this is NP-hard in one restricted setting, but leaving open the general question.
关 键 词: 概率推理; 后验分配问题; 多项式时间
课程来源: 视频讲座网
最后编审: 2019-07-26:cwx
阅读次数: 72