首页物理学
   首页化学
   首页地球科学
0


分层Poisson-Dirichlet过程的采样表配置

Sampling Table Configurations for the Hierarchical Poisson-Dirichlet Process
课程网址: http://videolectures.net/ecmlpkdd2011_chen_hierarchical/  
主讲教师: Changyou Chen
开课单位: 澳大利亚国立大学
开课时间: 2011-11-30
课程语种: 英语
中文简介:
分层建模和推理是机器智能的基础,为此,两个参数Poisson Dirichlet过程(PDP)起着重要作用。用于分层PDP和分层Dirichlet过程的最流行的MCMC采样算法是基于中国餐馆过程(CRP)的中国餐馆隐喻进行增量抽样。在本文中,使用相同的比喻,我们通过引入一个辅助潜在变量(称为表指示符)来建议分层PDP的新表表示,以记录哪个客户负责启动新表。通过这种方式,新表示允许完全可交换性,这是正确的Gibbs采样算法的必要条件。基于此表示,我们开发了块吉布斯采样算法,该算法可以联合采样数据项及其表格贡献。我们在由Teh,Jordan,Beal和Blei开发的潜在Dirichlet分配(HDP LDA)的分层Dirichlet过程变体上进行测试。实验结果表明,该算法在样本困惑度和收敛速度方面均优于“直接分配后验值”算法。该表示可以与许多其他分层PDP模型一起使用。
课程简介: Hierarchical modeling and reasoning are fundamental in machine intelligence, and for this the two-parameter Poisson-Dirichlet Process (PDP) plays an important role. The most popular MCMC sampling algorithm for the hierarchical PDP and hierarchical Dirichlet Process is to conduct an incremental sampling based on the Chinese restaurant metaphor, which originates from the Chinese restaurant process (CRP). In this paper, with the same metaphor, we propose a new table representation for the hierarchical PDPs by introducing an auxiliary latent variable, called table indicator, to record which customer takes responsibility for starting a new table. In this way, the new representation allows full exchangeability that is an essential condition for a correct Gibbs sampling algorithm. Based on this representation, we develop a block Gibbs sampling algorithm, which can jointly sample the data item and its table contribution. We test this out on the hierarchical Dirichlet process variant of latent Dirichlet allocation (HDP-LDA) developed by Teh, Jordan, Beal and Blei. Experiment results show that the proposed algorithm outperforms their "posterior sampling by direct assignment" algorithm in both out-of-sample perplexity and convergence speed. The representation can be used with many other hierarchical PDP models.
关 键 词: 分层建模; 参数; 推理
课程来源: 视频讲座网
最后编审: 2019-04-02:cwx
阅读次数: 37