0


MedLDA:回归和分类的最大保证金监督主题模型

MedLDA: Maximum Margin Supervised Topic Models for Regression and Classification
课程网址: http://videolectures.net/icml09_zhu_mlda/  
主讲教师: Jun Zhu
开课单位: 清华大学
开课时间: 2009-08-26
课程语种: 英语
中文简介:
监督主题模型利用文档的辅助信息来发现文档的预测低维表示;现有模型应用基于似然的估计。在本文中,我们提出了连续和分类响应变量的最大边际监督主题模型。我们的方法,即最大熵判别潜在Dirichlet分配(MedLDA),利用最大边际原则来训练监督主题模型并估计可能更适合于预测的预测主题表示。我们为后验推理开发​​了有效的变分方法,并定性和定量地证明了MedLDA优于基于可能性的主题模型对电影评论和20个新闻组数据集的优势。
课程简介: Supervised topic models utilize document’s side information for discovering predictive low dimensional representations of documents; and existing models apply likelihood-based estimation. In this paper, we present a max-margin supervised topic model for both continuous and categorical response variables. Our approach, the maximum entropy discrimination latent Dirichlet allocation (MedLDA), utilizes the max-margin principle to train supervised topic models and estimate predictive topic representations that are arguably more suitable for prediction. We develop efficient variational methods for posterior inference and demonstrate qualitatively and quantitatively the advantages of MedLDA over likelihood-based topic models on movie review and 20 Newsgroups data sets.
关 键 词: 监督主题模型; 预测低维; 似然
课程来源: 视频讲座网
最后编审: 2019-04-25:cwx
阅读次数: 127