0


组合空间上的变分推理

Variational Inference over Combinatorial Spaces
课程网址: http://videolectures.net/nips2010_bouchard_cote_vic/  
主讲教师: Alexandre Bouchard-Côté
开课单位: 不列颠哥伦比亚大学
开课时间: 2010-03-25
课程语种: 英语
中文简介:
自从发现了一系列复杂的完全多项式随机算法(Karzanov等人,1991年;Jerrum等人,2001年;Wilson,2004年),关于组合空间中近似推理的理论研究集中在马尔可夫链蒙特卡罗方法上。尽管这些随机算法有很强的理论保证,但它们的运行时间很慢,并且对其潜力的限制性假设也阻碍了这些算法在机器学习中的应用。因此,在组合空间的应用中,简单精确模型通常比需要近似推理的更复杂模型更受欢迎(Siepel等人,2004)。考虑到图形模型变分方法的成功,变分推理似乎提供了一种吸引人的替代方法(Wainwright等人,2008);然而,不幸的是,如何为组合对象(如匹配、部分阶、平面划分和序列对齐)开发变分近似并不明显。我们提出了一个新的框架,将变分推理扩展到广泛的组合空间。我们的方法基于一个简单的假设:存在一个可跟踪的测度因子分解,我们在许多例子中都证明了这一点。对一系列匹配模型的模拟表明,该算法比一个流行的完全多项式随机算法更通用,更具经验上的速度。我们还将该框架应用于蛋白质序列的多重排列问题,在Balibase数据集上获得最先进的结果(Thompson等人,1999)。
课程简介: Since the discovery of sophisticated fully polynomial randomized algorithms for a range of #P problems (Karzanov et al., 1991; Jerrum et al., 2001; Wilson, 2004), theoretical work on approximate inference in combinatorial spaces has focused on Markov chain Monte Carlo methods. Despite their strong theoretical guarantees, the slow running time of many of these randomized algorithms and the restrictive assumptions on the potentials have hindered the applicability of these algorithms to machine learning. Because of this, in applications to combinatorial spaces simple exact models are often preferred to more complex models that require approximate inference (Siepel et al., 2004). Variational inference would appear to provide an appealing alternative, given the success of variational methods for graphical models (Wainwright et al., 2008); unfortunately, however, it is not obvious how to develop variational approximations for combinatorial objects such as matchings, partial orders, plane partitions and sequence alignments. We propose a new framework that extends variational inference to a wide range of combinatorial spaces. Our method is based on a simple assumption: the existence of a tractable measure factorization, which we show holds in many examples.Simulations on a range of matching models show that the algorithm is more general and empirically faster than a popular fully polynomial randomized algorithm. We also apply the framework to the problem of multiple alignment of protein sequences, obtaining state-of-the-art results on the BAliBASE dataset (Thompson et al., 1999).
关 键 词: 统计; 数学; 组合
课程来源: 视频讲座网
最后编审: 2020-06-01:wuyq
阅读次数: 44