0


具有局部和全局一致性的概率二元数据分析

Probabilistic Dyadic Data Analysis with Local and Global Consistency
课程网址: http://videolectures.net/icml09_raykar_pdd/  
主讲教师: Vikas Raykar
开课单位: 马里兰大学
开课时间: 2009-08-26
课程语种: 英语
中文简介:
二元数据出现在许多现实世界的应用中,如社交网络分析和信息检索。为了发现二元数据中的潜在隐藏结构,提出了许多主题建模技术。典型算法包括概率潜在语义分析(PLSA)和潜在DirichletAllocation(LDA)。欧几里德空间支持由这两种算法得到的概率密度函数。然而,许多先前的研究表明,天然存在的数据可能存在于或低于底层数据。我们引入了概率框架,用于对二元数据的局部和几何结构进行建模,该结构明确地考虑了局部流形结构。具体地,局部流形结构由图形建模。拉普拉斯算子图,类似于流形上的拉普拉斯贝尔特拉米算子,用于平滑概率密度函数。结果,所获得的概率分布集中在数据流形上。实际数据集的实验结果证明了该方法的有效性。
课程简介: Dyadic data arises in many real world applications such as social network analysis and information retrieval. In order to discover the underlying or hidden structure in the dyadic data, many topic modeling techniques were proposed. The typical algorithms include Probabilistic Latent Semantic Analysis (PLSA) and Latent Dirichlet Allocation (LDA). The probability density functions obtained by both of these two algorithms are supported on the Euclidean space. However, many previous studies have shown naturally occurring data may reside on or close to an underlying submanifold. We introduce a probabilistic framework for modeling both the topical and geometrical structure of the dyadic data that explicitly takes into account the local manifold structure. Specifically, the local manifold structure is modeled by a graph. The graph Laplacian, analogous to the Laplace-Beltrami operator on manifolds, is applied to smooth the probability density functions. As a result, the obtained probabilistic distributions are concentrated around the data manifold. Experimental results on real data sets demonstrate the effectiveness of the proposed approach.
关 键 词: 二元数据; 欧几里德空间; 概率
课程来源: 视频讲座网
最后编审: 2019-04-24:lxf
阅读次数: 49