首页函数论
   首页数学分析
   首页函数论
0


精确推理和累积学习

Exact inference and learning for cumulative
课程网址: http://videolectures.net/nips2010_huang_eil/  
主讲教师: Huang Jim C
开课单位: 亚马逊公司
开课时间: 2011-03-25
课程语种: 英语
中文简介:
概率图形模型使用局部因素来表示变量集之间的依赖性。对于许多问题领域,例如气候学和流行病学,除了局部依赖性之外,我们可能还希望对重尾统计进行建模,其中极端偏差不应被视为异常值。使用概率密度函数(PDF)的图形模型指定这种分布通常会导致难以处理的推理和学习。累积分布网络(CDN)提供了一种方法,可以将多变量重尾模型作为累积分布函数(CDF)的乘积轻松地指定。目前,推理和学习算法只适用于树形图,与混合导数的计算相对应。对于任意拓扑图,需要一种有效的算法,利用模型的稀疏结构,而不像Mathematica和D*这样的符号微分程序。我们提出了一种递归分解任意拓扑的Cdn导数计算的算法,其中的分解是用连接树自然描述的。我们将所得算法的性能与Mathematica和D*进行了比较,并将我们的方法应用于降雨和H1N1数据的学习模型,其中我们表明,与树结构和非结构cdn以及其他重尾多变量分布相比,具有循环的cdn能够提供更好的数据拟合。作为多变量连接和逻辑模型。
课程简介: Probabilistic graphical models use local factors to represent dependence among sets of variables. For many problem domains, for instance climatology and epidemiology, in addition to local dependencies, we may also wish to model heavy-tailed statistics, where extreme deviations should not be treated as outliers. Specifying such distributions using graphical models for probability density functions (PDFs) generally lead to intractable inference and learning. Cumulative distribution networks (CDNs) provide a means to tractably specify multivariate heavy-tailed models as a product of cumulative distribution functions (CDFs). Currently, algorithms for inference and learning, which correspond to computing mixed derivatives, are exact only for tree-structured graphs. For graphs of arbitrary topology, an efficient algorithm is needed that takes advantage of the sparse structure of the model, unlike symbolic differentiation programs such as Mathematica and D* that do not. We present an algorithm for recursively decomposing the computation of derivatives for CDNs of arbitrary topology, where the decomposition is naturally described using junction trees. We compare the performance of the resulting algorithm to Mathematica and D*, and we apply our method to learning models for rainfall and H1N1 data, where we show that CDNs with cycles are able to provide a significantly better fits to the data as compared to tree-structured and unstructured CDNs and other heavy-tailed multivariate distributions such as the multivariate copula and logistic models.
关 键 词: 概率图模型; 概率密度函数; 累积分布网络
课程来源: 视频讲座网
最后编审: 2020-06-06:zyk
阅读次数: 33