0


对数线性模型和条件随机场

Log-linear Models and Conditional Random Fields
课程网址: http://videolectures.net/cikm08_elkan_llmacrf/  
主讲教师: Charles Elkan
开课单位: 加州大学圣地亚哥分校
开课时间: 2008-11-19
课程语种: 英语
中文简介:
对数线性模型是逻辑回归的深远扩展,而条件随机场(CRF)是适用于所谓的结构化学习任务的对数线性模型的特例。结构化学习意味着学习预测具有内部结构的输出。例如,当相邻字母之间的相关性用于重新预测时,识别手写单词更准确。本教程将简要但全面地介绍机器学习中的这些新发展,这些新发展对许多新颖的应用程序具有巨大的潜力。本教程将首先解释什么是对数线性模型,具体示例还有数学通用性。接下来,将解释特征函数;这些是对数线性模型的知识表示技术。然后,本教程将介绍线性链CRF,从它们是对数线性模型的特殊情况的角度来看。将讨论Viterbi算法,该算法使线性链CRF易于推断,然后讨论一般CRF的推断。该演示文稿将继续推导对数线性模型的梯度;这是所有对数线性训练算法的数学基础。然后,本教程将讨论两个重要的特殊案例CRF训练算法,一个是感知器方法的变体,另一个是称为对比分歧。最后但同样重要的是,tu torial将引入公开的培训和使用CRF的软件,并将详细解释CRF的实际应用。
课程简介: Log-linear models are a far-reaching extension of logistic regression, while con- ditional random fields (CRFs) are a special case of log-linear models suitable for so-called structured learning tasks. Structured learning means learning to predict outputs that have internal structure. For example, recognizing handwritten words is more accurate when the correlations between neighboring letters are used to reÞne predictions. This tutorial will provide a simple but thorough introduction to these new developments in machine learning that have great potential for many novel applications. The tutorial will first explain what log-linear models are, with with concrete examples but also with mathematical generality. Next, feature-functions will be explained; these are the knowledge-representation technique underlying log-linear models. The tutorial will then present linear-chain CRFs, from the point of view that they are a special case of log-linear models. The Viterbi algorithm that makes inference tractable for linear-chain CRFs will be covered, followed by a discus- sion of inference for general CRFs. The presentation will continue with a general derivation of the gradient of log-linear models; this is the mathematical foundation of all log-linear training algorithms. Then, the tutorial will discuss two impor- tant special-case CRF training algorithms, one that is a variant of the perceptron method, and another one called contrastive divergence. Last but not least, the tu- torial will introduce publicly available software for training and using CRFs, and will explain a practical application of CRFs with hands-on detail.
关 键 词: 计算机科学; 对数线性模型; 逻辑回归; 条件随机场; 特征函数; CRF训练算法
课程来源: 视频讲座网
最后编审: 2019-10-17:cwx
阅读次数: 23