0


松弛和随机化:从值到算法

Relax and Randomize: From Value to Algorithms
课程网址: http://videolectures.net/nips2012_sridharan_relax_randomize/  
主讲教师: Karthik Sridharan
开课单位: 康奈尔大学
开课时间: 2013-01-16
课程语种: 英语
中文简介:
我们展示了一种从最小极大分析中推导出在线学习算法的原则方法。先前被认为是非构造性的各种上界值minimax值被证明可以产生算法。这允许usto无缝地恢复已知方法并推导出新的,也可以捕获诸如跟随扰动领导者和R ^ 2预测者之类的“非正统”方法。因此,理解学习问题的固有复杂性会导致算法的发展。为了说明我们的方法,我们提出了几种新的算法,包括一系列使用“随机播出”概念的随机方法。介绍了跟随扰动Leaderalgorithms的新版本,以及基于Littlestone维数的方法,基于跟踪范数的矩阵完备的有效方法,以及静态专家对转导学习和预测问题的算法。
课程简介: We show a principled way of deriving online learning algorithms from a minimax analysis. Various upper bounds on the minimax value, previously thought to be nonconstructive, are shown to yield algorithms. This allows us to seamlessly recover known methods and to derive new ones, also capturing such "unorthodox" methods as Follow the Perturbed Leader and the R^2 forecaster. Understanding the inherent complexity of the learning problem thus leads to the development of algorithms. To illustrate our approach, we present several new algorithms, including a family of randomized methods that use the idea of a "random play out". New versions of the Follow-the-Perturbed-Leader algorithms are presented, as well as methods based on the Littlestone’s dimension, efficient methods for matrix completion with trace norm, and algorithms for the problems of transductive learning and prediction with static experts.
关 键 词: 最小极大分析; 在线学习算法; 随机播出
课程来源: 视频讲座网
最后编审: 2019-07-26:cwx
阅读次数: 86