首页数学
0


敌对的土匪问题:随机化的权力

Adversarial bandit problems: the power of randomization
课程网址: http://videolectures.net/ecmlpkdd2010_lugosi_abp/  
主讲教师: Gabor Lugosi
开课单位: 培法布拉大学
开课时间: 2010-11-16
课程语种: 英语
中文简介:
在本教程中,我们将讨论顺序预测问题,其中预报员对序列的过去结果的信息有限。我们专注于所谓的“对抗性”框架中没有概率可用于序列。我们描述了有限反馈的各种模型,并特别关注所谓的“多臂强盗”。问题。我们讨论各种随机预测方法并分析它们的行为。
课程简介: In this tutorial we discuss sequential prediction problems in which the forecaster has limited information about the past outcomes of the sequence. We concentrate on the so-called "adversarial" framework in which no probabilistic is available for the sequence. We describe various models of limited feedback and pay special attention to the so-called "multi-armed bandit" problem. We discuss various randomized prediction methods and analyze their behavior.
关 键 词: 敌对”的框架; 可用的序列; 多武装土匪问题
课程来源: 视频讲座网
最后编审: 2020-06-15:wuyq
阅读次数: 65