敌对的土匪问题:随机化的权力Adversarial bandit problems: the power of randomization |
|
课程网址: | http://videolectures.net/ecmlpkdd2010_lugosi_abp/ |
主讲教师: | Gabor Lugosi |
开课单位: | 培法布拉大学 |
开课时间: | 2010-11-16 |
课程语种: | 英语 |
中文简介: | 在本教程中,我们将讨论顺序预测问题,其中预报员对序列的过去结果的信息有限。我们专注于所谓的“对抗性”框架中没有概率可用于序列。我们描述了有限反馈的各种模型,并特别关注所谓的“多臂强盗”。问题。我们讨论各种随机预测方法并分析它们的行为。 |
课程简介: | In this tutorial we discuss sequential prediction problems in which the forecaster has limited information about the past outcomes of the sequence. We concentrate on the so-called "adversarial" framework in which no probabilistic is available for the sequence. We describe various models of limited feedback and pay special attention to the so-called "multi-armed bandit" problem. We discuss various randomized prediction methods and analyze their behavior. |
关 键 词: | 敌对”的框架; 可用的序列; 多武装土匪问题 |
课程来源: | 视频讲座网 |
最后编审: | 2020-06-15:wuyq |
阅读次数: | 76 |