对抗性土匪问题:随机化的力量Adversarial bandit problems: the power of randomization |
|
课程网址: | http://videolectures.net/ecmlpkdd2010_lugosi_abp/ |
主讲教师: | http://videolectures.net/ecmlpkdd2010_lugosi_abp/ |
开课单位: | 庞贝法布拉大学 |
开课时间: | 2010-11-16 |
课程语种: | 英语 |
中文简介: | 在本教程中,我们讨论序列预测问题,其中预测者对序列过去结果的信息有限。我们专注于所谓的“对抗性”框架,其中没有可用的序列概率。我们描述了各种有限反馈模型,并特别关注所谓的“多武装强盗”问题。我们讨论了各种随机预测方法并分析了它们的行为。 |
课程简介: | In this tutorial we discuss sequential prediction problems in which the forecaster has limited information about the past outcomes of the sequence. We concentrate on the so-called "adversarial" framework in which no probabilistic is available for the sequence. We describe various models of limited feedback and pay special attention to the so-called "multi-armed bandit" problem. We discuss various randomized prediction methods and analyze their behavior. |
关 键 词: | 序列概率; 随机预测; 有限反馈模型 |
课程来源: | 视频讲座网 |
数据采集: | 2021-06-09:zyk |
最后编审: | 2021-06-09:zyk |
阅读次数: | 72 |