
Adversarial bandit problems: the power of randomization
开课单位: 庞贝法布拉大学
开课时间: 2010-11-16
课程简介: In this tutorial we discuss sequential prediction problems in which the forecaster has limited information about the past outcomes of the sequence. We concentrate on the so-called "adversarial" framework in which no probabilistic is available for the sequence. We describe various models of limited feedback and pay special attention to the so-called "multi-armed bandit" problem. We discuss various randomized prediction methods and analyze their behavior.
关 键 词: 序列概率; 随机预测; 有限反馈模型
