单调的多武器强盗分配Monotone multi-armed bandit allocations |
|
课程网址: | http://videolectures.net/colt2011_slivkins_monotone/ |
主讲教师: | Aleksandrs Slivkins |
开课单位: | 微软公司 |
开课时间: | 2011-08-02 |
课程语种: | 英语 |
中文简介: | 我们提出了一个针对多武装匪徒的新视角(以下简称MAB),该视角来自于最近关于MAB机制的研究(Devanur and Kakade, 2009, Babaio et al., 2009, 2010)。新的问题本质上是关于在MAB机制应用的附加约束下设计MAB算法。虽然假设您对MAB有一定的了解,但是本文是自包含的;我们请读者参考Cesa-Bianchi和Lugosi(2006)的更多背景资料。 |
课程简介: | We present a novel angle for multi-armed bandits (henceforth abbreviated MAB) which follows from the recent work on MAB mechanisms (Devanur and Kakade, 2009, Babaio et al., 2009, 2010). The new problem is, essentially, about designing MAB algorithms under an additional constraint motivated by their application to MAB mechanisms. This note is self-contained, although some familiarity with MAB is assumed; we refer the reader to Cesa-Bianchi and Lugosi (2006) for more background. |
关 键 词: | 数学思维; 数学分析; 新视角分析 |
课程来源: | 视频讲座网 |
最后编审: | 2020-06-08:吴雨秋(课程编辑志愿者) |
阅读次数: | 54 |