0


匪徒问题,查询学习,和haystack维度

Bandits, Query Learning, and the Haystack Dimension
课程网址: http://videolectures.net/colt2011_amin_bandits/  
主讲教师: Kareem Amin
开课单位: 宾夕法尼亚大学
开课时间: 2011-08-02
课程语种: 英语
中文简介:
由于武装匪徒(MAB)存在很大甚至有限数量武器的问题,我们通过查询所选输入(或武器)的功能来考虑寻找最大未知目标函数的问题。我们在假设收益的基础上给出了这个问题的查询复杂性的分析?每个臂的一个由属于已知的,有限的但是任意的函数类的函数给出。我们的分析集中在一个新的函数类复杂概念上,我们将其称为haystack维度,用于证明简单贪婪算法的近似最优性。然后将该算法用作功能性MAB算法中的子例程,从而产生可证明接近最优的遗憾。我们提供了对无限基数设置的推广,并评论了我们的分析如何与查询学习和广义二进制搜索的现有结果相关联和改进。
课程简介: Motivated by multi-armed bandits (MAB) problems with a very large or even in finite number of arms, we consider the problem of finding a maximum of an unknown target function by querying the function at chosen inputs (or arms). We give an analysis of the query complexity of this problem, under the assumption that the payoff of each arm is given by a function belonging to a known, finite, but otherwise arbitrary function class. Our analysis centers on a new notion of function class complexity that we call the haystack dimension, which is used to prove the approximate optimality of a simple greedy algorithm. This algorithm is then used as a subroutine in a functional MAB algorithm, yielding provably near-optimal regret. We provide a generalization to the infinite cardinality setting, and comment on how our analysis is connected to, and improves upon, existing results for query learning and generalized binary search.
关 键 词: 函数; 简单贪婪算法; 查询学习
课程来源: 视频讲座网
最后编审: 2019-03-05:lxf
阅读次数: 66