Bandit子集选择中的信息复杂性][Information Complexity in Bandit Subset Selection]_MOOC(慕课)境外开放课程

   首页 → 计算机软件
   首页 → 计算机应用
   首页 → 计算机科学技术基础学科

Bandit子集选择中的信息复杂性 Information Complexity in Bandit Subset Selection


课程网址:	http://videolectures.net/colt2013_kaufmann_information/
主讲教师:	Emilie Kaufmann
开课单位:	国立高等电信学校
开课时间:	2013-08-09
课程语种:	英语
中文简介:	研究了the arms of a stochastic bandit 的有效搜索问题，以确定最优子集。在PAC和固定预算公式下，利用基于kl -分歧的置信区间，得到了改进的界。在后悔设置中应用类似的思想已经产生了关于武器之间的kl发散的界限，而我们在纯探索设置中的界限涉及武器之间的Chernoff信息。除了将这一新颖的数量引入到bandits算法文学中，我们还对纯探索问题的“竞速”策略和“智能抽样”策略进行了比较，发现了支持后者的有力证据。
课程简介:	We consider the problem of efficiently exploring the arms of a stochastic bandit to identify the best subset. Under the PAC and the fixed-budget formulations, we derive improved bounds by using KL-divergence-based confidence intervals. Whereas the application of a similar idea in the regret setting has yielded bounds in terms of the KL-divergence between the arms, our bounds in the pure-exploration setting involve the Chernoff information between the arms. In addition to introducing this novel quantity to the bandits literature, we contribute a comparison between the “racing” and “smart sampling” strategies for pure-exploration problems, finding strong evidence in favor of the latter.
关键词:	有效搜索; 最优子集; 竞速策略; 智能抽样
课程来源:	视频讲座网
最后编审:	2019-10-17：cwx
阅读次数:	101

服务热线：0574-88229129
电子邮件：info_lib@nbt.edu.cn
信息服务：图书馆305室
系统研发：图书馆303室

图书馆学生服务群：437507696
图书馆教工服务群：1038697975
QQ在线咨询
2013-2025 © 浙大宁波理工学院图书馆