学习何时停止思考和做某事!][Learning When to Stop Thinking and Do Something!]_MOOC(慕课)境外开放课程

   首页 → 商业经济学
   首页 → 应用数学
   首页 → 计算数学

学习何时停止思考和做某事! Learning When to Stop Thinking and Do Something!


课程网址:	http://videolectures.net/icml09_poczos_lwts/
主讲教师:	Barnabás Póczos
开课单位:	卡内基梅隆大学
开课时间:	2009-08-26
课程语种:	英语
中文简介:	任何时候的算法基本上都能够在任何时间返回对给定任务的响应；通常，响应的质量随着时间的增加而提高。在这里，我们考虑学习的挑战，当我们在一系列的IID任务上终止这样的算法，以优化每单位时间的预期平均回报。我们提供了一个算法来回答这个问题。将全局优化交叉熵法与局部梯度上升法相结合，从理论上研究了估计梯度与真实梯度的距离。我们通过实验证明了该算法在一个玩具问题和一个真实的人脸检测任务中的适用性。
课程简介:	An anytime algorithm is capable of returning a response to the given task at essentially any time; typically the quality of the response improves as the time increases. Here, we consider the challenge of learning when we should terminate such algorithms on each of a sequence of iid tasks, to optimize the expected average reward per unit time. We provide an algorithm for answering this question. We combine the global optimizer Cross Entropy method and the local gradient ascent, and theoretically investigate how far the estimated gradient is from the true gradient. We empirically demonstrate the applicability of the proposed algorithm on a toy problem, as well as on a real-world face detection task.
关键词:	随时算法; 预期平均报酬; 全局优化交叉熵方法; 局部梯度
课程来源:	视频讲座网
最后编审:	2019-11-30：lxf
阅读次数:	68

服务热线：0574-88229129
电子邮件：info_lib@nbt.edu.cn
信息服务：图书馆305室
系统研发：图书馆303室

图书馆学生服务群：437507696
图书馆教工服务群：1038697975
QQ在线咨询
2013-2025 © 浙大宁波理工学院图书馆