0


学习何时停止思考和做某事!

Learning When to Stop Thinking and Do Something!
课程网址: http://videolectures.net/icml09_poczos_lwts/  
主讲教师: Barnabás Póczos
开课单位: 卡内基梅隆大学
开课时间: 2009-08-26
课程语种: 英语
中文简介:
任何时候的算法基本上都能够在任何时间返回对给定任务的响应;通常,响应的质量随着时间的增加而提高。在这里,我们考虑学习的挑战,当我们在一系列的IID任务上终止这样的算法,以优化每单位时间的预期平均回报。我们提供了一个算法来回答这个问题。将全局优化交叉熵法与局部梯度上升法相结合,从理论上研究了估计梯度与真实梯度的距离。我们通过实验证明了该算法在一个玩具问题和一个真实的人脸检测任务中的适用性。
课程简介: An anytime algorithm is capable of returning a response to the given task at essentially any time; typically the quality of the response improves as the time increases. Here, we consider the challenge of learning when we should terminate such algorithms on each of a sequence of iid tasks, to optimize the expected average reward per unit time. We provide an algorithm for answering this question. We combine the global optimizer Cross Entropy method and the local gradient ascent, and theoretically investigate how far the estimated gradient is from the true gradient. We empirically demonstrate the applicability of the proposed algorithm on a toy problem, as well as on a real-world face detection task.
关 键 词: 随时算法; 预期平均报酬; 全局优化交叉熵方法; 局部梯度
课程来源: 视频讲座网
最后编审: 2019-11-30:lxf
阅读次数: 48