0


自适应k-气象学问题及其在强化学习中结构学习与特征选择中的应用

The Adaptive k-Meteorologists Problem and Its Application to Structure Learning and Feature Selection in Reinforcement Learning
课程网址: http://videolectures.net/icml09_diuk_akmp/  
主讲教师: Carlos Diuk
开课单位: 新泽西州立大学
开课时间: 2009-08-26
课程语种: 英语
中文简介:
本文的目的是三折。首先,我们在最近提出的KWIK框架中形式化并研究了学习概率概念的问题。我们给出一种算法的细节,称为自适应k气象学家算法,分析其样本复杂性上限,并给出匹配的下界。其次,该算法用于为因子状态问题创建新的强化学习算法,该算法比先前的现有技术算法有显着改进。最后,我们应用自适应k气象学家算法来消除现有强化学习算法中的极限假设。我们的方法的有效性在几个基准测试领域以及机器人导航问题中经验证明。
课程简介: The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an algorithm, known as the Adaptive k-Meteorologists Algorithm, analyze its sample complexity upper bound, and give a matching lower bound. Second, this algorithm is used to create a new reinforcement learning algorithm for factoredstate problems that enjoys significant improvement over the previous state-of-the-art algorithm. Finally, we apply the Adaptive k-Meteorologists Algorithm to remove a limiting assumption in an existing reinforcement-learning algorithm. The effectiveness of our approaches are demonstrated empirically in a couple benchmark domains as well as a robotics navigation problem.
关 键 词: 概率概念; 自适应k气象学家算法; 强化学习算法
课程来源: 视频讲座网
最后编审: 2019-04-21:lxf
阅读次数: 70