0


MDP导航的自主探索

Autonomous Exploration For Navigating In MDPs
课程网址: http://videolectures.net/machine_auer_autonomous_exploration/  
主讲教师: Peter Auer
开课单位: 莱奥本矿业大学
开课时间: 2013-08-06
课程语种: 英语
中文简介:
虽然内在动机的学习代理人具有克服更多监督学习系统限制的相当大的希望,但这些代理的定量评估和理论分析是困难的。 我们建议考虑对自主学习进行限制性设置,以便对学习成绩进行系统评估。 在此设置中,代理需要学习导航马尔可夫决策过程,其中外部奖励不存在或被忽略。 我们为这种情景提出了一种学习算法,并根据它用于学习环境的探索量来评估它。
课程简介: While intrinsically motivated learning agents hold considerable promise to overcome limitations of more supervised learning systems, quantitative evaluation and theoretical analysis of such agents are difficult. We propose to consider a restricted setting for autonomous learning where systematic evaluation of learning performance is possible. In this setting the agent needs to learn to navigate in a Markov Decision Process where extrinsic rewards are not present or are ignored. We present a learning algorithm for this scenario and evaluate it by the amount of exploration it uses to learn the environment.
关 键 词: 内在动机; 定量评估; 理论分析; 马尔可夫决策过程
课程来源: 视频讲座网
最后编审: 2019-05-15:cjy
阅读次数: 18