0


强化学习简介

Introduction to Reinforcement Learning
课程网址: http://videolectures.net/mlss08au_szepesvari_rele/  
主讲教师: Csaba Szepesvári
开课单位: 阿尔伯塔大学
开课时间: 2008-03-17
课程语种: 英语
中文简介:
本教程将介绍   强化学习,即学习采取什么行动,何时采取它们,以优化长期性能。 这可能涉及牺牲即时奖励以获得更大的奖励长期或只是为了获得更多有关环境的信息。该本教程的第一部分将介绍基础知识,例如Markov决策过程,动态规划,时差学习,蒙特卡罗方法,资格痕迹,功能的作用近似。 在第二部分,我们将介绍最近的一些发展,即政策梯度和二阶方法,如LSPI和改进的Bellman残差最小化算法
课程简介: The tutorial will introduce Reinforcement Learning, that is, learning what actions to take, and when to take them, so as to optimize long-term performance. This may involve sacrificing immediate reward to obtain greater reward in the long-term or just to obtain more information about the environment. The first part of the tutorial will cover the basics, such as Markov decision processes, dynamic programming, temporal-difference learning, Monte Carlo methods, eligibility traces, the role of function approximation. In the second part we cover some recent developments, namely policy gradient and second order methods, such as LSPI and the modified Bellman residual minimization algorithm.
关 键 词: 强化学习; 长期性能; 蒙特卡罗方法
课程来源: 视频讲座网
最后编审: 2019-07-17:cjy
阅读次数: 92