强化学习的线性模型，线性值 - 函数逼近和特征选择分析][An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning]

强化学习的线性模型，线性值 - 函数逼近和特征选择分析 An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning


课程网址:	http://videolectures.net/icml08_parr_alm/
主讲教师:	Ronald Parr
开课单位:	杜克大学
开课时间:	2008-08-12
课程语种:	英语
中文简介:	我们证明线性值函数逼近等价于线性模型近似的一种形式。我们推导出模型逼近误差与Bellman误差之间的关系，并展示了这种关系如何指导特征选择以进行模型改进和/或改进价值函数。我们还展示了这些结果如何深入了解现有特征选择算法的行为。
课程简介:	We show that linear value function approximation is equivalent to a form of linear model approximation. We derive a relationship between the model approximation error and the Bellman error, and show how this relationship can guide feature selection for model improvement and/or value function improvement. We also show how these results give insight into the behavior of existing feature-selection algorithms.
关键词:	线性值函数; 逼近误差; 特征选择算法
课程来源:	视频讲座网
最后编审:	2019-04-19：lxf
阅读次数:	130

境外开放课程导航