强化学习的线性模型,线性值 - 函数逼近和特征选择分析An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning |
|
课程网址: | http://videolectures.net/icml08_parr_alm/ |
主讲教师: | Ronald Parr |
开课单位: | 杜克大学 |
开课时间: | 2008-08-12 |
课程语种: | 英语 |
中文简介: | 我们证明线性值函数逼近等价于线性模型近似的一种形式。我们推导出模型逼近误差与Bellman误差之间的关系,并展示了这种关系如何指导特征选择以进行模型改进和/或改进价值函数。我们还展示了这些结果如何深入了解现有特征选择算法的行为。 |
课程简介: | We show that linear value function approximation is equivalent to a form of linear model approximation. We derive a relationship between the model approximation error and the Bellman error, and show how this relationship can guide feature selection for model improvement and/or value function improvement. We also show how these results give insight into the behavior of existing feature-selection algorithms. |
关 键 词: | 线性值函数; 逼近误差; 特征选择算法 |
课程来源: | 视频讲座网 |
最后编审: | 2019-04-19:lxf |
阅读次数: | 102 |