时间差网络的在线发现On-line Discovery of Temporal-Difference Networks |
|
课程网址: | http://videolectures.net/icml08_makino_odt/ |
主讲教师: | Takaki Makino |
开课单位: | 东京大学 |
开课时间: | 2008-08-04 |
课程语种: | 英语 |
中文简介: | 我们提出了一种在线,增量发现时间差(TD)网络的算法。关键贡献是建立三个标准以扩展TD网络中的节点:当节点众所周知时,节点被扩展,独立,并且具有需要进一步解释的预测误差。由于这些标准都不需要集中计算操作,因此与预测状态表示的其他发现方法相比,它们可以以并行和分布的方式容易地计算,并且可以扩展以用于更大的问题。通过计算机实验,我们证明了我们的算法的经验有效性。 |
课程简介: | We present an algorithm for on-line, incremental discovery of temporal-difference (TD) networks. The key contribution is the establishment of three criteria to expand a node in TD network: a node is expanded when the node is well-known, independent, and has a prediction error that requires further explanation. Since none of these criteria requires centralized calculation operations, they are easily computed in a parallel and distributed manner, and scalable for bigger problems compared to other discovery methods of predictive state representations. Through computer experiments, we demonstrate the empirical effectiveness of our algorithm. |
关 键 词: | 预测误差; 计算机实验; 时间差网络 |
课程来源: | 视频讲座网 |
最后编审: | 2019-04-19:lxf |
阅读次数: | 55 |