

Efficient Policy Construction for MDPs Represented in Probabilistic
课程网址: http://videolectures.net/icaps2011_lesner_probabilistic/  
主讲教师: Boris Lesner
开课单位: 卡昂·巴斯·诺曼底大学
开课时间: 2011-07-21
课程语种: 英语
课程简介: We present a novel dynamic programming approach to computing optimal policies for Markov Decision Processes compactly represented in grounded Probabilistic PDDL. Unlike other approaches, which use an intermediate representation as Dynamic Bayesian Networks, we directly exploit the PPDDL description by introducing dedicated backup rules. This provides an alternative approach to DBNs, especially when actions have highly correlated effects on variables. Indeed, we show interesting improvements on several planning domains from the International Planning Competition. Finally, we exploit the incremental flavor of our backup rules for designing promising approaches to policy revision.
关 键 词: 动态规划方法; 概率PDDL; 动态贝叶斯网络; 备份规则
课程来源: 视频讲座网
最后编审: 2020-06-29:zyk
阅读次数: 73