首页概率论
0


PDDL在概率问题PDDL表示有效的政策建设

Efficient Policy Construction for MDPs Represented in Probabilistic
课程网址: http://videolectures.net/icaps2011_lesner_probabilistic/  
主讲教师: Boris Lesner
开课单位: 卡昂·巴斯·诺曼底大学
开课时间: 2011-07-21
课程语种: 英语
中文简介:
我们提出了一种新颖的动态规划方法来计算马尔可夫决策过程的最优策略,该过程在接地的概率PDDL中紧凑地表示。与使用中间表示作为动态贝叶斯网络的其他方法不同,我们通过引入专用备份规则直接利用PPDDL描述。这为DBN提供了另一种方法,特别是当操作对变量具有高度相关的影响时。实际上,我们在国际规划竞赛中对几个规划领域进行了有趣的改进。最后,我们利用备份规则的增量风格来设计有希望的策略修订方法。
课程简介: We present a novel dynamic programming approach to computing optimal policies for Markov Decision Processes compactly represented in grounded Probabilistic PDDL. Unlike other approaches, which use an intermediate representation as Dynamic Bayesian Networks, we directly exploit the PPDDL description by introducing dedicated backup rules. This provides an alternative approach to DBNs, especially when actions have highly correlated effects on variables. Indeed, we show interesting improvements on several planning domains from the International Planning Competition. Finally, we exploit the incremental flavor of our backup rules for designing promising approaches to policy revision.
关 键 词: 动态规划方法; 概率PDDL; 动态贝叶斯网络; 备份规则
课程来源: 视频讲座网
最后编审: 2020-06-29:zyk
阅读次数: 67