0


贪婪搜索的加权规则集迭代学习

Iterative Learning of Weighted Rule Sets for Greedy Search
课程网址: http://videolectures.net/icaps2010_xu_greedysearch/  
主讲教师: Yuehua Xu
开课单位: 俄勒冈州立大学
开课时间: 2010-05-10
课程语种: 英语
中文简介:
贪婪搜索通常用于以牺牲完整性和最优性为代价快速生成解决方案。在本文中,我们考虑一组加权行动选择规则的学习集合,用以指导贪婪搜索,并将其应用于自动规划。我们在贪婪搜索学习方面比以前的工作做出了两个主要贡献。首先,我们引入加权动作选择规则集作为贪婪搜索的一种新的控制知识形式。以前的工作已经表明了行为选择规则对于贪婪搜索的实用性,但是已经将这些规则视为硬约束,从而导致了脆弱性。我们的加权规则集允许多个规则投票,有助于提高对噪声规则的鲁棒性。其次,给出了一种基于RankBoost的加权规则集迭代学习算法。每次迭代都考虑当前规则集的实际性能,并基于观察到的搜索错误指导学习。这与大多数先前的方法形成对比,后者独立于搜索过程学习控制知识。我们的实证结果已经表明这种方法在很多领域具有重大的前景。
课程简介: Greedy search is commonly used in an attempt to generate solutions quickly at the expense of completeness and optimality. In this work, we consider learning sets of weighted action-selection rules for guiding greedy search with application to automated planning. We make two primary contributions over prior work on learning for greedy search. First, we introduce weighted sets of action-selection rules as a new form of control knowledge for greedy search. Prior work has shown the utility of action-selection rules for greedy search, but has treated the rules as hard constraints, resulting in brittleness. Our weighted rule sets allow multiple rules to vote, helping to improve robustness to noisy rules. Second, we give a new iterative learning algorithm for learning weighted rule sets based on RankBoost, an efficient boosting algorithm for ranking. Each iteration considers the actual performance of the current rule set and directs learning based on the observed search errors. This is in contrast to most prior approaches, which learn control knowledge independently of the search process. Our empirical results have shown significant promise for this approach in a number of domains.
关 键 词: 计算机科学; 机器学习; 贪婪搜索
课程来源: 视频讲座网
最后编审: 2019-11-17:cwx
阅读次数: 30