0


机器学习与机构设计的相互作用

The Interplay of Machine Learning and Mechanism Design
课程网址: http://videolectures.net/nips2010_parkes_iml/  
主讲教师: David C. Parkes
开课单位: 哈佛大学
开课时间: 2011-01-12
课程语种: 英语
中文简介:
在机制设计的经济理论中,目标是从多个代理中的每一个中获取私人信息,以便选择理想的系统范围的结果,并且尽管代理人自己有兴趣促进个体有益的结果。拍卖提供了一个典型的例子,以投标的形式引出信息,以及定义结果的资源和支付的分配。实际上,机器学习(ML)和机制设计(MD)之间新兴相互作用的一个方面是通过将拍卖解释为学习代理商评估功能的方法而产生的。除了寻求足够的准确性以支持最佳资源分配之外,我们还需要激励兼容性,即价格对任何单个代理的输入都不敏感,并且在统计ML中找到与正规化有趣的联系。更广泛地说,ML可用于从头设计,学习具有适当激励属性的支付规则。 MD的想法也流入了ML。一个例子考虑使用机制引出私有状态,奖励和过渡模型,以实现多代理系统中的协调探索和利用,尽管自身利益。另一个应用是监督学习,其中标记的训练数据是从自感兴趣的代理引出的,每个代理都有自己的客观标准,该假设是由机制学习的。展望未来,一个诱人的挑战问题是采用激励机制来设计强大的代理体系结构,例如分配促进模块化智能系统的内部奖励。
课程简介: In the economic theory of mechanism design, the goal is to elicit private information from each of multiple agents in order to select a desirable system wide outcome, and despite agent self-interest in promoting individually beneficial outcomes. Auctions provide a canonical example, with information elicited in the form of bids, and an allocation of resources and payments defining an outcome. Indeed, one aspect of the emerging interplay between machine learning (ML) and mechanism design (MD) arises by interpreting auctions as a method for learning agent valuation functions. In addition to seeking sufficient accuracy to support optimal resource allocation, we require for incentive compatibility that prices are insensitive to the inputs of any individual agent and find an interesting connection with regularization in statistical ML. More broadly, ML can be used for de novo design, in learning payment rules with suitable incentive properties. Ideas from MD are also flowing into ML. One example considers the use of mechanisms to elicit private state, reward and transition models, in enabling coordinated exploration and exploitation in multi-agent systems despite self-interest. Another application is to supervised learning, where labeled training data is elicited from self-interested agents, each with its own objective criterion on the hypothesis learned by the mechanism. Looking ahead, a tantalizing challenge problem is to adopt incentive mechanisms for the design of robust agent architectures, for example in assigning internal rewards that promote modular intelligent systems.
关 键 词: 机制设计; 监督学习; 代理体系结构
课程来源: 视频讲座网
最后编审: 2019-07-25:cwx
阅读次数: 59