0


落实“群众智慧”

Implementing the "Wisdom of the Crowd"
课程网址: http://videolectures.net/colt2014_mansour_implementing/  
主讲教师: Yishay Mansour
开课单位: 特拉维夫大学
开课时间: 2014-07-15
课程语种: 英语
中文简介:

随着互联网的迅速适应,“人群的智慧”已成为过去十年的热门话题。现象的核心是用户不仅消耗信息,而且产生信息。用户的双重角色导致了一个基本的设计问题,即如何激励用户去探索(产生新信息)而不是去利用(使用现有信息)。

我们提供了理解这一新特性的第一步。面对代理人的激励,探索与剥削之间经典权衡的一个方面。我们的抽象研究了一种新颖的模型,在该模型中,特工一个接一个地依次到达,而每个代理又从一组固定的动作中选择一个动作,以根据到达时所拥有的信息最大化其预期的回报。 (更具体地讲,每个特工都是两只土匪,鉴于他所观察到的信息,因此最大化了他自己的效用。)

可获得的信息会影响特工探索和产生新信息的动机。我们描述了以最大化社会福利为目标的计划者的最佳披露政策。规划者的最佳策略的特点是直观且易于实施。随着代理人数量的增加,社会福利会收敛到不受约束机制的最佳福利,而遗憾的局限是一个常数。

[基于与Ilan Kremer和Motty Perry的共同工作。]

课程简介: The “wisdom of the crowds” has become a hot topic in the last decade with the rapid adaptation of the Internet. At the core of the phenomena is that users do not only consume information but also produce it. This dual role of the users leads to a fundamental design question, how to incentivize the users to explore (produce new information) rather than exploit (use existing information). We provide a first step in understanding this new aspect of the classical tradeoff between exploration and exploitation in the face of agents’ incentives. Our abstraction studies a novel model in which agents arrive sequentially one after the other and each in turn chooses one action from a fixed set of actions to maximize his expected rewards given the information he possesses at the time of arrival. (More concretely, each agent is a two-arm bandit, maximizing his own utility given the information he has observed.) The information that becomes available affects the incentives of an agent to explore and generate new information. We characterize the optimal disclosure policy of a planner whose goal is to maximizes social welfare. The planner's optimal policy is characterized and shown to be intuitive and very simple to implement. As the number of agents increases the social welfare converges to the optimal welfare of the unconstrained mechanism and the regret is bounded by a constant. [Based on a joint work with Ilan Kremer and Motty Perry.]
关 键 词: 模型代理; 信息探索
课程来源: 视频讲座网
数据采集: 2020-11-26:zyk
最后编审: 2020-11-28:zyk
阅读次数: 47