0


学习多种强化学习的不同排名

Learning Diverse Rankings with Multi-Armed Bandits
课程网址: http://videolectures.net/icml08_kleinberg_ldr/  
主讲教师: Robert Kleinberg
开课单位: 康奈尔大学
开课时间: 2008-08-06
课程语种: 英语
中文简介:
用于学习对Web文档进行排名的算法通常假设文档的相关性独立于其他文档。这导致学习的排名函数产生具有冗余结果的排名。相比之下,用户研究表明,高等级的多样性通常是首选。我们提出了两种新的学习算法,可以根据用户的点击行为直接学习各种文档排名。我们证明这些算法最小化放弃,或者最大化在排名的前k个位置找到相关文档的概率。我们证明,即使用户的兴趣发生变化,我们的算法之一渐近地实现了在多项式时间内可获得的最佳可能收益。当用户兴趣是静态的时,另一个在经验上表现更好,并且在那种情况下理论上仍然接近最优。
课程简介: Algorithms for learning to rank Web documents usually assume a document's relevance is independent of other documents. This leads to learned ranking functions that produce rankings with redundant results. In contrast, user studies have shown that diversity at high ranks is often preferred. We present two new learning algorithms that directly learn a diverse ranking of documents based on users' clicking behavior. We show that these algorithms minimize abandonment, or alternatively, maximize the probability that a relevant document is found in the top k positions of a ranking. We show that one of our algorithms asymptotically achieves the best possible payoff obtainable in polynomial time even as user's interests change. The other performs better empirically when user interests are static, and is still theoretically near-optimal in that case.
关 键 词: Web文档; 排名函数; 多项式时间
课程来源: 视频讲座网
最后编审: 2019-04-19:lxf
阅读次数: 77