0


Web应用程序的推荐程序问题

Recommender Problems for Web Applications
课程网址: http://videolectures.net/kdd2010_agarwal_chen_rpwa/  
主讲教师: Bee-Chung Chen, Deepak Agarwal
开课单位: 领英公司
开课时间: 2010-10-01
课程语种: 英语
中文简介:
在这个为期半天的教程中,我们深入介绍了在Web应用程序的推荐问题中出现的数据挖掘挑战。自Netflix发布大型电影评级数据集以来,推荐人问题受到了相当多的关注。然而,本教程的重点是Web应用程序,我们将讨论在Netflix竞赛环境中讨论的重要扩展主题。虽然在Netflix竞赛中所进行的研究在“离线研究”(从推荐系统获得的历史数据拟合模型)方面取得了很大进展,但本教程超越了这一点并讨论了“在线研究”中的重要问题(设计科学)以最佳方式向用户提供项目的顺序方案。在抽象层面,推荐系统的目标是在每个用户访问某个网站时显示库存中的项目。每个显示都会产生一些用户的响应(点击次数,评分等),这些响应会更新我们对用户偏好的看法,并在将来提供更好的推荐。数据挖掘挑战是构建服务方案或顺序设计,其通过与项目的交互来学习用户偏好,以便在长时间范围内最大化一些效用函数。例如,像Yahoo!这样的门户网站也许有兴趣构建一个服务方案,向访问其​​首页的用户显示文章以最大化点击率。本教程将从实际应用程序中的实际示例开始,对问题进行正式定义。我们详细讨论了可能需要提出建议的各种场景,包括不同的项目池大小,动态与静态项目池,用户和项目的冷启动程度,每个显示器要选择的项目数,用户疲劳等。这些场景远远超出了经典电影推荐问题。然后,我们对该领域的最新技术进行了全面的概述,并详细讨论了几个我们希望将进一步研究的开放性问题。特别地,我们描述了时间序列模型,多臂匪盗方案,回归方法,矩阵分解方法,冷启动,基于相似性的方法,通过现实世界的例子来举例说明。最后,我们将详细讨论该领域的若干技术挑战。
课程简介: In this half-day tutorial, we provide an in-depth introduction of data mining challenges that arise in the context of recommender problems for web applications. Since Netflix released a large movie ratings dataset, recommender problems have received considerable attention. The focus of this tutorial however is on web applications and we will cover topics that are significant extensions of those discussed in the context of Netflix contest. While the research pursued in the context of Netflix contest made great advances in “offline-research” (fitting model to historic data obtained from a recommender system), this tutorial goes beyond this and discusses important issues in “online-research” (the science of designing sequential schemes that serve items to users in an optimal fashion). At an abstract level, the goal of recommender systems is to display items from an inventory for each user visit to some website. Each display results in some response from the user (clicks, ratings and so on) that updates our belief about user preferences and results in better recommendations in the future. The data mining challenge is to construct a serving scheme or sequential design that learns user preferences through interactions with items in order to maximize some utility function over a long time horizon. For instance, a portal like Yahoo! maybe interested in constructing a serving scheme that displays articles to users visiting their front page to maximize click rates. The tutorial will begin with a formal definition of the problem through real life examples drawn from actual applications. We provide a detailed discussion of various scenarios under which recommendations may have to be made, including varying item pool size, dynamic versus static item pool, degrees of cold-start both for users and items, number of items to be selected for each display, user fatigue and so on. These scenarios go significantly beyond the classical movie recommendation problem. We then provide a comprehensive overview of state-of-the-art techniques in this area with detailed discussion of several open problems that we hope will stimulate further research. In particular, we describe time-series models, multi-armed bandit schemes, regression approaches, matrix factorization approaches, cold-start, similarity-based approaches exemplified through real world examples. We will end with detailed discussion of several technical challenges in this area.
关 键 词: 应用程序; 数据挖掘; 扩展主题
课程来源: 视频讲座网
最后编审: 2020-01-13:chenxin
阅读次数: 61