0


跟踪多个主题以查找有趣的文章

Tracking Multiple Topics for Finding Interesting Articles
课程网址: http://videolectures.net/kdd07_pon_tmt/  
主讲教师: Raymond Pon
开课单位: 加州大学洛杉矶分校
开课时间: 2007-09-14
课程语种: 英语
中文简介:
我们为iScore引入了多主题跟踪(MTT),以便为具有多种兴趣的用户更好地推荐新闻文章,并解决用户利益随时间的变化。作为基本Rocchio算法,传统主题检测和跟踪以及单通道聚类的扩展,MTT维护多个兴趣简档,以便在给定用户反馈的情况下为特定用户识别有趣的文章。仅关注有趣的主题使iScore能够丢弃无用的配置文件,以解决用户兴趣的变化,并在资源消耗和分类准确性之间取得平衡。此外,通过将主题的趣味性与文章的趣味性联系起来,iScore能够获得比传统方法(如Rocchio算法)更高质量的结果。我们确定了几个适用于MTT的操作参数。使用相同的参数,我们表明MTT单独产生高质量的结果,用于推荐来自几个语料库的有趣文章。在推荐来自雅虎的新闻报道时,包含MTT可将iScore的表现提高9%。新闻RSS提要和TREC11自适应滤波器文章集。通过一项小型用户调查,我们发现iScore在仅提供少量用户反馈时仍能表现良好。
课程简介: We introduce multiple topic tracking (MTT) for iScore to better recommend news articles for users with multiple interests and to address changes in user interests over time. As an extension of the basic Rocchio algorithm, traditional topic detection and tracking, and single-pass clustering, MTT maintains multiple interest profiles to identify interesting articles for a specific user given user-feedback. Focusing on only interesting topics enables iScore to discard useless profiles to address changes in user interests and to achieve a balance between resource consumption and classification accuracy. Also by relating a topic’s interestingness to an article’s interestingness, iScore is able to achieve higher quality results than traditional methods such as the Rocchio algorithm. We identify several operating parameters that work well for MTT. Using the same parameters, we show that MTT alone yields high quality results for recommending interesting articles from several corpora. The inclusion of MTT improves iScore’s performance by 9% in recommending news articles from the Yahoo! News RSS feeds and the TREC11 adaptive filter article collection. And through a small user study, we show that iScore can still perform well when only provided with little user feedback.
关 键 词: 多主题跟踪; Rocchio算法; 语料库
课程来源: 视频讲座网
最后编审: 2019-05-09:lxf
阅读次数: 29