0


雷夫兰克:选择最有帮助的书完全无监督的算法综述

RevRank: a Fully Unsupervised Algorithm for Selecting the Most Helpful Book Reviews
课程网址: http://videolectures.net/icwsm09_tsur_rr/  
主讲教师: Oren Tsur
开课单位: 哈佛大学
开课时间: 2009-06-24
课程语种: 英语
中文简介:
我们提出了一种根据评论的有用性自动对用户生成的图书评论进行排序的算法。给出了一组评论,我们的revrank算法识别了构成虚拟最优评论核心的显性词汇。这个词汇定义了一个特征向量表示。然后将评论转换为这种表示,并根据它们与“虚拟核心”评论向量之间的距离进行排名。该算法完全不受监督,从而避免了昂贵且容易出错的人工训练注释。我们的实验表明,Revrank明显优于模仿亚马逊用户投票评论排名系统的基线。
课程简介: We present an algorithm for automatically ranking usergenerated book reviews according to review helpfulness. Given a collection of reviews, our REVRANK algorithm identifies a lexicon of dominant terms that constitutes the core of a virtual optimal review. This lexicon defines a feature vector representation. Reviews are then converted to this representation and ranked according to their distance from a ‘virtual core’ review vector. The algorithm is fully unsupervised and thus avoids costly and error-prone manual training annotations. Our experiments show that REVRANK clearly outperforms a baseline imitating the Amazon user vote review ranking system.
关 键 词: 自动排序算法; 特征向量; 排名系统
课程来源: 视频讲座网
最后编审: 2019-11-30:lxf
阅读次数: 46