0


用主题模型描述微博

Characterizing Microblogs with Topic Models
课程网址: http://videolectures.net/icwsm2010_ramage_cmt/  
主讲教师: Daniel Ramage
开课单位: 斯坦福大学
开课时间: 2010-06-29
课程语种: 英语
中文简介:
随着微博的普及,像Twitter这样的服务正在支持信息收集需求超越其作为社交网络的传统角色。但大多数用户与Twitter的互动仍然主要集中在他们的社交图上,迫使“我关注的人”与“我想阅读的东西”经常不恰当的混淆。我们描述了当前Twitter界面无法支持的一些信息需求,并争取更好地表达内容以解决这些挑战。我们提出了部分监督学习模型(Labeled LDA)的可扩展实现,该模型将Twitter提要的内容映射到维度。这些维度大致对应于帖子的内容,风格,状态和社交特征。我们使用此模型描述用户和推文的特征,并在两个面向信息消费的任务中呈现结果。
课程简介: As microblogging grows in popularity, services like Twitter are coming to support information gathering needs above and beyond their traditional roles as social networks. But most users’ interaction with Twitter is still primarily focused on their social graphs, forcing the often inappropriate conflation of “people I follow” with “stuff I want to read.” We characterize some information needs that the current Twitter interface fails to support, and argue for better representations of content for solving these challenges. We present a scalable implementation of a partially supervised learning model (Labeled LDA) that maps the content of the Twitter feed into dimensions. These dimensions correspond roughly to substance, style, status, and social characteristics of posts. We characterize users and tweets using this model, and present results on two information consumption oriented tasks.
关 键 词: 信息收集; 社交网络; 部分监督学习
课程来源: 视频讲座网
最后编审: 2019-04-26:lxf
阅读次数: 36