0


宠物:一个社会团体中的社区流行事件跟踪的统计模型

PET: A Statistical Model for Popular Events Tracking in Social Communities
课程网址: http://videolectures.net/kdd2010_xide_lin_pet/  
主讲教师: Cindy Xide Lin
开课单位: 伊利诺伊大学
开课时间: 2010-10-01
课程语种: 英语
中文简介:
在线社区中用户生成的信息具有文本流和网络结构的混合特征,两者都随时间变化。一个很好的例子是一个拥有每日博客帖子和博客社交网络的网络博客社区。分析在线社区的一个重要任务是观察和跟踪社区中随时间演变的流行事件或主题。现有的研究方法通常侧重于主题的粗犷或网络的发展,而忽略了文本主题与网络结构之间的相互作用。在本文中,我们正式定义了在线社区中流行事件跟踪的问题,重点讨论文本和网络之间的相互作用。我们提出了一种新的统计方法,该方法考虑到用户兴趣的突发性、网络结构上的信息扩散以及文本主题的演变,对事件随时间的流行进行建模。具体地说,定义了一个吉布斯随机字段来模拟历史状态的影响和图中的依赖关系;之后,主题模型生成事件文本内容中的单词,并由吉布斯随机字段进行正则化。我们证明了两个经典的信息扩散模型和文本模糊模型是我们模型在特定情况下的特例。对两个不同社区和数据集(即Twitter和DBLP)进行的实验表明,我们的方法是有效的,并且优于现有的方法。
课程简介: User generated information in online communities has been characterized with the mixture of a text stream and a network structure both changing over time. A good example is a web-blogging community with the daily blog posts and a social network of bloggers. An important task of analyzing an online community is to observe and track the popular events, or topics that evolve over time in the community. Existing approaches usually focus on either the burstiness of topics or the evolution of networks, but ignoring the interplay between textual topics and network structures. In this paper, we formally define the problem of popular event tracking in online communities (PET), focusing on the interplay between texts and networks. We propose a novel statistical method that models the the popularity of events over time, taking into consideration the burstiness of user interest, information diffusion on the network structure, and the evolution of textual topics. Specifically, a Gibbs Random Field is defined to model the influence of historic status and the dependency relationships in the graph; thereafter a topic model generates the words in text content of the event, regularized by the Gibbs Random Field. We prove that two classic models in information diffusion and text burstiness are special cases of our model under certain situations. Empirical experiments with two different communities and datasets (i.e., Twitter and DBLP) show that our approach is effective and outperforms existing approaches.
关 键 词: 网络社区; 跟踪流行事件; 突发性话题; 信息扩散; 数据实证实验
课程来源: 视频讲座网
最后编审: 2020-07-13:yumf
阅读次数: 101