0


一个以社区为基础的社会网络中的标签pseudolikelihood方法的关系

A Community-Based Pseudolikelihood Approach for Relationship Labeling in Social Networks
课程网址: http://videolectures.net/ecmlpkdd2011_wan_community/  
主讲教师: Huaiyu Wan
开课单位: 北京交通大学
开课时间: 2011-10-03
课程语种: 汉简
中文简介:
社会网络由一组社会关系连接的人(或其他社会实体)组成。对关系类型的认识有助于我们了解社会网络的结构和特征。传统的分类器对于关系标记不够精确,因为它们假定所有的标签都是独立的,并且分布相同。将关系概率模型——关系马尔可夫网络(rmns)引入到关系的标识中,但由于参数估计的效率不高,使得在大规模的社会网络中很难进行配置。本文提出了一种基于社区的关系标记伪似然方法。利用社会网络的社区结构来辅助构建条件随机场,使我们的方法合理、准确。另外,伪似然的计算简单有效地解决了RMN所面临的时间复杂度问题。我们将我们的方法应用于两个现实世界的社交网络,一个是恐怖分子关系网络,另一个是我们从加密呼叫详细记录中收集的电话呼叫网络。在我们的实验中,为了避免在将一个紧密相连的社交网络分割成单独的训练和测试子集时丢失链接,我们根据链接而不是个人分割数据集。实验结果表明,该方法在精度和效率方面都取得了良好的效果。
课程简介: A social network consists of people (or other social entities) connected by a set of social relationships. Awareness of the relationship types is very helpful for us to understand the structure and the characteristics of the social network. Traditional classifiers are not accurate enough for relationship labeling since they assume that all the labels are independent and identically distributed. A relational probabilistic model, relational Markov networks (RMNs), is introduced to labeling relationships, but the inefficient parameter estimation makes it difficult to deploy in large-scale social networks. In this paper, we propose a community-based pseudolikelihood (CBPL) approach for relationship labeling. The community structure of a social network is used to assist in constructing the conditional random field, and this makes our approach reasonable and accurate. In addition, the computational simplicity of pseudolikelihood effectively resolves the time complexity problem which RMNs are suffering. We apply our approach on two real-world social networks, one is a terrorist relation network and the other is a phone call network we collected from encrypted call detail records. In our experiments, for avoiding losing links while splitting a closely connected social network into separate training and test subsets, we split the datasets according to the links rather than the individuals. The experimental results show that our approach performs well in terms of accuracy and efficiency.
关 键 词: 社会网络; 概率模型; 参数估计
课程来源: 视频讲座网
最后编审: 2019-11-30:lxf
阅读次数: 37