0


线路之间的连接:充实社会网络与文本

Connections between the Lines: Augmenting Social Networks with Text
课程网址: http://videolectures.net/kdd09_chang_cbtl/  
主讲教师: Jonathan Chang
开课单位: 脸书公司
开课时间: 2009-09-14
课程语种: 英语
中文简介:
网络数据无处不在, 编码人员、地点、基因或公司等实体之间关系的集合。虽然有趣实体网络的许多资源正在出现, 但其中大多数只能以有限的方式注释连接。尽管实体之间的关系很丰富, 但手动为大型真实实体上的每一对实体设计这些关系的完整特征是不切实际的。 本文提出了一种新的概率主题模型来分析文本语料库, 并推断其实体及其实体之间的关系描述。我们开发了变分方法来执行模型的近似推理, 并证明我们的模型可以实际部署在维基百科等大型语料库上。我们从质量和数量上表明, 我们的模型可以构造和注释关系图, 并做出有用的预测。
课程简介: Network data is ubiquitous, encoding collections of relationships between entities such as people, places, genes, or corporations. While many resources for networks of interesting entities are emerging, most of these can only annotate connections in a limited fashion. Although relationships between entities are rich, it is impractical to manually devise complete characterizations of these relationships for every pair of entities on large, real-world corpora. In this paper we present a novel probabilistic topic model to analyze text corpora and infer descriptions of its entities and of relationships between those entities. We develop variational methods for performing approximate inference on our model and demonstrate that our model can be practically deployed on large corpora such as Wikipedia. We show qualitatively and quantitatively that our model can construct and annotate graphs of relationships and make useful predictions.
关 键 词: 线路; 连接; 社会网络; 网络数据
课程来源: 视频演讲网
最后编审: 2020-06-19:cxin
阅读次数: 44