0


一目了然的博客:基于内容的简单结构

The Blogosphere at a glance: Content-based structures made simple
课程网址: http://videolectures.net/socialweb2011_gornerup_blogosphere/  
主讲教师: Olof Görnerup
开课单位: 瑞典计算机科学研究所
开课时间: 2011-08-04
课程语种: 英语
中文简介:
介绍了一种基于基本词汇重叠相似性测度的网络表示方法。这种表示的简单性使得它在计算上易于处理、透明并且对表示依赖的工件不敏感。使用瑞典的博客数据,我们证明了尽管表示简单,但它仍然能够捕获博客圈中内容的重要结构属性。首先,处理类似主题的博客被组织在不同的网络集群中。第二,网络按层次结构组织为集群,然后形成高阶集群:一种类似于博客分类法的复合结构。
课程简介: A network representation based on a basic wordoverlap similarity measure between blogs is introduced. The simplicity of the representation renders it computationally tractable, transparent and insensitive to representation-dependent artifacts. Using Swedish blog data, we demonstrate that the representation, in spite of its simplicity, manages to capture important structural properties of the content in the blogosphere. First, blogs that treat similar subjects are organized in distinct network clusters. Second, the network is hierarchically organized as clusters in turn form higher-order clusters: a compound structure reminiscent of a blog taxonomy.
关 键 词: 社会化媒体; 机器学习; 聚类; 计算机科学; Web挖掘
课程来源: 视频讲座网
最后编审: 2020-12-19:yumf
阅读次数: 38