0


使用指向结构化数据源的超链接改进社交媒体中的分类

Improving Categorisation in Social Media using Hyperlinks to Structured Data Sources
课程网址: http://videolectures.net/eswc2011_kinsella_improving/  
主讲教师: Sheila Kinsella
开课单位: 爱尔兰国立大学
开课时间: 2011-07-07
课程语种: 英语
中文简介:
社交媒体为主题分类提出了独特的挑战,包括帖子的简洁性,对话的非正式性以及频繁依赖外部超链接为对话提供背景。在本文中,我们调查这些外部超链接的用途,以确定个别帖子的主题。我们将分析重点放在具有相关元数据的对象上,这些对象可以通过API或链接数据在Web上获得。我们的实验表明,除了原始帖子内容之外,包含超链接对象的元数据显着提高了两个不同数据集的分类器性能。我们发现,包含API和关联数据中的选定元数​​据比包含HTML页面中的文本提供了更好的结果。我们还利用数据的语义来比较社交媒体数据集中主题分类的不同类型的外部元数据的有用性。
课程简介: Social media presents unique challenges for topic classification, including the brevity of posts, the informal nature of conversations, and the frequent reliance on external hyperlinks to give context to a conversation. In this paper we investigate the usefulness of these external hyperlinks for determining the topic of individual posts. We focus our analysis on objects which have related metadata available on the Web, either via APIs or as Linked Data. Our experiments show that the inclusion of metadata from hyperlinked objects in addition to the original post content significantly improved classifier performance on two disparate datasets. We found that including selected metadata from APIs and Linked Data gave better results than including text from HTML pages. We also make use of the semantics of the data to compare the usefulness of different types of external metadata for topic classification in a social media dataset.
关 键 词: 社交媒体; 非正式性; 外部超链接
课程来源: 视频讲座网
最后编审: 2020-06-15:heyf
阅读次数: 50