

TRUMIT: A Tool to Support Large-Scale Mining of Text Association Rules
课程网址: http://videolectures.net/ecmlpkdd2011_tsatsaronis_mining/  
主讲教师: George Tsatsaronis
开课单位: 挪威科技大学
开课时间: 2011-10-03
课程语种: 英语
由于文本数据的性质,关联规则挖掘在文本语料库中的应用多年来一直吸引着研究科学界的关注。在本文中,我们将展示一个可以从文本中有效挖掘关联规则的系统。系统使用多个注释器注释,并提取术语或术语类别之间的文本关联规则。这项工作的另一个贡献是包含了novelunsupervised评估措施,用于加权和排列textrules的重要性。我们使用两个文本集合,一组Wikileaks文档和一个来自TREC 7的文档来演示我们系统的功能。
课程简介: Due to the nature of textual data the application of association rule mining in text corpora has attracted the focus of the research scientific community for years. In this paper we demonstrate a system that can efficiently mine association rules from text. The system annotates terms using several annotators, and extracts text association rules between terms or categories of terms. An additional contribution of this work is the inclusion of novel unsupervised evaluation measures for weighting and ranking the importance of the text rules. We demonstrate the functionalities of our system with two text collections, a set of Wikileaks documents, and one from TREC-7.
关 键 词: 文本数据; 有效挖掘; 文本集合
课程来源: 视频讲座网
最后编审: 2019-04-07:cwx
阅读次数: 64