0


语义数据挖掘

Semantic Data Mining
课程网址: http://videolectures.net/ecmlpkdd2011_lavrac_vavpetic_mining/  
主讲教师: Alexandros Kalousis; Jędrzej Potoniec
开课单位: 日内瓦大学;波兰波兹南工业大学
开课时间: 2011-11-29
课程语种: 英语
中文简介:
术语语义数据挖掘表示一种数据挖掘方法,其中领域本体被用作背景知识。这种方法的动机是大量的数据,这些数据越来越公开,并使用以语义网语言表示的真实本体进行描述,可以说是生物学领域中最广泛的。这最近为有趣的大规模和真实世界的语义应用开辟了可能性。 语义注释数据的可用性对新类型的数据挖掘方法提出了要求,这些方法将能够处理语义表示语言的复杂性和表达性,利用所描述资源的本体和显式语义的可用性,并解释了作为利用本体的推理服务的基础的新颖假设(例如开放世界)。 本教程解决了上述问题,重点讨论了机器学习技术如何直接在结构丰富的语义网数据上工作、利用本体和语义网技术、利用本体的机器学习方法的附加值是什么以及语义数据挖掘方法开发人员面临的挑战。它还包含支持语义数据挖掘的工具演示。 本教程从三个互补的角度介绍了语义数据挖掘的主题。 首先,在[NVTL09]工作的基础上,提出了语义数据挖掘的通用框架。本教程的第一部分还讨论了一种新的语义子群发现方法:g-SEGS。它还附带了开发工具的演示,该工具是Orange4WS环境的一部分。 本教程的第二部分涵盖了从描述逻辑学习(DL学习)的主题,其动机是标准Web本体语言OWL在理论上是基于描述逻辑的。其中包括一个支持DL学习的工具演示(Rapid Miner系统的插件)。 最后,本教程的第三部分介绍了语义元挖掘的主题。这种方法有三个特点,使其与以前的方法不同。首先,与之前的工作相比,它采用了一种面向过程的方法,其中元学习被应用于支持整个数据挖掘过程或工作流的不同阶段的设计选择。其次,它通过深入分析和表征算法——它们的基本假设、优化目标和策略,以及它们生成的模型和模式,来补充数据集描述。最后,它依赖于一个数据挖掘本体,该本体提取了与知识发现本身有关的大量背景知识。
课程简介: The term semantic data mining denotes a data mining approach where domain ontologies are used as background knowledge. Such approach is motivated by large amounts of data that are increasingly becoming openly available and described using real-life ontologies represented in Semantic Web languages, arguably most extensively in the domain of biology. This recently opened up the possibility for interesting large-scale and real-world semantic applications. The availability of semantically annotated data poses requirements for new kinds of approaches for data mining that would be able to deal with the complexity, and expressivity of the semantic representation languages, leverage on availability of ontologies and explicit semantics of the described resources, and account for novel assumptions (e.g., open world) that underlie reasoning services exploiting ontologies. The tutorial addresses the above issues, focusing on the problems of how machine learning techniques can work directly on the richly structured Semantic Web data, exploit ontologies, and the Semantic Web technologies, what is the value added of machine learning methods exploiting ontologies, and what are the challenges for developers of semantic data mining methods. It also contains demonstrations of tools supporting semantic data mining. The tutorial addresses the above issues, focusing on the problems of how machine learning techniques can work directly on the richly structured Semantic Web data, exploit ontologies, and the Semantic Web technologies, what is the value added of machine learning methods exploiting ontologies, and what are the challenges for developers of semantic data mining methods. It also contains demonstrations of tools supporting semantic data mining. The tutorial presents the topic of semantic data mining from three complementary perspectives. Firstly, it presents a general framework for semantic data mining, following the work [NVTL09]. The first part of the tutorial also discusses a new method for semantic subgroup discovery: g-SEGS. It is accompanied with a presentation of the developed tool, a part of Orange4WS environment. The second part of tutorial covers the topic of learning from description logics (DL-learning), motivated by the fact that the standard Web ontology language, OWL, is theoretically based on description logics. This includes a demo of a tool supporting DL-learning (a plugin to the Rapid Miner system). Finally, the third part of the tutorial covers the topic of semantic meta-mining. This approach has three features that distinguish it from its predecessors. First, more than in previous work, it adopts a process-oriented approach where meta-learning is applied to support design choices at different stages of the complete data mining process or workflow. Second, it complements dataset descriptions with an in-depth analysis and characterization of algorithms—their underlying assumptions, optimization goals and strategies, the models and patterns they generate. Finally, it relies on a data mining ontology which distills extensive background knowledge concerning knowledge discovery itself.
关 键 词: 术语语义; 数据挖掘; 本体领域; 背景知识
课程来源: 视频讲座网
数据采集: 2023-05-24:chenxin01
最后编审: 2023-05-24:chenxin01
阅读次数: 16