0


数据挖掘中的随机化方法

Randomization Methods in Data Mining
课程网址: http://videolectures.net/kdd09_mannila_rmdm/  
主讲教师: Heikki Mannila
开课单位: 赫尔辛基大学
开课时间: 2009-09-14
课程语种: 英语
中文简介:
数据挖掘研究为大型和复杂数据集的各种分析任务开发了许多算法。然而,评估数据挖掘结果的重要性却很少受到关注。分析方法很少可用,因此必须使用计算密集型方法。基于空模型的随机化方法至少在原则上提供了一种通用方法,可用于获得各种数据挖掘方法的经验p值。我回顾了最近在这方面的一些工作,概述了一些开放的问题和问题。
课程简介: Data mining research has developed many algorithms for various analysis tasks on large and complex datasets. However, assessing the significance of data mining results has received less attention. Analytical methods are rarely available, and hence one has to use computationally intensive methods. Randomization approaches based on null models provide, at least in principle, a general approach that can be used to obtain empirical p-values for various types of data mining approaches. I review some of the recent work in this area, outlining some of the open questions and problems.
关 键 词: 数据挖掘研究; 分析任务; 计算密集型
课程来源: 视频讲座网
最后编审: 2019-12-20:lxf
阅读次数: 57