0


将新颖的类检测与概念漂移数据流的分类相结合

Integrating Novel Class Detection with Classification for Concept-Drifting Data Streams
课程网址: http://videolectures.net/ecmlpkdd09_khan_incdccdds/  
主讲教师: Latifur Khan
开课单位: 德克萨斯大学
开课时间: 2009-10-20
课程语种: 英语
中文简介:
在典型的数据流分类任务中,假设类的总数是固定的。这种假设在新的类可能发展的真实流式环境中可能无效。传统的数据流分类技术不能识别新的类实例,直到手动识别新类的出现,并且该类的标记实例被呈现给学习算法用于训练。当基础数据分布随时间变化时,存在概念漂移时问题变得更具挑战性。我们提出了一种新颖有效的技术,通过量化未标记的测试实例之间的内聚力,以及将测试实例与训练实例分离,可以在存在概念漂移的情况下自动检测新类的出现。我们的方法是非参数的,意思是,它不假设任何潜在的数据分布。与现有技术流分类技术的比较证明了我们的方法的优越性。
课程简介: In a typical data stream classification task, it is assumed that the total number of classes are fixed. This assumption may not be valid in a real streaming environment, where new classes may evolve. Traditional data stream classification techniques are not capable of recognizing novel class instances until the appearance of the novel class is manually identified, and labeled instances of that class are presented to the learning algorithm for training. The problem becomes more challenging in the presence of concept-drift, when the underlying data distribution changes over time. We propose a novel and efficient technique that can automatically detect the emergence of a novel class in the presence of concept-drift by quantifying cohesion among unlabeled test instances, and separation of the test instances from training instances. Our approach is non-parametric, meaning, it does not assume any underlying distributions of data. Comparison with the state-of-the-art stream classification techniques prove the superiority of our approach.
关 键 词: 数据流; 学习算法; 概念漂移
课程来源: 视频讲座网
最后编审: 2020-07-31:yumf
阅读次数: 66