工业数据挖掘、挑战和前景Industrial data mining, Challenges and perspectives |
|
课程网址: | http://videolectures.net/ecmlpkdd08_soulie_idmc/ |
主讲教师: | Françoise Fogelman Soulié |
开课单位: | KXEN |
开课时间: | 2008-10-10 |
课程语种: | 英语 |
中文简介: | 商业智能在所有工业领域都是非常活跃的行业。主要涉及呈现数据的经典技术(报告和Olap)已经被广泛应用。同时,数据挖掘长期以来一直作为一种利基技术在公司中使用,只为专家和非常具体的问题(例如信用评分、欺诈检测)保留。但随着大数据量(尤其是但不仅仅是来自Web的)的可用性不断增加,公司越来越多地转向数据挖掘,为其提供高附加值的预测分析。然而,在工业环境中大量生产模型,利用大量数据量,只有在找到理论和操作挑战的解决方案时才能实现:当数据集有数千个变量和数百万个观测值时,我们需要可以用于生产模型的算法;我们需要学习如何运行和控制数百个模型的正确执行;我们需要自动化数据挖掘过程的方法。我将在工业背景下介绍这些限制,并展示KXEN如何利用理论结果(来自Vladimir Vapnik的工作)为上述挑战提供答案。我将举几个实际应用的例子,最后将对工业领域数据挖掘的未来发表一些看法。 |
课程简介: | Business Intelligence is a very active sector in all industrial domains. Classical techniques (reporting and Olap), mainly concerned with presenting data, are already widely deployed. Meanwhile, Data Mining has long been used in companies as a niche-technique, reserved for experts only and for very specific problems (credit scoring, fraud detection for example). But with the increasing availability of large data volumes (in particular, but not only, from the Web), companies are more and more turning to data mining to provide them with high added-value predictive analytics. However producing models in large numbers, making use of large data volumes in an industrial context can only happen if solutions to challenges, both theoretic and operational, are found: we need algorithms which can be used to produce models when datasets have thousands of variables and millions of observations; we need to learn how to run and control the correct execution of hundreds of models; we need ways to automate the data mining process. I will present these constraints in industrial contexts and show how KXEN has exploited theoretical results (coming from Vladimir Vapnik's work) to provide answers to the above-mentioned challenges. I will give a few examples of real-life applications and will conclude with some remarks on the future of data mining in the industrial domain. |
关 键 词: | 商业智能; 工业领域; 数据挖掘 |
课程来源: | 视频讲座网 |
数据采集: | 2023-03-06:chenjy |
最后编审: | 2023-03-06:chenjy |
阅读次数: | 24 |