0


不匹配的模型、错误的结果和可怕的决策

Mismatched Models, Wrong Results, and Dreadful Decisions
课程网址: http://videolectures.net/kdd09_hand_mmwrdd/  
主讲教师: David Hand
开课单位: 伦敦帝国学院
开课时间: 2009-09-14
课程语种: 英语
中文简介:
数据挖掘技术使用分数函数来量化模型对给定数据集的拟合程度。参数通过优化拟合来估计,由所选的分数函数来衡量,模型选择由不同模型的分数大小来指导。由于不同的评分函数以不同的方式总结拟合,因此选择一个与数据挖掘练习的目标匹配的函数是很重要的。对于预测分类问题,存在各种各样的评分函数,包括诸如精度和召回率、F测量、误分类率、ROC曲线下面积(AUC)等测量。前四项要求选择分类阈值,这一选择可能并不容易,甚至可能是不可能的,特别是当将来要应用分类规则时。相比之下,AUC不需要指定分类阈值,而是在可能的阈值选择范围内总结性能。然而,不幸的是,尽管AUC被广泛使用,但在其定义的核心上存在着以前未被认识到的基本不一致性。这意味着使用AUC会导致糟糕的模型选择和不必要的错误分类。AUC是在上下文中设定的,解释了其不足之处,并说明了其含义——底线是不应使用AUC。描述了一组相干替代分数。这些想法通过银行贷款、欺诈、人脸识别和健康筛查的例子来说明。
课程简介: Data mining techniques use score functions to quantify how well a model fits a given data set. Parameters are estimated by optimising the fit, as measured by the chosen score function, and model choice is guided by the size of the scores for the different models. Since different score functions summarise the fit in different ways, it is important to choose a function which matches the objectives of the data mining exercise. For predictive classification problems, a wide variety of score functions exist, including measures such as precision and recall, the F measure, misclassification rate, the area under the ROC curve (the AUC), and others. The first four of these require a classification threshold to be chosen, a choice which may not be easy, or may even be impossible, especially when the classification rule is to be applied in the future. In contrast, the AUC does not require the specification of a classification threshold, but summarises performance over the range of possible threshold choices. However, unfortunately, and despite the widespread use of the AUC, it has a previously unrecognised fundamental incoherence lying at the core of its definition. This means that using the AUC can lead to poor model choice and unecessary misclassifications. The AUC is set in context, its deficiency explained and the implications illustrated - with the bottom line being that the AUC should not be used. A family of coherent alternative scores is described. The ideas are illustrated with examples from bank loans, fraud, face recognition, and health screening.
关 键 词: 数据挖掘技术; 量化模型; 相干替代
课程来源: 视频讲座网
数据采集: 2022-12-12:chenjy
最后编审: 2022-12-12:chenjy
阅读次数: 28