0


关于大量量词的机会准确性

On the Chance Accuracies of Large Collections of Classifiers
课程网址: http://videolectures.net/icml08_palatucci_oca/  
主讲教师: Mark Palatucci
开课单位: 卡内基梅隆大学
开课时间: 2008-08-29
课程语种: 英语
中文简介:
我们提供了大量分类器的机会准确性的理论分析。我们证明了在少数例子的问题上,一些分类器可以通过随机机会很好地执行,并且我们推导出一个定理来明确地计算这个精度。我们使用该定理为稀疏的高维问题提供原则特征选择标准。我们在微阵列和fMRI数据集上评估这种方法,并表明它的表现非常接近从oracle获得的最佳准确度。我们还表明,在fMRI数据集上,该技术成功地选择了相关特征,而另一种最先进的方法,即虚假发现率(FDR),在标准显着性水平下完全失败。
课程简介: We provide a theoretical analysis of the chance accuracies of large collections of classifiers. We show that on problems with small numbers of examples, some classifier can perform well by random chance, and we derive a theorem to explicitly calculate this accuracy. We use this theorem to provide a principled feature selection criteria for sparse, high-dimensional problems. We evaluate this method on both microarray and fMRI datasets and show that it performs very close to the optimal accuracy obtained from an oracle. We also show that on the fMRI dataset this technique chooses relevant features successfully while another state-of-the-art method, the False Discovery Rate (FDR), completely fails at standard significance levels.
关 键 词: 分类器; 高维问题; 微阵列
课程来源: 视频讲座网
最后编审: 2020-06-22:chenxin
阅读次数: 57