学习数以百万计的例子和维度 - 竞赛意图

Learning with Millions of Examples and Dimensions - Competition proposal
课程网址: http://videolectures.net/eml07_sonnenburg_lme/  
主讲教师: Sören Sonnenburg
开课单位: 弗劳恩霍夫智能分析与信息系统研究所
开课时间: 2008-02-01
课程语种: 英语
课程简介: Over the years many different classification methods have been proposed in machine learning. However it is currently very difficult to judge which method is the most efficient with respect to training time and memory requirements and classification performance, which are the practically relevant criteria. A possible explanation for this difficulty is that methods are (often) evaluated under different conditions: For instance different datasets, evaluation criteria, model parameters and stopping conditions are used. We would therefore like to organize a competition, that is designed to be fair and enables a direct comparison of current large scale classifiers. To this end we plan to provide a generic evaluation framework tailored to the specifics of the competing methods, for example for Support Vector Machine classifiers, one would in addition to test-error record the objective value of the primal problem. Providing a wide range of datasets, each of which having specific properties, like extremely sparse, dense, high or low dimensional, we propose to evaluate the methods based on the following figures: training time vs. test error, dataset size vs. test error and dataset size vs. training time. We seek help from the community to gather relevant large-scale real-world data sets and to critically review and discuss fair evaluation criteria and finally invite researchers to co-organize and to participate in this challenge.
关 键 词: 机器学习; 大型分类器; 向量机分类器
课程来源: 视频讲座网
最后编审: 2019-04-10:lxf
阅读次数: 38