0


成本敏感不确定性抽样的自训练方法

A Self-Training Approach to Cost Sensitive Uncertainty Sampling
课程网址: http://videolectures.net/ecmlpkdd09_ghosh_asta/  
主讲教师: Joydeep Ghosh
开课单位: 德克萨斯大学
开课时间: 2009-10-20
课程语种: 英语
中文简介:
不确定性抽样是一种有效的主动学习方法,与其他主动学习方法(如损失减少法)相比,它具有计算效率高的特点。然而,与损耗减少法不同,不确定性抽样法不能在误差产生不同成本时,将总误分类成本最小化。本文介绍了一种利用自训练进行成本敏感不确定度抽样的方法。我们表明,即使错误分类成本相等,与标准不确定性抽样相比,这种自我训练方法也能更快地减少损失,因为标记点的数量和更可靠的后验概率估计。我们还说明了为什么其他更幼稚的方法,修改不确定性抽样,以尽量减少总误分类成本,将不会总是很好地工作。
课程简介: Uncertainty sampling is an effective method for performing active learning that is computationally efficient compared to other active learning methods such as loss-reduction methods. However, unlike lossreduction methods, uncertainty sampling cannot minimize total misclassification costs when errors incur different costs. This paper introduces a method for performing cost-sensitive uncertainty sampling that makes use of self-training. We show that, even when misclassification costs are equal, this self-training approach results in faster reduction of loss as a function of number of points labeled and more reliable posterior probability estimates as compared to standard uncertainty sampling. We also show why other more naive methods of modifying uncertainty sampling to minimize total misclassification costs will not always work well.
关 键 词: 不确定性采样; 误判成本; 后验概率估计
课程来源: 视频讲座网
最后编审: 2020-06-08:yumf
阅读次数: 136