0


通过光谱正则化选择不同的特征

Selecting Diverse Features via Spectral Regularization
课程网址: http://videolectures.net/machine_das_spectral/  
主讲教师: Abhimanyu Das
开课单位: 微软公司
开课时间: 2013-06-15
课程语种: 英语
中文简介:
我们研究了线性回归中不同特征选择的问题:选择一小部分可以预测给定目标的不同特征。由于诸如可解释性,对噪声的鲁棒性等几个原因,多样性是有用的。我们提出了几个频谱正则化器,它们捕获特征多样性的概念,并表明这些都是子模块集函数。当这些正则化器被添加到线性回归的目标函数时,会产生近似的子模块函数,然后可以通过有效的贪婪和局部搜索算法来近似最大化,并具有可证明的保证。我们将我们的算法与传统的贪婪和ℓ1正则化方案进行比较,并表明我们获得了更多样化的特征集,这些特征导致回归问题在扰动下是稳定的。
课程简介: We study the problem of diverse feature selection in linear regression: selecting a small subset of diverse features that can predict a given objective. Diversity is useful for several reasons such as interpretability, robustness to noise, etc. We propose several spectral regularizers that capture a notion of diversity of features and show that these are all submodular set functions. These regularizers, when added to the objective function for linear regression, result in approximately submodular functions, which can then be maximized approximately by efficient greedy and local search algorithms, with provable guarantees. We compare our algorithms to traditional greedy and ℓ1-regularization schemes and show that we obtain a more diverse set of features that result in the regression problem being stable under perturbations.
关 键 词: 线性回归; 特征选择; 可解释性
课程来源: 视频讲座网
最后编审: 2020-04-23:chenxin
阅读次数: 48