0


应邀演讲:对领域适应学习的理论理解

Invited Talk: Towards Theoretical Understanding of Domain Adaptation Learning
课程网址: http://videolectures.net/ecmlpkdd09_ben_david_ttu/  
主讲教师: Shai Ben-David
开课单位: 滑铁卢大学
开课时间: 2009-10-20
课程语种: 英语
中文简介:
机器学习享有深刻而强大的理论,这导致了各种非常成功的实用工具。然而,这一理论的大部分是在一些简化的假设下发展的,这些假设在现实世界中显然是失败的。特别是,该理论的基本假设是可用于培训的数据和目标应用的数据来自同一来源。当这种假设失败时,学习者将面临“领域适应”挑战。在过去的几年中,机器学习应用程序的范围已经扩展到包括需要域适应的各种任务。这种应用已经通过几种启发式范例来解决。然而,共同的理论模型不能提供对这些技术的有用分析。域适应的关键是训练域和目标域之间的相似性。在本次演讲中,我将讨论可以定义和测量任务相似性的几个参数,并讨论它们在多大程度上可用于指导学习算法并保证其成功。最近的工作可以为一些现有的实用启发式方法提供理论依据,并指导开发用于处理某些类型的数据差异的新算法。但是,我们目前的理解还有很多不足之处。我将把演讲的最后部分用于描述一些挑战和开放式问题,然后才能在存在训练测试差异的情况下对学习提出理解。该演讲基于与John Blitzer,Koby Crammer和Fernando Pereira以及我的学生David Pal,Teresa Luu和Tyler Lu的联合作品。
课程简介: Machine learning enjoys a deep and powerful theory that has led to a wide variety of highly successful practical tools. However, most of this theory is developed under some simplifying assumptions that clearly fail in the real world. In particular, a fundamental assumption of the theory is that the data available for training and the data of the target application come from the same source. When this assumption fails, the learner is faced with a “domain adaptation” challenge. In the past few years, the range of machine learning applications have been expanded to include various tasks requiring domain adaptation. Such application have been addressed by several heuristic paradigms. However, the common theoretical models fall short of providing useful analysis of these techniques. The key to domain adaptation is the similarity between the training and target domains. In this talk I will discuss several parameters along which task similarity can be defined and measured and discuss to what extent can they be utilized to direct learning algorithms and guarantee their success. Recent work can provide theoretical justification to some existing practical heuristics, as well as guide the development of novel algorithms for handling some types of data discrepancies. However, our current understanding leaves much to be desired. I shall devote the last part of the talk to describing some of the challenges and open questions that will have to be addressed before one can claim satisfactory understanding of learning in the presence of training-test discrepancies. The talk is based on joint works with John Blitzer, Koby Crammer and Fernando Pereira and with my students, David Pal, Teresa Luu and Tyler Lu.
关 键 词: 机器学习; 训练测试差异; 域适应
课程来源: 视频讲座网
最后编审: 2019-03-23:lxf
阅读次数: 77