小数据问题的大数据学习On Big Data Learning for Small Data Problems |
|
课程网址: | http://videolectures.net/kdd2018_teh_big_data_learning/ |
主讲教师: | Yee Whye Teh |
开课单位: | 牛津大学统计系 |
开课时间: | 2018-11-23 |
课程语种: | 英语 |
中文简介: | 机器学习最近取得的许多进展是由可用数据数量和多样性的爆炸性增长以及处理数据所需的计算资源所推动的。这就引出了一个问题:机器学习系统是否需要大量数据才能很好地解决任务。在元学习、终身学习、学会学习、多任务学习等旗帜下,最近的一个令人兴奋的发展是观察到,手边的数据集往往存在异质性,事实上,一个大数据集可以更有效地视为许多小数据集,每个数据集都属于不同的任务。例如,在推荐系统中,每个用户都可以说是一个不同的任务,具有一个小的关联数据集,而在人工智能中,一个圣杯是如何开发能够学会从少量数据中快速解决新任务的系统。在这种情况下,问题是如何通过利用任务之间的相似性来“快速学习”。如何实现这一点的一个观点是,暴露于大量以前的任务允许系统学习关于任务采样的世界的丰富的先验知识,并且系统能够快速解决新任务,这是因为具有丰富的世界知识。这是一个非常活跃、充满活力和多样化的研究领域,最近提出了许多不同的方法。在本次演讲中,我将从概率和深度学习的角度描述这个问题的观点,并描述我最近在这个方向上所做的一些努力。 |
课程简介: | Much recent progress in machine learning have been fuelled by the explosive growth in the amount and diversity of data available, and the computational resources needed to crunch through the data. This begs the question of whether machine learning systems necessarily need large amounts of data to solve a task well. An exciting recent development, under the banners of meta-learning, lifelong learning, learning to learn, multitask learning etc, has been the observation that often there is heterogeneity within the data sets at hand, and in fact a large data set can be viewed more productively as many smaller data sets, each pertaining to a different task. For example, in recommender systems each user can be said to be a different task with a small associated data set, and in AI one holy grail is how to develop systems that can learn to solve new tasks quickly from small amounts of data. In such settings, the problem is then how to “learn to learn quickly”, by making use of similarities among tasks. One perspective for how this is achievable is that exposure to lots of previous tasks allows the system to learn a rich prior knowledge about the world in which tasks are sampled from, and it is with rich world knowledge that the system is able to solve new tasks quickly. This is a very active, vibrant and diverse area of research, with many different approaches proposed recently. In this talk I will describe a view of this problem from probabilistic and deep learning perspectives, and describe a number of efforts in this direction that I have recently been involved in. |
关 键 词: | 机器学习系统; 多任务学习; 人工智能; 小的关联数据集 |
课程来源: | 视频讲座网 |
数据采集: | 2023-01-29:cyh |
最后编审: | 2023-01-30:cyh |
阅读次数: | 18 |