0


学习表述的深层次体系

Learning Deep Hierarchies of Representations
课程网址: http://videolectures.net/okt09_bengio_ldhr/  
主讲教师: Yoshua Bengio; Samy Bengio
开课单位: 蒙特利尔大学
开课时间: 2009-10-07
课程语种: 英语
中文简介:
虽然理论工作表明,深层架构在表示高度变化的功能方面可能在计算和统计上更有效,但在最近基于分层结构模型的每个级别的无监督预培训的算法出现之前,深层架构培训是不成功的。为此,本文提出了几种无监督准则和程序,从限制玻尔兹曼机器(RBM)开始,当叠加后产生深信网络(DBN)。虽然RBMS的配分函数是难以处理的,但是推理是可处理的,我们回顾了几种成功的学习算法,特别是那些在学习过程中使用权值变化很快而不是收敛的算法。基于RBMS和其他无监督学习方法的深层结构除了作为生成模型令人印象深刻外,还通过用于初始化深层监督神经网络而产生了影响。尽管这些新的算法能够训练深入的模型,但关于这一困难学习问题的本质仍然存在许多问题。我们试图通过比较不同的成功方法来训练深层架构,并通过广泛的模拟来研究解释性假设,从而揭示这些问题。最后,我们描述了我们当前的研究计划、目标和挑战,关于在多个抽象层次上学习表示,比较网络对象,如图像、文档和搜索引擎请求,比较是几个信息检索应用程序的核心。
课程简介: Whereas theoretical work suggests that deep architectures might be computationally and statistically more efficient at representing highly-varying functions, training deep architectures was unsuccessful until the recent advent of algorithms based on unsupervised pre-training of each level of a hierarchically structured model. Several unsupervised criteria and procedures were proposed for this purpose, starting with the Restricted Boltzmann Machine (RBM), which when stacked gives rise to Deep Belief Networks (DBN). Although the partition function of RBMs is intractable, inference is tractable and we review several successful learning algorithms that have been proposed, in particular those using weights that change quickly during learning instead of converging. In addition to being impressive as generative models, deep architectures based on RBMs and other unsupervised learning methods have made an impact by being used to initialize deep supervised neural networks. Even though these new algorithms have enabled training deep models, many questions remain as to the nature of this difficult learning problem. We attempt to shed some light on these questions by comparing different successful approaches to training deep architectures and through extensive simulations investigating explanatory hypotheses. Finally, we describe our current research program, objectives and challenges, regarding learning representations at multiple levels of abstraction, to compare web objects such as images, documents, and search engine requests, comparisons that are at the core of several information retrieval applications.
关 键 词: 深层架构; 层次结构模型; 模拟调查; 解释性假设
课程来源: 视频讲座网
最后编审: 2020-06-02:张荧(课程编辑志愿者)
阅读次数: 22