0


神经程序解释器

Neural Programmer-Interpreters
课程网址: http://videolectures.net/iclr2016_reed_neural_programmer/  
主讲教师: Scott Reed
开课单位: 视频讲座网
开课时间: 2016-05-27
课程语种: 英语
中文简介:
我们提出了神经编程解释器(NPI):一种学习表示和执行程序的循环和合成神经网络。NPI有三个可学习的组件:一个任务无关的循环核心,一个持久的键值程序存储器,以及特定于领域的编码器,使单个NPI能够在多个感知上不同的具有不同功能支持的环境中运行。通过学习组合较低级别的程序来表达较高级别的程序,NPI降低了样本的复杂性,并与序列到序列的lstm相比增加了泛化能力。程序存储器允许在现有程序的基础上高效地学习附加任务。NPI还可以利用环境(例如一个带有读写指针的便笺板)来缓存中间的计算结果,减少循环隐藏单元的长期内存负担。在这项工作中,我们用完全监督的执行轨迹来训练NPI;每个程序都有以输入为条件调用直接子程序的示例序列。NPI不是在大量相对较弱的标签上进行训练,而是从少量丰富的示例中学习。我们演示了我们的模型学习几种类型的组合程序的能力:添加、排序和规范化3D模型。此外,单个新pi学习执行这些程序和所有21个相关的子程序。
课程简介: We propose the neural programmer-interpreter (NPI): a recurrent and compositional neural network that learns to represent and execute programs. NPI has three learnable components: a task-agnostic recurrent core, a persistent key-value program memory, and domain-specific encoders that enable a single NPI to operate in multiple perceptually diverse environments with distinct affordances. By learning to compose lower-level programs to express higher-level programs, NPI reduces sample complexity and increases generalization ability compared to sequence-to-sequence LSTMs. The program memory allows efficient learning of additional tasks by building on existing programs. NPI can also harness the environment (e.g. a scratch pad with read-write pointers) to cache intermediate results of computation, lessening the long-term memory burden on recurrent hidden units. In this work we train the NPI with fully-supervised execution traces; each program has example sequences of calls to the immediate subprograms conditioned on the input. Rather than training on a huge number of relatively weak labels, NPI learns from a small number of rich examples. We demonstrate the capability of our model to learn several types of compositional programs: addition, sorting, and canonicalizing 3D models. Furthermore, a single NPI learns to execute these programs and all 21 associated subprograms.
关 键 词: 神经网络; 循环核心; 程序存储
课程来源: 视频讲座网
数据采集: 2022-12-02:chenxin01
最后编审: 2022-12-02:chenxin01
阅读次数: 38