0


加强人类和其他动物的学习

Reinforcement Learning in Humans and Other Animals
课程网址: http://videolectures.net/nips2010_daw_rlh/  
主讲教师: Nathaniel Daw
开课单位: 纽约大学
开课时间: 2011-01-12
课程语种: 英语
中文简介:
计算机科学的算法可以作为大脑如何处理困难的信息处理问题的详细过程级假设。本教程回顾了强化学习计算研究中的想法如何被用于生物学中,以概念化大脑的试验和错误决策机制,借鉴神经科学,心理学和行为经济学的证据。我们从时间差异学习和神经调节剂多巴胺之间激烈争论的关系开始,然后考虑RL中更复杂的方法和概念,包括部分可观察性,分层RL,函数逼近和各种基于模型的方法,可以为理解其他问题提供框架。自适应行为的生物学。除了帮助组织和概念化来自许多不同层次的数据之外,计算模型可以在实验数据的分析中更加定量地使用。本教程的第二个目的是再次使用强化学习的例子来回顾和演示使用计算模型分析实验数据的最新方法学进展。 RL算法可以被视为原始的试验模型,通过试验实验数据进行试验,例如受试者的选择或多巴胺能神经元的加标反应;估计模型参数或比较候选模型的问题然后减少到贝叶斯推断中的常见问题。从这个角度来看,神经科学数据的分析已经成熟,可以应用NIPS社区在其他问题领域中充分研究的许多相同类型的推理和机器学习技术。
课程简介: Algorithms from computer science can serve as detailed process-level hypotheses for how the brain might approach difficult information processing problems. This tutorial reviews how ideas from the computational study of reinforcement learning have been used in biology to conceptualize the brain's mechanisms for trial-and-error decision making, drawing on evidence from neuroscience, psychology, and behavioral economics. We begin with the much-debated relationship between temporal-difference learning and the neuromodulator dopamine, and then consider how more sophisticated methods and concepts from RL -- including partial observability, hierarchical RL, function approximation, and various model-based approaches -- can provide frameworks for understanding additional issues in the biology of adaptive behavior. In addition to helping to organize and conceptualize data from many different levels, computational models can be employed more quantitatively in the analysis of experimental data. The second aim of this tutorial is to review and demonstrate, again using the example of reinforcement learning, recent methodological advances in analyzing experimental data using computational models. An RL algorithm can be viewed as generative model for raw, trial-by-trial experimental data such as a subject's choices or a dopaminergic neuron's spiking responses; the problems of estimating model parameters or comparing candidate models then reduce to familiar problems in Bayesian inference. Viewed this way, the analysis of neuroscientific data is ripe for the application of many of the same sorts of inferential and machine learning techniques well studied by the NIPS community in other problem domains.
关 键 词: 计算机科学; 强化学习; 神经调节
课程来源: 视频讲座网
最后编审: 2019-07-25:cwx
阅读次数: 56