
Online Reinforcement Learning from Concurrent Customer Interaction Sequences
课程网址: http://videolectures.net/onlinelearning2012_silver_reinforcement_...  
主讲教师: David Silver
开课单位: 伦敦大学学院
开课时间: 2013-05-28
课程语种: 英语
课程简介: This talk explores applications in which a company interacts with many customers. The company has an objective function, such as maximising revenue, customer satisfaction, or customer loyalty, which depends primarily on the sequence of interactions between company and customer. A key aspect ofthis setting is that interactions with different customers occur asynchronously and in parallel. As a result, it is imperative to learn online from partial interaction sequences, so that information acquired from one customer is efficiently assimilated and applied in subsequent interactions with other customers. I will present the first framework for reinforcement learning in this setting, using an asynchronous variant of temporal-difference learning to learn efficiently from partial interaction sequences.
关 键 词: 应用程序; 最大化收益; 交互序列; 在线学习
课程来源: 视频讲座网
最后编审: 2020-10-22:chenxin
阅读次数: 80