开课单位--麦吉尔大学
<<<12 2/2

11
Direction Fields[方向场]
  David Shirokoff(麦吉尔大学)
热度:34

12
Probabilistic Decision-Making Under Model Uncertainty[不确定性条件下模型的概率决策]
  Joelle Pineau(麦吉尔大学) Partially Observable Markov Decision Processes offer a rich mathematical framework for decision-making under uncertainty. In recent years, a number of...
热度:38

13
Temporal Motifs Reveal the Dynamics of Editor Interactions in Wikipedia[时间主题揭示维基百科编辑互动的动态]
  David Jurgens(麦吉尔大学) Wikipedia is a collaborative setting with both combative and cooperative editing. We propose a new method for investigating the types of editor intera...
热度:67

14
Reinforcement Learning in the Presence of Rare Events[在罕见事件的存在下强化学习]
  Jordan Frank(麦吉尔大学) We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the c...
热度:43

17
Active Learning in Network Monitoring[网络监控中的主动学习]
  Mark Coates(麦吉尔大学) In several latency in obtaining information about the state of the network. The adoption of active learning techniques can result in a dramatic reduct...
热度:38

18
Piecewise-Stationary Bandit Problems with Side Observations[具有侧面观测的分段 - 平稳强盗问题]
  Jia Yuan Yu(麦吉尔大学) We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distri...
热度:31

19
Piecewise-Stationary Bandit Problems with Side Information[边信息的分段 - 平稳强盗问题]
  Jia Yuan Yu(麦吉尔大学) We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distri...
热度:28

20
Welcome to the Multidisciplinary Symposium on Reinforcement Learning[欢迎参加关于加强学习的多学科研讨会]
  Doina Precup(麦吉尔大学) In the last 25 years, reinforcement learning research has made great strides and has had a significant impact within several fields, including * ...
热度:28
<<<12 2/2