开课单位--克里特理工大学
1 1/1
![](functions/showpic.php?filename=2019042408373732.png)
Model-Free Reinforcement Learning as Mixture Learning[无模型强化学习作为混合学习]
Nikos Vlassis(克里特理工大学) We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both t...
热度:122
Nikos Vlassis(克里特理工大学) We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both t...
热度:122
![](functions/showpic.php?filename=2019042405001461.png)
Binary Action Search for Learning Continuous-Action Control Policies[学习连续动作控制策略的二元动作搜索]
Jason Pazis(克里特理工大学) Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces...
热度:34
Jason Pazis(克里特理工大学) Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces...
热度:34
1 1/1