开课单位--洛夫大学
1 1/1

1
The Many Faces of Optimism: a Unifying Approach[乐观的许多面孔:一个统一的方法]
  Istvan Szita(洛夫大学) The exploration-exploitation dilemma has been an intriguing and unsolved problem within the framework of reinforcement learning. Optimism in the face ...
热度:49
1 1/1