开课单位--微软印度研究院
1 1/1

1
Thompson Sampling: a provably good Bayesian heuristic for bandit problems[汤普森抽样:一个可证明的强盗问题的良好贝叶斯启发式]
  Shipra Agrawal(微软印度研究院) Multi-armed bandit problem is a basic model for managing the exploration/exploitation trade-off that arises in many situations. Thompson Sampling [Tho...
热度:38
1 1/1