算法介绍:
1.课程两节 Tutorial: Introduction to Bandits: Algorithms and Theory
http://techtalks.tv/talks/54451/
http://techtalks.tv/talks/54455/
2.博文介绍 Multi_armed bandit
https://mpatacchiola.github.io/blog/2017/08/14/dissecting-reinforcement-learning-6.html
toolbox:</