//header("Content-type: application/xhtml+xml"); /* echo "\n"; */ /*echo "\n"; */ ?>
The Red Bible N. Cesa-Bianchi, and G. Lugosi, Prediction, Learning, and Games. Cambridge University Press, New York, 2006. ISBN 0521841089. Table of Contents| Errata | Amazon |
|
|
|
Rémi Munos The optimistic principle applied to games, optimization and planning: Towards foundations of Monte-Carlo Tree Search [Submitted to Foundations and Trends in Machine Learning] |
|
Jean-Yves Audibert, Rémi Munos ICML 2010 Tutorial on bandits [video] |
Nicolò Cesa-Bianchi Bandit Algorithms for Online Linear Optimization |
|
Gabor Lugosi Adversarial bandit problems: the power of randomization |
|
Kamalika Chaudhuri (UCSD) CSE291W11 |
|
Andreas Krause (Caltech) Caltech CS 101.2 |
|
Kamesh Munagala (Duke) CPS296.04 Sequential Decision Theory: Algorithms, Policies, and Games |