Reinforcement Learning

“How hard is my MDP?” Distribution-norm to the rescue, Odalric-Ambrym Maillard, Timothy A. Mann and Shie Mannor in Proceedings of the 27th conference on advances in Neural Information Processing Systems (NIPS), 2014. Publisher website HaL

Selecting Near-Optimal Approximate State Representations in Reinforcement Learning, R.Ortner, O.-A. Maillard and D. Ryabko, in Proceedings of the International Conference on Algorithmic Learning Theory (ALT), 2014. Publisher website HaL

Competing with an infinite set of models in reinforcement learning, P. Nguyen, O.-A. Maillard, D. Ryabko, and R. Ortner, in Proceedings of the International Conference on Artificial Intelligence and Statistics (AI&STATS), volume 31 of JMLR W&CP , pages 463–471, Arizona, USA, 2013. Publisher website HaL

Optimal regret bounds for selecting the state representation in reinforcement learning, O.-A. Maillard, P. Nguyen, R. Ortner, and D. Ryabko, in Proceedings of the International conference on Machine Learning (ICML), volume 28 of JMLR W&CP, pages 543–551, Atlanta, USA, 2013. Publisher website HaL

Hierarchical optimistic region selection driven by curiosity, O.-A. Maillard, in P. Bartlett, F.C.N. Pereira, C.J.C. Burges, L. Bottou, and K.Q. Weinberger, editors, Proceedings of the conference on advances in Neural Information Processing Systems 25 (NIPS), pages 1457–1465, 2012. Publisher website HaL

Selecting the state-representation in reinforcement learning O.-A. Maillard, D. Ryabko, and R. Munos, in Proceedings of the 24th conference on advances in Neural Information Processing Systems (NIPS), pages 2627–2635, 2011. Publisher website HaL

Finite sample analysis of bellman residual minimization, O.-A. Maillard, R. Munos, A. Lazaric, and M. Ghavamzadeh, in Proceedings of the Asian Conference on Machine Learning (ACML), 2010. Publisher website HaL

LSTD with random projections, M. Ghavamzadeh, A. Lazaric, O.-A. Maillard, and R. Munos, In J. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R.S. Zemel, and A. Culotta, editors, Proceedings of 23rd conference on advances in Neural Information Processing Systems (NIPS) (NIPS), pages 721–729, 2010. Publisher website HaL

Les commentaires sont fermés.