Combating the Compounding-Error Problem with a Multi-step Model

30 May 2019

Michel L. Littman

Papers citing "Combating the Compounding-Error Problem with a Multi-step Model"

27 / 27 papers shown

Title
Diffusion Model Predictive Control Guangyao Zhou Sivaramakrishnan Swaminathan Rajkumar Vasudeva Raju J. S. Guntupalli Wolfgang Lehrach Joseph Ortiz Antoine Dedieu Miguel Lázaro-Gredilla Kevin P. Murphy 71 11 0 07 Oct 2024
DeepMDP: Learning Continuous Latent Space Models for Representation Learning Carles Gelada Saurabh Kumar Jacob Buckman Ofir Nachum Marc G. Bellemare BDL 81 287 0 06 Jun 2019
Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches Wen Sun Nan Jiang A. Krishnamurthy Alekh Agarwal John Langford OffRL 40 16 0 21 Nov 2018
Model-Based Active Exploration Pranav Shyam Wojciech Ja'skowski Faustino J. Gomez 86 179 0 29 Oct 2018
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees Yuping Luo Huazhe Xu Yuanzhi Li Yuandong Tian Trevor Darrell Tengyu Ma OffRL 107 226 0 10 Jul 2018
Equivalence Between Wasserstein and Value-Aware Loss for Model-based Reinforcement Learning Kavosh Asadi Evan Cater Dipendra Kumar Misra Michael L. Littman OffRL 62 11 0 01 Jun 2018
Lipschitz Continuity in Model-based Reinforcement Learning Kavosh Asadi Dipendra Kumar Misra Michael L. Littman KELM 86 155 0 19 Apr 2018
Model-Ensemble Trust-Region Policy Optimization Thanard Kurutach I. Clavera Yan Duan Aviv Tamar Pieter Abbeel 84 452 0 28 Feb 2018
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents Marlos C. Machado Marc G. Bellemare Erik Talvitie J. Veness Matthew J. Hausknecht Michael Bowling 83 554 0 18 Sep 2017
Mean Actor Critic Cameron Allen Kavosh Asadi Melrose Roderick Abdel-rahman Mohamed George Konidaris Michael Littman 61 44 0 01 Sep 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning Anusha Nagabandi G. Kahn R. Fearing Sergey Levine 91 974 0 08 Aug 2017
Safe Model-based Reinforcement Learning with Stability Guarantees Felix Berkenkamp M. Turchetta Angela P. Schoellig Andreas Krause 176 852 0 23 May 2017
Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning Thomas M. Moerland Joost Broekens Catholijn M. Jonker OffRL 71 31 0 01 May 2017
Multi-step Reinforcement Learning: A Unifying Algorithm Kristopher De Asis Fernando Hernandez-Garcia Zach Holland R. Sutton 40 121 0 03 Mar 2017
The Predictron: End-To-End Learning and Planning David Silver H. V. Hasselt Matteo Hessel Tom Schaul A. Guez ... Gabriel Dulac-Arnold David P. Reichert Neil C. Rabinowitz André Barreto T. Degris 71 291 0 28 Dec 2016
Self-Correcting Models for Model-Based Reinforcement Learning Erik Talvitie LRM 68 94 0 19 Dec 2016
A vector-contraction inequality for Rademacher complexities Andreas Maurer 74 260 0 01 May 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 197 8,859 0 04 Feb 2016
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning Christoph Dann Emma Brunskill 69 249 0 29 Oct 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games Junhyuk Oh Xiaoxiao Guo Honglak Lee Richard L. Lewis Satinder Singh 103 853 0 31 Jul 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks Samy Bengio Oriol Vinyals Navdeep Jaitly Noam M. Shazeer 145 2,034 0 09 Jun 2015
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 127 12,231 0 19 Dec 2013
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 117 3,006 0 19 Jul 2012
On the Sample Complexity of Reinforcement Learning with a Generative Model M. G. Azar Rémi Munos H. Kappen 71 156 0 27 Jun 2012
Compositional Planning Using Optimal Option Models David Silver K. Ciosek 99 74 0 27 Jun 2012
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping R. Sutton Csaba Szepesvári A. Geramifard Michael Bowling OffRL 83 204 0 13 Jun 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning Stéphane Ross Geoffrey J. Gordon J. Andrew Bagnell OffRL 220 3,221 0 02 Nov 2010