ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.13320
  4. Cited By
Combating the Compounding-Error Problem with a Multi-step Model

Combating the Compounding-Error Problem with a Multi-step Model

30 May 2019
Kavosh Asadi
Dipendra Kumar Misra
Seungchan Kim
Michel L. Littman
    LRM
ArXivPDFHTML

Papers citing "Combating the Compounding-Error Problem with a Multi-step Model"

27 / 27 papers shown
Title
Diffusion Model Predictive Control
Diffusion Model Predictive Control
Guangyao Zhou
Sivaramakrishnan Swaminathan
Rajkumar Vasudeva Raju
J. S. Guntupalli
Wolfgang Lehrach
Joseph Ortiz
Antoine Dedieu
Miguel Lázaro-Gredilla
Kevin P. Murphy
68
11
0
07 Oct 2024
DeepMDP: Learning Continuous Latent Space Models for Representation
  Learning
DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Carles Gelada
Saurabh Kumar
Jacob Buckman
Ofir Nachum
Marc G. Bellemare
BDL
78
287
0
06 Jun 2019
Model-based RL in Contextual Decision Processes: PAC bounds and
  Exponential Improvements over Model-free Approaches
Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches
Wen Sun
Nan Jiang
A. Krishnamurthy
Alekh Agarwal
John Langford
OffRL
40
16
0
21 Nov 2018
Model-Based Active Exploration
Model-Based Active Exploration
Pranav Shyam
Wojciech Ja'skowski
Faustino J. Gomez
86
179
0
29 Oct 2018
Algorithmic Framework for Model-based Deep Reinforcement Learning with
  Theoretical Guarantees
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Yuping Luo
Huazhe Xu
Yuanzhi Li
Yuandong Tian
Trevor Darrell
Tengyu Ma
OffRL
107
226
0
10 Jul 2018
Equivalence Between Wasserstein and Value-Aware Loss for Model-based
  Reinforcement Learning
Equivalence Between Wasserstein and Value-Aware Loss for Model-based Reinforcement Learning
Kavosh Asadi
Evan Cater
Dipendra Kumar Misra
Michael L. Littman
OffRL
62
11
0
01 Jun 2018
Lipschitz Continuity in Model-based Reinforcement Learning
Lipschitz Continuity in Model-based Reinforcement Learning
Kavosh Asadi
Dipendra Kumar Misra
Michael L. Littman
KELM
84
155
0
19 Apr 2018
Model-Ensemble Trust-Region Policy Optimization
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
84
452
0
28 Feb 2018
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
81
554
0
18 Sep 2017
Mean Actor Critic
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
61
44
0
01 Sep 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with
  Model-Free Fine-Tuning
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
91
974
0
08 Aug 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
Safe Model-based Reinforcement Learning with Stability Guarantees
Felix Berkenkamp
M. Turchetta
Angela P. Schoellig
Andreas Krause
176
852
0
23 May 2017
Learning Multimodal Transition Dynamics for Model-Based Reinforcement
  Learning
Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning
Thomas M. Moerland
Joost Broekens
Catholijn M. Jonker
OffRL
71
31
0
01 May 2017
Multi-step Reinforcement Learning: A Unifying Algorithm
Multi-step Reinforcement Learning: A Unifying Algorithm
Kristopher De Asis
Fernando Hernandez-Garcia
Zach Holland
R. Sutton
37
121
0
03 Mar 2017
The Predictron: End-To-End Learning and Planning
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
71
291
0
28 Dec 2016
Self-Correcting Models for Model-Based Reinforcement Learning
Self-Correcting Models for Model-Based Reinforcement Learning
Erik Talvitie
LRM
68
94
0
19 Dec 2016
A vector-contraction inequality for Rademacher complexities
A vector-contraction inequality for Rademacher complexities
Andreas Maurer
74
260
0
01 May 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
197
8,859
0
04 Feb 2016
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Christoph Dann
Emma Brunskill
69
249
0
29 Oct 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
103
853
0
31 Jul 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural
  Networks
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam M. Shazeer
145
2,034
0
09 Jun 2015
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
127
12,231
0
19 Dec 2013
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
117
3,006
0
19 Jul 2012
On the Sample Complexity of Reinforcement Learning with a Generative
  Model
On the Sample Complexity of Reinforcement Learning with a Generative Model
M. G. Azar
Rémi Munos
H. Kappen
71
156
0
27 Jun 2012
Compositional Planning Using Optimal Option Models
Compositional Planning Using Optimal Option Models
David Silver
K. Ciosek
99
74
0
27 Jun 2012
Dyna-Style Planning with Linear Function Approximation and Prioritized
  Sweeping
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
83
204
0
13 Jun 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
220
3,221
0
02 Nov 2010
1