v1v2 (latest)

On the role of planning in model-based deep reinforcement learning

8 November 2020

Jessica B. Hamrick

A. Friesen

Feryal M. P. Behbahani

Papers citing "On the role of planning in model-based deep reinforcement learning"

50 / 50 papers shown

Bootstrap Off-policy with World Model

505

01 Nov 2025

Path Channels and Plan Extension Kernels: a Mechanistic Description of Planning in a Sokoban RNN

322

11 Jun 2025

Trust-Region Twisted Policy Improvement

579

08 Apr 2025

Extendable Planning via Multiscale Diffusion

541

25 Mar 2025

On-line Policy Improvement using Monte-Carlo SearchNeural Information Processing Systems (NeurIPS), 1996

Gerald Tesauro

Gregory R. Galperin

475

276

09 Jan 2025

Demystifying MuZero Planning: Interpreting the Learned ModelIEEE Transactions on Artificial Intelligence (IEEE TAI), 2024

329

07 Nov 2024

Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning

501

15 Oct 2024

How to Choose a Reinforcement-Learning Algorithm

Julian Rodemann

239

30 Jul 2024

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

Qian Liu

341

136

13 Jun 2024

Learning to Play Atari in a World of Tokens

Pranav Agarwal

Sheldon Andrews

Samira Ebrahimi Kahou

OffRL

273

03 Jun 2024

Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning

238

22 May 2024

How does the primate brain combine generative and discriminative computations in vision?

Todd Gureckis

...

274

11 Jan 2024

Simple Hierarchical Planning with Diffusion

294

05 Jan 2024

Predictive auxiliary objectives in deep RL mimic learning in the brainInternational Conference on Learning Representations (ICLR), 2023

Ching Fang

Kimberly L. Stachenfeld

317

09 Oct 2023

Efficient Planning with Latent DiffusionInternational Conference on Learning Representations (ICLR), 2023

Wenhao Li

DiffM

456

30 Sep 2023

Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023

353

27 Jul 2023

What model does MuZero learn?European Conference on Artificial Intelligence (ECAI), 2023

Jinke He

Thomas M. Moerland

F. Oliehoek

370

01 Jun 2023

Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseInformation Sciences (Inf. Sci.), 2023

221

29 May 2023

The Update-Equivalence Framework for Decision-Time PlanningInternational Conference on Learning Representations (ICLR), 2023

J. Zico Kolter

366

25 Apr 2023

236

09 Feb 2023

Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving

251

08 Feb 2023

PushWorld: A benchmark for manipulation planning with tools and movable obstacles

Miguel Lazaro-Gredilla

Dileep George

365

24 Jan 2023

Safe Reinforcement Learning using Data-Driven Predictive ControlInternational Conference on Communications, Signal Processing, and their Applications (ICCSPA), 2022

252

20 Nov 2022

Continuous Monte Carlo Graph SearchAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

984

04 Oct 2022

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022

Homanga Bharadhwaj

413

18 Sep 2022

A model-based approach to meta-Reinforcement Learning: Transformers and tree searchThe European Symposium on Artificial Neural Networks (ESANN), 2022

Brieuc Pinon

Jean-Charles Delvenne

Raphaël Jungers

OffRL

234

24 Aug 2022

Efficient Planning in a Compact Latent Action SpaceInternational Conference on Learning Representations (ICLR), 2022

Tianjun Zhang

351

22 Aug 2022

Intelligent problem-solving as integrated hierarchical reinforcement learningNature Machine Intelligence (Nat. Mach. Intell.), 2022

298

18 Aug 2022

Symphony: Learning Realistic and Diverse Agents for Autonomous Driving SimulationIEEE International Conference on Robotics and Automation (ICRA), 2022

266

06 May 2022

Physical Design using Differentiable Learned Simulators

Kelsey R. Allen

Tatiana López-Guevara

Kimberly L. Stachenfeld

Alvaro Sanchez-Gonzalez

285

01 Feb 2022

Inferring perceptual decision making parameters from behavior in production and reproduction tasks

Nils Neupärtl

Constantin Rothkopf

192

31 Dec 2021

Learning Generalizable Behavior via Visual Rewrite Rules

Michael Littman

273

09 Dec 2021

Procedural Generalization by Planning with Self-Supervised World ModelsInternational Conference on Learning Representations (ICLR), 2021

197

02 Nov 2021

Self-Consistent Models and ValuesNeural Information Processing Systems (NeurIPS), 2021

David Silver

259

25 Oct 2021

Model-based Reinforcement Learning for Service Mesh Fault Resiliency in a Web Application-levelApplied and Computational Engineering (ACE), 2021

128

21 Oct 2021

Neural Algorithmic Reasoners are Implicit PlannersNeural Information Processing Systems (NeurIPS), 2021

Andreea Deac

Petar Velivcković

Ognjen Milinković

Pierre-Luc Bacon

Jian Tang

Mladen Nikolic

OffRL

178

11 Oct 2021

Evaluating model-based planning and planner amortization for continuous control

Alessandro Davide Ialongo

...

Jost Tobias Springenberg

A. Abdolmaleki

N. Heess

J. Merel

Martin Riedmiller

200

07 Oct 2021

Potential-based Reward Shaping in Sokoban

180

10 Sep 2021

Subgoal Search For Complex Reasoning TasksNeural Information Processing Systems (NeurIPS), 2021

276

25 Aug 2021

Deep Multiagent Reinforcement Learning: Challenges and DirectionsArtificial Intelligence Review (AIR), 2021

Thomas Bäck

313

161

29 Jun 2021

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021

Sitao Luan

475

03 Jun 2021

Towards Deeper Deep Reinforcement Learning with Spectral NormalizationNeural Information Processing Systems (NeurIPS), 2021

Johan Bjorck

Daniel Schwalbe-Koda

Kilian Q. Weinberger

394

02 Jun 2021

Learning Neuro-Symbolic Relational Transition Models for Bilevel PlanningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

Tomas Lozano-Perez

387

28 May 2021

Transfer Learning and Curriculum Learning in Sokoban

293

25 May 2021

MBRL-Lib: A Modular Library for Model-based Reinforcement Learning

392

20 Apr 2021

Muesli: Combining Improvements in Policy OptimizationInternational Conference on Machine Learning (ICML), 2021

Ivo Danihelka

David Silver

281

13 Apr 2021

Planning and Learning Using Adaptive Entropy Tree SearchIEEE International Joint Conference on Neural Network (IJCNN), 2021

Piotr Kozakowski

Mikolaj Pacek

Piotr Milo's

218

12 Feb 2021

Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short SurveyJournal of Artificial Intelligence Research (JAIR), 2020

908

125

17 Dec 2020

On the model-based stochastic value gradient for continuous reinforcement learningConference on Learning for Dynamics & Control (L4DC), 2020

432

28 Aug 2020

A Unifying Framework for Reinforcement Learning and Planning

544

26 Jun 2020