Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction

4 July 2021

Gal Dalal

Papers citing "Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction"

23 / 23 papers shown

Title
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation Jongmin Lee Wonseok Jeon Byung-Jun Lee J. Pineau Kee-Eung Kim OffRL 170 99 0 21 Jun 2021
Online and Offline Reinforcement Learning by Planning with a Learned Model Julian Schrittwieser Thomas Hubert Amol Mandhane M. Barekatain Ioannis Antonoglou David Silver OffRL 66 116 0 13 Apr 2021
Learning to Simulate Dynamic Environments with GameGAN Seung Wook Kim Yuhao Zhou Jonah Philion Antonio Torralba Sanja Fidler GAN 68 103 0 25 May 2020
Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning Thomas M. Moerland Anna Deichler S. Baldi Joost Broekens Catholijn M. Jonker OffRL 22 10 0 15 May 2020
Model-Based Reinforcement Learning for Atari Lukasz Kaiser Mohammad Babaeizadeh Piotr Milos B. Osinski R. Campbell ... Sergey Levine Afroz Mohiuddin Ryan Sepassi George Tucker Henryk Michalewski OffRL 129 861 0 01 Mar 2019
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning Jacky Liang Viktor Makoviychuk Ankur Handa N. Chentanez Miles Macklin Dieter Fox AI4CE 67 182 0 12 Oct 2018
Generalization and Regularization in DQN Jesse Farebrother Marlos C. Machado Michael Bowling 87 205 0 29 Sep 2018
How to Combine Tree-Search Methods in Reinforcement Learning Yonathan Efroni Gal Dalal B. Scherrer Shie Mannor 51 32 0 06 Sep 2018
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion Jacob Buckman Danijar Hafner George Tucker E. Brevdo Honglak Lee 91 332 0 04 Jul 2018
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning Vladimir Feinberg Alvin Wan Ion Stoica Michael I. Jordan Joseph E. Gonzalez Sergey Levine OffRL 62 317 0 28 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof David Meger OffRL 172 5,187 0 26 Feb 2018
Beyond the One Step Greedy Approach in Reinforcement Learning Yonathan Efroni Gal Dalal B. Scherrer Shie Mannor OffRL 80 50 0 10 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 215 1,600 0 05 Feb 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai ... D. Kumaran T. Graepel Timothy Lillicrap Karen Simonyan Demis Hassabis 141 1,775 0 05 Dec 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning Matteo Hessel Joseph Modayil H. V. Hasselt Tom Schaul Georg Ostrovski Will Dabney Dan Horgan Bilal Piot M. G. Azar David Silver OffRL 107 2,265 0 06 Oct 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents Marlos C. Machado Marc G. Bellemare Erik Talvitie J. Veness Matthew J. Hausknecht Michael Bowling 83 554 0 18 Sep 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning Anusha Nagabandi G. Kahn R. Fearing Sergey Levine 91 974 0 08 Aug 2017
Imagination-Augmented Agents for Deep Reinforcement Learning T. Weber S. Racanière David P. Reichert Lars Buesing A. Guez ... Razvan Pascanu Peter W. Battaglia Demis Hassabis David Silver Daan Wierstra LM&Ro 97 557 0 19 Jul 2017
Safe and Efficient Off-Policy Reinforcement Learning Rémi Munos T. Stepleton Anna Harutyunyan Marc G. Bellemare OffRL 138 615 0 08 Jun 2016
Deep Kalman Filters Rahul G. Krishnan Uri Shalit David Sontag BDL AI4TS 70 374 0 16 Nov 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games Junhyuk Oh Xiaoxiao Guo Honglak Lee Richard L. Lewis Satinder Singh 103 853 0 31 Jul 2015
cuDNN: Efficient Primitives for Deep Learning Sharan Chetlur Cliff Woolley Philippe Vandermersch Jonathan M. Cohen J. Tran Bryan Catanzaro Evan Shelhamer 133 1,848 0 03 Oct 2014
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 127 12,231 0 19 Dec 2013