ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05253
  4. Cited By
Search on the Replay Buffer: Bridging Planning and Reinforcement
  Learning

Search on the Replay Buffer: Bridging Planning and Reinforcement Learning

12 June 2019
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Search on the Replay Buffer: Bridging Planning and Reinforcement Learning"

50 / 173 papers shown
Title
Breadcrumbs to the Goal: Goal-Conditioned Exploration from
  Human-in-the-Loop Feedback
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
26
8
0
20 Jul 2023
Goal-Conditioned Reinforcement Learning with Disentanglement-based
  Reachability Planning
Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability Planning
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Bin He
26
3
0
20 Jul 2023
Maximizing Seaweed Growth on Autonomous Farms: A Dynamic Programming
  Approach for Underactuated Systems Navigating on Uncertain Ocean Currents
Maximizing Seaweed Growth on Autonomous Farms: A Dynamic Programming Approach for Underactuated Systems Navigating on Uncertain Ocean Currents
Matthias Killer
Marius Wiggert
Hanna Krasowski
M. Doshi
Pierre FJ Lermusiaux
Claire Tomlin
18
2
0
04 Jul 2023
Reinforcement Learning in Robotic Motion Planning by Combined
  Experience-based Planning and Self-Imitation Learning
Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning
Sha Luo
Lambert Schomaker
19
9
0
11 Jun 2023
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from
  Offline Data
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
Chongyi Zheng
Benjamin Eysenbach
Homer Walke
Patrick Yin
Kuan Fang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
39
4
0
06 Jun 2023
Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal
  Theorem Proving
Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving
Xueliang Zhao
Wenda Li
Lingpeng Kong
35
28
0
25 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
Errors are Useful Prompts: Instruction Guided Task Programming with
  Verifier-Assisted Iterative Prompting
Errors are Useful Prompts: Instruction Guided Task Programming with Verifier-Assisted Iterative Prompting
Marta Skreta
Naruki Yoshikawa
Sebastian Arellano-Rubach
Zhi Ji
L. B. Kristensen
Kourosh Darvish
Alán Aspuru-Guzik
Florian Shkurti
Animesh Garg
78
56
0
24 Mar 2023
Neural Constraint Satisfaction: Hierarchical Abstraction for
  Combinatorial Generalization in Object Rearrangement
Neural Constraint Satisfaction: Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement
Michael Chang
Alyssa Dayan
Franziska Meier
Thomas L. Griffiths
Sergey Levine
Amy Zhang
OCL
OffRL
27
9
0
20 Mar 2023
Imitating Graph-Based Planning with Goal-Conditioned Policies
Imitating Graph-Based Planning with Goal-Conditioned Policies
Junsup Kim
Younggyo Seo
Sungsoo Ahn
Kyunghwan Son
Jinwoo Shin
26
9
0
20 Mar 2023
Efficient Learning of High Level Plans from Play
Efficient Learning of High Level Plans from Play
Núria Armengol Urpí
Marco Bagatella
Otmar Hilliges
Georg Martius
Stelian Coros
OffRL
21
3
0
16 Mar 2023
One-4-All: Neural Potential Fields for Embodied Navigation
One-4-All: Neural Potential Fields for Embodied Navigation
Sacha Morin
Miguel A. Saavedra-Ruiz
Liam Paull
29
5
0
07 Mar 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for
  Embodied Agents
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents
Wenlong Huang
Fei Xia
Dhruv Shah
Danny Driess
Andy Zeng
...
Pete Florence
Igor Mordatch
Sergey Levine
Karol Hausman
Brian Ichter
LM&Ro
27
42
0
01 Mar 2023
Handling Long and Richly Constrained Tasks through Constrained
  Hierarchical Reinforcement Learning
Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning
Yu Lu
Arunesh Sinha
Pradeep Varakantham
24
0
0
21 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based
  Reinforcement Learning
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
37
4
0
17 Feb 2023
Graph schemas as abstractions for transfer learning, inference, and
  planning
Graph schemas as abstractions for transfer learning, inference, and planning
J. S. Guntupalli
Rajkumar Vasudeva Raju
Shrinu Kushagra
Carter Wendelken
Daniel P. Sawyer
Ishani Deshpande
Guangyao Zhou
Miguel Lazaro-Gredilla
Dileep George
37
9
0
14 Feb 2023
GePA*SE: Generalized Edge-Based Parallel A* for Slow Evaluations
GePA*SE: Generalized Edge-Based Parallel A* for Slow Evaluations
Shohin Mukherjee
Maxim Likhachev
21
2
0
24 Jan 2023
Learning Robotic Navigation from Experience: Principles, Methods, and
  Recent Results
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results
Sergey Levine
Dhruv Shah
SSL
37
21
0
13 Dec 2022
PALMER: Perception-Action Loop with Memory for Long-Horizon Planning
PALMER: Perception-Action Loop with Memory for Long-Horizon Planning
Onur Beker
Mohammad Mohammadi
Amir Zamir
34
2
0
08 Dec 2022
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel
  Program Guidance
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance
C. Chang
Ni Mu
Jiajun Wu
Ling Pan
Huazhe Xu
50
7
0
05 Dec 2022
Control Transformer: Robot Navigation in Unknown Environments through
  PRM-Guided Return-Conditioned Sequence Modeling
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
24
7
0
11 Nov 2022
Emergency action termination for immediate reaction in hierarchical
  reinforcement learning
Emergency action termination for immediate reaction in hierarchical reinforcement learning
Michal Bortkiewicz
Jakub Lyskawa
Pawel Wawrzyñski
M. Ostaszewski
Artur Grudkowski
Tomasz Trzciñski
21
0
0
11 Nov 2022
Leveraging Sequentiality in Reinforcement Learning from a Single
  Demonstration
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
17
4
0
09 Nov 2022
Reducing Collision Checking for Sampling-Based Motion Planning Using
  Graph Neural Networks
Reducing Collision Checking for Sampling-Based Motion Planning Using Graph Neural Networks
Chen-Ping Yu
Sicun Gao
34
47
0
17 Oct 2022
Generalization with Lossy Affordances: Leveraging Broad Offline Data for
  Learning Visuomotor Tasks
Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor Tasks
Kuan Fang
Patrick Yin
Ashvin Nair
Homer Walke
Ge Yan
Sergey Levine
OffRL
31
22
0
12 Oct 2022
DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical
  Reinforcement Learning
DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning
Seungjae Lee
Jigang Kim
Inkyu Jang
H. J. Kim
OffRL
27
10
0
11 Oct 2022
Generating Executable Action Plans with Environmentally-Aware Language
  Models
Generating Executable Action Plans with Environmentally-Aware Language Models
Maitrey Gramopadhye
D. Szafir
LM&Ro
LLMAG
20
22
0
10 Oct 2022
Multi-Task Option Learning and Discovery for Stochastic Path Planning
Multi-Task Option Learning and Discovery for Stochastic Path Planning
Naman Shah
Siddharth Srivastava
19
2
0
30 Sep 2022
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement
  Learning
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning
Firas Jarboui
Ahmed Akakzia
16
0
0
26 Sep 2022
ProgPrompt: Generating Situated Robot Task Plans using Large Language
  Models
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
D. Fox
Jesse Thomason
Animesh Garg
LM&Ro
LLMAG
120
624
0
22 Sep 2022
Cell-Free Latent Go-Explore
Cell-Free Latent Go-Explore
Quentin Gallouedec
Emmanuel Dellandréa
12
1
0
31 Aug 2022
Inner Monologue: Embodied Reasoning through Planning with Language
  Models
Inner Monologue: Embodied Reasoning through Planning with Language Models
Wenlong Huang
F. Xia
Ted Xiao
Harris Chan
Jacky Liang
...
Tomas Jackson
Linda Luu
Sergey Levine
Karol Hausman
Brian Ichter
LLMAG
LM&Ro
LRM
41
856
0
12 Jul 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
50
101
0
19 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
28
139
0
15 Jun 2022
State Supervised Steering Function for Sampling-based Kinodynamic
  Planning
State Supervised Steering Function for Sampling-based Kinodynamic Planning
P. Atreya
Joydeep Biswas
LLMSV
8
7
0
15 Jun 2022
Value Memory Graph: A Graph-Structured World Model for Offline
  Reinforcement Learning
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Deyao Zhu
Erran L. Li
Mohamed Elhoseiny
OffRL
34
8
0
09 Jun 2022
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement
  Learning
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
53
57
0
24 May 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned
  Reinforcement Learning
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
30
0
0
20 May 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in
  Latent Space
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
55
29
0
17 May 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement
  Learning in Safety-Critical Driving Scenarios
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
21
0
0
11 Apr 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng-Tao Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
45
1,845
0
04 Apr 2022
Topological Experience Replay
Topological Experience Replay
Zhang-Wei Hong
Tao Chen
Yen-Chen Lin
Joni Pajarinen
Pulkit Agrawal
16
16
0
29 Mar 2022
Demonstration-Bootstrapped Autonomous Practicing via Multi-Task
  Reinforcement Learning
Demonstration-Bootstrapped Autonomous Practicing via Multi-Task Reinforcement Learning
Abhishek Gupta
Corey Lynch
Brandon Kinman
Garrett Peake
Sergey Levine
Karol Hausman
OffRL
19
17
0
29 Mar 2022
Continual Auxiliary Task Learning
Continual Auxiliary Task Learning
Matt McLeod
Chun-Ping Lo
M. Schlegel
Andrew Jacobsen
Raksha Kumaraswamy
Martha White
Adam White
CLL
16
8
0
22 Feb 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
32
53
0
17 Feb 2022
Contextualize Me -- The Case for Context in Reinforcement Learning
Contextualize Me -- The Case for Context in Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
Aditya Mohan
Sebastian Dohler
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
30
29
0
09 Feb 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei-ping Xu
Haonan Yu
38
10
0
24 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
33
132
0
20 Jan 2022
Automated Reinforcement Learning: An Overview
Automated Reinforcement Learning: An Overview
Reza Refaei Afshar
Yingqian Zhang
Joaquin Vanschoren
U. Kaymak
OffRL
28
16
0
13 Jan 2022
A Research Agenda for AI Planning in the Field of Flexible Production
  Systems
A Research Agenda for AI Planning in the Field of Flexible Production Systems
Aljosha Kocher
René Heesch
Niklas Widulle
Anna Nordhausen
Julian Putzke
Alexander Windmann
Oliver Niggemann
13
5
0
31 Dec 2021
Previous
1234
Next