Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05253
Cited By
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
12 June 2019
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Search on the Replay Buffer: Bridging Planning and Reinforcement Learning"
50 / 173 papers shown
Title
Efficient Robotic Policy Learning via Latent Space Backward Planning
Dongxiu Liu
Haoyi Niu
Zhihao Wang
Jinliang Zheng
Yinan Zheng
Zhonghong Ou
Jianming Hu
Jianxiong Li
Xianyuan Zhan
28
0
0
11 May 2025
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Simo Alami C.
Rim Kaddah
Jesse Read
Marie-Paule Cani
46
0
0
07 May 2025
Reward Shaping to Mitigate Reward Hacking in RLHF
Jiayi Fu
Xuandong Zhao
Chengyuan Yao
Hairu Wang
Qi Han
Yanghua Xiao
84
6
0
26 Feb 2025
DHP: Discrete Hierarchical Planning for Hierarchical Reinforcement Learning Agents
Shashank Sharma
Janina Hoffmann
Vinay P. Namboodiri
88
0
0
04 Feb 2025
Episodic memory in AI agents poses risks that should be studied and mitigated
Chad DeChant
67
2
0
20 Jan 2025
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep P. Chinchali
Ufuk Topcu
OffRL
92
0
0
02 Dec 2024
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
Zhe Wang
Haozhu Wang
Yanjun Qi
OffRL
81
0
0
01 Dec 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
50
8
0
26 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
38
1
0
07 Oct 2024
ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation
Abrar Anwar
John Welsh
Joydeep Biswas
Soha Pouya
Yan Chang
LM&Ro
34
9
0
20 Sep 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
40
0
0
11 Aug 2024
Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin
Pieter Abbeel
OffRL
48
3
0
22 Jul 2024
Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Hao-Tien Lewis Chiang
Zhuo Xu
Zipeng Fu
M. Jacob
Tingnan Zhang
...
Carolina Parada
Chelsea Finn
Peng Xu
Sergey Levine
Jie Tan
LM&Ro
51
20
0
10 Jul 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
32
0
0
26 Jun 2024
To Err is Robotic: Rapid Value-Based Trial-and-Error during Deployment
Maximilian Du
Alexander Khazatsky
Tobias Gerstenberg
Chelsea Finn
49
0
0
22 Jun 2024
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
Yuhui Wang
Qingyuan Wu
Weida Li
Dylan R. Ashley
Francesco Faccio
Chao Huang
Jürgen Schmidhuber
AI4CE
26
0
0
12 Jun 2024
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
Michał Zawalski
Gracjan Góral
Michał Tyrolski
Emilia Wisnios
Franciszek Budrowski
Marek Cygan
Łukasz Kuciński
Piotr Miłoś
47
0
0
05 Jun 2024
RoboMP
2
^2
2
: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models
Qi Lv
Haochuan Li
Xiang Deng
Rui Shao
Michael Yu Wang
Liqiang Nie
LRM
LM&Ro
42
1
0
07 Apr 2024
Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks
Shaunak A. Mehta
Soheil Habibian
Dylan P. Losey
SSL
73
2
0
20 Mar 2024
Probabilistic World Modeling with Asymmetric Distance Measure
Meng Song
33
0
0
16 Mar 2024
Meta-operators for Enabling Parallel Planning Using Deep Reinforcement Learning
Ángel Aso-Mollar
Eva Onaindia
OffRL
24
0
0
13 Mar 2024
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
Benjamin Eysenbach
Vivek Myers
Ruslan Salakhutdinov
Sergey Levine
AI4TS
41
8
0
06 Mar 2024
Feudal Networks for Visual Navigation
Faith Johnson
Bryan Bo Cao
Kristin J. Dana
Shubham Jain
Ashwin Ashok
34
2
0
19 Feb 2024
Single-Reset Divide & Conquer Imitation Learning
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
40
0
0
14 Feb 2024
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents
Jae-Woo Choi
Youngwoo Yoon
Hyobin Ong
Jaehong Kim
Minsu Jang
19
13
0
13 Feb 2024
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim
Yunseon Choi
Daiki E. Matsunaga
Kee-Eung Kim
OffRL
46
6
0
11 Feb 2024
Offline Deep Model Predictive Control (MPC) for Visual Navigation
Taha Bouzid
Youssef Alj
26
0
0
07 Feb 2024
TopoNav: Topological Navigation for Efficient Exploration in Sparse Reward Environments
Jumman Hossain
A. Faridee
Nirmalya Roy
Jade Freeman
Timothy Gregory
Theron T. Trout
28
3
0
06 Feb 2024
Reconciling Spatial and Temporal Abstractions for Goal Representation
Mehdi Zadem
Sergio Mover
Sao Mai Nguyen
18
3
0
18 Jan 2024
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRL
VLM
33
0
0
25 Dec 2023
ReRoGCRL: Representation-based Robustness in Goal-Conditioned Reinforcement Learning
Xiangyu Yin
Sihao Wu
Jiaxu Liu
Meng Fang
Xingyu Zhao
Xiaowei Huang
Wenjie Ruan
AAML
38
5
0
12 Dec 2023
Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning
Yingdong Hu
Fanqi Lin
Tong Zhang
Li Yi
Yang Gao
LM&Ro
91
101
0
29 Nov 2023
Goal-conditioned Offline Planning from Curious Exploration
Marco Bagatella
Georg Martius
OffRL
21
1
0
28 Nov 2023
Imagination-Augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments
Sang-Hyun Lee
Yoonjae Jung
Seung-Woo Seo
32
1
0
17 Nov 2023
Large Language Models for Robotics: A Survey
Fanlong Zeng
Wensheng Gan
Yongheng Wang
Ning Liu
Philip S. Yu
LM&Ro
124
125
0
13 Nov 2023
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
28
11
0
31 Oct 2023
CQM: Curriculum Reinforcement Learning with a Quantized World Model
Seungjae Lee
Daesol Cho
Jonghae Park
H. J. Kim
28
6
0
26 Oct 2023
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
Xueliang Zhao
Xinting Huang
Wei Bi
Lingpeng Kong
LRM
48
0
0
19 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models
Mengkang Hu
Yao Mu
Xinmiao Yu
Mingyu Ding
Shiguang Wu
Wenqi Shao
Qiguang Chen
Bin Wang
Yu Qiao
Ping Luo
LLMAG
42
33
0
12 Oct 2023
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Zichen Zhang
Yunshuang Li
Osbert Bastani
Abhishek Gupta
Dinesh Jayaraman
Yecheng Jason Ma
Luca Weihs
37
17
0
12 Oct 2023
Roadmaps with Gaps over Controllers: Achieving Efficiency in Planning under Dynamics
A. Sivaramakrishnan
Sumanth Tangirala
Edgar Granados
Noah R. Carver
Kostas E. Bekris
25
3
0
05 Oct 2023
Efficient Planning with Latent Diffusion
Wenhao Li
DiffM
40
4
0
30 Sep 2023
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
Mingde Zhao
Safa Alver
H. V. Seijen
Romain Laroche
Doina Precup
Yoshua Bengio
15
3
0
30 Sep 2023
PlaceNav: Topological Navigation through Place Recognition
Liangyu Zhang
Jiadong Liang
Harry Edelman
Zhihua Zhang
30
6
0
29 Sep 2023
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
30
2
0
24 Sep 2023
Goal Space Abstraction in Hierarchical Reinforcement Learning via Set-Based Reachability Analysis
Mehdi Zadem
Sergio Mover
S. Nguyen
18
5
0
14 Sep 2023
Learning Team-Based Navigation: A Review of Deep Reinforcement Learning Techniques for Multi-Agent Pathfinding
Jaeho Chung
Jamil Fayyad
Younes Al Younes
H. Najjaran
27
13
0
11 Aug 2023
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Qingyang Zhang
Yiming Yang
Jingqing Ruan
Xuantang Xiong
Dengpeng Xing
Bo Xu
33
0
0
22 Jul 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
30
44
0
22 Jul 2023
1
2
3
4
Next