Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.03647
Cited By
Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
5 June 2020
T. Matsushima
Hiroki Furuta
Y. Matsuo
Ofir Nachum
S. Gu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization"
34 / 34 papers shown
Title
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
64
0
0
03 Apr 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
72
1
0
17 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
89
1
0
13 Feb 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
102
1
0
22 Dec 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
33
1
0
12 Oct 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
10
0
06 Jan 2024
Designing Long-term Group Fair Policies in Dynamical Systems
Miriam Rateike
Isabel Valera
Patrick Forré
33
4
0
21 Nov 2023
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
35
0
0
28 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
36
1
0
26 Sep 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
36
93
0
19 May 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
18
21
0
14 Mar 2023
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OOD
OffRL
27
0
0
27 Nov 2022
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
57
9
0
23 Oct 2022
World Robot Challenge 2020 -- Partner Robot: A Data-Driven Approach for Room Tidying with Mobile Manipulator
T. Matsushima
Yukiyasu Noguchi
Jumpei Arima
Toshiki Aoki
Yuki Okita
...
Yuki Yamashita
Shoichi Seto
S. Gu
Yusuke Iwasawa
Yutaka Matsuo
32
8
0
20 Jul 2022
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
46
2
0
12 Jul 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
19
22
0
24 Jun 2022
Demonstration-Bootstrapped Autonomous Practicing via Multi-Task Reinforcement Learning
Abhishek Gupta
Corey Lynch
Brandon Kinman
Garrett Peake
Sergey Levine
Karol Hausman
OffRL
19
17
0
29 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
31
132
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Dan Qiao
Ming Yin
Ming Min
Yu-Xiang Wang
34
28
0
13 Feb 2022
Model-Based Offline Meta-Reinforcement Learning with Regularization
Sen Lin
Jialin Wan
Tengyu Xu
Yingbin Liang
Junshan Zhang
OffRL
31
17
0
07 Feb 2022
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
24
29
0
16 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
58
780
0
12 Jun 2021
Distributed Learning in Wireless Networks: Recent Progress and Future Challenges
Mingzhe Chen
Deniz Gündüz
Kaibin Huang
Walid Saad
M. Bennis
Aneta Vulgarakis Feljan
H. Vincent Poor
38
401
0
05 Apr 2021
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
Ilya Kostrikov
Jonathan Tompson
Rob Fergus
Ofir Nachum
OffRL
29
300
0
14 Mar 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
415
0
16 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
140
6
0
01 Feb 2021
Reliable Off-policy Evaluation for Reinforcement Learning
Jie Wang
Rui Gao
H. Zha
OffRL
22
11
0
08 Nov 2020
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Anurag Ajay
Aviral Kumar
Pulkit Agrawal
Sergey Levine
Ofir Nachum
OffRL
OnRL
34
155
0
26 Oct 2020
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
37
45
0
23 Aug 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
BRPO: Batch Residual Policy Optimization
Kentaro Kanamori
Yinlam Chow
Takuya Takagi
Hiroki Arimura
Honglak Lee
Ken Kobayashi
Craig Boutilier
OffRL
139
46
0
08 Feb 2020
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
73
312
0
06 Jun 2015
1