Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.04779
Cited By
Conservative Q-Learning for Offline Reinforcement Learning
8 June 2020
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conservative Q-Learning for Offline Reinforcement Learning"
50 / 429 papers shown
Title
Controlling Commercial Cooling Systems Using Reinforcement Learning
Jerry Luo
Cosmin Paduraru
Octavian Voicu
Yuri Chervonyi
Scott A. Munns
...
Sims Witherspoon
D. Parish
Peter Dolan
Chenyu Zhao
D. Mankowitz
OffRL
AI4CE
28
25
0
11 Nov 2022
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILM
LRM
32
187
0
08 Nov 2022
Spatio-temporal Incentives Optimization for Ride-hailing Services with Offline Deep Reinforcement Learning
Yanqiu Wu
Qingyang Li
Zhiwei Qin
OffRL
14
3
0
06 Nov 2022
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning
D. Elbaz
Gal Novik
Oren Salzman
OffRL
33
0
0
06 Nov 2022
Contrastive Value Learning: Implicit Models for Simple Offline RL
Bogdan Mazoure
Benjamin Eysenbach
Ofir Nachum
Jonathan Tompson
SSL
OffRL
43
8
0
03 Nov 2022
Dual Generator Offline Reinforcement Learning
Q. Vuong
Aviral Kumar
Sergey Levine
Yevgen Chebotar
OffRL
34
1
0
02 Nov 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair
Brian Zhu
Gokul Narayanan
Eugen Solowjow
Sergey Levine
OffRL
OnRL
28
15
0
27 Oct 2022
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRL
OnRL
28
39
0
25 Oct 2022
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRL
AI4TS
32
27
0
24 Oct 2022
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
30
42
0
24 Oct 2022
Implicit Offline Reinforcement Learning via Supervised Learning
Alexandre Piché
Rafael Pardiñas
David Vazquez
Igor Mordatch
C. Pal
SSL
OffRL
29
4
0
21 Oct 2022
MoCoDA: Model-based Counterfactual Data Augmentation
Silviu Pitis
Elliot Creager
Ajay Mandlekar
Animesh Garg
OffRL
48
33
0
20 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
18
1
0
20 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Zhuowen Tu
OffRL
33
16
0
19 Oct 2022
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
45
2
0
19 Oct 2022
Boosting Offline Reinforcement Learning via Data Rebalancing
Yang Yue
Bingyi Kang
Xiao Ma
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
26
22
0
17 Oct 2022
Causal Inference for De-biasing Motion Estimation from Robotic Observational Data
Junhong Xu
Kai-Li Yin
Jason M. Gregory
Lantao Liu
CML
23
3
0
17 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
26
62
0
15 Oct 2022
Sustainable Online Reinforcement Learning for Auto-bidding
Zhiyu Mou
Yusen Huo
Rongquan Bai
Mingzhou Xie
Chuan Yu
Jian Xu
Bo Zheng
OffRL
OnRL
34
15
0
13 Oct 2022
Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning
Tung Nguyen
Qinqing Zheng
Aditya Grover
OffRL
34
6
0
11 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
29
7
0
07 Oct 2022
B2RL: An open-source Dataset for Building Batch Reinforcement Learning
Hsin-Yu Liu
Xiaohan Fu
Bharathan Balaji
Rajesh E. Gupta
Dezhi Hong
OffRL
27
4
0
30 Sep 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
Daesol Cho
D. Shim
H. J. Kim
OffRL
45
11
0
30 Sep 2022
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffM
OffRL
108
106
0
29 Sep 2022
Programmable and Customized Intelligence for Traffic Steering in 5G Networks Using Open RAN Architectures
Andrea Lacava
Michele Polese
R. Sivaraj
Rahul Soundrarajan
Bhawani Shanker Bhati
Tarunjeet Singh
Tommaso Zugno
F. Cuomo
Tommaso Melodia
35
80
0
28 Sep 2022
Latent Plans for Task-Agnostic Offline Reinforcement Learning
Erick Rosete-Beas
Oier Mees
Gabriel Kalweit
Joschka Boedecker
Wolfram Burgard
OffRL
46
81
0
19 Sep 2022
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
31
3
0
19 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
40
3
0
15 Sep 2022
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Xiaoteng Ma
Zhipeng Liang
Jose H. Blanchet
MingWen Liu
Li Xia
Jiheng Zhang
Qianchuan Zhao
Zhengyuan Zhou
OOD
OffRL
41
22
0
14 Sep 2022
Task-Agnostic Learning to Accomplish New Tasks
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
91
0
0
09 Sep 2022
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
Taku Yamagata
Ahmed Khalil
Raúl Santos-Rodríguez
OffRL
160
72
0
08 Sep 2022
Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz
Praveen K. Pilly
OffRL
27
1
0
07 Sep 2022
Dialogue Evaluation with Offline Reinforcement Learning
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Michael Heck
Shutong Feng
Milica Gavsić
OffRL
27
4
0
02 Sep 2022
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
24
37
0
22 Aug 2022
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity
Laixi Shi
Yuejie Chi
OOD
OffRL
31
61
0
11 Aug 2022
Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems
Qihua Zhang
Junning Liu
Yuzhuo Dai
Yiyan Qi
Yifan Yuan
Kunlun Zheng
Fan Huang
Xianfeng Tan
OffRL
35
50
0
09 Aug 2022
A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning
Z. Bukhsh
N. Jansen
Hajo Molegraaf
OffRL
AI4CE
25
6
0
01 Aug 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
44
2
0
31 Jul 2022
Differentiable Integrated Motion Prediction and Planning with Learnable Cost Function for Autonomous Driving
Zhiyu Huang
Haochen Liu
Jingda Wu
Chen Lv
37
92
0
21 Jul 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam R. Villaflor
Zheng Huang
Swapnil Pande
John M. Dolan
J. Schneider
OffRL
25
24
0
21 Jul 2022
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
30
165
0
19 Jul 2022
Making Linear MDPs Practical via Contrastive Representation Learning
Tianjun Zhang
Tongzheng Ren
Mengjiao Yang
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
25
44
0
14 Jul 2022
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
46
2
0
12 Jul 2022
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
Homer Walke
Jonathan Yang
Albert Yu
Aviral Kumar
Jedrzej Orbik
Avi Singh
Sergey Levine
OffRL
OnRL
29
32
0
11 Jul 2022
Multi-objective Optimization of Notifications Using Offline Reinforcement Learning
Prakruthi Prabhakar
Yiping Yuan
Guangyu Yang
Wensheng Sun
A. Muralidharan
OffRL
28
6
0
07 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
Learning to Rearrange with Physics-Inspired Risk Awareness
Meng Song
Yuhan Liu
Zhengqin Li
Manmohan Chandraker
32
0
0
26 Jun 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
53
101
0
19 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
26
4
0
17 Jun 2022
Previous
1
2
3
4
5
6
7
8
9
Next