Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.10783
Cited By
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
21 June 2021
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation"
50 / 67 papers shown
Title
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
27
0
0
17 Apr 2025
Dual Alignment Maximin Optimization for Offline Model-based RL
Chi Zhou
Wang Luo
Haoran Li
Congying Han
Tiande Guo
Zicheng Zhang
OffRL
71
0
0
02 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
97
2
0
28 Jan 2025
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Pavel Kolev
Marin Vlastelica
Georg Martius
OffRL
33
0
0
08 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
91
0
0
31 Dec 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
45
6
0
25 Oct 2024
Scalable Offline Reinforcement Learning for Mean Field Games
Axel Brunnbauer
Julian Lemmel
Z. Babaiee
Sophie A. Neubauer
Radu Grosu
OffRL
43
0
0
23 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
28
0
0
02 Oct 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
40
1
0
07 Sep 2024
Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning
Vanshaj Khattar
Ming Jin
OffRL
16
0
0
27 Aug 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
41
5
0
29 Jul 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Yi-Fan Yao
Zhepeng Cen
Wenhao Ding
Hao-ming Lin
Shiqi Liu
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
OnRL
51
1
0
19 Jul 2024
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
Minjae Cho
Chuangchuang Sun
OffRL
40
0
0
17 Jul 2024
Bellman Diffusion Models
Liam Schramm
Abdeslam Boularias
DiffM
24
2
0
16 Jul 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
51
7
0
13 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
31
4
0
13 Jun 2024
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
62
3
0
29 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
32
12
0
26 May 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue
Jiani Liu
Xingyuan Hua
Ju Ren
Sen Lin
Junshan Zhang
Yaoxue Zhang
OffRL
34
3
0
24 May 2024
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
OffRL
41
1
0
20 May 2024
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
Minjae Cho
Jonathan P. How
Chuangchuang Sun
OODD
OffRL
38
1
0
06 May 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
34
10
0
01 Feb 2024
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Yinan Zheng
Jianxiong Li
Dongjie Yu
Yujie Yang
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
OffRL
36
24
0
19 Jan 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
21
0
0
18 Jan 2024
Learning from Sparse Offline Datasets via Conservative Density Estimation
Zhepeng Cen
Zuxin Liu
Zitong Wang
Yi-Fan Yao
Henry Lam
Ding Zhao
OffRL
23
7
0
16 Jan 2024
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain
Vaibhav Unhelkar
OffRL
19
4
0
17 Dec 2023
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning
Melrose Roderick
Gaurav Manek
Felix Berkenkamp
J. Zico Kolter
OffRL
OnRL
17
0
0
25 Nov 2023
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Daiki E. Matsunaga
Jongmin Lee
Jaeseok Yoon
Stefanos Leonardos
Pieter Abbeel
Kee-Eung Kim
OODD
OffRL
22
3
0
03 Nov 2023
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
21
0
0
02 Nov 2023
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
27
5
0
02 Nov 2023
Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks
Zihao Li
Xiang Ji
Minshuo Chen
Mengdi Wang
OffRL
23
0
0
16 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
34
4
0
10 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
32
0
0
28 Sep 2023
Tempo Adaptation in Non-stationary Reinforcement Learning
Hyunin Lee
Yuhao Ding
Jongmin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
9
3
0
26 Sep 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
21
81
0
05 Sep 2023
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Pulkit Katdare
Nan Jiang
Katherine Driggs-Campbell
OffRL
22
4
0
04 Sep 2023
Offline Diversity Maximization Under Imitation Constraints
Marin Vlastelica
Jin Cheng
Georg Martius
Pavel Kolev
OffRL
38
0
0
21 Jul 2023
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Datasets and Benchmarks for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Haohong Lin
Yi-Fan Yao
Jiacheng Zhu
...
Hanjiang Hu
Wenhao Yu
Tingnan Zhang
Jie Tan
Ding Zhao
OffRL
24
36
0
15 Jun 2023
A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning
Kihyuk Hong
Yuhang Li
Ambuj Tewari
OffRL
18
7
0
13 Jun 2023
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
M. Geist
Yuejie Chi
OOD
30
31
0
26 May 2023
Offline Reinforcement Learning with Additional Covering Distributions
Chenjie Mao
OffRL
23
0
0
22 May 2023
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
Lantao Yu
Tianhe Yu
Jiaming Song
W. Neiswanger
Stefano Ermon
OffRL
65
16
0
05 Mar 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
28
19
0
16 Feb 2023
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability
Hanlin Zhu
Amy Zhang
OffRL
16
2
0
07 Feb 2023
Reinforcement Learning in Low-Rank MDPs with Density Features
Audrey Huang
Jinglin Chen
Nan Jiang
OffRL
18
14
0
04 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya-Qin Zhang
OffRL
38
19
0
03 Feb 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
38
15
0
30 Jan 2023
Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Ying Jin
Zhimei Ren
Zhuoran Yang
Zhaoran Wang
OffRL
26
25
0
19 Dec 2022
1
2
Next