ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.10783
  4. Cited By
OptiDICE: Offline Policy Optimization via Stationary Distribution
  Correction Estimation

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

21 June 2021
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
    OffRL
ArXivPDFHTML

Papers citing "OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation"

50 / 67 papers shown
Title
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
27
0
0
17 Apr 2025
Dual Alignment Maximin Optimization for Offline Model-based RL
Dual Alignment Maximin Optimization for Offline Model-based RL
Chi Zhou
Wang Luo
Haoran Li
Congying Han
Tiande Guo
Zicheng Zhang
OffRL
71
0
0
02 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
97
2
0
28 Jan 2025
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Pavel Kolev
Marin Vlastelica
Georg Martius
OffRL
33
0
0
08 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
91
0
0
31 Dec 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action
  Suppression
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
45
6
0
25 Oct 2024
Scalable Offline Reinforcement Learning for Mean Field Games
Scalable Offline Reinforcement Learning for Mean Field Games
Axel Brunnbauer
Julian Lemmel
Z. Babaiee
Sophie A. Neubauer
Radu Grosu
OffRL
43
0
0
23 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with
  Stationary Distribution Shift Regularization
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
28
0
0
02 Oct 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
40
1
0
07 Sep 2024
Optimization Solution Functions as Deterministic Policies for Offline
  Reinforcement Learning
Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning
Vanshaj Khattar
Ming Jin
OffRL
16
0
0
27 Aug 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement
  Learning
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
41
5
0
29 Jul 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement
  Learning
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Yi-Fan Yao
Zhepeng Cen
Wenhao Ding
Hao-ming Lin
Shiqi Liu
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
OnRL
51
1
0
19 Jul 2024
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement
  Learning
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
Minjae Cho
Chuangchuang Sun
OffRL
40
0
0
17 Jul 2024
Bellman Diffusion Models
Bellman Diffusion Models
Liam Schramm
Abdeslam Boularias
DiffM
24
2
0
16 Jul 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
51
7
0
13 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline
  Datasets
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
31
4
0
13 Jun 2024
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
62
3
0
29 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
32
12
0
26 May 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue
Jiani Liu
Xingyuan Hua
Ju Ren
Sen Lin
Junshan Zhang
Yaoxue Zhang
OffRL
34
3
0
24 May 2024
A Unified Linear Programming Framework for Offline Reward Learning from
  Human Demonstrations and Feedback
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
OffRL
41
1
0
20 May 2024
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning
  via Causal Normalizing Flows
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
Minjae Cho
Jonathan P. How
Chuangchuang Sun
OODD
OffRL
38
1
0
06 May 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via
  Orthogonal-gradient Update
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
34
10
0
01 Feb 2024
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion
  Model
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Yinan Zheng
Jianxiong Li
Dongjie Yu
Yujie Yang
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
OffRL
36
24
0
19 Jan 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
21
0
0
18 Jan 2024
Learning from Sparse Offline Datasets via Conservative Density
  Estimation
Learning from Sparse Offline Datasets via Conservative Density Estimation
Zhepeng Cen
Zuxin Liu
Zitong Wang
Yi-Fan Yao
Henry Lam
Ding Zhao
OffRL
23
7
0
16 Jan 2024
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via
  Stationary Distribution Correction Estimation
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain
Vaibhav Unhelkar
OffRL
19
4
0
17 Dec 2023
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline
  Reinforcement Learning
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning
Melrose Roderick
Gaurav Manek
Felix Berkenkamp
J. Zico Kolter
OffRL
OnRL
17
0
0
25 Nov 2023
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline
  Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Daiki E. Matsunaga
Jongmin Lee
Jaeseok Yoon
Stefanos Leonardos
Pieter Abbeel
Kee-Eung Kim
OODD
OffRL
22
3
0
03 Nov 2023
Offline Imitation from Observation via Primal Wasserstein State
  Occupancy Matching
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
21
0
0
02 Nov 2023
A Simple Solution for Offline Imitation from Observations and Examples
  with Possibly Incomplete Trajectories
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
27
5
0
02 Nov 2023
Sample Complexity of Preference-Based Nonparametric Off-Policy
  Evaluation with Deep Networks
Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks
Zihao Li
Xiang Ji
Minshuo Chen
Mengdi Wang
OffRL
23
0
0
16 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
34
4
0
10 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced
  Datasets
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
Stackelberg Batch Policy Learning
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
32
0
0
28 Sep 2023
Tempo Adaptation in Non-stationary Reinforcement Learning
Tempo Adaptation in Non-stationary Reinforcement Learning
Hyunin Lee
Yuhao Ding
Jongmin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
9
3
0
26 Sep 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and
  Challenges
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
21
81
0
05 Sep 2023
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Pulkit Katdare
Nan Jiang
Katherine Driggs-Campbell
OffRL
22
4
0
04 Sep 2023
Offline Diversity Maximization Under Imitation Constraints
Offline Diversity Maximization Under Imitation Constraints
Marin Vlastelica
Jin Cheng
Georg Martius
Pavel Kolev
OffRL
38
0
0
21 Jul 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Datasets and Benchmarks for Offline Safe Reinforcement Learning
Datasets and Benchmarks for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Haohong Lin
Yi-Fan Yao
Jiacheng Zhu
...
Hanjiang Hu
Wenhao Yu
Tingnan Zhang
Jie Tan
Ding Zhao
OffRL
24
36
0
15 Jun 2023
A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement
  Learning
A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning
Kihyuk Hong
Yuhang Li
Ambuj Tewari
OffRL
18
7
0
13 Jun 2023
The Curious Price of Distributional Robustness in Reinforcement Learning
  with a Generative Model
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
M. Geist
Yuejie Chi
OOD
30
31
0
26 May 2023
Offline Reinforcement Learning with Additional Covering Distributions
Offline Reinforcement Learning with Additional Covering Distributions
Chenjie Mao
OffRL
23
0
0
22 May 2023
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed
  Distribution Matching
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
Lantao Yu
Tianhe Yu
Jiaming Song
W. Neiswanger
Stefano Ermon
OffRL
65
16
0
05 Mar 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation
  Learning
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
28
19
0
16 Feb 2023
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with
  General Function Approximation and Single-Policy Concentrability
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability
Hanlin Zhu
Amy Zhang
OffRL
16
2
0
07 Feb 2023
Reinforcement Learning in Low-Rank MDPs with Density Features
Reinforcement Learning in Low-Rank MDPs with Density Features
Audrey Huang
Jinglin Chen
Nan Jiang
OffRL
18
14
0
04 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya-Qin Zhang
OffRL
38
19
0
03 Feb 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline
  Reinforcement Learning
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
38
15
0
30 Jan 2023
Policy learning "without'' overlap: Pessimism and generalized empirical
  Bernstein's inequality
Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
Ying Jin
Zhimei Ren
Zhuoran Yang
Zhaoran Wang
OffRL
26
25
0
19 Dec 2022
12
Next