ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.06239
  4. Cited By
Supported Policy Optimization for Offline Reinforcement Learning

Supported Policy Optimization for Offline Reinforcement Learning

13 February 2022
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
    OffRL
ArXivPDFHTML

Papers citing "Supported Policy Optimization for Offline Reinforcement Learning"

46 / 46 papers shown
Title
ExOSITO: Explainable Off-Policy Learning with Side Information for Intensive Care Unit Blood Test Orders
ExOSITO: Explainable Off-Policy Learning with Side Information for Intensive Care Unit Blood Test Orders
Zongliang Ji
Andre Carlos Kajdacsy-Balla Amaral
Anna Goldenberg
Rahul G. Krishnan
OffRL
44
0
0
24 Apr 2025
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Ke Jiang
Wen Jiang
Y. Li
Xiaoyang Tan
OffRL
38
0
0
02 Apr 2025
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
Yunkai Gao
Jiaming Guo
Fan Wu
Rui Zhang
OffRL
56
0
0
07 Mar 2025
Average Reward Reinforcement Learning for Wireless Radio Resource Management
Average Reward Reinforcement Learning for Wireless Radio Resource Management
Kun Yang
Jing Yang
Cong Shen
46
1
0
12 Jan 2025
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo
  Cancellation
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
Fei Zhao
Xueliang Zhang
36
0
0
25 Dec 2024
Reinforcement Learning Gradients as Vitamin for Online Finetuning
  Decision Transformers
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
OnRL
36
0
0
31 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
36
0
0
27 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action
  Suppression
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
48
6
0
25 Oct 2024
Decision-Point Guided Safe Policy Improvement
Decision-Point Guided Safe Policy Improvement
Abhishek Sharma
Leo Benac
S. Parbhoo
Finale Doshi-Velez
OffRL
30
0
0
12 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
46
4
0
03 Oct 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective
  Subgoal Guidance
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
34
0
0
06 Sep 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
32
2
0
04 Aug 2024
Adaptive Advantage-Guided Policy Regularization for Offline
  Reinforcement Learning
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu
Yang Li
Yixing Lan
Hao Gao
Wei Pan
Xin Xu
OffRL
36
5
0
30 May 2024
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
Jinxin Liu
Xinghong Guo
Zifeng Zhuang
Donglin Wang
DiffM
OffRL
50
2
0
23 May 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline
  Reinforcement Learning
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Xiaoyu Wen
Chenjia Bai
Kang Xu
Xudong Yu
Yang Zhang
Xuelong Li
Zhen Wang
41
2
0
10 May 2024
A2PO: Towards Effective Offline Reinforcement Learning from an
  Advantage-aware Perspective
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
31
1
0
12 Mar 2024
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for
  Efficiency
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
Yanxiao Zhao
Yangge Qian
Tianyi Wang
Jingyang Shan
Xiaolin Qin
21
0
0
01 Mar 2024
Context-Former: Stitching via Latent Conditioned Sequence Modeling
Context-Former: Stitching via Latent Conditioned Sequence Modeling
Ziqi Zhang
Jingzehua Xu
Jinxin Liu
Zifeng Zhuang
Donglin Wang
Miao Liu
Shuai Zhang
OffRL
45
4
0
29 Jan 2024
Supported Trust Region Optimization for Offline Reinforcement Learning
Supported Trust Region Optimization for Offline Reinforcement Learning
Yongyi Mao
Hongchang Zhang
Cheng Chen
Yi Tian Xu
Xiangyang Ji
OffRL
29
14
0
15 Nov 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with
  Multi-Step On-Policy Optimization
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRL
OnRL
51
13
0
06 Nov 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online
  Reinforcement Learning
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
29
13
0
27 Oct 2023
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for
  Autonomous Real-World Reinforcement Learning
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
Jingyun Yang
Max Sobol Mark
Brandon Vu
Archit Sharma
Jeannette Bohg
Chelsea Finn
OffRL
OnRL
29
21
0
23 Oct 2023
Score Regularized Policy Optimization through Diffusion Behavior
Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen
Cheng Lu
Zhengyi Wang
Hang Su
Jun Zhu
28
20
0
11 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Donglin Wang
OffRL
OnRL
42
0
0
07 Oct 2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty
  and Smoothness
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
Xiaoyu Wen
Xudong Yu
Rui Yang
Chenjia Bai
Zhen Wang
OffRL
OnRL
18
10
0
29 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
36
0
0
26 Sep 2023
Counterfactual Conservative Q Learning for Offline Multi-agent
  Reinforcement Learning
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
17
19
0
22 Sep 2023
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Zilai Zeng
Ce Zhang
Shijie Wang
Chen Sun
OffRL
29
5
0
07 Jul 2023
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
Jinxin Liu
Lipeng Zu
Li He
Donglin Wang
OffRL
45
8
0
23 Jun 2023
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement
  Learning
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
Jinxin Liu
Ziqi Zhang
Zhenyu Wei
Zifeng Zhuang
Yachen Kang
Sibo Gai
Donglin Wang
OffRL
20
16
0
22 Jun 2023
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online
  Reinforcement Learning
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
Siyuan Guo
Yanchao Sun
Jifeng Hu
Sili Huang
Hechang Chen
Haiyin Piao
Lichao Sun
Yi-Ju Chang
OffRL
OnRL
31
7
0
13 Jun 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement
  Learning
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Yuhang Ran
Yi-Chen Li
Fuxiang Zhang
Zongzhang Zhang
Yang Yu
OffRL
21
23
0
11 Jun 2023
Iteratively Refined Behavior Regularization for Offline Reinforcement
  Learning
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning
Xiao Hu
Yi Ma
Chenjun Xiao
Yan Zheng
Zhaopeng Meng
OffRL
18
4
0
09 Jun 2023
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning
Linjie Xu
Zhengyao Jiang
Jinyu Wang
Lei Song
Jiang Bian
OffRL
33
0
0
06 Jun 2023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement
  Learning
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Ya-Qin Zhang
OffRL
OnRL
36
19
0
25 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
33
36
0
16 May 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online
  Fine-Tuning
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
112
108
0
09 Mar 2023
The In-Sample Softmax for Offline Reinforcement Learning
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
21
26
0
28 Feb 2023
Constrained Policy Optimization with Explicit Behavior Density for
  Offline Reinforcement Learning
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
Jing Zhang
Chi Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
27
7
0
28 Jan 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement
  Operators
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Jiachen Li
Edwin Zhang
Ming Yin
Qinxun Bai
Yu-Xiang Wang
William Yang Wang
OffRL
31
15
0
29 Nov 2022
CORL: Research-oriented Deep Offline Reinforcement Learning Library
CORL: Research-oriented Deep Offline Reinforcement Learning Library
Denis Tarasov
Alexander Nikulin
Dmitry Akimov
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
51
78
0
13 Oct 2022
Offline Reinforcement Learning via High-Fidelity Generative Behavior
  Modeling
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffM
OffRL
103
105
0
29 Sep 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
843
0
12 Oct 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
143
85
0
22 Sep 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
1