ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.14548
  4. Cited By
Offline Reinforcement Learning via High-Fidelity Generative Behavior
  Modeling

Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling

29 September 2022
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
    DiffM
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling"

33 / 83 papers shown
Title
Regularized Conditional Diffusion Model for Multi-Task Preference
  Alignment
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
Xudong Yu
Chenjia Bai
Haoran He
Changhong Wang
Xuelong Li
32
6
0
07 Apr 2024
Versatile Navigation under Partial Observability via Value-guided
  Diffusion Policy
Versatile Navigation under Partial Observability via Value-guided Diffusion Policy
Gengyu Zhang
Hao Tang
Yan Yan
28
2
0
01 Apr 2024
Disentangling Policy from Offline Task Representation Learning via
  Adversarial Data Augmentation
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Chengxing Jia
Fuxiang Zhang
Yi-Chen Li
Chenxiao Gao
Xu-Hui Liu
Lei Yuan
Zongzhang Zhang
Yang Yu
AAML
37
4
0
12 Mar 2024
Stabilizing Policy Gradients for Stochastic Differential Equations via
  Consistency with Perturbation Process
Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process
Xiangxin Zhou
Liang Wang
Yichi Zhou
DiffM
21
4
0
07 Mar 2024
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
Ge Yan
Yueh-hua Wu
Xiaolong Wang
VGen
32
20
0
07 Mar 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
39
107
0
16 Feb 2024
Contrastive Diffuser: Planning Towards High Return States via
  Contrastive Learning
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Yixiang Shan
Zhengbang Zhu
Ting Long
Qifan Liang
Yi-Ju Chang
Weinan Zhang
Liang Yin
OffRL
34
1
0
05 Feb 2024
Towards Efficient Exact Optimization of Language Model Alignment
Towards Efficient Exact Optimization of Language Model Alignment
Haozhe Ji
Cheng Lu
Yilin Niu
Pei Ke
Hongning Wang
Jun Zhu
Jie Tang
Minlie Huang
50
11
0
01 Feb 2024
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion
  Model
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Yinan Zheng
Jianxiong Li
Dongjie Yu
Yujie Yang
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
OffRL
34
24
0
19 Jan 2024
Simple Hierarchical Planning with Diffusion
Simple Hierarchical Planning with Diffusion
Chang Chen
Fei Deng
Kenji Kawaguchi
Çağlar Gülçehre
Sungjin Ahn
OffRL
DiffM
38
24
0
05 Jan 2024
Diffusion Models for Reinforcement Learning: A Survey
Diffusion Models for Reinforcement Learning: A Survey
Zhengbang Zhu
Hanye Zhao
Haoran He
Yichao Zhong
Shenyu Zhang
Haoquan Guo
Tingting Chen
Weinan Zhang
41
59
0
02 Nov 2023
Score Regularized Policy Optimization through Diffusion Behavior
Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen
Cheng Lu
Zhengyi Wang
Hang Su
Jun Zhu
23
20
0
11 Oct 2023
Boosting Continuous Control with Consistency Policy
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
27
19
0
10 Oct 2023
DiffCPS: Diffusion Model based Constrained Policy Search for Offline
  Reinforcement Learning
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
Longxiang He
Li Shen
Linrui Zhang
Junbo Tan
Xueqian Wang
OffRL
28
8
0
09 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in
  Offline-RL
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
27
16
0
06 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable
  Diffusion Model
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
45
29
0
03 Oct 2023
Efficient Planning with Latent Diffusion
Efficient Planning with Latent Diffusion
Wenhao Li
DiffM
38
4
0
30 Sep 2023
Elastic Decision Transformer
Elastic Decision Transformer
Yueh-hua Wu
Xiaolong Wang
Masashi Hamaya
OffRL
21
39
0
05 Jul 2023
Beyond Conservatism: Diffusion Policies in Offline Multi-agent
  Reinforcement Learning
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Zhuoran Li
Ling Pan
Longbo Huang
DiffM
OffRL
20
7
0
04 Jul 2023
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion
  Models
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models
Sheng-Yen Chou
Pin-Yu Chen
Tsung-Yi Ho
DiffM
11
52
0
12 Jun 2023
Decision Stacks: Flexible Reinforcement Learning via Modular Generative
  Models
Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models
Siyan Zhao
Aditya Grover
OffRL
11
7
0
09 Jun 2023
Decoupled Prioritized Resampling for Offline RL
Decoupled Prioritized Resampling for Offline RL
Yang Yue
Bingyi Kang
Xiao Ma
Qisen Yang
Gao Huang
S. Song
Shuicheng Yan
OffRL
25
0
0
08 Jun 2023
Diffusion Model is an Effective Planner and Data Synthesizer for
  Multi-Task Reinforcement Learning
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffM
OffRL
30
89
0
29 May 2023
Making Offline RL Online: Collaborative World Models for Offline Visual
  Reinforcement Learning
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning
Q. Wang
Jun Yang
Yunbo Wang
Xin Jin
Wenjun Zeng
Xiaokang Yang
OffRL
OnRL
33
3
0
24 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
25
39
0
22 May 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling
  in Offline Reinforcement Learning
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
Cheng Lu
Huayu Chen
Jianfei Chen
Hang Su
Chongxuan Li
Jun Zhu
DiffM
OffRL
19
58
0
25 Apr 2023
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion
  Policies
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Philippe Hansen-Estruch
Ilya Kostrikov
Michael Janner
J. Kuba
Sergey Levine
OffRL
20
128
0
20 Apr 2023
Constrained Policy Optimization with Explicit Behavior Density for
  Offline Reinforcement Learning
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
Jing Zhang
Chi Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
27
7
0
28 Jan 2023
How to Backdoor Diffusion Models?
How to Backdoor Diffusion Models?
Sheng-Yen Chou
Pin-Yu Chen
Tsung-Yi Ho
DiffM
SILM
14
94
0
11 Dec 2022
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
627
0
20 May 2022
Supported Policy Optimization for Offline Reinforcement Learning
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
27
64
0
13 Feb 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
838
0
12 Oct 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Previous
12