ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.06193
  4. Cited By
Diffusion Policies as an Expressive Policy Class for Offline
  Reinforcement Learning

Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning

12 August 2022
Zhendong Wang
Jonathan J. Hunt
Mingyuan Zhou
    OffRL
ArXivPDFHTML

Papers citing "Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning"

50 / 69 papers shown
Title
Few-Step Diffusion via Score identity Distillation
Few-Step Diffusion via Score identity Distillation
Mingyuan Zhou
Yi Gu
Zhendong Wang
5
0
0
19 May 2025
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang
Jiaxuan Gao
Feng Gao
Yi Wu
Chao Yu
36
0
0
15 May 2025
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Huiyun Jiang
Zhuang Yang
29
0
0
13 May 2025
Improving Trajectory Stitching with Flow Models
Improving Trajectory Stitching with Flow Models
Reece O'Mahoney
Wanming Yu
Ioannis Havoutis
33
0
0
12 May 2025
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
Ce Hao
Anxing Xiao
Zhiwei Xue
Harold Soh
46
0
0
12 May 2025
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Zhengyuan Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
149
0
0
03 May 2025
Extendable Long-Horizon Planning via Hierarchical Multiscale Diffusion
Extendable Long-Horizon Planning via Hierarchical Multiscale Diffusion
Chang Chen
Hany Hamed
Doojin Baek
Taegu Kang
Yoshua Bengio
Sungjin Ahn
54
0
0
25 Mar 2025
ARFlow: Human Action-Reaction Flow Matching with Physical Guidance
ARFlow: Human Action-Reaction Flow Matching with Physical Guidance
Wentao Jiang
Jingya Wang
Haotao Lu
Kaiyang Ji
Baoxiong Jia
Siyuan Huang
Ye-ling Shi
44
0
0
21 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Zihao Liu
Xing Liu
Yizhai Zhang
Zhengxiong Liu
Panfeng Huang
69
0
0
19 Mar 2025
Quantization-Free Autoregressive Action Transformer
Quantization-Free Autoregressive Action Transformer
Ziyad Sheebaelhamd
Michael Tschannen
Michael Muehlebach
Claire Vernade
49
0
0
18 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
58
0
0
17 Mar 2025
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Jiaming Liu
Hao Chen
Pengju An
Zhuoyang Liu
Renrui Zhang
...
Chengkai Hou
Mengdi Zhao
KC alex Zhou
Pheng-Ann Heng
S. Zhang
72
8
0
13 Mar 2025
Generative Trajectory Stitching through Diffusion Composition
Generative Trajectory Stitching through Diffusion Composition
Yunhao Luo
Utkarsh Aashu Mishra
Yilun Du
Danfei Xu
135
1
0
07 Mar 2025
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Teng Pang
Bingzheng Wang
Guoqiang Wu
Yilong Yin
OffRL
73
0
0
03 Mar 2025
Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
Zhanpeng He
Yifeng Cao
M. Ciocarlie
64
0
0
26 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
Mingyang Sun
Pengxiang Ding
Weinan Zhang
Donglin Wang
OT
83
0
0
24 Feb 2025
DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories
DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories
Jean-Baptiste Bouvier
Kanghyun Ryu
Kartik Nagpal
Qiayuan Liao
K. Sreenath
Negar Mehr
88
2
0
20 Feb 2025
Maximum Entropy Reinforcement Learning with Diffusion Policy
Maximum Entropy Reinforcement Learning with Diffusion Policy
Xiaoyi Dong
Jian Cheng
Xiaotian Zhang
46
0
0
17 Feb 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi Ma
DiffM
93
23
0
17 Feb 2025
Skill Expansion and Composition in Parameter Space
Skill Expansion and Composition in Parameter Space
Tenglong Liu
J. Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
58
4
0
09 Feb 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
99
0
0
22 Jan 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
88
34
0
03 Jan 2025
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
Yishu Li
Wen Hui Leng
Yiming Fang
Ben Eisner
David Held
AI4CE
47
1
0
31 Dec 2024
Robust Contact-rich Manipulation through Implicit Motor Adaptation
Robust Contact-rich Manipulation through Implicit Motor Adaptation
Teng Xue
Amirreza Razmjoo
Suhan Shetty
Sylvain Calinon
102
1
0
16 Dec 2024
Fast and Robust Visuomotor Riemannian Flow Matching Policy
Fast and Robust Visuomotor Riemannian Flow Matching Policy
Haoran Ding
Noémie Jaquier
Jan Peters
Leonel Rozo
86
2
0
14 Dec 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
111
1
0
22 Nov 2024
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Mufei Li
Viraj Shitole
Eli Chien
Changhai Man
Zhaodong Wang
Srinivas Sridharan
Ying Zhang
Tushar Krishna
P. Li
37
0
0
04 Nov 2024
Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Wonje Choi
Honguk Woo
CLL
93
3
0
30 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
36
0
0
27 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on
  Aligned Spaces
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
H. Chen
Yi-Ju Chang
Dacheng Tao
Lichao Sun
OffRL
39
0
0
21 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive
  Revaluation
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
33
1
0
15 Oct 2024
Control-oriented Clustering of Visual Latent Representation
Control-oriented Clustering of Visual Latent Representation
Han Qi
Haocheng Yin
Heng Yang
SSL
52
2
0
07 Oct 2024
Joint Localization and Planning using Diffusion
Joint Localization and Planning using Diffusion
Lukas Lao Beyer
S. Karaman
38
0
0
26 Sep 2024
Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance
Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance
Kin Man Lee
Sean Ye
Qingyu Xiao
Zixuan Wu
Z. Zaidi
David B. DÁmbrosio
Pannag R. Sanketi
Matthew Gombolay
80
1
0
23 Sep 2024
Adaptive Planning with Generative Models under Uncertainty
Adaptive Planning with Generative Models under Uncertainty
Pascal Jutras-Dubé
Ruqi Zhang
Aniket Bera
36
2
0
02 Aug 2024
ARDuP: Active Region Video Diffusion for Universal Policies
ARDuP: Active Region Video Diffusion for Universal Policies
Shuaiyi Huang
Mara Levy
Zhenyu Jiang
Anima Anandkumar
Yuke Zhu
Linxi Fan
De-An Huang
Abhinav Shrivastava
VGen
50
2
0
19 Jun 2024
CDSA: Conservative Denoising Score-based Algorithm for Offline
  Reinforcement Learning
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
Zeyuan Liu
Kai Yang
Xiu Li
OffRL
44
0
0
11 Jun 2024
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary
  Trajectories
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories
Qianlan Yang
Yu-Xiong Wang
OnRL
42
1
0
06 Jun 2024
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function
  in Offline Reinforcement Learning
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
53
0
0
05 Jun 2024
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
Michał Zawalski
Gracjan Góral
Michał Tyrolski
Emilia Wisnios
Franciszek Budrowski
Marek Cygan
Łukasz Kuciński
Piotr Miłoś
47
0
0
05 Jun 2024
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
Guanxing Lu
Zifeng Gao
Tianxing Chen
Wen-Dao Dai
Ziwei Wang
Yansong Tang
Yansong Tang
DiffM
73
14
0
03 Jun 2024
Amortizing intractable inference in diffusion models for vision, language, and control
Amortizing intractable inference in diffusion models for vision, language, and control
S. Venkatraman
Moksh Jain
Luca Scimeca
Minsu Kim
Marcin Sendera
...
Alexandre Adam
Jarrid Rector-Brooks
Yoshua Bengio
Glen Berseth
Nikolay Malkin
68
24
0
31 May 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
56
1
0
31 May 2024
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
Ge Yan
Yueh-hua Wu
Xiaolong Wang
VGen
37
20
0
07 Mar 2024
Align Your Intents: Offline Imitation Learning via Optimal Transport
Align Your Intents: Offline Imitation Learning via Optimal Transport
Maksim Bobrin
N. Buzun
Dmitrii Krylov
Dmitry V. Dylov
OffRL
51
3
0
20 Feb 2024
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
Weikang Wan
Ziyu Wang
Zackory M. Erickson
David Held
David Held
31
4
0
08 Feb 2024
DDM-Lag : A Diffusion-based Decision-making Model for Autonomous
  Vehicles with Lagrangian Safety Enhancement
DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement
Jiaqi Liu
Peng Hang
Xiaocong Zhao
Jianqiang Wang
Jian Sun
54
10
0
08 Jan 2024
Reinforcement Learning from Diffusion Feedback: Q* for Image Search
Reinforcement Learning from Diffusion Feedback: Q* for Image Search
Aboli Rajan Marathe
VLM
44
0
0
27 Nov 2023
Boosting Continuous Control with Consistency Policy
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
41
20
0
10 Oct 2023
12
Next