Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.10573
Cited By
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
20 April 2023
Philippe Hansen-Estruch
Ilya Kostrikov
Michael Janner
J. Kuba
Sergey Levine
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies"
50 / 116 papers shown
Title
Imagination-Limited Q-Learning for Offline Reinforcement Learning
Wenhui Liu
Zhijian Wu
J. Wang
Dingjiang Huang
Shuigeng Zhou
OffRL
3
0
0
18 May 2025
Robust Planning for Autonomous Driving via Mixed Adversarial Diffusion Predictions
Albert Zhao
Stefano Soatto
DiffM
2
0
0
18 May 2025
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang
Jiaxuan Gao
Feng Gao
Yi Wu
Chao Yu
36
0
0
15 May 2025
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Huiyun Jiang
Zhuang Yang
29
0
0
13 May 2025
What Matters for Batch Online Reinforcement Learning in Robotics?
Perry Dong
Suvir Mirchandani
Dorsa Sadigh
Chelsea Finn
OffRL
31
0
0
12 May 2025
You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts
Hongkun Dou
Zeyu Li
Xingyu Jiang
Yiming Li
Lijun Yang
Wen Yao
Yue Deng
DiffM
38
0
0
12 May 2025
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
Ce Hao
Anxing Xiao
Zhiwei Xue
Harold Soh
46
0
0
12 May 2025
Efficient Robotic Policy Learning via Latent Space Backward Planning
Dongxiu Liu
Haoyi Niu
Zhihao Wang
Jinliang Zheng
Yinan Zheng
Zhonghong Ou
Jianming Hu
Jianxiong Li
Xianyuan Zhan
28
0
0
11 May 2025
Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients
Stefano Bruno
Sotirios Sabanis
DiffM
50
0
0
06 May 2025
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Zhengyuan Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
152
0
0
03 May 2025
Latent Diffusion Planning for Imitation Learning
Amber Xie
Oleh Rybkin
Dorsa Sadigh
Chelsea Finn
35
1
0
23 Apr 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
27
0
0
17 Apr 2025
Offline Reinforcement Learning with Discrete Diffusion Skills
Ruixi Qiao
Jie Cheng
Xingyuan Dai
Yonglin Tian
Yisheng Lv
OffRL
84
0
0
26 Mar 2025
COLSON: Controllable Learning-Based Social Navigation via Diffusion-Based Reinforcement Learning
Yuki Tomita
Kohei Matsumoto
Yuki Hyodo
Ryo Kurazume
66
0
0
18 Mar 2025
THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks
Chaoran Xiong
Litao Wei
Kehui Ma
Zhen Sun
Yan Xiang
Zihan Nan
Trieu-Kien Truong
Ling Pei
41
0
0
07 Mar 2025
Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving
Kalle Kujanpää
Daulet Baimukashev
Farzeen Munir
Shoaib Azam
Tomasz Piotr Kucner
Joni Pajarinen
Ville Kyrki
41
0
0
07 Mar 2025
What Makes a Good Diffusion Planner for Decision Making?
Haofei Lu
Dongqi Han
Yifei Shen
Dongsheng Li
DiffM
41
3
0
01 Mar 2025
Fast Visuomotor Policies via Partial Denoising
Haojun Chen
Minghao Liu
Xiaojian Ma
Zailin Ma
Huimin Wu
...
Yuanpei Chen
Yifan Zhong
Mingzhi Wang
Qing Li
Yaodong Yang
VGen
34
0
0
01 Mar 2025
Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
Zhanpeng He
Yifeng Cao
M. Ciocarlie
64
0
0
26 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
Mingyang Sun
Pengxiang Ding
Weinan Zhang
Donglin Wang
OT
83
0
0
24 Feb 2025
Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
56
0
0
18 Feb 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi Ma
DiffM
93
23
0
17 Feb 2025
Maximize Your Diffusion: A Study into Reward Maximization and Alignment for Diffusion-based Control
Dom Huh
P. Mohapatra
89
1
0
16 Feb 2025
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu
Yifei Shen
Dongsheng Li
Junliang Xing
Dongqi Han
67
0
0
10 Feb 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
J. Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
58
4
0
09 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
60
2
0
29 Jan 2025
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
Fei Zhao
Xueliang Zhang
36
0
0
25 Dec 2024
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Xiu Yuan
Tongzhou Mu
Stone Tao
Yunhao Fang
Mengke Zhang
H. Su
OffRL
68
3
0
18 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
95
4
0
09 Dec 2024
Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation
Xuanlin Li
Tong Zhao
Xinghao Zhu
Jiuguang Wang
Tao Pang
Kuan Fang
85
4
0
03 Dec 2024
Unpacking the Individual Components of Diffusion Policy
Xiu Yuan
84
0
0
27 Nov 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
114
1
0
22 Nov 2024
So You Think You Can Scale Up Autonomous Robot Data Collection?
Suvir Mirchandani
Suneel Belkhale
Joey Hejna
Evelyn Choi
Md Sazzad Islam
Dorsa Sadigh
OffRL
40
5
0
04 Nov 2024
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Zhendong Wang
Zeju Li
Ajay Mandlekar
Zhenjia Xu
Jiaojiao Fan
...
Yuke Zhu
Yogesh Balaji
Mingyuan Zhou
Xuan Li
Yu Zeng
37
16
0
28 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
36
0
0
27 Oct 2024
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Kyle Hatch
Ashwin Balakrishna
Oier Mees
Suraj Nair
Seohong Park
...
Masha Itkina
Benjamin Eysenbach
Sergey Levine
Thomas Kollar
Benjamin Burchfiel
65
2
0
26 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
57
0
0
23 Oct 2024
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Tonghan Wang
Heng Dong
Yanchen Jiang
David C. Parkes
Milind Tambe
DiffM
47
2
0
17 Oct 2024
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
79
13
0
17 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
33
1
0
15 Oct 2024
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
Jianxiong Li
Zhihao Wang
Jinliang Zheng
Xiaoai Zhou
Guanming Wang
...
Yu Liu
Jingjing Liu
Ya-Qin Zhang
Junzhi Yu
Xianyuan Zhan
40
2
0
02 Oct 2024
Sampling from Energy-based Policies using Diffusion
V. Jain
Tara Akhound-Sadegh
Siamak Ravanbakhsh
DiffM
40
1
0
02 Oct 2024
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
33
3
0
30 Sep 2024
Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization
Haoran Li
Zhennan Jiang
Yuhui Chen
Dongbin Zhao
OffRL
25
2
0
28 Sep 2024
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Zhao Shan
Chenyou Fan
Shuang Qiu
Jiyuan Shi
Chenjia Bai
40
4
0
09 Sep 2024
Diffusion Policy Policy Optimization
Allen Z. Ren
Justin Lidard
Lars L. Ankile
Anthony Simeonov
Pulkit Agrawal
Anirudha Majumdar
Benjamin Burchfiel
Hongkai Dai
Max Simchowitz
45
36
0
01 Sep 2024
Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling
Yuejiang Liu
Jubayer Ibn Hamid
Annie Xie
Yoonho Lee
Maximilian Du
Chelsea Finn
OffRL
65
5
0
30 Aug 2024
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
Yifu Yuan
Zhenrui Zheng
Zibin Dong
Jianye Hao
OffRL
52
1
0
28 Aug 2024
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
56
3
0
27 Aug 2024
1
2
3
Next