Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.04726
Cited By
v1
v2
v3
v4 (latest)
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
10 July 2023
S. E. Ada
Erhan Öztop
Emre Ugur
Author Contacts:
ece.ada@bogazici.edu.tr
erhan.oztop@otri.osaka-u.ac.jp
emre.ugur@bogazici.edu.tr
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning"
40 / 40 papers shown
Title
Diff-DAgger: Uncertainty Estimation with Diffusion Policy for Robotic Manipulation
Sung-Wook Lee
Yen-Ling Kuo
Yen-Ling Kuo
83
4
0
18 Oct 2024
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
95
2
0
12 Oct 2023
Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Tony Zhao
Vikash Kumar
Sergey Levine
Chelsea Finn
92
629
0
23 Apr 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
347
1,189
0
07 Mar 2023
Rethinking Real-world Image Deraining via An Unpaired Degradation-Conditioned Diffusion Model
Yiyang Shen
Mingqiang Wei
Yongzhen Wang
H. Xie
F. Wang
DiffM
18
7
0
23 Jan 2023
Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models
Jun Yue
Leyuan Fang
Shaobo Xia
Yue Deng
Jiayi Ma
DiffM
74
103
0
19 Jan 2023
Denoising Diffusion Probabilistic Models for Generation of Realistic Fully-Annotated Microscopy Image Data Sets
Dennis Eschweiler
Rüveyda Yilmaz
Matisse Baumann
Ina Laube
Rijo Roy
Abin Jose
Daniel Brückner
Johannes Stegmaier
DiffM
MedIm
57
19
0
02 Jan 2023
Lifelong Reinforcement Learning with Modulating Masks
Eseoghene Ben-Iwhiwhu
Saptarshi Nath
Praveen K. Pilly
Soheil Kolouri
Andrea Soltoggio
CLL
OffRL
66
22
0
21 Dec 2022
Schrödinger's Bat: Diffusion Models Sometimes Generate Polysemous Words in Superposition
Jennifer C. White
Ryan Cotterell
DiffM
60
5
0
23 Nov 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffM
MedIm
357
1,385
0
02 Sep 2022
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
Zhendong Wang
Jonathan J. Hunt
Mingyuan Zhou
OffRL
98
383
0
12 Aug 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
305
686
0
20 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
877
12,973
0
04 Mar 2022
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
71
21
0
09 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
288
910
0
12 Oct 2021
Implicit Behavioral Cloning
Peter R. Florence
Corey Lynch
Andy Zeng
Oscar Ramirez
Ayzaan Wahid
Laura Downs
Adrian S. Wong
Johnny Lee
Igor Mordatch
Jonathan Tompson
OffRL
115
389
0
01 Sep 2021
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
86
169
0
16 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
130
822
0
12 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
136
1,642
0
02 Jun 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
270
433
0
16 Feb 2021
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
645
18,096
0
19 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
107
612
0
16 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
140
1,824
0
08 Jun 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
799
42,055
0
28 May 2020
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
76
770
0
27 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
558
2,029
0
04 May 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
223
1,377
0
15 Apr 2020
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Shangtong Zhang
Bo Liu
Shimon Whiteson
OffRL
56
103
0
29 Jan 2020
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
89
687
0
26 Nov 2019
Generalization in Transfer Learning
S. E. Ada
Emre Ugur
H. L. Akin
43
18
0
03 Sep 2019
Generative Modeling by Estimating Gradients of the Data Distribution
Yang Song
Stefano Ermon
SyDa
DiffM
258
3,916
0
12 Jul 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
132
1,060
0
03 Jun 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
234
1,613
0
07 Dec 2018
Improving Reconstruction Autoencoder Out-of-distribution Detection with Mahalanobis Distance
Taylor Denouden
Rick Salay
Krzysztof Czarnecki
Vahdat Abdelzad
Buu Phan
Sachin Vernekar
OODD
58
122
0
06 Dec 2018
Continual Lifelong Learning with Neural Networks: A Review
G. I. Parisi
Ronald Kemker
Jose L. Part
Christopher Kanan
S. Wermter
KELM
CLL
193
2,888
0
21 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
311
8,352
0
04 Jan 2018
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
223
5,077
0
05 Jun 2016
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
170
7,641
0
22 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
320
13,248
0
09 Sep 2015
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDa
DiffM
303
6,949
0
12 Mar 2015
1