Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.13855
Cited By
Diffusing States and Matching Scores: A New Framework for Imitation Learning
17 October 2024
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diffusing States and Matching Scores: A New Framework for Imitation Learning"
50 / 65 papers shown
Title
Your Learned Constraint is Secretly a Backward Reachable Tube
Mohamad Qadri
Gokul Swamy
Jonathan Francis
Michael Kaess
Andrea Bajcsy
92
3
0
26 Jan 2025
Diffusion Imitation from Observation
Bo-Ruei Huang
Chun-Kai Yang
Chun-Mao Lai
Dai-Jie Wu
Shao-Hua Sun
78
4
0
07 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
240
10
0
19 Sep 2024
Imitating Language via Scalable Inverse Reinforcement Learning
Markus Wulfmeier
Michael Bloesch
Nino Vieillard
Arun Ahuja
Jorg Bornschein
...
Jost Tobias Springenberg
Nikola Momchev
Olivier Bachem
Matthieu Geist
Martin Riedmiller
82
10
0
02 Sep 2024
EvIL: Evolution Strategies for Generalisable Imitation Learning
Silvia Sapora
Gokul Swamy
Chris Xiaoxuan Lu
Yee Whye Teh
Jakob Nicolaus Foerster
59
6
0
15 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
71
4
0
13 Jun 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
109
2
0
11 Jun 2024
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai
Hsiang-Chun Wang
Ping-Chun Hsieh
Yu-Chiang Frank Wang
Min-Hung Chen
Shao-Hua Sun
67
9
0
25 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
182
422
0
20 May 2024
Adversarial Imitation Learning via Boosting
Jonathan D. Chang
Dhruv Sreenivas
Yingbing Huang
Kianté Brantley
Wen Sun
29
3
0
12 Apr 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
101
44
0
15 Mar 2024
Hybrid Inverse Reinforcement Learning
Juntao Ren
Gokul Swamy
Zhiwei Steven Wu
J. Andrew Bagnell
Sanjiban Choudhury
59
20
0
13 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
102
12
0
11 Feb 2024
DiffAIL: Diffusion Adversarial Imitation Learning
Bingzheng Wang
Guoqiang Wu
Teng Pang
Yan Zhang
Yilong Yin
58
12
0
11 Dec 2023
Provable Guarantees for Generative Behavior Cloning: Bridging Low-Level Stability and High-Level Behavior
Adam Block
Ali Jadbabaie
Daniel Pfrommer
Max Simchowitz
Russ Tedrake
DiffM
69
25
0
27 Jul 2023
Provable benefits of score matching
Chirag Pabbaraju
Dhruv Rohatgi
A. Sevekari
Holden Lee
Ankur Moitra
Andrej Risteski
51
9
0
03 Jun 2023
Extracting Reward Functions from Diffusion Models
Felipe Nuti
Tim Franzmeyer
João F. Henriques
64
14
0
01 Jun 2023
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
69
19
0
25 May 2023
Massively Scalable Inverse Reinforcement Learning in Google Maps
Matt Barnes
Matthew Abueg
Oliver F. Lange
Matt Deeds
Jason M. Trader
Denali Molitor
Markus Wulfmeier
S. O’Banion
52
6
0
18 May 2023
Inverse Reinforcement Learning without Reinforcement Learning
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
57
37
0
26 Mar 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
347
1,175
0
07 Mar 2023
Diffusion Models are Minimax Optimal Distribution Estimators
Kazusato Oko
Shunta Akiyama
Taiji Suzuki
DiffM
83
94
0
03 Mar 2023
LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning
Firas Al-Hafez
Davide Tateo
Oleg Arenz
Guoping Zhao
Jan Peters
48
22
0
01 Mar 2023
Diffusion Model-Augmented Behavioral Cloning
Shangcheng Chen
Hsiang-Chun Wang
Ming-Hao Hsu
Chun-Mao Lai
Shao-Hua Sun
DiffM
97
31
0
26 Feb 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
70
28
0
16 Feb 2023
Imitating Human Behaviour with Diffusion Models
Tim Pearce
Tabish Rashid
Anssi Kanervisto
David Bignell
Mingfei Sun
...
Sergio Valcarcel Macua
Shan Zheng Tan
Ida Momennejad
Katja Hofmann
Sam Devlin
DiffM
82
217
0
25 Jan 2023
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
68
600
0
10 Jan 2023
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
118
394
0
28 Nov 2022
Improved Analysis of Score-based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions
Hongrui Chen
Holden Lee
Jianfeng Lu
DiffM
56
138
0
03 Nov 2022
Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions
Sitan Chen
Sinho Chewi
Jungshian Li
Yuanzhi Li
Adil Salim
Anru R. Zhang
DiffM
186
269
0
22 Sep 2022
Convergence for score-based generative modeling with polynomial complexity
Holden Lee
Jianfeng Lu
Yixin Tan
DiffM
49
136
0
13 Jun 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
300
684
0
20 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
377
6,859
0
13 Apr 2022
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
270
30,123
0
01 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
410
15,486
0
20 Dec 2021
IQ-Learn: Inverse soft-Q Learning for Imitation
Divyansh Garg
Shuvam Chakraborty
Chris Cundy
Jiaming Song
Matthieu Geist
Stefano Ermon
78
185
0
23 Jun 2021
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Steven Wu
49
77
0
04 Mar 2021
MobILE: Model-Based Imitation Learning From Observation Alone
Rahul Kidambi
Jonathan D. Chang
Wen Sun
46
40
0
22 Feb 2021
Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency
Masatoshi Uehara
Masaaki Imaizumi
Nan Jiang
Nathan Kallus
Wen Sun
Tengyang Xie
OffRL
44
53
0
05 Feb 2021
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
330
6,453
0
26 Nov 2020
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Tianwei Ni
Harshit S. Sikchi
Yufei Wang
Tejus Gupta
Lisa Lee
Benjamin Eysenbach
75
73
0
09 Nov 2020
f
f
f
-GAIL: Learning
f
f
f
-Divergence for Generative Adversarial Imitation Learning
Xin Zhang
Jun Luo
Ziming Zhang
Zhi-Li Zhang
36
33
0
02 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
147
1,456
0
21 Sep 2020
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Paul Barde
Julien Roy
Wonseok Jeon
Joelle Pineau
C. Pal
Derek Nowrouzezahrai
30
27
0
23 Jun 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
604
18,036
0
19 Jun 2020
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
58
129
0
08 Jun 2020
Energy-Based Imitation Learning
Minghuan Liu
Tairan He
Minkai Xu
Weinan Zhang
45
48
0
20 Apr 2020
A Divergence Minimization Perspective on Imitation Learning Methods
Seyed Kamyar Seyed Ghasemipour
R. Zemel
S. Gu
69
249
0
06 Nov 2019
Generative Modeling by Estimating Gradients of the Data Distribution
Yang Song
Stefano Ermon
SyDa
DiffM
234
3,902
0
12 Jul 2019
Imitation Learning as
f
f
f
-Divergence Minimization
Liyiming Ke
Sanjiban Choudhury
Matt Barnes
Wen Sun
Gilwoo Lee
S. Srinivasa
VLM
58
161
0
30 May 2019
1
2
Next