Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.06054
Cited By
v1
v2
v3 (latest)
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
11 December 2021
Mingfei Sun
Sam Devlin
Katja Hofmann
Shimon Whiteson
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency"
14 / 14 papers shown
Title
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
89
129
0
08 Jun 2020
Imitation Learning via Off-Policy Distribution Matching
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OOD
OffRL
158
205
0
10 Dec 2019
A Divergence Minimization Perspective on Imitation Learning Methods
Seyed Kamyar Seyed Ghasemipour
R. Zemel
S. Gu
83
250
0
06 Nov 2019
Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
Ruohan Wang
C. Ciliberto
P. Amadori
Y. Demiris
73
62
0
16 May 2019
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Qiang Liu
Lihong Li
Ziyang Tang
Dengyong Zhou
OffRL
174
356
0
29 Oct 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
108
259
0
09 Sep 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
198
5,226
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,432
0
04 Jan 2018
Improved Training of Wasserstein GANs
Ishaan Gulrajani
Faruk Ahmed
Martín Arjovsky
Vincent Dumoulin
Aaron Courville
GAN
230
9,568
0
31 Mar 2017
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
165
3,125
0
10 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
225
5,087
0
05 Jun 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
332
13,295
0
09 Sep 2015
On Convergence of Emphatic Temporal-Difference Learning
Huizhen Yu
OffRL
83
73
0
08 Jun 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
135
3,442
0
08 Jun 2015
1