ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.02925
  4. Cited By
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward
  Bias in Adversarial Imitation Learning
v1v2 (latest)

Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning

9 September 2018
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
ArXiv (abs)PDFHTML

Papers citing "Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning"

50 / 187 papers shown
Title
Imitation Learning via Differentiable Physics
Imitation Learning via Differentiable Physics
Siwei Chen
Xiao Ma
Zhongwen Xu
PINNAI4CE
51
4
0
10 Jun 2022
Receding Horizon Inverse Reinforcement Learning
Receding Horizon Inverse Reinforcement Learning
Yiqing Xu
Wei Gao
David Hsu
92
14
0
09 Jun 2022
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Fan Luo
Xingchen Cao
Rong-Jun Qin
Yang Yu
111
3
0
01 Jun 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Scalable Multi-Agent Model-Based Reinforcement Learning
Vladimir Egorov
A. Shpilman
92
27
0
25 May 2022
Learning Energy Networks with Generalized Fenchel-Young Losses
Learning Energy Networks with Generalized Fenchel-Young Losses
Mathieu Blondel
Felipe Llinares-López
Robert Dadashi
Léonard Hussenot
Matthieu Geist
108
7
0
19 May 2022
A State-Distribution Matching Approach to Non-Episodic Reinforcement
  Learning
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
Archit Sharma
Rehaan Ahmad
Chelsea Finn
OODOffRL
81
21
0
11 May 2022
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Adam Gleave
Sam Toyer
87
13
0
22 Mar 2022
Vision-Based Manipulators Need to Also See from Their Hands
Vision-Based Manipulators Need to Also See from Their Hands
Kyle Hsu
Moo Jin Kim
Rafael Rafailov
Jiajun Wu
Chelsea Finn
106
49
0
15 Mar 2022
Learning Category-Level Generalizable Object Manipulation Policy via
  Generative Adversarial Self-Imitation Learning from Demonstrations
Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations
Hao Shen
Weikang Wan
He Wang
SSL
84
26
0
04 Mar 2022
Fail-Safe Adversarial Generative Imitation Learning
Fail-Safe Adversarial Generative Imitation Learning
Philipp Geiger
C. Straehle
GAN
69
2
0
03 Mar 2022
LobsDICE: Offline Learning from Observation via Stationary Distribution
  Correction Estimation
LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation
Geon-hyeong Kim
Jongmin Lee
Youngsoo Jang
Hongseok Yang
Kyungmin Kim
OffRL
121
17
0
28 Feb 2022
Efficient Learning of Safe Driving Policy via Human-AI Copilot
  Optimization
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
165
59
0
17 Feb 2022
Rethinking ValueDice: Does It Really Improve Performance?
Rethinking ValueDice: Does It Really Improve Performance?
Ziniu Li
Tian Xu
Yang Yu
Zhimin Luo
OffRL
86
17
0
05 Feb 2022
Versatile Offline Imitation from Observations and Examples via
  Regularized State-Occupancy Matching
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Yecheng Jason Ma
Andrew Shen
Dinesh Jayaraman
Osbert Bastani
OffRL
93
36
0
04 Feb 2022
Iterated Reasoning with Mutual Information in Cooperative and Byzantine
  Decentralized Teaming
Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming
Sachin Konan
Esmaeil Seraj
Matthew C. Gombolay
LRM
115
27
0
20 Jan 2022
Parallelized and Randomized Adversarial Imitation Learning for
  Safety-Critical Self-Driving Vehicles
Parallelized and Randomized Adversarial Imitation Learning for Safety-Critical Self-Driving Vehicles
Won Joon Yun
Myungjae Shin
Soyi Jung
S. Kwon
Joongheon Kim
71
6
0
26 Dec 2021
Direct Behavior Specification via Constrained Reinforcement Learning
Direct Behavior Specification via Constrained Reinforcement Learning
Julien Roy
Roger Girgis
Joshua Romoff
Pierre-Luc Bacon
C. Pal
129
36
0
22 Dec 2021
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting
  Adversarial Imitation for Sample Efficiency
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Mingfei Sun
Sam Devlin
Katja Hofmann
Shimon Whiteson
58
4
0
11 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information
  Matching
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
118
104
0
19 Nov 2021
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via
  Terminal State Regularization
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization
Youngwoon Lee
Joseph J. Lim
Anima Anandkumar
Yuke Zhu
OffRL
90
41
0
15 Nov 2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement
  Learning
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Sabela Ramos
Sertan Girgin
Léonard Hussenot
Damien Vincent
Hanna Yakubovich
...
Piotr Stańczyk
Raphaël Marinier
Jeremiah Harmsen
Olivier Pietquin
Nikola Momchev
OffRL
95
24
0
04 Nov 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
OffRL
107
23
0
19 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior
  Engineering beyond Reward Maximization
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
82
14
0
10 Oct 2021
OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via
  Distribution Matching
OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching
Hanako Hoshino
Keita Ota
Asako Kanezaki
Rio Yokota
OffRLOOD
68
19
0
09 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A
  Systematic Review and Future Directions
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
A Pragmatic Look at Deep Imitation Learning
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
70
9
0
04 Aug 2021
Demonstration-Guided Reinforcement Learning with Learned Skills
Demonstration-Guided Reinforcement Learning with Learned Skills
Karl Pertsch
Youngwoon Lee
Yue Wu
Joseph J. Lim
OffRL
72
86
0
21 Jul 2021
Visual Adversarial Imitation Learning using Variational Models
Visual Adversarial Imitation Learning using Variational Models
Rafael Rafailov
Tianhe Yu
Aravind Rajeswaran
Chelsea Finn
SSL
118
50
0
16 Jul 2021
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
163
29
0
13 Jul 2021
Imitation by Predicting Observations
Imitation by Predicting Observations
Andrew Jaegle
Yury Sulsky
Arun Ahuja
Jake Bruce
Rob Fergus
Greg Wayne
45
12
0
08 Jul 2021
The MineRL BASALT Competition on Learning from Human Feedback
The MineRL BASALT Competition on Learning from Human Feedback
Rohin Shah
Cody Wild
Steven H. Wang
Neel Alex
Brandon Houghton
...
Stephanie Milani
Nicholay Topin
Pieter Abbeel
Stuart J. Russell
Anca Dragan
99
32
0
05 Jul 2021
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic
  Manipulation via Discretisation
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
Stephen James
Kentaro Wada
Tristan Laidlow
Andrew J. Davison
106
135
0
23 Jun 2021
IQ-Learn: Inverse soft-Q Learning for Imitation
IQ-Learn: Inverse soft-Q Learning for Imitation
Divyansh Garg
Shuvam Chakraborty
Chris Cundy
Jiaming Song
Matthieu Geist
Stefano Ermon
133
189
0
23 Jun 2021
OptiDICE: Offline Policy Optimization via Stationary Distribution
  Correction Estimation
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
OffRL
202
101
0
21 Jun 2021
Sample Efficient Social Navigation Using Inverse Reinforcement Learning
Sample Efficient Social Navigation Using Inverse Reinforcement Learning
Bobak H. Baghi
Gregory Dudek
42
5
0
18 Jun 2021
SoftDICE for Imitation Learning: Rethinking Off-policy Distribution
  Matching
SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching
Min Sun
Anuj Mahajan
Katja Hofmann
Shimon Whiteson
OffRL
60
12
0
06 Jun 2021
What Matters for Adversarial Imitation Learning?
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
Matthieu Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
128
78
0
01 Jun 2021
Q-attention: Enabling Efficient Learning for Vision-based Robotic
  Manipulation
Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation
Stephen James
Andrew J. Davison
96
129
0
31 May 2021
Hyperparameter Selection for Imitation Learning
Hyperparameter Selection for Imitation Learning
Léonard Hussenot
Marcin Andrychowicz
Damien Vincent
Robert Dadashi
Anton Raichuk
...
Sabela Ramos
Manu Orsini
Olivier Bachem
Matthieu Geist
Olivier Pietquin
121
18
0
25 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
56
8
0
03 May 2021
Reward function shape exploration in adversarial imitation learning: an
  empirical study
Reward function shape exploration in adversarial imitation learning: an empirical study
Yawei Wang
Xiu Li
43
4
0
14 Apr 2021
No Need for Interactions: Robust Model-Based Imitation Learning using
  Neural ODE
No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODE
HaoChih Lin
Baopu Li
Xin Zhou
Jiankun Wang
Max Meng
45
6
0
03 Apr 2021
Scalable Visual Attribute Extraction through Hidden Layers of a Residual
  ConvNet
Scalable Visual Attribute Extraction through Hidden Layers of a Residual ConvNet
Andres Baloian
Garrett A. Warnell
J. M. Saavedra
FAtt
59
5
0
31 Mar 2021
Learning Lipschitz Feedback Policies from Expert Demonstrations:
  Closed-Loop Guarantees, Generalization and Robustness
Learning Lipschitz Feedback Policies from Expert Demonstrations: Closed-Loop Guarantees, Generalization and Robustness
Abed AlRahman Al Makdah
Vishaal Krishnan
Fabio Pasqualetti
38
0
0
30 Mar 2021
Replacing Rewards with Examples: Example-Based Policy Search via
  Recursive Classification
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
139
50
0
23 Mar 2021
Offline Reinforcement Learning with Fisher Divergence Critic
  Regularization
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
Ilya Kostrikov
Jonathan Tompson
Rob Fergus
Ofir Nachum
OffRL
154
309
0
14 Mar 2021
WFA-IRL: Inverse Reinforcement Learning of Autonomous Behaviors Encoded
  as Weighted Finite Automata
WFA-IRL: Inverse Reinforcement Learning of Autonomous Behaviors Encoded as Weighted Finite Automata
Tianyu Wang
Nikolay Atanasov
85
0
0
10 Mar 2021
Domain-Robust Visual Imitation Learning with Mutual Information
  Constraints
Domain-Robust Visual Imitation Learning with Mutual Information Constraints
Edoardo Cetin
Oya Celiktutan
OODDRL
77
21
0
08 Mar 2021
Off-Policy Imitation Learning from Observations
Off-Policy Imitation Learning from Observations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
68
86
0
25 Feb 2021
Balancing Rational and Other-Regarding Preferences in
  Cooperative-Competitive Environments
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
74
5
0
24 Feb 2021
Previous
1234
Next