ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.02925
  4. Cited By
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward
  Bias in Adversarial Imitation Learning
v1v2 (latest)

Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning

9 September 2018
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
ArXiv (abs)PDFHTML

Papers citing "Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning"

50 / 187 papers shown
Title
Proxy-Free GFlowNet
Proxy-Free GFlowNet
Ruishuo Chen
Xun Wang
Rui Hu
Zhuoran Li
Longbo Huang
78
0
0
26 May 2025
PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization
PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization
Ben Rahman
78
0
0
23 May 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
Volkan Cevher
222
1
0
27 Feb 2025
RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning
RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning
Adib Karimi
Mohammad Mehdi Ebadzadeh
OOD
85
0
0
27 Feb 2025
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation
Minwoo Kim
Geunsik Bae
Jinwoo Lee
Woojae Shin
Changseung Kim
Myong-Yol Choi
Heejung Shin
H. Oh
236
0
0
04 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
191
18
0
28 Jan 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
124
0
0
22 Jan 2025
SR-Reward: Taking The Path More Traveled
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
185
0
0
04 Jan 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
OffRL
135
0
0
03 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
Yangchun Zhang
Wang Zhou
Yirui Zhou
102
0
0
31 Dec 2024
Policy Decorator: Model-Agnostic Online Refinement for Large Policy
  Model
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Xiu Yuan
Tongzhou Mu
Stone Tao
Yunhao Fang
Mengke Zhang
H. Su
OffRL
147
8
0
18 Dec 2024
Inverse Delayed Reinforcement Learning
Inverse Delayed Reinforcement Learning
S. Zhan
Qingyuan Wu
Zhian Ruan
Frank Yang
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
148
0
0
04 Dec 2024
Provably and Practically Efficient Adversarial Imitation Learning with
  General Function Approximation
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Tian Xu
Zhilong Zhang
Ruishuo Chen
Yihao Sun
Yang Yu
88
1
0
01 Nov 2024
Hierarchical Preference Optimization: Learning to achieve goals via
  feasible subgoals prediction
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Anit Kumar Sahu
Mubarak Shah
Vinay P. Namboodiri
Amrit Singh Bedi
139
1
0
01 Nov 2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
168
5
0
17 Oct 2024
Diffusion Imitation from Observation
Diffusion Imitation from Observation
Bo-Ruei Huang
Chun-Kai Yang
Chun-Mao Lai
Dai-Jie Wu
Shao-Hua Sun
91
4
0
07 Oct 2024
Model-Based Reward Shaping for Adversarial Inverse Reinforcement
  Learning in Stochastic Environments
Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
S. Zhan
Qingyuan Wu
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
112
1
0
04 Oct 2024
Imitating Language via Scalable Inverse Reinforcement Learning
Imitating Language via Scalable Inverse Reinforcement Learning
Markus Wulfmeier
Michael Bloesch
Nino Vieillard
Arun Ahuja
Jorg Bornschein
...
Jost Tobias Springenberg
Nikola Momchev
Olivier Bachem
Matthieu Geist
Martin Riedmiller
123
10
0
02 Sep 2024
Markov Balance Satisfaction Improves Performance in Strictly Batch
  Offline Imitation Learning
Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
Ashutosh Nayyar
OffRL
75
0
0
17 Aug 2024
On Causally Disentangled State Representation Learning for Reinforcement
  Learning based Recommender Systems
On Causally Disentangled State Representation Learning for Reinforcement Learning based Recommender Systems
Siyu Wang
Xiaocong Chen
Lina Yao
CML
71
0
0
18 Jul 2024
Visually Robust Adversarial Imitation Learning from Videos with
  Contrastive Learning
Visually Robust Adversarial Imitation Learning from Videos with Contrastive Learning
Vittorio Giammarino
James Queeney
I. Paschalidis
90
2
0
18 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
104
1
0
09 Jun 2024
A Generalized Apprenticeship Learning Framework for Modeling
  Heterogeneous Student Pedagogical Strategies
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Md Mirajul Islam
Xi Yang
J. Hostetter
Adittya Soukarjya Saha
Min Chi
61
1
0
04 Jun 2024
Provably Efficient Off-Policy Adversarial Imitation Learning with
  Convergence Guarantees
Provably Efficient Off-Policy Adversarial Imitation Learning with Convergence Guarantees
Yilei Chen
Vittorio Giammarino
James Queeney
I. Paschalidis
70
0
0
26 May 2024
Diffusion-Reward Adversarial Imitation Learning
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai
Hsiang-Chun Wang
Ping-Chun Hsieh
Yu-Chiang Frank Wang
Min-Hung Chen
Shao-Hua Sun
91
9
0
25 May 2024
Efficient Imitation Learning with Conservative World Models
Efficient Imitation Learning with Conservative World Models
Victor Kolev
Rafael Rafailov
Kyle Hatch
Jiajun Wu
Chelsea Finn
OffRL
82
5
0
21 May 2024
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
Tongzhou Mu
Minghua Liu
Hao Su
OffRL
104
4
0
25 Apr 2024
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement
  Learning via Hindsight Relabeling
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh
Wesley A Suttle
Brian M Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
75
5
0
20 Apr 2024
Adversarial Imitation Learning via Boosting
Adversarial Imitation Learning via Boosting
Jonathan D. Chang
Dhruv Sreenivas
Yingbing Huang
Kianté Brantley
Wen Sun
61
3
0
12 Apr 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation,
  Transferable Reward Recovery and Algebraic Equilibrium Proof
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
100
0
0
21 Mar 2024
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with
  Control Theory
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory
Tianjiao Luo
Tim Pearce
Huayu Chen
Jianfei Chen
Jun Zhu
119
2
0
26 Feb 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human
  Racing Gameplay
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
84
0
0
22 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
97
10
0
06 Feb 2024
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation
  Learning
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Chia-Cheng Chiang
Li-Cheng Lan
Wei-Fang Sun
Chien Feng
Cho-Jui Hsieh
Chun-Yi Lee
113
0
0
01 Feb 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
56
0
0
18 Jan 2024
DiffAIL: Diffusion Adversarial Imitation Learning
DiffAIL: Diffusion Adversarial Imitation Learning
Bingzheng Wang
Guoqiang Wu
Teng Pang
Yan Zhang
Yilong Yin
92
13
0
11 Dec 2023
Working Backwards: Learning to Place by Picking
Working Backwards: Learning to Place by Picking
Oliver Limoyo
Abhisek Konar
Trevor Ablett
Jonathan Kelly
F. Hogan
Gregory Dudek
106
0
0
04 Dec 2023
Domain Adaptive Imitation Learning with Visual Observation
Domain Adaptive Imitation Learning with Visual Observation
Sungho Choi
Seungyul Han
Woojun Kim
Jongseong Chae
Whiyoung Jung
Young-Jin Sung
OOD
84
7
0
01 Dec 2023
Time-series Generation by Contrastive Imitation
Time-series Generation by Contrastive Imitation
Daniel Jarrett
Ioana Bica
M. Schaar
AI4TS
86
24
0
02 Nov 2023
Offline Imitation from Observation via Primal Wasserstein State
  Occupancy Matching
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRL
107
0
0
02 Nov 2023
A Simple Solution for Offline Imitation from Observations and Examples
  with Possibly Incomplete Trajectories
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRL
128
5
0
02 Nov 2023
Explaining by Imitating: Understanding Decisions by Interpretable Policy
  Learning
Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning
Alihan Huyuk
Daniel Jarrett
M. Schaar
78
21
0
28 Oct 2023
Inverse Decision Modeling: Learning Interpretable Representations of
  Behavior
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Daniel Jarrett
Alihan Huyuk
M. Schaar
AI4CE
94
28
0
28 Oct 2023
Robust Visual Imitation Learning with Inverse Dynamics Representations
Robust Visual Imitation Learning with Inverse Dynamics Representations
Siyuan Li
Xun Wang
Rongchang Zuo
Kewu Sun
Lingfei Cui
Jishiyu Ding
Peng Liu
Zhe Ma
70
4
0
22 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
97
10
0
09 Oct 2023
Imitation Learning from Observation through Optimal Transport
Imitation Learning from Observation through Optimal Transport
Wei-Di Chang
Scott Fujimoto
David Meger
Gregory Dudek
68
4
0
02 Oct 2023
Adversarial Imitation Learning from Visual Observations using Latent
  Information
Adversarial Imitation Learning from Visual Observations using Latent Information
Vittorio Giammarino
Tomas Landelius
I. Paschalidis
97
7
0
29 Sep 2023
HumanMimic: Learning Natural Locomotion and Transitions for Humanoid
  Robot via Wasserstein Adversarial Imitation
HumanMimic: Learning Natural Locomotion and Transitions for Humanoid Robot via Wasserstein Adversarial Imitation
Annan Tang
Takuma Hiraoka
Naoki Hiraoka
Fan Shi
Kento Kawaharazuka
Kunio Kojima
K. Okada
Masayuki Inaba
101
29
0
25 Sep 2023
See to Touch: Learning Tactile Dexterity through Visual Incentives
See to Touch: Learning Tactile Dexterity through Visual Incentives
Irmak Güzey
Yinlong Dai
Ben Evans
Soumith Chintala
Lerrel Pinto
109
37
0
21 Sep 2023
Conditional Kernel Imitation Learning for Continuous State Environments
Conditional Kernel Imitation Learning for Continuous State Environments
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
A. Nayyar
75
0
0
24 Aug 2023
1234
Next