Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.03476
Cited By
Generative Adversarial Imitation Learning
10 June 2016
Jonathan Ho
Stefano Ermon
GAN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generative Adversarial Imitation Learning"
26 / 76 papers shown
Title
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
273
52
0
23 May 2024
Knowledge Graph Reasoning with Self-supervised Reinforcement Learning
Ying Ma
Owen Burns
Mingqiu Wang
Gang Li
Nan Du
Laurent El Shafey
Liqiang Wang
Izhak Shafran
H. Soltau
SSL
ReLM
OffRL
LRM
114
0
0
22 May 2024
Learning Manipulation Skills through Robot Chain-of-Thought with Sparse Failure Guidance
Kaifeng Zhang
Zhao-Heng Yin
Weirui Ye
Yang Gao
105
4
0
22 May 2024
ATOM: Attention Mixer for Efficient Dataset Distillation
Samir Khaki
A. Sajedi
Kai Wang
Lucy Z. Liu
Y. Lawryshyn
Konstantinos N. Plataniotis
118
3
0
02 May 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
175
0
0
24 Mar 2024
Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Peihong Yu
Manav Mishra
Alec Koppel
Carl E. Busart
Priya Narayan
Dinesh Manocha
Amrit Singh Bedi
Pratap Tokekar
69
4
0
13 Mar 2024
Generalizable Imitation Learning Through Pre-Trained Representations
Wei-Di Chang
F. Hogan
David Meger
Gregory Dudek
Gregory Dudek
77
1
0
15 Nov 2023
Synthesizing Physically Plausible Human Motions in 3D Scenes
Liang Pan
Jingbo Wang
Buzhen Huang
Junyu Zhang
Haofan Wang
Xu Tang
Yangang Wang
62
29
0
17 Aug 2023
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
64
18
0
06 Feb 2023
Goal-conditioned dual-action imitation learning for dexterous dual-arm robot manipulation
Heecheol Kim
Yoshiyuki Ohmura
Yasuo Kuniyoshi
69
28
0
18 Mar 2022
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
50
84
0
12 Aug 2020
Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis
Ye Yuan
Kris Kitani
131
77
0
12 Jun 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
137
201
0
10 Jun 2020
SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks
Bohan Wu
Feng Xu
Zhanpeng He
Abhi Gupta
Peter K. Allen
OffRL
136
13
0
10 Mar 2020
Compensation for undefined behaviors during robot task execution by switching controllers depending on embedded dynamics in RNN
Kanata Suzuki
Hiroki Mori
T. Ogata
68
11
0
10 Mar 2020
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
130
235
0
03 Mar 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
158
472
0
28 Feb 2017
NIPS 2016 Tutorial: Generative Adversarial Networks
Ian Goodfellow
GAN
165
1,725
0
31 Dec 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
223
5,077
0
05 Jun 2016
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
47
149
0
26 May 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
108
949
0
01 Mar 2016
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
99
3,414
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,776
0
19 Feb 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,115
0
22 Dec 2014
Continuous Inverse Optimal Control with Locally Optimal Examples
Sergey Levine
V. Koltun
OffRL
85
331
0
18 Jun 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
222
3,221
0
02 Nov 2010
Previous
1
2