ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.02256
  4. Cited By
A Divergence Minimization Perspective on Imitation Learning Methods

A Divergence Minimization Perspective on Imitation Learning Methods

6 November 2019
Seyed Kamyar Seyed Ghasemipour
R. Zemel
S. Gu
ArXivPDFHTML

Papers citing "A Divergence Minimization Perspective on Imitation Learning Methods"

25 / 25 papers shown
Title
Online Episodic Convex Reinforcement Learning
Online Episodic Convex Reinforcement Learning
B. Moreno
Khaled Eldowa
Pierre Gaillard
Margaux Brégère
Nadia Oudjane
OffRL
94
0
0
12 May 2025
On the Effective Horizon of Inverse Reinforcement Learning
On the Effective Horizon of Inverse Reinforcement Learning
Yiqing Xu
Finale Doshi-Velez
David Hsu
72
0
0
21 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
105
16
0
28 Jan 2025
SR-Reward: Taking The Path More Traveled
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
104
0
0
04 Jan 2025
Few-Shot Task Learning through Inverse Generative Modeling
Few-Shot Task Learning through Inverse Generative Modeling
Aviv Netanyahu
Yilun Du
Antonia Bronars
Jyothish Pari
J. Tenenbaum
Tianmin Shu
Pulkit Agrawal
81
2
0
07 Nov 2024
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
37
18
0
06 Feb 2023
SQUIRL: Robust and Efficient Learning from Video Demonstration of
  Long-Horizon Robotic Manipulation Tasks
SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks
Bohan Wu
Feng Xu
Zhanpeng He
Abhi Gupta
Peter K. Allen
OffRL
115
13
0
10 Mar 2020
Efficient Exploration via State Marginal Matching
Efficient Exploration via State Marginal Matching
Lisa Lee
Benjamin Eysenbach
Emilio Parisotto
Eric Xing
Sergey Levine
Ruslan Salakhutdinov
94
242
0
12 Jun 2019
Imitation Learning as $f$-Divergence Minimization
Imitation Learning as fff-Divergence Minimization
Liyiming Ke
Sanjiban Choudhury
Matt Barnes
Wen Sun
Gilwoo Lee
S. Srinivasa
VLM
50
161
0
30 May 2019
Formal Limitations on the Measurement of Mutual Information
Formal Limitations on the Measurement of Mutual Information
David A. McAllester
K. Stratos
SSL
53
275
0
10 Nov 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
51
667
0
02 May 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
57
317
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
194
8,236
0
04 Jan 2018
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
96
746
0
30 Oct 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning
  and Demonstrations
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
98
1,079
0
28 Sep 2017
Improved Training of Wasserstein GANs
Improved Training of Wasserstein GANs
Ishaan Gulrajani
Faruk Ahmed
Martín Arjovsky
Vincent Dumoulin
Aaron Courville
GAN
126
9,509
0
31 Mar 2017
A Connection between Generative Adversarial Networks, Inverse
  Reinforcement Learning, and Energy-Based Models
A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models
Chelsea Finn
Paul Christiano
Pieter Abbeel
Sergey Levine
OffRL
AI4CE
GAN
44
353
0
11 Nov 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi
Samy Bengio
Zhiwen Chen
Navdeep Jaitly
M. Schuster
Yonghui Wu
Dale Schuurmans
59
253
0
01 Sep 2016
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
111
3,084
0
10 Jun 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
166
5,048
0
05 Jun 2016
f-GAN: Training Generative Neural Samplers using Variational Divergence
  Minimization
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization
Sebastian Nowozin
Botond Cseke
Ryota Tomioka
GAN
86
1,648
0
02 Jun 2016
Training generative neural networks via Maximum Mean Discrepancy
  optimization
Training generative neural networks via Maximum Mean Discrepancy optimization
Gintare Karolina Dziugaite
Daniel M. Roy
Zoubin Ghahramani
GAN
73
528
0
14 May 2015
Generative Moment Matching Networks
Generative Moment Matching Networks
Yujia Li
Kevin Swersky
R. Zemel
OOD
GAN
88
844
0
10 Feb 2015
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
152
3,196
0
02 Nov 2010
Estimating divergence functionals and the likelihood ratio by convex
  risk minimization
Estimating divergence functionals and the likelihood ratio by convex risk minimization
X. Nguyen
Martin J. Wainwright
Michael I. Jordan
149
799
0
04 Sep 2008
1