A Divergence Minimization Perspective on Imitation Learning Methods

6 November 2019

Papers citing "A Divergence Minimization Perspective on Imitation Learning Methods"

25 / 25 papers shown

Title
Online Episodic Convex Reinforcement Learning B. Moreno Khaled Eldowa Pierre Gaillard Margaux Brégère Nadia Oudjane OffRL 94 0 0 12 May 2025
On the Effective Horizon of Inverse Reinforcement Learning Yiqing Xu Finale Doshi-Velez David Hsu 72 0 0 21 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning Hao Sun M. Schaar 105 16 0 28 Jan 2025
SR-Reward: Taking The Path More Traveled Seyed Mahdi Basiri Azad Zahra Padar Gabriel Kalweit Joschka Boedecker OffRL 104 0 0 04 Jan 2025
Few-Shot Task Learning through Inverse Generative Modeling Aviv Netanyahu Yilun Du Antonia Bronars Jyothish Pari J. Tenenbaum Tianmin Shu Pulkit Agrawal 81 2 0 07 Nov 2024
DITTO: Offline Imitation Learning with World Models Branton DeMoss Paul Duckworth Nick Hawes Ingmar Posner Ingmar Posner OffRL 37 18 0 06 Feb 2023
SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks Bohan Wu Feng Xu Zhanpeng He Abhi Gupta Peter K. Allen OffRL 115 13 0 10 Mar 2020
Efficient Exploration via State Marginal Matching Lisa Lee Benjamin Eysenbach Emilio Parisotto Eric Xing Sergey Levine Ruslan Salakhutdinov 94 242 0 12 Jun 2019
Imitation Learning as $f$ -Divergence Minimization Liyiming Ke Sanjiban Choudhury Matt Barnes Wen Sun Gilwoo Lee S. Srinivasa VLM 50 161 0 30 May 2019
Formal Limitations on the Measurement of Mutual Information David A. McAllester K. Stratos SSL 53 275 0 10 Nov 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review Sergey Levine AI4CE BDL 51 667 0 02 May 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills Yuke Zhu Ziyun Wang J. Merel Andrei A. Rusu Tom Erez ... S. Tunyasuvunakool János Kramár R. Hadsell Nando de Freitas N. Heess SSL 57 317 0 26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 194 8,236 0 04 Jan 2018
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning Justin Fu Katie Z Luo Sergey Levine 96 746 0 30 Oct 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations Aravind Rajeswaran Vikash Kumar Abhishek Gupta Giulia Vezzani John Schulman E. Todorov Sergey Levine 98 1,079 0 28 Sep 2017
Improved Training of Wasserstein GANs Ishaan Gulrajani Faruk Ahmed Martín Arjovsky Vincent Dumoulin Aaron Courville GAN 126 9,509 0 31 Mar 2017
A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models Chelsea Finn Paul Christiano Pieter Abbeel Sergey Levine OffRL AI4CE GAN 44 353 0 11 Nov 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction Mohammad Norouzi Samy Bengio Zhiwen Chen Navdeep Jaitly M. Schuster Yonghui Wu Dale Schuurmans 59 253 0 01 Sep 2016
Generative Adversarial Imitation Learning Jonathan Ho Stefano Ermon GAN 111 3,084 0 10 Jun 2016
OpenAI Gym Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang Wojciech Zaremba OffRL ODL 166 5,048 0 05 Jun 2016
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization Sebastian Nowozin Botond Cseke Ryota Tomioka GAN 86 1,648 0 02 Jun 2016
Training generative neural networks via Maximum Mean Discrepancy optimization Gintare Karolina Dziugaite Daniel M. Roy Zoubin Ghahramani GAN 73 528 0 14 May 2015
Generative Moment Matching Networks Yujia Li Kevin Swersky R. Zemel OOD GAN 88 844 0 10 Feb 2015
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning Stéphane Ross Geoffrey J. Gordon J. Andrew Bagnell OffRL 152 3,196 0 02 Nov 2010
Estimating divergence functionals and the likelihood ratio by convex risk minimization X. Nguyen Martin J. Wainwright Michael I. Jordan 149 799 0 04 Sep 2008