Genetic Imitation Learning by Reward Extrapolation

3 January 2023

Jianlong Zhou

Papers citing "Genetic Imitation Learning by Reward Extrapolation"

21 / 21 papers shown

Title
Quantum Imitation Learning Zhihao Cheng Kaining Zhang Li Shen Dacheng Tao 42 1 0 04 Apr 2023
Imitation Learning: Progress, Taxonomies and Challenges Boyuan Zheng Sunny Verma Jianlong Zhou Ivor Tsang Fang Chen 145 86 0 23 Jun 2021
Off-Policy Imitation Learning from Observations Zhuangdi Zhu Kaixiang Lin Bo Dai Jiayu Zhou OffRL 47 86 0 25 Feb 2021
Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator Roy Eliya J. Herrmann 31 2 0 17 Sep 2020
Inverse Reinforcement Learning with Multiple Ranked Experts Pablo Samuel Castro Shijian Li Daqing Zhang 33 13 0 31 Jul 2019
Learning Reward Functions by Integrating Human Demonstrations and Preferences Malayandi Palan Nicholas C. Landolfi Gleb Shevchuk Dorsa Sadigh 51 126 0 21 Jun 2019
Goal-conditioned Imitation Learning Yiming Ding Carlos Florensa Mariano Phielipp Pieter Abbeel 62 225 0 13 Jun 2019
Imitation Learning from Video by Leveraging Proprioception F. Torabi Garrett A. Warnell Peter Stone 48 35 0 22 May 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations Daniel S. Brown Wonjoon Goo P. Nagarajan S. Niekum 71 355 0 12 Apr 2019
Hindsight Generative Adversarial Imitation Learning N. Liu Tao Lu Yinghao Cai Boyao Li Shuo Wang 65 6 0 19 Mar 2019
Playing hard exploration games by watching YouTube Y. Aytar Tobias Pfaff David Budden T. Paine Ziyun Wang Nando de Freitas 63 270 0 29 May 2018
End-to-end Driving via Conditional Imitation Learning Felipe Codevilla Matthias Muller Antonio M. López V. Koltun Alexey Dosovitskiy 123 1,066 0 06 Oct 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 463 19,006 0 20 Jul 2017
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation YuXuan Liu Abhishek Gupta Pieter Abbeel Sergey Levine 103 380 0 11 Jul 2017
Hindsight Experience Replay Marcin Andrychowicz Dwight Crow Alex Ray Jonas Schneider Rachel Fong Peter Welinder Bob McGrew Joshua Tobin Pieter Abbeel Wojciech Zaremba OffRL 245 2,326 0 05 Jul 2017
Deep reinforcement learning from human preferences Paul Christiano Jan Leike Tom B. Brown Miljan Martic Shane Legg Dario Amodei 151 3,296 0 12 Jun 2017
Time-Contrastive Networks: Self-Supervised Learning from Video P. Sermanet Corey Lynch Yevgen Chebotar Jasmine Hsu Eric Jang S. Schaal Sergey Levine SSL 98 826 0 23 Apr 2017
One-Shot Imitation Learning Yan Duan Marcin Andrychowicz Bradly C. Stadie Jonathan Ho Jonas Schneider Ilya Sutskever Pieter Abbeel Wojciech Zaremba OffRL 77 686 0 21 Mar 2017
Generative Adversarial Imitation Learning Jonathan Ho Stefano Ermon GAN 131 3,105 0 10 Jun 2016
OpenAI Gym Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang Wojciech Zaremba OffRL ODL 214 5,075 0 05 Jun 2016
Learning to Search Better Than Your Teacher Kai-Wei Chang A. Krishnamurthy Alekh Agarwal Hal Daumé John Langford OffRL 50 231 0 08 Feb 2015