ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.07182
  4. Cited By
Genetic Imitation Learning by Reward Extrapolation

Genetic Imitation Learning by Reward Extrapolation

3 January 2023
Boyuan Zheng
Jianlong Zhou
Fang Chen
ArXivPDFHTML

Papers citing "Genetic Imitation Learning by Reward Extrapolation"

21 / 21 papers shown
Title
Quantum Imitation Learning
Quantum Imitation Learning
Zhihao Cheng
Kaining Zhang
Li Shen
Dacheng Tao
42
1
0
04 Apr 2023
Imitation Learning: Progress, Taxonomies and Challenges
Imitation Learning: Progress, Taxonomies and Challenges
Boyuan Zheng
Sunny Verma
Jianlong Zhou
Ivor Tsang
Fang Chen
145
86
0
23 Jun 2021
Off-Policy Imitation Learning from Observations
Off-Policy Imitation Learning from Observations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
47
86
0
25 Feb 2021
Evolutionary Selective Imitation: Interpretable Agents by Imitation
  Learning Without a Demonstrator
Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator
Roy Eliya
J. Herrmann
31
2
0
17 Sep 2020
Inverse Reinforcement Learning with Multiple Ranked Experts
Inverse Reinforcement Learning with Multiple Ranked Experts
Pablo Samuel Castro
Shijian Li
Daqing Zhang
33
13
0
31 Jul 2019
Learning Reward Functions by Integrating Human Demonstrations and
  Preferences
Learning Reward Functions by Integrating Human Demonstrations and Preferences
Malayandi Palan
Nicholas C. Landolfi
Gleb Shevchuk
Dorsa Sadigh
51
126
0
21 Jun 2019
Goal-conditioned Imitation Learning
Goal-conditioned Imitation Learning
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
62
225
0
13 Jun 2019
Imitation Learning from Video by Leveraging Proprioception
Imitation Learning from Video by Leveraging Proprioception
F. Torabi
Garrett A. Warnell
Peter Stone
48
35
0
22 May 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement
  Learning from Observations
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel S. Brown
Wonjoon Goo
P. Nagarajan
S. Niekum
71
355
0
12 Apr 2019
Hindsight Generative Adversarial Imitation Learning
Hindsight Generative Adversarial Imitation Learning
N. Liu
Tao Lu
Yinghao Cai
Boyao Li
Shuo Wang
65
6
0
19 Mar 2019
Playing hard exploration games by watching YouTube
Playing hard exploration games by watching YouTube
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
63
270
0
29 May 2018
End-to-end Driving via Conditional Imitation Learning
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
123
1,066
0
06 Oct 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
463
19,006
0
20 Jul 2017
Imitation from Observation: Learning to Imitate Behaviors from Raw Video
  via Context Translation
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
YuXuan Liu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
103
380
0
11 Jul 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
245
2,326
0
05 Jul 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
151
3,296
0
12 Jun 2017
Time-Contrastive Networks: Self-Supervised Learning from Video
Time-Contrastive Networks: Self-Supervised Learning from Video
P. Sermanet
Corey Lynch
Yevgen Chebotar
Jasmine Hsu
Eric Jang
S. Schaal
Sergey Levine
SSL
98
826
0
23 Apr 2017
One-Shot Imitation Learning
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
77
686
0
21 Mar 2017
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
131
3,105
0
10 Jun 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
214
5,075
0
05 Jun 2016
Learning to Search Better Than Your Teacher
Learning to Search Better Than Your Teacher
Kai-Wei Chang
A. Krishnamurthy
Alekh Agarwal
Hal Daumé
John Langford
OffRL
50
231
0
08 Feb 2015
1