ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.04281
  4. Cited By
Integrating Behavior Cloning and Reinforcement Learning for Improved
  Performance in Dense and Sparse Reward Environments
v1v2 (latest)

Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments

9 October 2019
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments"

19 / 19 papers shown
Title
Learn What Not to Learn: Action Elimination with Deep Reinforcement
  Learning
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Tom Zahavy
Matan Haroush
Nadav Merlis
D. Mankowitz
Shie Mannor
102
191
0
06 Sep 2018
Cycle-of-Learning for Autonomous Systems from Human Interaction
Cycle-of-Learning for Autonomous Systems from Human Interaction
Nicholas R. Waytowich
Vinicius G. Goecks
Vernon J. Lawhern
45
20
0
28 Aug 2018
Learning Dexterous In-Hand Manipulation
Learning Dexterous In-Hand Manipulation
OpenAI OpenAI
Marcin Andrychowicz
Bowen Baker
Maciek Chociej
Rafal Jozefowicz
...
Szymon Sidor
Joshua Tobin
Peter Welinder
Lilian Weng
Wojciech Zaremba
169
1,884
0
01 Aug 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
63
121
0
29 May 2018
Reinforcement Learning from Imperfect Demonstrations
Reinforcement Learning from Imperfect Demonstrations
Yang Gao
Huazhe Xu
Ji Lin
Feng Yu
Sergey Levine
Trevor Darrell
74
202
0
14 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,432
0
04 Jan 2018
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
72
272
0
28 Sep 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
102
789
0
28 Sep 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning
  and Demonstrations
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
146
1,101
0
28 Sep 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics
  Problems with Sparse Rewards
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Matej Vecerík
Todd Hester
Jonathan Scholz
Fumin Wang
Olivier Pietquin
Bilal Piot
N. Heess
Thomas Rothörl
Thomas Lampe
Martin Riedmiller
OffRL
99
669
0
27 Jul 2017
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
72
231
0
17 Jul 2017
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous
  Vehicles
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles
S. Shah
Debadeepta Dey
Chris Lovett
Ashish Kapoor
96
2,007
0
15 May 2017
Accurately and Efficiently Interpreting Human-Robot Instructions of
  Varying Granularities
Accurately and Efficiently Interpreting Human-Robot Instructions of Varying Granularities
Dilip Arumugam
Siddharth Karamcheti
N. Gopalan
Lawson L. S. Wong
Stefanie Tellex
LM&Ro
61
60
0
21 Apr 2017
Interactive Learning from Policy-Dependent Human Feedback
Interactive Learning from Policy-Dependent Human Feedback
J. MacGlashan
Mark K. Ho
R. Loftin
Bei Peng
Guan Wang
David L. Roberts
Matthew E. Taylor
Michael L. Littman
97
306
0
21 Jan 2017
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
165
3,125
0
10 Jun 2016
Fast and Accurate Deep Network Learning by Exponential Linear Units
  (ELUs)
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
Djork-Arné Clevert
Thomas Unterthiner
Sepp Hochreiter
311
5,539
0
23 Nov 2015
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
233
3,796
0
18 Nov 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
330
13,295
0
09 Sep 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
1