ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.10089
  4. Cited By
Overcoming Exploration in Reinforcement Learning with Demonstrations

Overcoming Exploration in Reinforcement Learning with Demonstrations

28 September 2017
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
    OffRL
ArXivPDFHTML

Papers citing "Overcoming Exploration in Reinforcement Learning with Demonstrations"

50 / 182 papers shown
Title
Natural Language-conditioned Reinforcement Learning with Inside-out Task
  Language Development and Translation
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Jing-Cheng Pang
Xinyi Yang
Sibei Yang
Yang Yu
29
8
0
18 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
37
163
0
06 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
40
62
0
02 Feb 2023
On Pathologies in KL-Regularized Reinforcement Learning from Expert
  Demonstrations
On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations
Tim G. J. Rudner
Cong Lu
Michael A. Osborne
Yarin Gal
Yee Whye Teh
OffRL
33
27
0
28 Dec 2022
Cross-Domain Transfer via Semantic Skill Imitation
Cross-Domain Transfer via Semantic Skill Imitation
Karl Pertsch
Ruta Desai
Vikash Kumar
Franziska Meier
Joseph J. Lim
Dhruv Batra
Akshara Rai
LM&Ro
16
18
0
14 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with
  Demonstrations
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
32
49
0
12 Dec 2022
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various
  Robotic Manipulator Tasks
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks
Altun Rzayev
Vahid Tavakol Aghaei
OffRL
21
0
0
11 Dec 2022
Reinforcement learning with Demonstrations from Mismatched Task under
  Sparse Reward
Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Yanjiang Guo
Jingyue Gao
Zheng Wu
Chengming Shi
Jianyu Chen
OffRL
26
4
0
03 Dec 2022
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and
  Stable Online Fine-Tuning
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Alex Beeson
Giovanni Montana
OffRL
OnRL
26
22
0
21 Nov 2022
Model-based Trajectory Stitching for Improved Offline Reinforcement
  Learning
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Charles A. Hepburn
Giovanni Montana
OffRL
32
13
0
21 Nov 2022
Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop
  Scheduling
Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop Scheduling
Cong Zhang
Zhiguang Cao
Wen Song
Puay Siew Tan
Jie Zhang
22
17
0
20 Nov 2022
ToolFlowNet: Robotic Manipulation with Tools via Predicting Tool Flow
  from Point Clouds
ToolFlowNet: Robotic Manipulation with Tools via Predicting Tool Flow from Point Clouds
Daniel Seita
Yufei Wang
Sarthak J. Shetty
Edward Li
Zackory M. Erickson
David Held
3DPC
30
49
0
16 Nov 2022
Reinforcement Learning for Solving Robotic Reaching Tasks in the
  Neurorobotics Platform
Reinforcement Learning for Solving Robotic Reaching Tasks in the Neurorobotics Platform
Márton Szep
Leander Lauenburg
Kevin Farkas
Xiyan Su
Chuanlong Zang
16
0
0
31 Oct 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward
  Long-Horizon Goal-Conditioned Reinforcement Learning
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
34
3
0
28 Oct 2022
Knowledge-Guided Exploration in Deep Reinforcement Learning
Knowledge-Guided Exploration in Deep Reinforcement Learning
Sahisnu Mazumder
Bing-Quan Liu
Shuai Wang
Yingxuan Zhu
Xiaotian Yin
Lifeng Liu
Jian Li
46
4
0
26 Oct 2022
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal
  Conditioning
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning
Caroline Wang
Garrett A. Warnell
Peter Stone
40
3
0
26 Oct 2022
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon
  Manipulation
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation
Shuo Cheng
Danfei Xu
54
37
0
23 Oct 2022
Cut-and-Approximate: 3D Shape Reconstruction from Planar Cross-sections
  with Deep Reinforcement Learning
Cut-and-Approximate: 3D Shape Reconstruction from Planar Cross-sections with Deep Reinforcement Learning
Azimkhon Ostonov
3DV
29
3
0
22 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
18
1
0
20 Oct 2022
Planning for Sample Efficient Imitation Learning
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
31
21
0
18 Oct 2022
Learning Skills from Demonstrations: A Trend from Motion Primitives to
  Experience Abstraction
Learning Skills from Demonstrations: A Trend from Motion Primitives to Experience Abstraction
Mehrdad Tavassoli
S. Katyara
Maria Pozzi
Nikhil Deshpande
D. Caldwell
D. Prattichizzo
30
11
0
14 Oct 2022
Augmentation for Learning From Demonstration with Environmental
  Constraints
Augmentation for Learning From Demonstration with Environmental Constraints
Xing Li
Manuel Baum
Oliver Brock
38
0
0
13 Oct 2022
Mastering the Game of No-Press Diplomacy via Human-Regularized
  Reinforcement Learning and Planning
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
A. Bakhtin
David J. Wu
Adam Lerer
Jonathan Gray
Athul Paul Jacob
Gabriele Farina
Alexander H. Miller
Noam Brown
35
42
0
11 Oct 2022
Flexible Attention-Based Multi-Policy Fusion for Efficient Deep
  Reinforcement Learning
Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
Zih-Yun Chiu
Yi-Lin Tuan
William Yang Wang
Michael C. Yip
OffRL
25
3
0
07 Oct 2022
Learning Depth Vision-Based Personalized Robot Navigation From Dynamic
  Demonstrations in Virtual Reality
Learning Depth Vision-Based Personalized Robot Navigation From Dynamic Demonstrations in Virtual Reality
Jorge de Heuvel
Nathan Corral
Benedikt Kreis
Jacobus Conradi
Anne Driemel
Maren Bennewitz
37
13
0
04 Oct 2022
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive
  Control
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control
Murad Dawood
Nils Dengler
Jorge de Heuvel
Maren Bennewitz
24
9
0
04 Oct 2022
Bayesian Q-learning With Imperfect Expert Demonstrations
Bayesian Q-learning With Imperfect Expert Demonstrations
Fengdi Che
Xiru Zhu
Doina Precup
D. Meger
Gregory Dudek
19
2
0
01 Oct 2022
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse
  Reward Environments
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Desik Rengarajan
Sapana Chaudhary
JaeWon Kim
D. Kalathil
S. Shakkottai
OffRL
29
2
0
26 Sep 2022
Minimizing Human Assistance: Augmenting a Single Demonstration for Deep
  Reinforcement Learning
Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning
Abraham George
Alison Bartsch
A. Farimani
OffRL
14
5
0
22 Sep 2022
First-order Policy Optimization for Robust Markov Decision Process
First-order Policy Optimization for Robust Markov Decision Process
Yan Li
Guanghui Lan
Tuo Zhao
77
23
0
21 Sep 2022
Sample-Efficient Multi-Agent Reinforcement Learning with Demonstrations
  for Flocking Control
Sample-Efficient Multi-Agent Reinforcement Learning with Demonstrations for Flocking Control
Yunbo Qiu
Yuzhu Zhan
Yue Jin
Jian Wang
Xudong Zhang
26
6
0
17 Sep 2022
Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking
  Control
Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control
Yunbo Qiu
Yue Jin
Jian Wang
Xudong Zhang
16
1
0
17 Sep 2022
MetaTrader: An Reinforcement Learning Approach Integrating Diverse
  Policies for Portfolio Optimization
MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization
Hui Niu
Siyuan Li
Jian Li
AIFin
26
30
0
01 Sep 2022
Turning Mathematics Problems into Games: Reinforcement Learning and
  Gröbner bases together solve Integer Feasibility Problems
Turning Mathematics Problems into Games: Reinforcement Learning and Gröbner bases together solve Integer Feasibility Problems
Yue Wu
J. D. Loera
16
4
0
25 Aug 2022
Impact Makes a Sound and Sound Makes an Impact: Sound Guides
  Representations and Explorations
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations
Xufeng Zhao
C. Weber
Muhammad Burhan Hafez
S. Wermter
25
8
0
04 Aug 2022
Improved Policy Optimization for Online Imitation Learning
Improved Policy Optimization for Online Imitation Learning
J. Lavington
Sharan Vaswani
Mark W. Schmidt
OffRL
21
6
0
29 Jul 2022
Learning Deformable Object Manipulation from Expert Demonstrations
Learning Deformable Object Manipulation from Expert Demonstrations
G. Salhotra
Isabella Liu
Marcus Dominguez-Kuhne
Gaurav Sukhatme
34
27
0
20 Jul 2022
Abstract Demonstrations and Adaptive Exploration for Efficient and
  Stable Multi-step Sparse Reward Reinforcement Learning
Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning
Xintong Yang
Ze Ji
Jing Wu
Yunyu Lai
OffRL
27
5
0
19 Jul 2022
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic
  Reinforcement Learning
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
Homer Walke
Jonathan Yang
Albert Yu
Aviral Kumar
Jedrzej Orbik
Avi Singh
Sergey Levine
OffRL
OnRL
27
32
0
11 Jul 2022
Domain Adapting Deep Reinforcement Learning for Real-world Speech
  Emotion Recognition
Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition
Thejan Rajapakshe
R. Rana
Sara Khalifa
Bjorn W. Schuller
19
0
0
07 Jul 2022
Watch and Match: Supercharging Imitation with Regularized Optimal
  Transport
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
58
62
0
30 Jun 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
31
22
0
24 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Critic Sequential Monte Carlo
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
50
7
0
30 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning
  Framework
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert W. Platt
27
9
0
28 May 2022
Efficient Reinforcement Learning from Demonstration Using Local Ensemble
  and Reparameterization with Split and Merge of Expert Policies
Efficient Reinforcement Learning from Demonstration Using Local Ensemble and Reparameterization with Split and Merge of Expert Policies
Yu Wang
Fang Liu
29
0
0
23 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
26
324
0
02 May 2022
Counterfactual harm
Counterfactual harm
Jonathan G. Richens
R. Beard
Daniel H. Thompson
29
27
0
27 Apr 2022
Learning Design and Construction with Varying-Sized Materials via
  Prioritized Memory Resets
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
47
4
0
12 Apr 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
44
109
0
05 Apr 2022
Previous
1234
Next