ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.05421
  4. Cited By
DDCO: Discovery of Deep Continuous Options for Robot Learning from
  Demonstrations

DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations

15 October 2017
S. Krishnan
Roy Fox
Ion Stoica
Ken Goldberg
ArXivPDFHTML

Papers citing "DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations"

26 / 26 papers shown
Title
RT-H: Action Hierarchies Using Language
RT-H: Action Hierarchies Using Language
Suneel Belkhale
Tianli Ding
Ted Xiao
P. Sermanet
Quon Vuong
Jonathan Tompson
Yevgen Chebotar
Debidatta Dwibedi
Dorsa Sadigh
LM&Ro
45
78
0
04 Mar 2024
Multi-Stage Cable Routing through Hierarchical Imitation Learning
Multi-Stage Cable Routing through Hierarchical Imitation Learning
Jianlan Luo
Charles Xu
Xinyang Geng
Gilbert Feng
Kuan Fang
L. Tan
S. Schaal
Sergey Levine
38
52
0
18 Jul 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
31
3
0
07 Apr 2023
Dichotomy of Control: Separating What You Can Control from What You
  Cannot
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
25
42
0
24 Oct 2022
Abstract Demonstrations and Adaptive Exploration for Efficient and
  Stable Multi-step Sparse Reward Reinforcement Learning
Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning
Xintong Yang
Ze Ji
Jing Wu
Yunyu Lai
OffRL
27
5
0
19 Jul 2022
A Versatile Agent for Fast Learning from Human Instructors
A Versatile Agent for Fast Learning from Human Instructors
Yiwen Chen
Zedong Zhang
Hao-Kang Liu
Jiayi Tan
C. Chew
Marcelo H. Ang Jr
12
0
0
01 Mar 2022
Transfering Hierarchical Structure with Dual Meta Imitation Learning
Transfering Hierarchical Structure with Dual Meta Imitation Learning
Chongkai Gao
Yizhou Jiang
F. Chen
30
8
0
28 Jan 2022
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
Mengjiao Yang
Sergey Levine
Ofir Nachum
OffRL
41
42
0
27 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Example-Driven Model-Based Reinforcement Learning for Solving
  Long-Horizon Visuomotor Tasks
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
38
24
0
21 Sep 2021
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
81
28
0
13 Jul 2021
From Motor Control to Team Play in Simulated Humanoid Football
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
31
129
0
25 May 2021
Learning Task Decomposition with Ordered Memory Policy Network
Learning Task Decomposition with Ordered Memory Policy Network
Yucheng Lu
Songlin Yang
Siyuan Zhou
Aaron Courville
J. Tenenbaum
Chuang Gan
19
15
0
19 Mar 2021
Learning Composable Behavior Embeddings for Long-horizon Visual
  Navigation
Learning Composable Behavior Embeddings for Long-horizon Visual Navigation
Xiangyun Meng
Yu Xiang
Dieter Fox
24
3
0
19 Feb 2021
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement
  Learning
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Anurag Ajay
Aviral Kumar
Pulkit Agrawal
Sergey Levine
Ofir Nachum
OffRL
OnRL
34
155
0
26 Oct 2020
Data-efficient Hindsight Off-policy Option Learning
Data-efficient Hindsight Off-policy Option Learning
Markus Wulfmeier
Dushyant Rao
Roland Hafner
Thomas Lampe
A. Abdolmaleki
...
Michael Neunert
Dhruva Tirumala
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
28
47
0
30 Jul 2020
Learning Robot Skills with Temporal Variational Inference
Learning Robot Skills with Temporal Variational Inference
Tanmay Shankar
Abhinav Gupta
DRL
BDL
38
74
0
29 Jun 2020
Modeling Long-horizon Tasks as Sequential Interaction Landscapes
Modeling Long-horizon Tasks as Sequential Interaction Landscapes
Soren Pirk
Karol Hausman
Alexander Toshev
Mohi Khansari
22
27
0
08 Jun 2020
Applying Depth-Sensing to Automated Surgical Manipulation with a da
  Vinci Robot
Applying Depth-Sensing to Automated Surgical Manipulation with a da Vinci Robot
M. Hwang
Daniel Seita
Brijen Thananjeyan
Jeffrey Ichnowski
Samuel Paradis
Danyal Fer
Thomas Low
Ken Goldberg
8
31
0
15 Feb 2020
Few-Shot Bayesian Imitation Learning with Logical Program Policies
Few-Shot Bayesian Imitation Learning with Logical Program Policies
Tom Silver
Kelsey R. Allen
Alexander K. Lew
L. Kaelbling
J. Tenenbaum
LM&Ro
21
50
0
12 Apr 2019
The Termination Critic
The Termination Critic
Anna Harutyunyan
Will Dabney
Diana Borsa
N. Heess
Rémi Munos
Doina Precup
OffRL
16
48
0
26 Feb 2019
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented
  Demonstrations using Directed Information
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information
Arjun Sharma
Mohit Sharma
Nicholas Rhinehart
Kris Kitani
24
68
0
29 Sep 2018
Least-Squares Temporal Difference Learning for the Linear Quadratic
  Regulator
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu
Benjamin Recht
OffRL
32
130
0
22 Dec 2017
A Berkeley View of Systems Challenges for AI
A Berkeley View of Systems Challenges for AI
Ion Stoica
D. Song
Raluca A. Popa
D. Patterson
Michael W. Mahoney
...
Joseph E. Gonzalez
Ken Goldberg
A. Ghodsi
David Culler
Pieter Abbeel
24
199
0
15 Dec 2017
Using Intermittent Synchronization to Compensate for Rhythmic Body
  Motion During Autonomous Surgical Cutting and Debridement
Using Intermittent Synchronization to Compensate for Rhythmic Body Motion During Autonomous Surgical Cutting and Debridement
Vatsal Patel
S. Krishnan
A. Goncalves
Carolyn L. Chen
W. D. Boyd
Ken Goldberg
22
8
0
08 Dec 2017
1