Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1504.00702
Cited By
v1
v2
v3
v4
v5 (latest)
End-to-End Training of Deep Visuomotor Policies
2 April 2015
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"End-to-End Training of Deep Visuomotor Policies"
50 / 1,177 papers shown
Title
SafePicking: Learning Safe Object Extraction via Object-Level Mapping
Kentaro Wada
Stephen James
Andrew J. Davison
128
13
0
11 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
133
30
0
10 Feb 2022
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence
Dongsheng Ding
Chen-Yu Wei
Jianchao Tan
M. Jovanović
92
69
0
08 Feb 2022
DURableVS: Data-efficient Unsupervised Recalibrating Visual Servoing via online learning in a structured generative model
Nishad Gothoskar
Miguel Lazaro-Gredilla
Yasemin Bekiroglu
A. Agarwal
J. Tenenbaum
Vikash K. Mansinghka
Dileep George
37
2
0
08 Feb 2022
Auto-Lambda: Disentangling Dynamic Task Relationships
Shikun Liu
Stephen James
Andrew J. Davison
Edward Johns
123
78
0
07 Feb 2022
Rethinking ValueDice: Does It Really Improve Performance?
Ziniu Li
Tian Xu
Yang Yu
Zhimin Luo
OffRL
79
17
0
05 Feb 2022
Practical Imitation Learning in the Real World via Task Consistency Loss
Mohi Khansari
Daniel Ho
Yuqing Du
Armando Fuentes
Matthew Bennice
Nicolas Sievers
Sean Kirmani
Yunfei Bai
Eric Jang
SSL
59
8
0
03 Feb 2022
You Only Demonstrate Once: Category-Level Manipulation from Single Visual Demonstration
Bowen Wen
Wenzhao Lian
Kostas Bekris
S. Schaal
106
96
0
30 Jan 2022
GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems
Bhavya Sukhija
M. Turchetta
David Lindner
Andreas Krause
Sebastian Trimpe
Dominik Baumann
133
19
0
24 Jan 2022
DROPO: Sim-to-Real Transfer with Offline Domain Randomization
Gabriele Tiboni
Karol Arndt
Ville Kyrki
63
28
0
20 Jan 2022
Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation
Rishabh Jangir
Nicklas Hansen
Sambaran Ghosal
Mohit Jain
Xiaolong Wang
122
70
0
19 Jan 2022
Neural Circuit Architectural Priors for Embodied Control
Nikhil X. Bhattasali
A. Zador
Tatiana A. Engel
146
5
0
13 Jan 2022
Off Environment Evaluation Using Convex Risk Minimization
Pulkit Katdare
Shuijing Liu
Katherine Driggs-Campbell
58
2
0
21 Dec 2021
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Tianhao Wu
Yunchang Yang
Han Zhong
Liwei Wang
S. Du
Jiantao Jiao
129
14
0
21 Dec 2021
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning
Yunhao Tang
81
7
0
14 Dec 2021
Contact-Rich Manipulation of a Flexible Object based on Deep Predictive Learning using Vision and Tactility
Hideyuki Ichiwara
Hiroshi Ito
Kenjiro Yamamoto
Hiroki Mori
Tetsuya Ogata
78
22
0
13 Dec 2021
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
Yunru Bai
Chen Gong
Bin Zhang
Guoliang Fan
Xinwen Hou
Yu Liu
71
7
0
09 Dec 2021
CoMPS: Continual Meta Policy Search
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
CLL
OffRL
94
15
0
08 Dec 2021
Policy Search for Model Predictive Control with Application to Agile Drone Flight
Yunlong Song
Davide Scaramuzza
92
86
0
07 Dec 2021
Guided Imitation of Task and Motion Planning
M. McDonald
Dylan Hadfield-Menell
151
21
0
06 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
71
17
0
02 Dec 2021
Learning State Representations via Retracing in Reinforcement Learning
Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
87
8
0
24 Nov 2021
A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning
Zhaolin Ren
Tianjun Zhang
Csaba Szepesvári
Bo Dai
117
20
0
22 Nov 2021
Improving Learning from Demonstrations by Learning from Experience
Hao-Kang Liu
Yiwen Chen
Jiayi Tan
M. Ang
OffRL
117
1
0
16 Nov 2021
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization
Youngwoon Lee
Joseph J. Lim
Anima Anandkumar
Yuke Zhu
OffRL
90
41
0
15 Nov 2021
Learning Multi-Stage Tasks with One Demonstration via Self-Replay
Norman Di Palo
Edward Johns
SSL
82
25
0
14 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Wenlong Huang
Igor Mordatch
Pieter Abbeel
Deepak Pathak
136
64
0
04 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning
Sindre Benjamin Remman
Inga Strümke
A. Lekkas
CML
48
7
0
04 Nov 2021
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives
Murtaza Dalal
Deepak Pathak
Ruslan Salakhutdinov
129
95
0
28 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
80
8
0
28 Oct 2021
Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System
M. Schultheis
Dominik Straub
Constantin Rothkopf
50
21
0
21 Oct 2021
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information
Jin Li
Xianyuan Zhan
Zixu Xiao
Guyue Zhou
OffRL
OnRL
59
2
0
21 Oct 2021
Dual-Arm Adversarial Robot Learning
Elie Aljalbout
61
1
0
15 Oct 2021
Provable Regret Bounds for Deep Online Learning and Control
Xinyi Chen
Edgar Minasyan
Jason D. Lee
Elad Hazan
115
6
0
15 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
81
31
0
14 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Junhong Shen
Lin F. Yang
OffRL
51
18
0
09 Oct 2021
Offline Meta-Reinforcement Learning for Industrial Insertion
Tony Zhao
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
N. Heess
Jonathan Scholz
S. Schaal
Sergey Levine
OffRL
OnRL
100
83
0
08 Oct 2021
Learning to Centralize Dual-Arm Assembly
Marvin Alles
Elie Aljalbout
67
18
0
08 Oct 2021
Cross-Domain Imitation Learning via Optimal Transport
Arnaud Fickinger
Samuel N. Cohen
Stuart J. Russell
Brandon Amos
OT
109
52
0
07 Oct 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations
Sindre Benjamin Remman
A. Lekkas
55
14
0
07 Oct 2021
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Wonjoon Goo
S. Niekum
OffRL
92
8
0
05 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
68
34
0
05 Oct 2021
Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization
C. Imai
Minghao Zhang
Yuchen Zhang
Marcin Kierebinski
Ruihan Yang
Yuzhe Qin
Xiaolong Wang
131
33
0
29 Sep 2021
Reinforcement Learning for Quantitative Trading
Shuo Sun
Rongpin Wang
Bo An
OffRL
AIFin
75
55
0
28 Sep 2021
Bridge Data: Boosting Generalization of Robotic Skills with Cross-Domain Datasets
F. Ebert
Yanlai Yang
Karl Schmeckpeper
Bernadette Bucher
G. Georgakis
Kostas Daniilidis
Chelsea Finn
Sergey Levine
253
236
0
27 Sep 2021
The
f
f
f
-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
68
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
63
34
0
24 Sep 2021
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
162
24
0
21 Sep 2021
Soft Actor-Critic With Integer Actions
Ting-Han Fan
Yubo Wang
69
15
0
17 Sep 2021
Multi-Task Learning with Sequence-Conditioned Transporter Networks
M. H. Lim
Andy Zeng
Brian Ichter
Maryam Bandari
Erwin Coumans
Claire Tomlin
S. Schaal
Aleksandra Faust
65
15
0
15 Sep 2021
Previous
1
2
3
...
5
6
7
...
22
23
24
Next