v1v2v3v4v5 (latest)

End-to-End Training of Deep Visuomotor Policies

2 April 2015

Pieter Abbeel

Papers citing "End-to-End Training of Deep Visuomotor Policies"

50 / 1,177 papers shown

Title
SafePicking: Learning Safe Object Extraction via Object-Level Mapping Kentaro Wada Stephen James Andrew J. Davison 128 13 0 11 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks Maja Franz Lucas Wolf Maniraman Periyasamy Christian Ufrecht Daniel D. Scherer Axel Plinge Christopher Mutschler Wolfgang Mauerer 133 30 0 10 Feb 2022
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence Dongsheng Ding Chen-Yu Wei Jianchao Tan M. Jovanović 92 69 0 08 Feb 2022
DURableVS: Data-efficient Unsupervised Recalibrating Visual Servoing via online learning in a structured generative model Nishad Gothoskar Miguel Lazaro-Gredilla Yasemin Bekiroglu A. Agarwal J. Tenenbaum Vikash K. Mansinghka Dileep George 37 2 0 08 Feb 2022
Auto-Lambda: Disentangling Dynamic Task Relationships Shikun Liu Stephen James Andrew J. Davison Edward Johns 123 78 0 07 Feb 2022
Rethinking ValueDice: Does It Really Improve Performance? Ziniu Li Tian Xu Yang Yu Zhimin Luo OffRL 79 17 0 05 Feb 2022
Practical Imitation Learning in the Real World via Task Consistency Loss Mohi Khansari Daniel Ho Yuqing Du Armando Fuentes Matthew Bennice Nicolas Sievers Sean Kirmani Yunfei Bai Eric Jang SSL 59 8 0 03 Feb 2022
You Only Demonstrate Once: Category-Level Manipulation from Single Visual Demonstration Bowen Wen Wenzhao Lian Kostas Bekris S. Schaal 106 96 0 30 Jan 2022
GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems Bhavya Sukhija M. Turchetta David Lindner Andreas Krause Sebastian Trimpe Dominik Baumann 133 19 0 24 Jan 2022
DROPO: Sim-to-Real Transfer with Offline Domain Randomization Gabriele Tiboni Karol Arndt Ville Kyrki 63 28 0 20 Jan 2022
Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation Rishabh Jangir Nicklas Hansen Sambaran Ghosal Mohit Jain Xiaolong Wang 122 70 0 19 Jan 2022
Neural Circuit Architectural Priors for Embodied Control Nikhil X. Bhattasali A. Zador Tatiana A. Engel 146 5 0 13 Jan 2022
Off Environment Evaluation Using Convex Risk Minimization Pulkit Katdare Shuijing Liu Katherine Driggs-Campbell 58 2 0 21 Dec 2021
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee Tianhao Wu Yunchang Yang Han Zhong Liwei Wang S. Du Jiantao Jiao 129 14 0 21 Dec 2021
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning Yunhao Tang 81 7 0 14 Dec 2021
Contact-Rich Manipulation of a Flexible Object based on Deep Predictive Learning using Vision and Tactility Hideyuki Ichiwara Hiroshi Ito Kenjiro Yamamoto Hiroki Mori Tetsuya Ogata 78 22 0 13 Dec 2021
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution Yunru Bai Chen Gong Bin Zhang Guoliang Fan Xinwen Hou Yu Liu 71 7 0 09 Dec 2021
CoMPS: Continual Meta Policy Search Glen Berseth Zhiwei Zhang Grace Zhang Chelsea Finn Sergey Levine CLL OffRL 94 15 0 08 Dec 2021
Policy Search for Model Predictive Control with Application to Agile Drone Flight Yunlong Song Davide Scaramuzza 92 86 0 07 Dec 2021
Guided Imitation of Task and Motion Planning M. McDonald Dylan Hadfield-Menell 151 21 0 06 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL Charles Packer Pieter Abbeel Joseph E. Gonzalez OffRL 71 17 0 02 Dec 2021
Learning State Representations via Retracing in Reinforcement Learning Changmin Yu Dong Li Jianye Hao Jun Wang Neil Burgess 87 8 0 24 Nov 2021
A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning Zhaolin Ren Tianjun Zhang Csaba Szepesvári Bo Dai 117 20 0 22 Nov 2021
Improving Learning from Demonstrations by Learning from Experience Hao-Kang Liu Yiwen Chen Jiayi Tan M. Ang OffRL 117 1 0 16 Nov 2021
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization Youngwoon Lee Joseph J. Lim Anima Anandkumar Yuke Zhu OffRL 90 41 0 15 Nov 2021
Learning Multi-Stage Tasks with One Demonstration via Self-Replay Norman Di Palo Edward Johns SSL 82 25 0 14 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning Wenlong Huang Igor Mordatch Pieter Abbeel Deepak Pathak 136 64 0 04 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning Sindre Benjamin Remman Inga Strümke A. Lekkas CML 48 7 0 04 Nov 2021
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives Murtaza Dalal Deepak Pathak Ruslan Salakhutdinov 129 95 0 28 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment Tung M. Luu Chang D. Yoo 80 8 0 28 Oct 2021
Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System M. Schultheis Dominik Straub Constantin Rothkopf 50 21 0 21 Oct 2021
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information Jin Li Xianyuan Zhan Zixu Xiao Guyue Zhou OffRL OnRL 59 2 0 21 Oct 2021
Dual-Arm Adversarial Robot Learning Elie Aljalbout 61 1 0 15 Oct 2021
Provable Regret Bounds for Deep Online Learning and Control Xinyi Chen Edgar Minasyan Jason D. Lee Elad Hazan 115 6 0 15 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization Haoran Xu Xianyuan Zhan Jianxiong Li Honglei Yin OffRL 81 31 0 14 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation Junhong Shen Lin F. Yang OffRL 51 18 0 09 Oct 2021
Offline Meta-Reinforcement Learning for Industrial Insertion Tony Zhao Jianlan Luo Oleg O. Sushkov Rugile Pevceviciute N. Heess Jonathan Scholz S. Schaal Sergey Levine OffRL OnRL 100 83 0 08 Oct 2021
Learning to Centralize Dual-Arm Assembly Marvin Alles Elie Aljalbout 67 18 0 08 Oct 2021
Cross-Domain Imitation Learning via Optimal Transport Arnaud Fickinger Samuel N. Cohen Stuart J. Russell Brandon Amos OT 109 52 0 07 Oct 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations Sindre Benjamin Remman A. Lekkas 55 14 0 07 Oct 2021
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL Wonjoon Goo S. Niekum OffRL 92 8 0 05 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery phantom Jihoon Kweon Kyunghwan Kim Chaehyuk Lee Hwi Kwon Jinwoo Park ... Inwook Back J. Roh Y. Moon Jaesoon Choi Young-Hak Kim OnRL 68 34 0 05 Oct 2021
Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization C. Imai Minghao Zhang Yuchen Zhang Marcin Kierebinski Ruihan Yang Yuzhe Qin Xiaolong Wang 131 33 0 29 Sep 2021
Reinforcement Learning for Quantitative Trading Shuo Sun Rongpin Wang Bo An OffRL AIFin 75 55 0 28 Sep 2021
Bridge Data: Boosting Generalization of Robotic Skills with Cross-Domain Datasets F. Ebert Yanlai Yang Karl Schmeckpeper Bernadette Bucher G. Georgakis Kostas Daniilidis Chelsea Finn Sergey Levine 253 236 0 27 Sep 2021
The $f$ -Divergence Reinforcement Learning Framework Chen Gong Qiang He Yunpeng Bai Zhouyi Yang Xiaoyu Chen Xinwen Hou Xianjie Zhang Yu Liu Guoliang Fan 68 3 0 24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience C. Banerjee Zhiyong Chen N. Noman 63 34 0 24 Sep 2021
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks Bohan Wu Suraj Nair Li Fei-Fei Chelsea Finn OffRL LM&Ro 162 24 0 21 Sep 2021
Soft Actor-Critic With Integer Actions Ting-Han Fan Yubo Wang 69 15 0 17 Sep 2021
Multi-Task Learning with Sequence-Conditioned Transporter Networks M. H. Lim Andy Zeng Brian Ichter Maryam Bandari Erwin Coumans Claire Tomlin S. Schaal Aleksandra Faust 65 15 0 15 Sep 2021