Title
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering Jun Lv Yunhai Feng Cheng Zhang Shu Zhao Lin Shao Cewu Lu 84 26 0 27 Oct 2022
Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing Jiawei Fu Yunlong Song Yongpeng Wu Feng Yu Davide Scaramuzza 114 21 0 26 Oct 2022
Environment Design for Inverse Reinforcement Learning Thomas Kleine Buening Victor Villin Christos Dimitrakakis 102 1 0 26 Oct 2022
Will we run out of data? Limits of LLM scaling based on human-generated data Pablo Villalobos A. Ho J. Sevilla T. Besiroglu Lennart Heim Marius Hobbhahn ALM 102 125 0 26 Oct 2022
DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to Reality Ankur Handa Arthur Allshire Viktor Makoviychuk Aleksei Petrenko Ritvik Singh ... Balakumar Sundaralingam Yashraj S. Narang Jean-Francois Lafleche Dieter Fox Gavriel State 139 157 0 25 Oct 2022
Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation Zoey Qiuyu Chen Karl Van Wyk Yu-Wei Chao Wei Yang Arsalan Mousavian Abhishek Gupta Dieter Fox 91 29 0 24 Oct 2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds Joshua Albrecht Abraham J. Fetterman Bryden Fogelman Ellie Kitanidis Bartosz Wróblewski ... Michael Rosenthal Maksis Knutins Zachary Polizzi James B. Simon Kanjun Qiu OffRL 87 23 0 24 Oct 2022
Evaluating Long-Term Memory in 3D Mazes J. Pašukonis Timothy Lillicrap Danijar Hafner 3DV 88 23 0 24 Oct 2022
Co-Training an Observer and an Evading Target André Brandenburger Folker Hoffmann A. Charlish 71 1 0 20 Oct 2022
Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation Peide Huang Mengdi Xu Jiacheng Zhu Laixi Shi Fei Fang Ding Zhao CLL 101 25 0 18 Oct 2022
Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion Zipeng Fu Xuxin Cheng Deepak Pathak 108 159 0 18 Oct 2022
Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity Maxime Allard Simón C. Smith Konstantinos Chatzilygeroudis Bryan Lim Antoine Cully 63 13 0 18 Oct 2022
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic Reinforcement Learning at Scale Kuang-Huei Lee Ted Xiao A. Li Paul Wohlhart Ian S. Fischer Yao Lu 120 10 0 15 Oct 2022
Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion Lev Grossman Brian Plancher MQ 69 4 0 14 Oct 2022
Skill-Based Reinforcement Learning with Intrinsic Reward Matching Ademi Adeniji Amber Xie Pieter Abbeel OffRL 73 5 0 14 Oct 2022
Policy Gradient With Serial Markov Chain Reasoning Edoardo Cetin Oya Celiktutan BDL LRM 58 2 0 13 Oct 2022
Holo-Dex: Teaching Dexterity with Immersive Mixed Reality Sridhar Pandian Arunachalam Irmak Güzey Soumith Chintala Lerrel Pinto 109 74 0 12 Oct 2022
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation Lingfeng Tao Jiucai Zhang Michael Bowman Xiaoli Zhang 67 6 0 11 Oct 2022
NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields Arunkumar Byravan Jan Humplik Leonard Hasenclever Arthur Brussee F. Nori ... Ben Moran Steven Bohez Fereshteh Sadeghi Bojan Vujatovic N. Heess 160 57 0 10 Oct 2022
Efficient Learning of Locomotion Skills through the Discovery of Diverse Environmental Trajectory Generator Priors Shikha Surana Bryan Lim Antoine Cully 79 4 0 10 Oct 2022
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot Tianli Ding L. Graesser Saminda Abeyruwan David B. DÁmbrosio Anish Shankar P. Sermanet Pannag R Sanketi Corey Lynch 125 22 0 07 Oct 2022
DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation Ruicheng Wang Jialiang Zhang Jiayi Chen Yinzhen Xu Puhao Li Tengyu Liu He Wang 145 124 0 06 Oct 2022
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals Rohin Shah Vikrant Varma Ramana Kumar Mary Phuong Victoria Krakovna J. Uesato Zachary Kenton 99 72 0 04 Oct 2022
Hyperbolic Deep Reinforcement Learning Edoardo Cetin B. Chamberlain Michael M. Bronstein Jonathan J. Hunt 93 22 0 04 Oct 2022
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms Fan Chen Yu Bai Song Mei 100 22 0 29 Sep 2022
Learning Low-Frequency Motion Control for Robust and Dynamic Robot Locomotion Siddhant Gangapurwala Luigi Campanaro Ioannis Havoutis 101 13 0 29 Sep 2022
DexTransfer: Real World Multi-fingered Dexterous Grasping with Minimal Human Demonstrations Zoey Qiuyu Chen Karl Van Wyk Yu-Wei Chao Wei Yang Arsalan Mousavian Abhishek Gupta Dieter Fox 101 22 0 28 Sep 2022
Learn what matters: cross-domain imitation learning with task-relevant embeddings Tim Franzmeyer Philip Torr João F. Henriques OOD 90 22 0 24 Sep 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation Kang Xu Yan Ma Bingsheng Wei Wei Li 84 3 0 24 Sep 2022
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels Sai Rajeswar Pietro Mazzaglia Tim Verbelen Alexandre Piché Bart Dhoedt Rameswar Panda Alexandre Lacoste SSL 102 21 0 24 Sep 2022
Grouped Adaptive Loss Weighting for Person Search Yanling Tian Di Chen Yunan Liu Shanshan Zhang Jian Yang 91 5 0 23 Sep 2022
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps Sudeep Dasari Abhi Gupta Vikash Kumar 113 43 0 22 Sep 2022
Optimizing Crop Management with Reinforcement Learning and Imitation Learning Ran Tao Pan Zhao Jing Wu N. F. Martin M. Harrison C. Ferreira Z. Kalantari N. Hovakimyan OffRL 61 26 0 20 Sep 2022
Robust Reinforcement Learning Algorithm for Vision-based Ship Landing of UAVs Vishnu Saj Bochan Lee D. Kalathil Moble Benedict 58 5 0 17 Sep 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability Mengdi Xu Zuxin Liu Peide Huang Wenhao Ding Zhepeng Cen Yue Liu Ding Zhao 175 47 0 16 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping Hao Sun Lei Han Rui Yang Xiaoteng Ma Jian Guo Bolei Zhou OffRL OnRL 80 11 0 15 Sep 2022
Meta-Reinforcement Learning via Language Instructions Zhenshan Bing A. Koch Xiangtong Yao Kai-Qi Huang Alois C. Knoll LM&Ro 122 19 0 11 Sep 2022
Instruction-driven history-aware policies for robotic manipulations Pierre-Louis Guhur Shizhe Chen Ricardo Garcia Pinel Makarand Tapaswi Ivan Laptev Cordelia Schmid LM&Ro 195 109 0 11 Sep 2022
Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse Data using a Learning-based Unscented Kalman Filter Alexander Schperberg Yusuke Tanaka Feng Xu Marcel Menner Dennis W. Hong 77 5 0 07 Sep 2022
What deep reinforcement learning tells us about human motor learning and vice-versa Michele Garibbo Casimir J. H. Ludwig Nathan Lepora Laurence Aitchison 67 0 0 23 Aug 2022
Learning Ball-balancing Robot Through Deep Reinforcement Learning Yifan Zhou Jianghao Lin Shuai Wang Chong Zhang 28 9 0 22 Aug 2022
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning Laura M. Smith Ilya Kostrikov Sergey Levine OffRL 73 105 0 16 Aug 2022
AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning Wangyang Yue Yuan Zhou Xiaochuan Zhang Yuchen Hua Zhiyuan Wang Guang Kou OffRL 45 3 0 03 Aug 2022
Learning Fast and Precise Pixel-to-Torque Control Steffen Bleher Steve Heim Sebastian Trimpe 73 2 0 03 Aug 2022
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization Y. Kadokawa Lingwei Zhu Yoshihisa Tsurumine Takamitsu Matsubara 60 8 0 29 Jul 2022
Learning Dynamic Manipulation Skills from Haptic-Play Taeyoon Lee D. Sung Kyoung-Whan Choi Choong-Keun Lee Changwoo Park Keunjun Choi 88 3 0 28 Jul 2022
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations Achkan Salehi Steffen Rühl Stéphane Doncieux AI4CE 90 2 0 25 Jul 2022
Towards Using Fully Observable Policies for POMDPs András Attila Sulyok K. Karacs 107 1 0 24 Jul 2022
Incorporating Prior Knowledge into Reinforcement Learning for Soft Tissue Manipulation with Autonomous Grasping Point Selection Xian He Shuai Zhang Shanlin Yang Bo Ouyang 25 0 0 21 Jul 2022
Human-to-Robot Imitation in the Wild Shikhar Bahl Abhi Gupta Deepak Pathak 123 174 0 19 Jul 2022