Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.07113
Cited By
Solving Rubik's Cube with a Robot Hand
16 October 2019
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
Bob McGrew
Arthur Petron
Alex Paino
Matthias Plappert
Glenn Powell
Raphael Ribas
Jonas Schneider
Nikolas Tezak
Jerry Tworek
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Solving Rubik's Cube with a Robot Hand"
50 / 775 papers shown
Title
i-Sim2Real: Reinforcement Learning of Robotic Policies in Tight Human-Robot Interaction Loops
Saminda Abeyruwan
L. Graesser
David B. DÁmbrosio
Avi Singh
Anish Shankar
Alex Bewley
Deepali Jain
K. Choromanski
Pannag R Sanketi
133
51
0
14 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
67
33
0
13 Jul 2022
Learning Continuous Grasping Function with a Dexterous Hand from Human Demonstrations
Jianglong Ye
Jiashun Wang
Binghao Huang
Yuzhe Qin
Xiaolong Wang
211
56
0
11 Jul 2022
State Dropout-Based Curriculum Reinforcement Learning for Self-Driving at Unsignalized Intersections
Shivesh Khaitan
John M. Dolan
SSL
64
18
0
10 Jul 2022
Egocentric Visual Self-Modeling for Autonomous Robot Dynamics Prediction and Adaptation
Yuhang Hu
Boyuan Chen
Hod Lipson
90
2
0
07 Jul 2022
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Lukas Schafer
Filippos Christianos
Amos Storkey
Stefano V. Albrecht
51
7
0
05 Jul 2022
Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies
Satvik Sharma
Ellen R. Novoseller
Vainavi Viswanath
Zaynah Javed
R. Parikh
Ryan Hoque
Ashwin Balakrishna
Daniel S. Brown
Ken Goldberg
76
8
0
02 Jul 2022
Online vs. Offline Adaptive Domain Randomization Benchmark
Gabriele Tiboni
Karol Arndt
Giuseppe Averta
Ville Kyrki
Tatiana Tommasi
OffRL
51
5
0
29 Jun 2022
DayDreamer: World Models for Physical Robot Learning
Philipp Wu
Alejandro Escontrela
Danijar Hafner
Ken Goldberg
Pieter Abbeel
146
303
0
28 Jun 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
100
22
0
24 Jun 2022
Behavior Transformers: Cloning
k
k
k
modes with one stone
Nur Muhammad (Mahi) Shafiullah
Zichen Jeff Cui
Ariuntuya Altanzaya
Lerrel Pinto
OffRL
78
242
0
22 Jun 2022
Imitation Learning for Generalizable Self-driving Policy with Sim-to-real Transfer
Zoltán LHorincz
Marton Szemenyei
Róbert Moni
39
2
0
22 Jun 2022
Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control
Katie Kang
Paula Gradu
Jason J. Choi
Michael Janner
Claire Tomlin
Sergey Levine
74
24
0
21 Jun 2022
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
Junlin Wu
Yevgeniy Vorobeychik
73
23
0
21 Jun 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
125
111
0
19 Jun 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
133
117
0
17 Jun 2022
Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization
Maxim Kaledin
Alexander Golubev
Denis Belomestny
OffRL
88
4
0
14 Jun 2022
Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks
Josip Josifovski
M. Malmir
Noah Klarmann
B. L. Žagar
Nicolás Navarro-Guerrero
Alois C. Knoll
72
18
0
13 Jun 2022
Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives
Dongwon Son
Myungsin Kim
Jaecheol Sim
Wonsik Shin
51
1
0
12 Jun 2022
Simple Kinesthetic Haptics for Object Recognition
A. Sintov
Inbar Meir
85
2
0
11 Jun 2022
Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality
Ming Yin
Wenjing Chen
Mengdi Wang
Yu Wang
OffRL
63
4
0
10 Jun 2022
Real2Sim or Sim2Real: Robotics Visual Insertion using Deep Reinforcement Learning and Real2Sim Policy Adaptation
Yiwen Chen
Xue-Yong Li
Sheng Guo
Xiang Yao Ng
Marcelo H. Ang Jr
52
5
0
06 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
126
66
0
03 Jun 2022
Offline Reinforcement Learning with Causal Structured World Models
Zhengbang Zhu
Xiong-Hui Chen
Hong Tian
Kun Zhang
Yang Yu
CML
OffRL
53
17
0
03 Jun 2022
Learning to Use Chopsticks in Diverse Gripping Styles
Zeshi Yang
KangKang Yin
Libin Liu
147
30
0
28 May 2022
MyoSuite -- A contact-rich simulation suite for musculoskeletal motor control
Vittorio Caggiano
Huawei Wang
G. Durandau
Massimo Sartori
Vikash Kumar
80
101
0
26 May 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
123
6
0
26 May 2022
POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning
Frederik Schubert
C. Benjamins
Sebastian Dohler
Bodo Rosenhahn
Marius Lindauer
SSL
OffRL
90
4
0
23 May 2022
Concurrent Policy Blending and System Identification for Generalized Assistive Control
Luke Bhan
Marcos Quiñones-Grueiro
G. Biswas
33
1
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
107
15
0
19 May 2022
Collision Detection Accelerated: An Optimization Perspective
Louis Montaut
Quentin Le Lidec
Vladimir Petrik
Josef Sivic
Justin Carpentier
49
20
0
19 May 2022
F1 Hand: A Versatile Fixed-Finger Gripper for Delicate Teleoperation and Autonomous Grasping
Guilherme J. Maeda
Naoki Fukaya
Shintaro Maeda
41
3
0
14 May 2022
Economical Precise Manipulation and Auto Eye-Hand Coordination with Binocular Visual Reinforcement Learning
Yiwen Chen
Shengchao Guo
Zedong Zhang
Lei Zhou
Xiang Yao Ng
Marcelo H. Ang Jr
47
0
0
12 May 2022
On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer
Marco Pleines
Konstantin Ramthun
Yannik Wegener
Hendrik Meyer
Matthias Pallasch
...
Oliver Chmurzynski
Frederik Rohkrähmer
Roman Kalkreuth
F. Zimmer
Mike Preuss
40
4
0
10 May 2022
Learning A Simulation-based Visual Policy for Real-world Peg In Unseen Holes
Liangru Xie
Hongxiang Yu
Kechun Xu
Tong Yang
Minhang Wang
Haojian Lu
R. Xiong
Yue Wang
67
1
0
09 May 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
76
89
0
08 May 2022
Rapid Locomotion via Reinforcement Learning
G. Margolis
Ge Yang
Kartik Paigwar
Tao Chen
Pulkit Agrawal
117
245
0
05 May 2022
RLFlow: Optimising Neural Network Subgraph Transformation with World Models
Sean Parker
Sami Alabed
Eiko Yoneki
34
0
0
03 May 2022
Goldilocks-curriculum Domain Randomization and Fractal Perlin Noise with Application to Sim2Real Pneumonia Lesion Detection
Takahiro Suzuki
S. Hanaoka
Issei Sato
OOD
MedIm
66
1
0
29 Apr 2022
From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation
Yuzhe Qin
Hao Su
Xiaolong Wang
99
106
0
26 Apr 2022
Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach
Jiaping Xiao
Phumrapee Pisutsin
Mir Feroskhan
110
23
0
26 Apr 2022
Optimizing Nitrogen Management with Deep Reinforcement Learning and Crop Simulations
Jing Wu
Ran Tao
Pan Zhao
N. F. Martin
N. Hovakimyan
OffRL
75
47
0
21 Apr 2022
Relevance-guided Unsupervised Discovery of Abilities with Quality-Diversity Algorithms
Luca Grillotti
Antoine Cully
87
8
0
21 Apr 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Dapeng Li
Bin Zhang
Yuan Zhan
Yunru Bai
Guoliang Fan
OffRL
72
8
0
20 Apr 2022
When Is Partially Observable Reinforcement Learning Not Scary?
Qinghua Liu
Alan Chung
Csaba Szepesvári
Chi Jin
80
98
0
19 Apr 2022
Evaluating the Effectiveness of Corrective Demonstrations and a Low-Cost Sensor for Dexterous Manipulation
A. Jain
Jack Kolb
IV J.M.Abbess
Harish Ravichandar
31
0
0
15 Apr 2022
Divide & Conquer Imitation Learning
Alexandre Chenu
Nicolas Perrin-Gilbert
Olivier Sigaud
115
5
0
15 Apr 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
77
95
0
14 Apr 2022
Hierarchical Quality-Diversity for Online Damage Recovery
Maxime Allard
Simón C. Smith
Konstantinos Chatzilygeroudis
Antoine Cully
66
12
0
12 Apr 2022
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
95
4
0
12 Apr 2022
Previous
1
2
3
...
8
9
10
...
14
15
16
Next