Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.07113
Cited By
Solving Rubik's Cube with a Robot Hand
16 October 2019
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
Bob McGrew
Arthur Petron
Alex Paino
Matthias Plappert
Glenn Powell
Raphael Ribas
Jonas Schneider
Nikolas Tezak
Jerry Tworek
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Solving Rubik's Cube with a Robot Hand"
50 / 775 papers shown
Title
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability
Juan Jose Garau-Luis
Yingjie Miao
John D. Co-Reyes
Aaron T Parisi
Jie Tan
Esteban Real
Aleksandra Faust
103
0
0
08 Apr 2022
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled Hand
Leon Sievers
Johannes Pitz
Berthold Bäuml
85
40
0
07 Apr 2022
Learning to Walk Autonomously via Reset-Free Quality-Diversity
Bryan Lim
A. Reichenbach
Antoine Cully
49
8
0
07 Apr 2022
Learning Generalizable Dexterous Manipulation from Human Grasp Affordance
Yueh-hua Wu
Jiashun Wang
Xiaolong Wang
138
62
0
05 Apr 2022
Autoencoder for Synthetic to Real Generalization: From Simple to More Complex Scenes
S. Cruz
B. Taetz
Thomas Stifter
D. Stricker
76
2
0
01 Apr 2022
Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning
David Howard
Josh Kannemeyer
Davide Dolcetti
Humphrey Munn
Nicole L. Robinson
87
5
0
29 Mar 2022
Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions
Alejandro Escontrela
Xue Bin Peng
Wenhao Yu
Tingnan Zhang
Atil Iscen
Ken Goldberg
Pieter Abbeel
89
120
0
28 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
107
123
0
25 Mar 2022
Dexterous Imitation Made Easy: A Learning-Based Framework for Efficient Dexterous Manipulation
Sridhar Pandian Arunachalam
Sneha Silwal
Ben Evans
Lerrel Pinto
96
107
0
24 Mar 2022
Teachable Reinforcement Learning via Advice Distillation
Olivia Watkins
Trevor Darrell
Pieter Abbeel
Jacob Andreas
Abhishek Gupta
OffRL
56
3
0
19 Mar 2022
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks
Fan Wu
Linyi Li
Chejian Xu
Huan Zhang
B. Kailkhura
K. Kenthapadi
Ding Zhao
Yue Liu
AAML
OffRL
74
38
0
16 Mar 2022
Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning
Satoshi Kataoka
Seyed Kamyar Seyed Ghasemipour
Daniel Freeman
Igor Mordatch
62
19
0
15 Mar 2022
Designing Underactuated Graspers with Dynamically Variable Geometry Using Potential Energy Map Based Analysis
Connor L. Yako
Shenli Yuan
J. Kenneth Salisbury
74
3
0
14 Mar 2022
Masked Visual Pre-training for Motor Control
Tete Xiao
Ilija Radosavovic
Trevor Darrell
Jitendra Malik
SSL
119
250
0
11 Mar 2022
Context is Everything: Implicit Identification for Dynamics Adaptation
Ben Evans
Abitha Thankaraj
Lerrel Pinto
70
20
0
10 Mar 2022
Safe Reinforcement Learning for Legged Locomotion
Tsung-Yen Yang
Tingnan Zhang
Linda Luu
Sehoon Ha
Jie Tan
Wenhao Yu
104
42
0
05 Mar 2022
AutoDIME: Automatic Design of Interesting Multi-Agent Environments
I. Kanitscheider
Harrison Edwards
58
0
0
04 Mar 2022
Evolving Curricula with Regret-Based Environment Design
Jack Parker-Holder
Minqi Jiang
Michael Dennis
Mikayel Samvelyan
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
122
125
0
02 Mar 2022
Dojo: A Differentiable Physics Engine for Robotics
Taylor A. Howell
Simon Le Cleac'h
Jan Brüdigam
J. Zico Kolter
Mac Schwager
Zachary Manchester
103
36
0
02 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
125
11
0
01 Mar 2022
Neuro-Inspired Deep Neural Networks with Sparse, Strong Activations
Metehan Cekic
Can Bakiskan
Upamanyu Madhow
46
7
0
26 Feb 2022
Improving generalization with synthetic training data for deep learning based quality inspection
Antoine Cordier
Pierre Gutierrez
Victoire Plessis
100
3
0
25 Feb 2022
ReorientBot: Learning Object Reorientation for Specific-Posed Placement
Kentaro Wada
Stephen James
Andrew J. Davison
79
29
0
22 Feb 2022
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Yuqing Du
Pieter Abbeel
Aditya Grover
109
18
0
22 Feb 2022
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training
Peide Huang
Mengdi Xu
Fei Fang
Ding Zhao
155
38
0
19 Feb 2022
Robust Reinforcement Learning via Genetic Curriculum
Yeeho Song
J. Schneider
71
9
0
17 Feb 2022
Benchmarking Robot Manipulation with the Rubik's Cube
Boling Yang
Patrick E. Lancaster
S. Srinivasa
Joshua R. Smith
50
17
0
14 Feb 2022
Compute Trends Across Three Eras of Machine Learning
J. Sevilla
Lennart Heim
A. Ho
T. Besiroglu
Marius Hobbhahn
Pablo Villalobos
116
279
0
11 Feb 2022
Uncertainty Aware System Identification with Universal Policies
B. L. Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
105
3
0
11 Feb 2022
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
86
54
0
08 Feb 2022
PolicyCleanse: Backdoor Detection and Mitigation in Reinforcement Learning
Junfeng Guo
Ang Li
Cong Liu
AAML
129
17
0
08 Feb 2022
DexVIP: Learning Dexterous Grasping with Human Hand Pose Priors from Video
Priyanka Mandikal
Kristen Grauman
197
98
0
01 Feb 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin
Hao Liu
Xue Bin Peng
Denis Yarats
Aravind Rajeswaran
Pieter Abbeel
SSL
164
69
0
01 Feb 2022
Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Julius Frost
Olivia Watkins
Eric Weiner
Pieter Abbeel
Trevor Darrell
Bryan A. Plummer
Kate Saenko
OffRL
84
6
0
29 Jan 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
149
31
0
28 Jan 2022
Closed-Loop Control of Direct Ink Writing via Reinforcement Learning
Michal Piovarči
Michael Foshey
Jie Xu
Timothy Erps
Vahid Babaei
Piotr Didyk
Szymon Rusinkiewicz
Wojciech Matusik
Bernd Bickel
AI4CE
49
23
0
27 Jan 2022
Surprisingly Robust In-Hand Manipulation: An Empirical Study
Aditya Bhatt
Adrian Sieler
Steffen Puhlmann
Oliver Brock
168
71
0
27 Jan 2022
Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots
Jagdeep Bhatia
Holly Jackson
Yunsheng Tian
Jie Xu
Wojciech Matusik
88
82
0
24 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
116
107
0
11 Jan 2022
Balsa: Learning a Query Optimizer Without Expert Demonstrations
Zongheng Yang
Wei-Lin Chiang
Sifei Luan
Gautam Mittal
Michael Luo
Ion Stoica
36
62
0
05 Jan 2022
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
Yufeng Zhang
Siyu Chen
Zhuoran Yang
Michael I. Jordan
Zhaoran Wang
128
4
0
27 Dec 2021
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
182
31
0
27 Dec 2021
A Survey on Interpretable Reinforcement Learning
Claire Glanois
Paul Weng
Matthieu Zimmer
Dong Li
Tianpei Yang
Jianye Hao
Wulong Liu
OffRL
116
107
0
24 Dec 2021
Curriculum Learning for Safe Mapless Navigation
Luca Marzari
Davide Corsi
Enrico Marchesini
Alessandro Farinelli
79
15
0
23 Dec 2021
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
95
69
0
22 Dec 2021
Off Environment Evaluation Using Convex Risk Minimization
Pulkit Katdare
Shuijing Liu
Katherine Driggs-Campbell
58
2
0
21 Dec 2021
Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization
Yufei Kuang
Miao Lu
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
56
22
0
20 Dec 2021
Learning Connectivity-Maximizing Network Configurations
Daniel Mox
Vijay Kumar
Alejandro Ribeiro
77
18
0
14 Dec 2021
Guided Imitation of Task and Motion Planning
M. McDonald
Dylan Hadfield-Menell
146
21
0
06 Dec 2021
Distilled Domain Randomization
J. Brosseit
Benedikt Hahner
Fabio Muratore
Michael Gienger
Jan Peters
45
4
0
06 Dec 2021
Previous
1
2
3
...
9
10
11
...
14
15
16
Next