ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.07113
  4. Cited By
Solving Rubik's Cube with a Robot Hand

Solving Rubik's Cube with a Robot Hand

16 October 2019
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
Bob McGrew
Arthur Petron
Alex Paino
Matthias Plappert
Glenn Powell
Raphael Ribas
Jonas Schneider
Nikolas Tezak
Jerry Tworek
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
    ODL
ArXiv (abs)PDFHTML

Papers citing "Solving Rubik's Cube with a Robot Hand"

50 / 775 papers shown
Title
Learning Barrier Certificates: Towards Safe Reinforcement Learning with
  Zero Training-time Violations
Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Yuping Luo
Tengyu Ma
OffRL
101
44
0
04 Aug 2021
SyDog: A Synthetic Dog Dataset for Improved 2D Pose Estimation
SyDog: A Synthetic Dog Dataset for Improved 2D Pose Estimation
Moira Shooter
Charles Malleson
A. Hilton
35
15
0
31 Jul 2021
Learning more skills through optimistic exploration
Learning more skills through optimistic exploration
D. Strouse
Kate Baumli
David Warde-Farley
Vlad Mnih
Steven Hansen
SSL
105
46
0
29 Jul 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
130
190
0
27 Jul 2021
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain
  Randomization Reinforcement Learning
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain Randomization Reinforcement Learning
Haoyi Niu
Jianming Hu
Zheyu Cui
Jianming Hu
120
18
0
25 Jul 2021
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
J. J. Nieto
Roger Creus
Xavier Giró-i-Nieto
SSLDRL
71
4
0
18 Jul 2021
An End-to-End Differentiable Framework for Contact-Aware Robot Design
An End-to-End Differentiable Framework for Contact-Aware Robot Design
Jie Xu
Tao Chen
Lara Zlokapa
Michael Foshey
Wojciech Matusik
Shinjiro Sueda
Pulkit Agrawal
97
91
0
15 Jul 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
369
118
0
13 Jul 2021
Cautious Policy Programming: Exploiting KL Regularization in Monotonic
  Policy Improvement for Reinforcement Learning
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
OffRL
48
1
0
13 Jul 2021
The Role of Pretrained Representations for the OOD Generalization of
  Reinforcement Learning Agents
The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents
Andrea Dittadi
Frederik Trauble
M. Wuthrich
Felix Widmaier
Peter V. Gehler
Ole Winther
Francesco Locatello
Olivier Bachem
Bernhard Schölkopf
Stefan Bauer
OOD
110
16
0
12 Jul 2021
Unity Perception: Generate Synthetic Data for Computer Vision
Unity Perception: Generate Synthetic Data for Computer Vision
S. Borkman
A. Crespi
S. Dhakad
Sujoy Ganguly
Jonathan Hogins
...
Cesar Romero
Wesley Smith
Alex Thaman
Samuel Warren
Nupur Yadav
3DVSyDaVLM
81
102
0
09 Jul 2021
Adaptation of Quadruped Robot Locomotion with Meta-Learning
Adaptation of Quadruped Robot Locomotion with Meta-Learning
A. Kuzhamuratov
Dmitry Sorokin
Alexander Ulanov
A. Lvovsky
40
0
0
08 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
107
115
0
07 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
77
6
0
07 Jul 2021
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic
  Data via Stereo
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo
Thomas Kollar
Michael Laskey
Kevin Stone
Brijen Thananjeyan
Mark Tjersland
126
25
0
30 Jun 2021
Multi-task curriculum learning in a complex, visual, hard-exploration
  domain: Minecraft
Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
I. Kanitscheider
Joost Huizinga
David Farhi
William H. Guss
Brandon Houghton
...
Bowen Baker
Adrien Ecoffet
Jie Tang
Oleg Klimov
Jeff Clune
75
22
0
28 Jun 2021
Discovering Generalizable Skills via Automated Generation of Diverse
  Tasks
Discovering Generalizable Skills via Automated Generation of Diverse Tasks
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
82
6
0
26 Jun 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body
  Simulation
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
117
380
0
24 Jun 2021
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for
  Cooperative Multi-Agent Reinforcement Learning
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Dapeng Li
Yunpeng Bai
Guoliang Fan
32
11
0
22 Jun 2021
Interpretable Model-based Hierarchical Reinforcement Learning using
  Inductive Logic Programming
Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming
Duo Xu
Faramarz Fekri
66
11
0
21 Jun 2021
Cat-like Jumping and Landing of Legged Robots in Low-gravity Using Deep
  Reinforcement Learning
Cat-like Jumping and Landing of Legged Robots in Low-gravity Using Deep Reinforcement Learning
Nikita Rudin
H. Kolvenbach
Vassilios Tsounis
Marco Hutter
81
86
0
17 Jun 2021
CROP: Certifying Robust Policies for Reinforcement Learning through
  Functional Smoothing
CROP: Certifying Robust Policies for Reinforcement Learning through Functional Smoothing
Fan Wu
Linyi Li
Zijian Huang
Yevgeniy Vorobeychik
Ding Zhao
Yue Liu
AAMLOffRL
85
61
0
17 Jun 2021
Mungojerrie: Reinforcement Learning of Linear-Time Objectives
Mungojerrie: Reinforcement Learning of Linear-Time Objectives
E. M. Hahn
Mateo Perez
S. Schewe
Fabio Somenzi
Ashutosh Trivedi
D. Wojtczak
64
10
0
16 Jun 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain
  Transfer in Offline RL
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
70
30
0
16 Jun 2021
Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation
Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation
Alex Church
John Lloyd
R. Hadsell
Nathan Lepora
91
53
0
16 Jun 2021
Online Sub-Sampling for Reinforcement Learning with General Function
  Approximation
Online Sub-Sampling for Reinforcement Learning with General Function Approximation
Dingwen Kong
Ruslan Salakhutdinov
Ruosong Wang
Lin F. Yang
OffRL
75
1
0
14 Jun 2021
Deception in Social Learning: A Multi-Agent Reinforcement Learning
  Perspective
Deception in Social Learning: A Multi-Agent Reinforcement Learning Perspective
P. Chelarescu
68
7
0
09 Jun 2021
Self-Paced Context Evaluation for Contextual Reinforcement Learning
Self-Paced Context Evaluation for Contextual Reinforcement Learning
Theresa Eimer
André Biedenkapp
Frank Hutter
Marius Lindauer
OffRLLRM
94
25
0
09 Jun 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via
  Relabeling Experience and Unsupervised Pre-training
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Kimin Lee
Laura M. Smith
Pieter Abbeel
OffRL
70
289
0
09 Jun 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online
  Reinforcement Learning
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Tengyang Xie
Nan Jiang
Huan Wang
Caiming Xiong
Yu Bai
OffRLOnRL
109
165
0
09 Jun 2021
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
Prithvijit Chattopadhyay
Judy Hoffman
Roozbeh Mottaghi
Aniruddha Kembhavi
87
55
0
08 Jun 2021
Towards Learning to Play Piano with Dexterous Hands and Touch
Towards Learning to Play Piano with Dexterous Hands and Touch
Huazhe Xu
Yuping Luo
Shaoxiong Wang
Trevor Darrell
Roberto Calandra
174
30
0
03 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
117
134
0
25 May 2021
SIDE: State Inference for Partially Observable Cooperative Multi-Agent
  Reinforcement Learning
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Yunru Bai
Dapeng Li
Bin Zhang
Guoliang Fan
74
9
0
13 May 2021
Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation
  Tasks
Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation Tasks
Zhihao Li
Zhenglong Sun
Jionglong SU
Jiaming Zhang
35
3
0
12 May 2021
Reinforcement learning of rare diffusive dynamics
Reinforcement learning of rare diffusive dynamics
Avishek Das
Dominic C. Rose
J. P. Garrahan
David T. Limmer
109
28
0
10 May 2021
Benchmarking Structured Policies and Policy Optimization for Real-World
  Dexterous Object Manipulation
Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Niklas Funk
Charles B. Schaff
Rishabh Madan
Takuma Yoneda
Julen Urain De Jesus
...
Stefan Bauer
S. Srinivasa
Tapomayukh Bhattacharjee
Matthew R. Walter
Jan Peters
103
35
0
05 May 2021
Pre-training of Deep RL Agents for Improved Learning under Domain
  Randomization
Pre-training of Deep RL Agents for Improved Learning under Domain Randomization
Artemij Amiranashvili
Max Argus
Lukás Hermann
Wolfram Burgard
Thomas Brox
39
3
0
29 Apr 2021
Network Defense is Not a Game
Network Defense is Not a Game
Andres Molina-Markham
Ransom K. Winder
Ahmad Ridley
AAML
40
14
0
20 Apr 2021
Learning on a Budget via Teacher Imitation
Learning on a Budget via Teacher Imitation
Ercüment Ilhan
Jeremy Gow
Diego Perez-Liebana
OffRL
58
2
0
17 Apr 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
144
280
0
16 Apr 2021
Auto-Tuned Sim-to-Real Transfer
Auto-Tuned Sim-to-Real Transfer
Yuqing Du
Olivia Watkins
Trevor Darrell
Pieter Abbeel
Deepak Pathak
88
73
0
15 Apr 2021
Curiosity-Driven Exploration via Latent Bayesian Surprise
Curiosity-Driven Exploration via Latent Bayesian Surprise
Pietro Mazzaglia
Ozan Çatal
Tim Verbelen
Bart Dhoedt
108
35
0
15 Apr 2021
Online and Offline Reinforcement Learning by Planning with a Learned
  Model
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian Schrittwieser
Thomas Hubert
Amol Mandhane
M. Barekatain
Ioannis Antonoglou
David Silver
OffRL
80
118
0
13 Apr 2021
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From
  a Single Offline Environment
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
Philip J. Ball
Cong Lu
Jack Parker-Holder
Stephen J. Roberts
OffRL
112
45
0
12 Apr 2021
A coevolutionary approach to deep multi-agent reinforcement learning
A coevolutionary approach to deep multi-agent reinforcement learning
Daan Klijn
A. E. Eiben
58
8
0
12 Apr 2021
Selection-Expansion: A Unifying Framework for Motion-Planning and
  Diversity Search Algorithms
Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search Algorithms
Alexandre Chenu
Nicolas Perrin-Gilbert
Stéphane Doncieux
Olivier Sigaud
49
1
0
10 Apr 2021
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive
  Navigation
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation
Jinyoung Choi
C. Dance
Jung-Eun Kim
Seulbin Hwang
Kyungsik Park
UQCV
58
26
0
07 Apr 2021
Synthetic training data generation for deep learning based quality
  inspection
Synthetic training data generation for deep learning based quality inspection
Pierre Gutierrez
Maria Luschkova
Antoine Cordier
Mustafa Shukor
Mona Schappert
Tim Dahmen
39
23
0
07 Apr 2021
The Value of Planning for Infinite-Horizon Model Predictive Control
The Value of Planning for Infinite-Horizon Model Predictive Control
Nathan Hatch
Byron Boots
65
10
0
07 Apr 2021
Previous
123...111213141516
Next