ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.07113
  4. Cited By
Solving Rubik's Cube with a Robot Hand

Solving Rubik's Cube with a Robot Hand

16 October 2019
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
Bob McGrew
Arthur Petron
Alex Paino
Matthias Plappert
Glenn Powell
Raphael Ribas
Jonas Schneider
Nikolas Tezak
Jerry Tworek
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
    ODL
ArXivPDFHTML

Papers citing "Solving Rubik's Cube with a Robot Hand"

50 / 282 papers shown
Title
Implicitly Regularized RL with Implicit Q-Values
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
Chen Wang
Claudia Pérez-DÁrpino
Danfei Xu
Li Fei-Fei
Chenxi Liu
Silvio Savarese
42
33
0
13 Aug 2021
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
134
190
0
12 Aug 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
29
42
0
11 Aug 2021
BEHAVIOR: Benchmark for Everyday Household Activities in Virtual,
  Interactive, and Ecological Environments
BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments
S. Srivastava
Chengshu Li
Michael Lingelbach
Roberto Martín-Martín
Fei Xia
...
Chenxi Liu
Silvio Savarese
H. Gweon
Jiajun Wu
Li Fei-Fei
LM&Ro
151
157
0
06 Aug 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
43
181
0
27 Jul 2021
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain
  Randomization Reinforcement Learning
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain Randomization Reinforcement Learning
Haoyi Niu
Jianming Hu
Zheyu Cui
Yi Zhang
36
16
0
25 Jul 2021
An End-to-End Differentiable Framework for Contact-Aware Robot Design
An End-to-End Differentiable Framework for Contact-Aware Robot Design
Jie Xu
Tao Chen
Lara Zlokapa
Michael Foshey
Wojciech Matusik
Shinjiro Sueda
Pulkit Agrawal
27
88
0
15 Jul 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
278
109
0
13 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
33
111
0
07 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic
  Data via Stereo
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo
Thomas Kollar
Michael Laskey
Kevin Stone
Brijen Thananjeyan
Mark Tjersland
48
25
0
30 Jun 2021
Discovering Generalizable Skills via Automated Generation of Diverse
  Tasks
Discovering Generalizable Skills via Automated Generation of Diverse Tasks
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
48
6
0
26 Jun 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body
  Simulation
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
48
350
0
24 Jun 2021
Interpretable Model-based Hierarchical Reinforcement Learning using
  Inductive Logic Programming
Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming
Duo Xu
Faramarz Fekri
21
10
0
21 Jun 2021
Cat-like Jumping and Landing of Legged Robots in Low-gravity Using Deep
  Reinforcement Learning
Cat-like Jumping and Landing of Legged Robots in Low-gravity Using Deep Reinforcement Learning
Nikita Rudin
H. Kolvenbach
Vassilios Tsounis
Marco Hutter
27
84
0
17 Jun 2021
Mungojerrie: Reinforcement Learning of Linear-Time Objectives
Mungojerrie: Reinforcement Learning of Linear-Time Objectives
E. M. Hahn
Mateo Perez
S. Schewe
F. Somenzi
Ashutosh Trivedi
D. Wojtczak
19
10
0
16 Jun 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain
  Transfer in Offline RL
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
27
29
0
16 Jun 2021
Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation
Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation
Alex Church
John Lloyd
R. Hadsell
Nathan Lepora
35
50
0
16 Jun 2021
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
Prithvijit Chattopadhyay
Judy Hoffman
Roozbeh Mottaghi
Aniruddha Kembhavi
25
55
0
08 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
31
129
0
25 May 2021
Reinforcement learning of rare diffusive dynamics
Reinforcement learning of rare diffusive dynamics
Avishek Das
Dominic C. Rose
J. P. Garrahan
David T. Limmer
16
27
0
10 May 2021
Learning on a Budget via Teacher Imitation
Learning on a Budget via Teacher Imitation
Ercüment Ilhan
Jeremy Gow
Diego Perez-Liebana
OffRL
27
2
0
17 Apr 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
47
271
0
16 Apr 2021
Auto-Tuned Sim-to-Real Transfer
Auto-Tuned Sim-to-Real Transfer
Yuqing Du
Olivia Watkins
Trevor Darrell
Pieter Abbeel
Deepak Pathak
27
69
0
15 Apr 2021
Sim-to-Real for Robotic Tactile Sensing via Physics-Based Simulation and
  Learned Latent Projections
Sim-to-Real for Robotic Tactile Sensing via Physics-Based Simulation and Learned Latent Projections
Yashraj S. Narang
Balakumar Sundaralingam
Miles Macklin
Arsalan Mousavian
Dieter Fox
30
58
0
31 Mar 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
27
21
0
17 Mar 2021
Lyapunov Barrier Policy Optimization
Lyapunov Barrier Policy Optimization
Harshit S. Sikchi
Wenxuan Zhou
David Held
26
14
0
16 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
Behavior From the Void: Unsupervised Active Pre-Training
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
41
195
0
08 Mar 2021
Machine Learning for Mechanical Ventilation Control
Machine Learning for Mechanical Ventilation Control
Daniel Suo
Naman Agarwal
Wenhan Xia
Xinyi Chen
Udaya Ghai
...
J. LaChance
Tom Zadjel
Manuel Schottdorf
Daniel J. Cohen
Elad Hazan
OOD
AI4CE
54
10
0
12 Feb 2021
Embodied Intelligence via Learning and Evolution
Embodied Intelligence via Learning and Evolution
Agrim Gupta
Silvio Savarese
Surya Ganguli
Li Fei-Fei
AI4CE
22
230
0
03 Feb 2021
Asymmetric self-play for automatic goal discovery in robotic
  manipulation
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
82
76
0
13 Jan 2021
The Distracting Control Suite -- A Challenging Benchmark for
  Reinforcement Learning from Pixels
The Distracting Control Suite -- A Challenging Benchmark for Reinforcement Learning from Pixels
Austin Stone
Oscar Ramirez
K. Konolige
Rico Jonschkowski
137
101
0
07 Jan 2021
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A
  Detection Approach
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach
Amin Nikanjam
Mohammad Mehdi Morovati
Foutse Khomh
Houssem Ben Braiek
27
30
0
01 Jan 2021
LISPR: An Options Framework for Policy Reuse with Reinforcement Learning
LISPR: An Options Framework for Policy Reuse with Reinforcement Learning
D. Graves
Jun Jin
Jun Luo
38
2
0
29 Dec 2020
Learning Accurate Long-term Dynamics for Model-based Reinforcement
  Learning
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning
Nathan Lambert
Albert Wilcox
Howard Zhang
K. Pister
Roberto Calandra
25
32
0
16 Dec 2020
CARLA Real Traffic Scenarios -- novel training ground and benchmark for
  autonomous driving
CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving
B. Osinski
Piotr Milos
Adam Jakubowski
Pawel Ziecina
Michal Martyniak
Christopher Galias
Antonia Breuer
S. Homoceanu
Henryk Michalewski
19
20
0
16 Dec 2020
TACTO: A Fast, Flexible, and Open-source Simulator for High-Resolution
  Vision-based Tactile Sensors
TACTO: A Fast, Flexible, and Open-source Simulator for High-Resolution Vision-based Tactile Sensors
Shaoxiong Wang
Mike Lambeta
Po-wei Chou
Roberto Calandra
18
135
0
15 Dec 2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts
WILDS: A Benchmark of in-the-Wild Distribution Shifts
Pang Wei Koh
Shiori Sagawa
Henrik Marklund
Sang Michael Xie
Marvin Zhang
...
A. Kundaje
Emma Pierson
Sergey Levine
Chelsea Finn
Percy Liang
OOD
83
1,377
0
14 Dec 2020
Learning from Simulation, Racing in Reality
Learning from Simulation, Racing in Reality
Eugenio Chisari
Alexander Liniger
Alisa Rupenyan
Luc Van Gool
John Lygeros
33
25
0
26 Nov 2020
Regret Bounds for Adaptive Nonlinear Control
Regret Bounds for Adaptive Nonlinear Control
Nicholas M. Boffi
Stephen Tu
Jean-Jacques E. Slotine
41
47
0
26 Nov 2020
REALab: An Embedded Perspective on Tampering
REALab: An Embedded Perspective on Tampering
Ramana Kumar
J. Uesato
Richard Ngo
Tom Everitt
Victoria Krakovna
Shane Legg
27
10
0
17 Nov 2020
Fault-Aware Robust Control via Adversarial Reinforcement Learning
Fault-Aware Robust Control via Adversarial Reinforcement Learning
Fan Yang
Chao Yang
Di Guo
Huaping Liu
F. Sun
42
4
0
17 Nov 2020
Meta Automatic Curriculum Learning
Meta Automatic Curriculum Learning
Rémy Portelas
Clément Romac
Katja Hofmann
Pierre-Yves Oudeyer
35
8
0
16 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social
  Preferences
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Bowen Baker
LRM
18
33
0
10 Nov 2020
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer
Daniel Ho
Kanishka Rao
Zhuo Xu
Eric Jang
Mohi Khansari
Yunfei Bai
GAN
LM&Ro
45
97
0
06 Nov 2020
Dynamics Randomization Revisited:A Case Study for Quadrupedal Locomotion
Dynamics Randomization Revisited:A Case Study for Quadrupedal Locomotion
Zhaoming Xie
Xingye Da
M. van de Panne
Buck Babich
Animesh Garg
42
85
0
04 Nov 2020
Affordance as general value function: A computational model
Affordance as general value function: A computational model
D. Graves
Johannes Günther
Jun Luo
AI4CE
21
6
0
27 Oct 2020
High Acceleration Reinforcement Learning for Real-World Juggling with
  Binary Rewards
High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards
Kai Ploeger
M. Lutter
Jan Peters
22
29
0
26 Oct 2020
Previous
123456
Next