ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.07113
  4. Cited By
Solving Rubik's Cube with a Robot Hand

Solving Rubik's Cube with a Robot Hand

16 October 2019
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
Bob McGrew
Arthur Petron
Alex Paino
Matthias Plappert
Glenn Powell
Raphael Ribas
Jonas Schneider
Nikolas Tezak
Jerry Tworek
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
    ODL
ArXiv (abs)PDFHTML

Papers citing "Solving Rubik's Cube with a Robot Hand"

50 / 775 papers shown
Title
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via
  Differentiable Physics-Based Simulation and Rendering
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
84
26
0
27 Oct 2022
Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone
  Racing
Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing
Jiawei Fu
Yunlong Song
Yongpeng Wu
Feng Yu
Davide Scaramuzza
114
21
0
26 Oct 2022
Environment Design for Inverse Reinforcement Learning
Environment Design for Inverse Reinforcement Learning
Thomas Kleine Buening
Victor Villin
Christos Dimitrakakis
102
1
0
26 Oct 2022
Will we run out of data? Limits of LLM scaling based on human-generated
  data
Will we run out of data? Limits of LLM scaling based on human-generated data
Pablo Villalobos
A. Ho
J. Sevilla
T. Besiroglu
Lennart Heim
Marius Hobbhahn
ALM
102
125
0
26 Oct 2022
DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to
  Reality
DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to Reality
Ankur Handa
Arthur Allshire
Viktor Makoviychuk
Aleksei Petrenko
Ritvik Singh
...
Balakumar Sundaralingam
Yashraj S. Narang
Jean-Francois Lafleche
Dieter Fox
Gavriel State
139
157
0
25 Oct 2022
Learning Robust Real-World Dexterous Grasping Policies via Implicit
  Shape Augmentation
Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation
Zoey Qiuyu Chen
Karl Van Wyk
Yu-Wei Chao
Wei Yang
Arsalan Mousavian
Abhishek Gupta
Dieter Fox
91
29
0
24 Oct 2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated
  Worlds
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Joshua Albrecht
Abraham J. Fetterman
Bryden Fogelman
Ellie Kitanidis
Bartosz Wróblewski
...
Michael Rosenthal
Maksis Knutins
Zachary Polizzi
James B. Simon
Kanjun Qiu
OffRL
87
23
0
24 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
88
23
0
24 Oct 2022
Co-Training an Observer and an Evading Target
Co-Training an Observer and an Evading Target
André Brandenburger
Folker Hoffmann
A. Charlish
71
1
0
20 Oct 2022
Curriculum Reinforcement Learning using Optimal Transport via Gradual
  Domain Adaptation
Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation
Peide Huang
Mengdi Xu
Jiacheng Zhu
Laixi Shi
Fei Fang
Ding Zhao
CLL
101
25
0
18 Oct 2022
Deep Whole-Body Control: Learning a Unified Policy for Manipulation and
  Locomotion
Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion
Zipeng Fu
Xuxin Cheng
Deepak Pathak
108
159
0
18 Oct 2022
Online Damage Recovery for Physical Robots with Hierarchical
  Quality-Diversity
Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity
Maxime Allard
Simón C. Smith
Konstantinos Chatzilygeroudis
Bryan Lim
Antoine Cully
63
13
0
18 Oct 2022
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic
  Reinforcement Learning at Scale
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic Reinforcement Learning at Scale
Kuang-Huei Lee
Ted Xiao
A. Li
Paul Wohlhart
Ian S. Fischer
Yao Lu
120
10
0
15 Oct 2022
Just Round: Quantized Observation Spaces Enable Memory Efficient
  Learning of Dynamic Locomotion
Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion
Lev Grossman
Brian Plancher
MQ
69
4
0
14 Oct 2022
Skill-Based Reinforcement Learning with Intrinsic Reward Matching
Skill-Based Reinforcement Learning with Intrinsic Reward Matching
Ademi Adeniji
Amber Xie
Pieter Abbeel
OffRL
73
5
0
14 Oct 2022
Policy Gradient With Serial Markov Chain Reasoning
Policy Gradient With Serial Markov Chain Reasoning
Edoardo Cetin
Oya Celiktutan
BDLLRM
58
2
0
13 Oct 2022
Holo-Dex: Teaching Dexterity with Immersive Mixed Reality
Holo-Dex: Teaching Dexterity with Immersive Mixed Reality
Sridhar Pandian Arunachalam
Irmak Güzey
Soumith Chintala
Lerrel Pinto
109
74
0
12 Oct 2022
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based
  In-Hand Manipulation
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation
Lingfeng Tao
Jiucai Zhang
Michael Bowman
Xiaoli Zhang
67
6
0
11 Oct 2022
NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills
  using Neural Radiance Fields
NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields
Arunkumar Byravan
Jan Humplik
Leonard Hasenclever
Arthur Brussee
F. Nori
...
Ben Moran
Steven Bohez
Fereshteh Sadeghi
Bojan Vujatovic
N. Heess
160
57
0
10 Oct 2022
Efficient Learning of Locomotion Skills through the Discovery of Diverse
  Environmental Trajectory Generator Priors
Efficient Learning of Locomotion Skills through the Discovery of Diverse Environmental Trajectory Generator Priors
Shikha Surana
Bryan Lim
Antoine Cully
79
4
0
10 Oct 2022
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
Tianli Ding
L. Graesser
Saminda Abeyruwan
David B. DÁmbrosio
Anish Shankar
P. Sermanet
Pannag R Sanketi
Corey Lynch
125
22
0
07 Oct 2022
DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General
  Objects Based on Simulation
DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation
Ruicheng Wang
Jialiang Zhang
Jiayi Chen
Yinzhen Xu
Puhao Li
Tengyu Liu
He Wang
145
124
0
06 Oct 2022
Goal Misgeneralization: Why Correct Specifications Aren't Enough For
  Correct Goals
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals
Rohin Shah
Vikrant Varma
Ramana Kumar
Mary Phuong
Victoria Krakovna
J. Uesato
Zachary Kenton
99
72
0
04 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
93
22
0
04 Oct 2022
Partially Observable RL with B-Stability: Unified Structural Condition
  and Sharp Sample-Efficient Algorithms
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms
Fan Chen
Yu Bai
Song Mei
100
22
0
29 Sep 2022
Learning Low-Frequency Motion Control for Robust and Dynamic Robot
  Locomotion
Learning Low-Frequency Motion Control for Robust and Dynamic Robot Locomotion
Siddhant Gangapurwala
Luigi Campanaro
Ioannis Havoutis
101
13
0
29 Sep 2022
DexTransfer: Real World Multi-fingered Dexterous Grasping with Minimal
  Human Demonstrations
DexTransfer: Real World Multi-fingered Dexterous Grasping with Minimal Human Demonstrations
Zoey Qiuyu Chen
Karl Van Wyk
Yu-Wei Chao
Wei Yang
Arsalan Mousavian
Abhishek Gupta
Dieter Fox
101
22
0
28 Sep 2022
Learn what matters: cross-domain imitation learning with task-relevant
  embeddings
Learn what matters: cross-domain imitation learning with task-relevant embeddings
Tim Franzmeyer
Philip Torr
João F. Henriques
OOD
90
22
0
24 Sep 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns
  for Cross-Domain Adaptation
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
84
3
0
24 Sep 2022
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Sai Rajeswar
Pietro Mazzaglia
Tim Verbelen
Alexandre Piché
Bart Dhoedt
Rameswar Panda
Alexandre Lacoste
SSL
102
21
0
24 Sep 2022
Grouped Adaptive Loss Weighting for Person Search
Grouped Adaptive Loss Weighting for Person Search
Yanling Tian
Di Chen
Yunan Liu
Shanshan Zhang
Jian Yang
91
5
0
23 Sep 2022
Learning Dexterous Manipulation from Exemplar Object Trajectories and
  Pre-Grasps
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps
Sudeep Dasari
Abhi Gupta
Vikash Kumar
113
43
0
22 Sep 2022
Optimizing Crop Management with Reinforcement Learning and Imitation
  Learning
Optimizing Crop Management with Reinforcement Learning and Imitation Learning
Ran Tao
Pan Zhao
Jing Wu
N. F. Martin
M. Harrison
C. Ferreira
Z. Kalantari
N. Hovakimyan
OffRL
61
26
0
20 Sep 2022
Robust Reinforcement Learning Algorithm for Vision-based Ship Landing of
  UAVs
Robust Reinforcement Learning Algorithm for Vision-based Ship Landing of UAVs
Vishnu Saj
Bochan Lee
D. Kalathil
Moble Benedict
58
5
0
17 Sep 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Yue Liu
Ding Zhao
175
47
0
16 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with
  Linear Reward Shaping
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRLOnRL
80
11
0
15 Sep 2022
Meta-Reinforcement Learning via Language Instructions
Meta-Reinforcement Learning via Language Instructions
Zhenshan Bing
A. Koch
Xiangtong Yao
Kai-Qi Huang
Alois C. Knoll
LM&Ro
122
19
0
11 Sep 2022
Instruction-driven history-aware policies for robotic manipulations
Instruction-driven history-aware policies for robotic manipulations
Pierre-Louis Guhur
Shizhe Chen
Ricardo Garcia Pinel
Makarand Tapaswi
Ivan Laptev
Cordelia Schmid
LM&Ro
195
109
0
11 Sep 2022
Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse
  Data using a Learning-based Unscented Kalman Filter
Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse Data using a Learning-based Unscented Kalman Filter
Alexander Schperberg
Yusuke Tanaka
Feng Xu
Marcel Menner
Dennis W. Hong
77
5
0
07 Sep 2022
What deep reinforcement learning tells us about human motor learning and
  vice-versa
What deep reinforcement learning tells us about human motor learning and vice-versa
Michele Garibbo
Casimir J. H. Ludwig
Nathan Lepora
Laurence Aitchison
67
0
0
23 Aug 2022
Learning Ball-balancing Robot Through Deep Reinforcement Learning
Learning Ball-balancing Robot Through Deep Reinforcement Learning
Yifan Zhou
Jianghao Lin
Shuai Wang
Chong Zhang
28
9
0
22 Aug 2022
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free
  Reinforcement Learning
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
73
105
0
16 Aug 2022
AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning
AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning
Wangyang Yue
Yuan Zhou
Xiaochuan Zhang
Yuchen Hua
Zhiyuan Wang
Guang Kou
OffRL
45
3
0
03 Aug 2022
Learning Fast and Precise Pixel-to-Torque Control
Learning Fast and Precise Pixel-to-Torque Control
Steffen Bleher
Steve Heim
Sebastian Trimpe
73
2
0
03 Aug 2022
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement
  Learning with Domain Randomization
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization
Y. Kadokawa
Lingwei Zhu
Yoshihisa Tsurumine
Takamitsu Matsubara
60
8
0
29 Jul 2022
Learning Dynamic Manipulation Skills from Haptic-Play
Learning Dynamic Manipulation Skills from Haptic-Play
Taeyoon Lee
D. Sung
Kyoung-Whan Choi
Choong-Keun Lee
Changwoo Park
Keunjun Choi
88
3
0
28 Jul 2022
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary
  Differential Equations
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations
Achkan Salehi
Steffen Rühl
Stéphane Doncieux
AI4CE
90
2
0
25 Jul 2022
Towards Using Fully Observable Policies for POMDPs
Towards Using Fully Observable Policies for POMDPs
András Attila Sulyok
K. Karacs
107
1
0
24 Jul 2022
Incorporating Prior Knowledge into Reinforcement Learning for Soft Tissue Manipulation with Autonomous Grasping Point Selection
Xian He
Shuai Zhang
Shanlin Yang
Bo Ouyang
25
0
0
21 Jul 2022
Human-to-Robot Imitation in the Wild
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
123
174
0
19 Jul 2022
Previous
123...789...141516
Next