Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.08973
Cited By
Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search
18 September 2021
Fan Bai
Fei Meng
Jianbang Liu
Jiankun Wang
Max Meng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search"
11 / 11 papers shown
Title
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
54
117
0
11 Jun 2021
Visual Foresight Trees for Object Retrieval from Clutter with Nonprehensile Rearrangement
Baichuan Huang
Shuai D. Han
Jingjin Yu
Abdeslam Boularias
77
53
0
06 May 2021
Multi-Object Rearrangement with Monte Carlo Tree Search:A Case Study on Planar Nonprehensile Sorting
Haoran Song
Joshua A. Haustein
Weihao Yuan
Kaiyu Hang
M. Y. Wang
Danica Kragic
J. A. Stork
56
55
0
15 Dec 2019
Simulation-based reinforcement learning for real-world autonomous driving
B. Osinski
Adam Jakubowski
Piotr Milos
Pawel Ziecina
Christopher Galias
S. Homoceanu
Henryk Michalewski
77
122
0
29 Nov 2019
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Jianyu Chen
Bodi Yuan
Masayoshi Tomizuka
61
265
0
20 Apr 2019
Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning
Weihao Yuan
J. A. Stork
Danica Kragic
M. Y. Wang
Kaiyu Hang
37
55
0
15 Mar 2018
Sim2Real View Invariant Visual Servoing by Recurrent Control
Fereshteh Sadeghi
Alexander Toshev
Eric Jang
Sergey Levine
48
99
0
20 Dec 2017
Virtual to Real Reinforcement Learning for Autonomous Driving
Xinlei Pan
Yurong You
Ziyan Wang
Cewu Lu
OffRL
69
336
0
13 Apr 2017
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates
S. Gu
E. Holly
Timothy Lillicrap
Sergey Levine
OffRL
SSL
114
1,480
0
03 Oct 2016
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
100
4,170
0
25 Apr 2016
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
466
2,951
0
28 Feb 2010
1