Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.07280
Cited By
Learning Generalized Reactive Policies using Deep Neural Networks
24 August 2017
E. Groshev
Maxwell Goldstein
Aviv Tamar
Siddharth Srivastava
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Generalized Reactive Policies using Deep Neural Networks"
17 / 17 papers shown
Title
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
65
552
0
19 Jul 2017
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
222
2,307
0
05 Jul 2017
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Ken Kansky
Tom Silver
David A. Mély
Mohamed Eldawy
Miguel Lazaro-Gredilla
Xinghua Lou
N. Dorfman
Szymon Sidor
Scott Phoenix
Dileep George
AI4CE
65
233
0
14 Jun 2017
Learning Combinatorial Optimization Algorithms over Graphs
H. Dai
Elias Boutros Khalil
Yuyu Zhang
B. Dilkina
Le Song
86
1,453
0
05 Apr 2017
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
61
684
0
21 Mar 2017
Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation
Ashvin Nair
Dian Chen
Pulkit Agrawal
Phillip Isola
Pieter Abbeel
Jitendra Malik
Sergey Levine
SSL
37
306
0
06 Mar 2017
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
47
289
0
28 Dec 2016
From Perception to Decision: A Data-driven Approach to End-to-end Motion Planning for Autonomous Ground Robots
Mark Pfeiffer
Michaela Schaeuble
Juan I. Nieto
Roland Siegwart
Cesar Cadena
68
380
0
26 Sep 2016
Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer
Jonathan Long
Trevor Darrell
VOS
SSeg
310
37,704
0
20 May 2016
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
59
650
0
09 Feb 2016
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
806
149,474
0
22 Dec 2014
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
275
20,491
0
10 Sep 2014
The LAMA Planner: Guiding Cost-Based Anytime Planning with Landmarks
Silvia Richter
Matthias Westphal
48
734
0
16 Jan 2014
Inductive Policy Selection for First-Order MDPs
S. Yoon
Alan Fern
R. Givan
48
94
0
12 Dec 2012
The Fast Downward Planning System
M. Helmert
47
1,895
0
27 Sep 2011
PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains
M. Fox
D. Long
70
2,168
0
22 Jun 2011
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
166
3,196
0
02 Nov 2010
1