Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.09028
Cited By
v1
v2 (latest)
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
21 October 2018
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RLgraph: Modular Computation Graphs for Deep Reinforcement Learning"
18 / 18 papers shown
Title
LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations
Michael Schaarschmidt
A. Kuhnle
Ben Ellis
Kai Fricke
Felix Gessert
Eiko Yoneki
OffRL
53
41
0
23 Aug 2018
Beyond Data and Model Parallelism for Deep Neural Networks
Zhihao Jia
Matei A. Zaharia
A. Aiken
GNN
AI4CE
64
505
0
14 Jul 2018
Dynamic Control Flow in Large-Scale Machine Learning
Yuan Yu
Martín Abadi
P. Barham
E. Brevdo
M. Burrows
...
Michael Isard
M. Kudlur
R. Monga
D. Murray
Xiaoqiang Zheng
AI4CE
74
106
0
04 May 2018
Unsupervised Predictive Memory in a Goal-Directed Agent
Greg Wayne
Chia-Chun Hung
David Amos
M. Berk Mirza
Arun Ahuja
...
David Silver
Koray Kavukcuoglu
M. Botvinick
Demis Hassabis
Timothy Lillicrap
81
192
0
28 Mar 2018
Simple random search provides a competitive approach to reinforcement learning
Horia Mania
Aurelia Guy
Benjamin Recht
62
316
0
19 Mar 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
151
741
0
02 Mar 2018
Horovod: fast and easy distributed deep learning in TensorFlow
Alexander Sergeev
Mike Del Balso
100
1,221
0
15 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
237
1,605
0
05 Feb 2018
Ray: A Distributed Framework for Emerging AI Applications
Philipp Moritz
Robert Nishihara
Stephanie Wang
Alexey Tumanov
Richard Liaw
...
Melih Elibol
Zongheng Yang
William Paul
Michael I. Jordan
Ion Stoica
GNN
107
1,267
0
16 Dec 2017
Domain Randomization and Generative Models for Robotic Grasping
Joshua Tobin
Lukas Biewald
Rocky Duan
Marcin Andrychowicz
Ankur Handa
...
Bob McGrew
Jonas Schneider
Peter Welinder
Wojciech Zaremba
Pieter Abbeel
OOD
86
175
0
17 Oct 2017
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
123
1,963
0
19 Sep 2017
TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow
Danijar Hafner
James Davidson
Vincent Vanhoucke
OffRL
42
49
0
08 Sep 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
535
19,265
0
20 Jul 2017
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
223
5,086
0
05 Jun 2016
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhiwen Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
433
18,361
0
27 May 2016
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
Tianqi Chen
Mu Li
Yutian Li
Min Lin
Naiyan Wang
Minjie Wang
Tianjun Xiao
Bing Xu
Chiyuan Zhang
Zheng Zhang
200
2,248
0
03 Dec 2015
End-to-End Training of Deep Visuomotor Policies
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
315
3,443
0
02 Apr 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
120
3,021
0
19 Jul 2012
1