Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.12560
Cited By
An Introduction to Deep Reinforcement Learning
30 November 2018
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Introduction to Deep Reinforcement Learning"
50 / 178 papers shown
Title
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
389
5,362
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
87
757
0
03 Nov 2016
TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games
Gabriel Synnaeve
Nantas Nardelli
Alex Auvolat
Soumith Chintala
Timothée Lacroix
Zeming Lin
Florian Richoux
Nicolas Usunier
GNN
25
105
0
01 Nov 2016
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Andrei A. Rusu
Matej Vecerík
Thomas Rothörl
N. Heess
Razvan Pascanu
R. Hadsell
66
532
0
13 Oct 2016
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates
S. Gu
E. Holly
Timothy Lillicrap
Sergey Levine
OffRL
SSL
104
1,477
0
03 Oct 2016
Video Pixel Networks
Nal Kalchbrenner
Aaron van den Oord
Karen Simonyan
Ivo Danihelka
Oriol Vinyals
Alex Graves
Koray Kavukcuoglu
59
423
0
03 Oct 2016
Playing FPS Games with Deep Reinforcement Learning
Guillaume Lample
Devendra Singh Chaplot
OffRL
EgoV
63
584
0
18 Sep 2016
Towards Deep Symbolic Reinforcement Learning
M. Garnelo
Kai Arulkumaran
Murray Shanahan
62
226
0
18 Sep 2016
Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning
Yuke Zhu
Roozbeh Mottaghi
Eric Kolve
Joseph J. Lim
Abhinav Gupta
Li Fei-Fei
Ali Farhadi
VGen
54
1,516
0
16 Sep 2016
The Option-Critic Architecture
Pierre-Luc Bacon
J. Harb
Doina Precup
OffRL
50
1,076
0
16 Sep 2016
Extending the OpenAI Gym for robotics: a toolkit for reinforcement learning using ROS and Gazebo
I. Zamora
N. G. Lopez
Víctor Mayoral-Vilches
A. Cordero
GP
39
159
0
19 Aug 2016
An Actor-Critic Algorithm for Sequence Prediction
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
102
637
0
24 Jul 2016
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
147
2,371
0
21 Jun 2016
Strategic Attentive Writer for Learning Macro-Actions
Alexander
A. Vezhnevets
Volodymyr Mnih
J. Agapiou
Simon Osindero
Alex Graves
Oriol Vinyals
Koray Kavukcuoglu
39
171
0
15 Jun 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
114
3,089
0
10 Jun 2016
Cooperative Inverse Reinforcement Learning
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
63
643
0
09 Jun 2016
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
122
611
0
08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
162
1,465
0
06 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
179
5,056
0
05 Jun 2016
Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information
Yoad Lewenberg
Valliappa Chockalingam
Satinder Singh
Honglak Lee
CVBM
46
304
0
29 May 2016
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
158
1,139
0
25 May 2016
Unsupervised Learning for Physical Interaction through Video Prediction
Chelsea Finn
Ian Goodfellow
Sergey Levine
65
1,042
0
23 May 2016
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Michal Kempka
Marek Wydmuch
Grzegorz Runc
Jakub Toczek
Wojciech Ja'skowski
53
695
0
06 May 2016
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
58
4,153
0
25 Apr 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Chen Tessler
Shahar Givony
Tom Zahavy
D. Mankowitz
Shie Mannor
CLL
112
378
0
25 Apr 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
66
1,689
0
22 Apr 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
51
1,133
0
20 Apr 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
225
573
0
04 Apr 2016
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martín Abadi
Ashish Agarwal
P. Barham
E. Brevdo
Zhiwen Chen
...
Pete Warden
Martin Wattenberg
Martin Wicke
Yuan Yu
Xiaoqiang Zheng
189
11,135
0
14 Mar 2016
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
62
1,010
0
02 Mar 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
95
946
0
01 Mar 2016
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Christian Szegedy
Sergey Ioffe
Vincent Vanhoucke
Alexander A. Alemi
299
14,196
0
23 Feb 2016
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
66
1,302
0
15 Feb 2016
Adaptive Skills, Adaptive Partitions (ASAP)
D. Mankowitz
Timothy A. Mann
Shie Mannor
33
58
0
10 Feb 2016
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
61
650
0
09 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
168
8,805
0
04 Feb 2016
Taming the Noise in Reinforcement Learning via Soft Updates
Roy Fox
Ari Pakman
Naftali Tishby
35
336
0
28 Dec 2015
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.4K
192,638
0
10 Dec 2015
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies
Vincent François-Lavet
R. Fonteneau
D. Ernst
33
110
0
07 Dec 2015
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints
Eric Tzeng
Coline Devin
Judy Hoffman
Chelsea Finn
Pieter Abbeel
Sergey Levine
Kate Saenko
Trevor Darrell
OOD
59
139
0
23 Nov 2015
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
75
1,611
0
20 Nov 2015
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
67
3,742
0
20 Nov 2015
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
OffRL
68
594
0
19 Nov 2015
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
58
687
0
19 Nov 2015
Neural Programmer-Interpreters
Scott E. Reed
Nando de Freitas
70
408
0
19 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
198
3,781
0
18 Nov 2015
Net2Net: Accelerating Learning via Knowledge Transfer
Tianqi Chen
Ian Goodfellow
Jonathon Shlens
100
663
0
18 Nov 2015
Deep multi-scale video prediction beyond mean square error
Michaël Mathieu
Camille Couprie
Yann LeCun
GAN
109
1,880
0
17 Nov 2015
Neural Programmer: Inducing Latent Programs with Gradient Descent
Arvind Neelakantan
Quoc V. Le
Ilya Sutskever
ODL
62
263
0
16 Nov 2015
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
141
621
0
11 Nov 2015
Previous
1
2
3
4
Next