Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06887
Cited By
A Distributional Perspective on Reinforcement Learning
21 July 2017
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Distributional Perspective on Reinforcement Learning"
50 / 257 papers shown
Title
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
38
34
0
23 Dec 2019
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
21
75
0
09 Nov 2019
Probabilistic Successor Representations with Kalman Temporal Differences
J. Geerts
Kimberly L. Stachenfeld
Neil Burgess
14
13
0
06 Oct 2019
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action
Mathieu Seurin
Philippe Preux
Olivier Pietquin
18
12
0
04 Oct 2019
Benchmarking Batch Deep Reinforcement Learning Algorithms
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
15
181
0
03 Oct 2019
Task-Relevant Adversarial Imitation Learning
Konrad Zolna
Scott E. Reed
Alexander Novikov
Sergio Gomez Colmenarejo
David Budden
Serkan Cabi
Misha Denil
Nando de Freitas
Ziyun Wang
GAN
28
61
0
02 Oct 2019
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
OffRL
20
50
0
01 Oct 2019
Scaling data-driven robotics with reward sketching and batch reinforcement learning
Serkan Cabi
Sergio Gomez Colmenarejo
Alexander Novikov
Ksenia Konyushkova
Scott E. Reed
...
David Barker
Jonathan Scholz
Misha Denil
Nando de Freitas
Ziyun Wang
OffRL
28
29
0
26 Sep 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
Adam Stooke
Pieter Abbeel
OffRL
24
96
0
03 Sep 2019
Deep reinforcement learning in World-Earth system models to discover sustainable management strategies
Felix M. Strnad
W. Barfuss
J. Donges
J. Heitzig
30
25
0
15 Aug 2019
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
24
24
0
13 Aug 2019
Accelerating Reinforcement Learning through GPU Atari Emulation
Steven Dalton
I. Frosio
M. Garland
ELM
21
9
0
19 Jul 2019
Capturing Financial markets to apply Deep Reinforcement Learning
Souradeep Chakraborty
AIFin
AI4TS
16
17
0
09 Jul 2019
Modern Deep Reinforcement Learning Algorithms
Sergey Ivanov
A. Dýakonov
OffRL
23
38
0
24 Jun 2019
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
32
285
0
12 Jun 2019
When to use parametric models in reinforcement learning?
H. V. Hasselt
Matteo Hessel
John Aslanides
13
188
0
12 Jun 2019
Learning to Score Behaviors for Guided Policy Optimization
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
A. Choromańska
K. Choromanski
Michael I. Jordan
19
38
0
11 Jun 2019
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
26
51
0
05 May 2019
Deep Reinforcement Learning with Decorrelation
B. Mavrin
Hengshuai Yao
Linglong Kong
32
8
0
18 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
23
17
0
11 Mar 2019
Hyperbolic Discounting and Learning over Multiple Horizons
W. Fedus
Carles Gelada
Yoshua Bengio
Marc G. Bellemare
Hugo Larochelle
32
105
0
19 Feb 2019
Artificial Intelligence for Prosthetics - challenge solutions
L. Kidzinski
Carmichael F. Ong
Sharada Mohanty
Jennifer Hicks
Sean F. Carroll
...
E. Tumer
J. Watson
M. Salathé
Sergey Levine
Scott L. Delp
15
40
0
07 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
361
0
30 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
19
7
0
23 Jan 2019
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
19
276
0
14 Dec 2018
Wireless Network Intelligence at the Edge
Jihong Park
S. Samarakoon
M. Bennis
Mérouane Debbah
21
518
0
07 Dec 2018
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
27
72
0
05 Dec 2018
Preparing for the Unexpected: Diversity Improves Planning Resilience in Evolutionary Algorithms
Thomas Gabor
Lenz Belzner
Thomy Phan
Kyrill Schmid
19
14
0
30 Oct 2018
Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
Nguyen Cong Luong
D. Hoang
Shimin Gong
Dusit Niyato
Ping Wang
Ying-Chang Liang
Dong In Kim
OffRL
57
1,422
0
18 Oct 2018
A Practical Approach to Insertion with Variable Socket Position Using Deep Reinforcement Learning
Mel Vecerík
Oleg O. Sushkov
David Barker
Thomas Rothörl
Todd Hester
Jonathan Scholz
19
110
0
02 Oct 2018
Combined Reinforcement Learning via Abstract Representations
Vincent François-Lavet
Yoshua Bengio
Doina Precup
Joelle Pineau
OffRL
30
89
0
12 Sep 2018
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
16
808
0
07 Sep 2018
Financial Trading as a Game: A Deep Reinforcement Learning Approach
Chien-Yi Huang
AIFin
29
72
0
08 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
30
212
0
20 Jun 2018
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
48
470
0
14 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
21
372
0
08 Jun 2018
Temporal Difference Variational Auto-Encoder
Karol Gregor
George Papamakarios
F. Besse
Lars Buesing
Theophane Weber
DRL
24
126
0
08 Jun 2018
Equivalence Between Wasserstein and Value-Aware Loss for Model-based Reinforcement Learning
Kavosh Asadi
Evan Cater
Dipendra Kumar Misra
Michael L. Littman
OffRL
11
11
0
01 Jun 2018
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
38
324
0
24 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
6
42
0
09 May 2018
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
41
30
0
04 May 2018
Lipschitz Continuity in Model-based Reinforcement Learning
Kavosh Asadi
Dipendra Kumar Misra
Michael L. Littman
KELM
34
150
0
19 Apr 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
78
1,656
0
30 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
16
72
0
13 Mar 2018
Accelerated Methods for Deep Reinforcement Learning
Adam Stooke
Pieter Abbeel
OffRL
OnRL
25
133
0
07 Mar 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
86
731
0
02 Mar 2018
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
...
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
24
557
0
26 Feb 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
66
238
0
25 Feb 2018
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
29
686
0
18 Dec 2017
Previous
1
2
3
4
5
6
Next