Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.06110
Cited By
Dopamine: A Research Framework for Deep Reinforcement Learning
14 December 2018
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dopamine: A Research Framework for Deep Reinforcement Learning"
10 / 60 papers shown
Title
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
104
292
0
16 Oct 2019
Benchmarking Batch Deep Reinforcement Learning Algorithms
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
18
182
0
03 Oct 2019
RecSim: A Configurable Simulation Platform for Recommender Systems
Eugene Ie
Chih-Wei Hsu
Martin Mladenov
Vihan Jain
Sanmit Narvekar
Jing Wang
Rui Wu
Craig Boutilier
30
178
0
11 Sep 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
Adam Stooke
Pieter Abbeel
OffRL
24
96
0
03 Sep 2019
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
24
25
0
13 Aug 2019
Behaviour Suite for Reinforcement Learning
Ian Osband
Yotam Doron
Matteo Hessel
John Aslanides
Eren Sezener
...
Satinder Singh
Benjamin Van Roy
R. Sutton
David Silver
H. V. Hasselt
OffRL
32
178
0
09 Aug 2019
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology
Eugene Ie
Vihan Jain
Jing Wang
Sanmit Narvekar
Ritesh Agarwal
...
Vince Gatto
Paul Covington
Jim McFadden
Tushar Chandra
Craig Boutilier
OffRL
24
69
0
29 May 2019
Lessons from Contextual Bandit Learning in a Customer Support Bot
Nikos Karampatziakis
Sebastian Kochman
Jade Huang
Paul Mineiro
Kathy Osborne
Weizhu Chen
15
6
0
06 May 2019
Hyperbolic Discounting and Learning over Multiple Horizons
W. Fedus
Carles Gelada
Yoshua Bengio
Marc G. Bellemare
Hugo Larochelle
32
105
0
19 Feb 2019
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
45
551
0
12 Oct 2018
Previous
1
2