Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.03015
Cited By
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
7 July 2021
J. Luis
E. Crawley
B. Cameron
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research"
50 / 113 papers shown
Title
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRL
OffRL
SSL
96
178
0
28 Nov 2018
Meta-Learning for Multi-objective Reinforcement Learning
Xi Chen
Ali Ghadirzadeh
Mårten Björkman
Pablo G. Cámara
OffRL
63
55
0
08 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
85
139
0
01 Nov 2018
Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
Nguyen Cong Luong
D. Hoang
Shimin Gong
Dusit Niyato
Ping Wang
Ying-Chang Liang
Dong In Kim
OffRL
88
1,442
0
18 Oct 2018
Optimizing Agent Behavior over Long Time Scales by Transporting Value
Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
M. Berk Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
80
124
0
15 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
72
755
0
05 Oct 2018
Learning Navigation Behaviors End-to-End with AutoRL
H. Chiang
Aleksandra Faust
Marek Fiser
Anthony G. Francis
126
235
0
26 Sep 2018
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Tom Zahavy
Matan Haroush
Nadav Merlis
D. Mankowitz
Shie Mannor
102
191
0
06 Sep 2018
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Jacob Buckman
Danijar Hafner
George Tucker
E. Brevdo
Honglak Lee
91
332
0
04 Jul 2018
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
119
728
0
03 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
87
221
0
20 Jun 2018
Learning Factorized Multimodal Representations
Yao-Hung Hubert Tsai
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
Ruslan Salakhutdinov
DRL
112
409
0
16 Jun 2018
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
139
532
0
14 Jun 2018
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDL
OffRL
71
262
0
06 Jun 2018
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
230
1,284
0
30 May 2018
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
86
544
0
28 May 2018
A Lyapunov-based Approach to Safe Reinforcement Learning
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
165
508
0
20 May 2018
Progress & Compress: A scalable framework for continual learning
Jonathan Richard Schwarz
Jelena Luketina
Wojciech M. Czarnecki
A. Grabska-Barwinska
Yee Whye Teh
Razvan Pascanu
R. Hadsell
CLL
129
889
0
16 May 2018
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
98
480
0
23 Apr 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
166
1,676
0
30 Mar 2018
Unsupervised Predictive Memory in a Goal-Directed Agent
Greg Wayne
Chia-Chun Hung
David Amos
M. Berk Mirza
Arun Ahuja
...
David Silver
Koray Kavukcuoglu
M. Botvinick
Demis Hassabis
Timothy Lillicrap
86
192
0
28 Mar 2018
Soft-Robust Actor-Critic Policy-Gradient
E. Derman
D. Mankowitz
Timothy A. Mann
Shie Mannor
61
64
0
11 Mar 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
151
741
0
02 Mar 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
90
449
0
28 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
113
1,089
0
16 Feb 2018
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
105
227
0
13 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
247
1,607
0
05 Feb 2018
Safe Exploration in Continuous Action Spaces
Gal Dalal
Krishnamurthy Dvijotham
Matej Vecerík
Todd Hester
Cosmin Paduraru
Yuval Tassa
53
443
0
26 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
96
37
0
09 Jan 2018
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
150
1,144
0
02 Jan 2018
Deep Reinforcement Learning for De-Novo Drug Design
Mariya Popova
Olexandr Isayev
Alexander Tropsha
98
1,033
0
29 Nov 2017
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
97
127
0
27 Nov 2017
Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung
Yongxin Yang
Li Zhang
Tao Xiang
Philip Torr
Timothy M. Hospedales
314
4,054
0
16 Nov 2017
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
Arun Mallya
Svetlana Lazebnik
CLL
111
1,309
0
15 Nov 2017
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
131
757
0
30 Oct 2017
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
117
1,368
0
18 Oct 2017
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
145
1,963
0
19 Sep 2017
Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
Zhenguo Li
Fengwei Zhou
Fei Chen
Hang Li
101
1,121
0
31 Jul 2017
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
103
1,506
0
21 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
568
19,296
0
20 Jul 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
162
4,520
0
07 Jun 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
134
1,335
0
30 May 2017
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
156
2,090
0
24 May 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
Felix Berkenkamp
M. Turchetta
Angela P. Schoellig
Andreas Krause
191
853
0
23 May 2017
Molecular De Novo Design through Deep Reinforcement Learning
Marcus Olivecrona
T. Blaschke
Ola Engkvist
Hongming Chen
BDL
150
1,019
0
25 Apr 2017
A Reinforcement Learning Approach to Weaning of Mechanical Ventilation in Intensive Care Units
Niranjani Prasad
Li-Fang Cheng
C. Chivers
Michael Draugelis
Barbara E. Engelhardt
OffRL
66
168
0
20 Apr 2017
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
86
689
0
21 Mar 2017
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Joshua Tobin
Rachel Fong
Alex Ray
Jonas Schneider
Wojciech Zaremba
Pieter Abbeel
267
2,973
0
20 Mar 2017
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability
Shayegan Omidshafiei
Jason Pazis
Chris Amato
Jonathan P. How
J. Vian
152
499
0
17 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
833
11,952
0
09 Mar 2017
Previous
1
2
3
Next