Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06560
Cited By
Deep Reinforcement Learning that Matters
19 September 2017
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning that Matters"
50 / 379 papers shown
Title
Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting
Berend Gort
Xiao-Yang Liu
Xinghang Sun
Jiechao Gao
Shuai Chen
Chris Wang
32
13
0
12 Sep 2022
Unifying Generative Models with GFlowNets and Beyond
Dinghuai Zhang
Ricky T. Q. Chen
Nikolay Malkin
Yoshua Bengio
BDL
AI4CE
59
25
0
06 Sep 2022
Project proposal: A modular reinforcement learning based automated theorem prover
Boris Shminke
26
1
0
06 Sep 2022
Distributed Ensembles of Reinforcement Learning Agents for Electricity Control
Pierrick Pochelu
S. Petiton
B. Conche
AI4CE
31
2
0
30 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
28
3
0
23 Aug 2022
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter
Aleksandar Stanić
Yujin Tang
David R Ha
Jürgen Schmidhuber
ELM
29
13
0
05 Aug 2022
Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows
Michael Xu
Abinash Kumar
J. Lebeau
18
7
0
04 Aug 2022
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Han Li
Changsheng Li
Kaituo Feng
Ye Yuan
Guoren Wang
H. Zha
34
13
0
22 Jul 2022
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Neural Color Operators for Sequential Image Retouching
Yili Wang
Xin Li
K. Xu
Dongliang He
Qi Zhang
Fu Li
Errui Ding
30
14
0
17 Jul 2022
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach
Zohar Rimon
Aviv Tamar
Gilad Adler
OOD
OffRL
36
8
0
21 Jun 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
27
5
0
01 Jun 2022
Comparing interpretation methods in mental state decoding analyses with deep learning models
A. Thomas
Christopher Ré
R. Poldrack
AI4CE
39
2
0
31 May 2022
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu
Qi Zeng
Gagandeep Singh
AAML
40
6
0
30 May 2022
Automated Dynamic Algorithm Configuration
Steven Adriaensen
André Biedenkapp
Gresa Shala
Noor H. Awad
Theresa Eimer
Marius Lindauer
Frank Hutter
34
37
0
27 May 2022
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization
Marco Pleines
Matthias Pallasch
F. Zimmer
Mike Preuss
26
14
0
23 May 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language
Iou-Jen Liu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
Alex Schwing
RALM
21
12
0
12 May 2022
Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach
Jiaping Xiao
Phumrapee Pisutsin
Mir Feroskhan
27
16
0
26 Apr 2022
Understanding and Preventing Capacity Loss in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
CLL
41
110
0
20 Apr 2022
deep-significance - Easy and Meaningful Statistical Significance Testing in the Age of Neural Networks
Dennis Ulmer
Christian Hardmeier
J. Frellsen
48
42
0
14 Apr 2022
A Visual Navigation Perspective for Category-Level Object Pose Estimation
Jiaxin Guo
Fangxun Zhong
R. Xiong
Yunhui Liu
Yue Wang
Yiyi Liao
OCL
19
6
0
25 Mar 2022
MetaMorph: Learning Universal Controllers with Transformers
Agrim Gupta
Linxi Fan
Surya Ganguli
Li Fei-Fei
LM&Ro
16
90
0
22 Mar 2022
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
RB2: Robotic Manipulation Benchmarking with a Twist
Sudeep Dasari
Jianren Wang
Joyce Hong
Shikhar Bahl
Yixin Lin
...
David Held
Lerrel Pinto
Deepak Pathak
Vikash Kumar
Abhi Gupta
32
27
0
15 Mar 2022
Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation
Pengfei Guo
Dong Yang
Ali Hatamizadeh
An Xu
Ziyue Xu
...
F. Patella
Elvira Stellato
G. Carrafiello
Vishal M. Patel
H. Roth
OOD
FedML
28
32
0
12 Mar 2022
Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control
L. D. Natale
B. Svetozarevic
Philipp Heer
Colin N. Jones
OffRL
AI4CE
40
6
0
10 Mar 2022
Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Alex Long
Alan Blair
H. V. Hoof
26
3
0
07 Mar 2022
Addressing Randomness in Evaluation Protocols for Out-of-Distribution Detection
Konstantin Kirchheim
Tim Gonschorek
F. Ortmeier
OODD
36
2
0
01 Mar 2022
Machine Learning Empowered Intelligent Data Center Networking: A Survey
Bo-wen Li
Ting Wang
Peng Yang
Mingsong Chen
Shui Yu
Mounir Hamdi
AI4CE
21
4
0
28 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
38
9
0
23 Feb 2022
Cellular Network Capacity and Coverage Enhancement with MDT Data and Deep Reinforcement Learning
Marco Skocaj
Lorenzo Mario Amorosa
G. Ghinamo
Giuliano Muratore
Davide Micheli
F. Zabini
Roberto Verdone
21
13
0
22 Feb 2022
Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Nikolaus H. R. Howe
Simon Dufort-Labbé
Nitarshan Rajkumar
Pierre-Luc Bacon
32
5
0
22 Feb 2022
Sequential Bayesian experimental designs via reinforcement learning
Hikaru Asano
OffRL
18
0
0
14 Feb 2022
Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D Environments with Dynamic Obstacles
Suleman Qamar
Dr. Saddam Hussain Khan
Muhammad Arif Arshad
Maryam Qamar
Asifullah Khan
29
16
0
13 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
36
29
0
10 Feb 2022
A Ranking Game for Imitation Learning
Harshit S. Sikchi
Akanksha Saran
Wonjoon Goo
S. Niekum
OffRL
30
22
0
07 Feb 2022
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration
André Biedenkapp
Nguyen Dang
Martin S. Krejca
Frank Hutter
Carola Doerr
34
8
0
07 Feb 2022
Towards Training Reproducible Deep Learning Models
Boyuan Chen
Mingzhi Wen
Yong Shi
Dayi Lin
Gopi Krishnan Rajbahadur
Zhen Ming
Z. Jiang
SyDa
23
37
0
04 Feb 2022
Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions
Aaron Mishkin
Arda Sahiner
Mert Pilanci
OffRL
77
30
0
02 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation
Martín Bertrán
Walter A. Talbott
Nitish Srivastava
J. Susskind
45
3
0
28 Jan 2022
Hyperparameter Tuning for Deep Reinforcement Learning Applications
M. Kiran
Melis Ozyildirim
40
22
0
26 Jan 2022
Reproducibility in Learning
R. Impagliazzo
Rex Lei
T. Pitassi
Jessica Sorrell
32
43
0
20 Jan 2022
SmartDet: Context-Aware Dynamic Control of Edge Task Offloading for Mobile Object Detection
Davide Callegaro
Francesco Restuccia
Marco Levorato
24
3
0
11 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Mirror Learning: A Unifying Framework of Policy Optimisation
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
29
24
0
07 Jan 2022
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
38
34
0
05 Jan 2022
Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
Mohammad Salimibeni
Arash Mohammadi
Parvin Malekzadeh
Konstantinos N. Plataniotis
18
5
0
30 Dec 2021
Parallelized and Randomized Adversarial Imitation Learning for Safety-Critical Self-Driving Vehicles
Won Joon Yun
Myungjae Shin
Soyi Jung
S. Kwon
Joongheon Kim
24
5
0
26 Dec 2021
Previous
1
2
3
4
5
6
7
8
Next