ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06560
  4. Cited By
Deep Reinforcement Learning that Matters

Deep Reinforcement Learning that Matters

19 September 2017
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning that Matters"

50 / 379 papers shown
Title
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
27
72
0
04 Jul 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulić
OffRL
30
27
0
23 Jun 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
33
3
0
18 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
40
74
0
16 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
26
220
0
14 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
M. Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
214
0
10 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning
  Machines
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
11
4
0
04 Jun 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
24
13
0
21 May 2020
Decentralized Deep Reinforcement Learning for a Distributed and Adaptive
  Locomotion Controller of a Hexapod Robot
Decentralized Deep Reinforcement Learning for a Distributed and Adaptive Locomotion Controller of a Hexapod Robot
M. Schilling
Kai Konen
F. Ohl
Timo Korthals
24
18
0
21 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
25
83
0
20 May 2020
On the Value of Out-of-Distribution Testing: An Example of Goodhart's
  Law
On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law
Damien Teney
Kushal Kafle
Robik Shrestha
Ehsan Abbasnejad
Christopher Kanan
Anton Van Den Hengel
OODD
OOD
33
145
0
19 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular
  Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Abdulelah S. Alshehri
R. Gani
Fengqi You
AI4CE
32
83
0
18 May 2020
Context-aware Dynamics Model for Generalization in Model-Based
  Reinforcement Learning
Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning
Kimin Lee
Younggyo Seo
Seunghyun Lee
Honglak Lee
Jinwoo Shin
45
125
0
14 May 2020
Guaranteeing Reproducibility in Deep Learning Competitions
Guaranteeing Reproducibility in Deep Learning Competitions
Brandon Houghton
Stephanie Milani
Nicholay Topin
William H. Guss
Katja Hofmann
Diego Perez-Liebana
Manuela Veloso
Ruslan Salakhutdinov
OOD
27
8
0
12 May 2020
Reinforcement Learning with Augmented Data
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
20
648
0
30 Apr 2020
CURL: Contrastive Unsupervised Representations for Reinforcement
  Learning
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSL
DRL
OffRL
49
1,063
0
08 Apr 2020
Improving Reproducibility in Machine Learning Research (A Report from
  the NeurIPS 2019 Reproducibility Program)
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Joelle Pineau
Philippe Vincent-Lamarre
Koustuv Sinha
V. Larivière
A. Beygelzimer
Florence dÁlché-Buc
E. Fox
Hugo Larochelle
26
358
0
27 Mar 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
121
0
24 Mar 2020
Robust Deep Reinforcement Learning against Adversarial Perturbations on
  State Observations
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
Huan Zhang
Hongge Chen
Chaowei Xiao
Bo-wen Li
Mingyan D. Liu
Duane S. Boning
Cho-Jui Hsieh
AAML
47
261
0
19 Mar 2020
Explore and Exploit with Heterotic Line Bundle Models
Explore and Exploit with Heterotic Line Bundle Models
Magdalena Larfors
Robin Schneider
41
38
0
10 Mar 2020
Stochastic Recursive Momentum for Policy Gradient Methods
Stochastic Recursive Momentum for Policy Gradient Methods
Huizhuo Yuan
Xiangru Lian
Ji Liu
Yuren Zhou
26
31
0
09 Mar 2020
Plannable Approximations to MDP Homomorphisms: Equivariance under
  Actions
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
Elise van der Pol
Thomas Kipf
F. Oliehoek
Max Welling
25
77
0
27 Feb 2020
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Avi Singh
Eric Jang
A. Irpan
Daniel Kappler
Murtaza Dalal
Sergey Levine
Mohi Khansari
Chelsea Finn
53
35
0
25 Feb 2020
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence
G. Marcus
VLM
32
355
0
14 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump
  Linear Systems
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
25
35
0
10 Feb 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
32
49
0
07 Feb 2020
Provably Efficient Online Hyperparameter Optimization with
  Population-Based Bandits
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
75
83
0
06 Feb 2020
Stacked Auto Encoder Based Deep Reinforcement Learning for Online
  Resource Scheduling in Large-Scale MEC Networks
Stacked Auto Encoder Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks
Feibo Jiang
Kezhi Wang
Li Dong
Cunhua Pan
Kun Yang
OffRL
28
39
0
24 Jan 2020
Lyceum: An efficient and scalable ecosystem for robot learning
Lyceum: An efficient and scalable ecosystem for robot learning
Colin Summers
Kendall Lowrey
Aravind Rajeswaran
S. Srinivasa
E. Todorov
24
18
0
21 Jan 2020
Towards GAN Benchmarks Which Require Generalization
Towards GAN Benchmarks Which Require Generalization
Ishaan Gulrajani
Colin Raffel
Luke Metz
24
57
0
10 Jan 2020
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for
  Reproducible Deep Reinforcement Learning
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning
Keng Wah Loon
L. Graesser
Milan Cvitkovic
OffRL
26
13
0
28 Dec 2019
Convolutional Neural Network-based Topology Optimization (CNN-TO) By
  Estimating Sensitivity of Compliance from Material Distribution
Convolutional Neural Network-based Topology Optimization (CNN-TO) By Estimating Sensitivity of Compliance from Material Distribution
Yusuke Takahashi
Yoshiro Suzuki
A. Todoroki
21
6
0
23 Dec 2019
Taming an autonomous surface vehicle for path following and collision
  avoidance using deep reinforcement learning
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
33
65
0
18 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
29
26
0
13 Dec 2019
Learning to Reach Goals via Iterated Supervised Learning
Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh
Abhishek Gupta
Ashwin Reddy
Justin Fu
Coline Devin
Benjamin Eysenbach
Sergey Levine
32
34
0
12 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization
Policy Optimization Reinforcement Learning with Entropy Regularization
Jingbin Liu
Xinyang Gu
Shuai Liu
28
4
0
02 Dec 2019
Empirical Study of Off-Policy Policy Evaluation for Reinforcement
  Learning
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
30
152
0
15 Nov 2019
Multi-Path Policy Optimization
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
18
2
0
11 Nov 2019
Experience Sharing Between Cooperative Reinforcement Learning Agents
Experience Sharing Between Cooperative Reinforcement Learning Agents
Lucas O. Souza
G. Ramos
C. Ralha
24
9
0
06 Nov 2019
DeepRacer: Educational Autonomous Racing Platform for Experimentation
  with Sim2Real Reinforcement Learning
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Bharathan Balaji
S. Mallya
Sahika Genc
Saurabh Gupta
Leo Dirac
...
Yunzhe Tao
Brian Townsend
E. Calleja
Sunil Muralidhara
Dhanasekar Karuppasamy
23
56
0
05 Nov 2019
Paths Explored, Paths Omitted, Paths Obscured: Decision Points &
  Selective Reporting in End-to-End Data Analysis
Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis
Yang Liu
Tim Althoff
Jeffrey Heer
13
51
0
30 Oct 2019
Improving Sample Efficiency in Model-Free Reinforcement Learning from
  Images
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
Denis Yarats
Amy Zhang
Ilya Kostrikov
Brandon Amos
Joelle Pineau
Rob Fergus
DRL
53
441
0
02 Oct 2019
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
OffRL
28
50
0
01 Oct 2019
Meta-Q-Learning
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
25
145
0
30 Sep 2019
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with
  Contextualized Embeddings
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings
Gregor Wiedemann
Steffen Remus
Avi Chawla
Chris Biemann
27
174
0
23 Sep 2019
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Ruohan Zhang
F. Torabi
L. Guan
D. Ballard
Peter Stone
19
87
0
21 Sep 2019
DECoVaC: Design of Experiments with Controlled Variability Components
DECoVaC: Design of Experiments with Controlled Variability Components
Thomas Boquet
Laure Delisle
Denis Kochetkov
Nathan Schucher
Parmida Atighehchian
Boris N. Oreshkin
Julien Cornebise
30
1
0
21 Sep 2019
Wield: Systematic Reinforcement Learning With Progressive Randomization
Wield: Systematic Reinforcement Learning With Progressive Randomization
Michael Schaarschmidt
Kai Fricke
Eiko Yoneki
21
2
0
15 Sep 2019
Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label
  Classification
Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification
Jiawei Wu
Wenhan Xiong
William Yang Wang
23
69
0
09 Sep 2019
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the
  playing field
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
24
25
0
13 Aug 2019
Previous
12345678
Next