Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.04342
Cited By
The Curse of Diversity in Ensemble-Based Exploration
7 May 2024
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Curse of Diversity in Ensemble-Based Exploration"
28 / 28 papers shown
Title
BRExIt: On Opponent Modelling in Expert Iteration
Daniel Hernández
Hendrik Baier
Michael Kaisers
35
2
0
31 May 2022
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Aviral Kumar
Rishabh Agarwal
Tengyu Ma
Aaron Courville
George Tucker
Sergey Levine
OffRL
74
69
0
09 Dec 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
George Andriopoulos
OffRL
33
8
0
17 Nov 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
161
278
0
04 Oct 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
118
671
0
30 Aug 2021
Ensemble Bootstrapping for Q-Learning
Oren Peer
Chen Tessler
Nadav Merlis
Ron Meir
76
42
0
28 Feb 2021
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
88
318
0
12 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
56
203
0
09 Jul 2020
Peer Collaborative Learning for Online Knowledge Distillation
Guile Wu
S. Gong
FedML
44
129
0
07 Jun 2020
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning
Will Dabney
André Barreto
Mark Rowland
Robert Dadashi
John Quan
Marc G. Bellemare
David Silver
59
67
0
03 Jun 2020
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Qingfeng Lan
Yangchen Pan
Alona Fyshe
Martha White
63
179
0
16 Feb 2020
Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Ge Liu
Rui Wu
Heng-Tze Cheng
Jing Wang
Jayden Ooi
Lihong Li
Ang Li
Wai Lok Sibon Li
Craig Boutilier
Ed H. Chi
OffRL
11
4
0
12 Feb 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
43
4
0
01 Feb 2020
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
67
68
0
25 Sep 2019
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
145
29
0
24 Jul 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
132
1,060
0
03 Jun 2019
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
136
2,445
0
13 Dec 2018
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
234
1,613
0
07 Dec 2018
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
84
452
0
28 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
180
5,187
0
26 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
220
1,600
0
05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
311
8,352
0
04 Jan 2018
Distributional Reinforcement Learning with Quantile Regression
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
92
760
0
27 Oct 2017
Deep Mutual Learning
Ying Zhang
Tao Xiang
Timothy M. Hospedales
Huchuan Lu
FedML
151
1,653
0
01 Jun 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
106
1,228
0
16 Nov 2016
Learning to Navigate in Complex Environments
Piotr Wojciech Mirowski
Razvan Pascanu
Fabio Viola
Hubert Soyer
Andy Ballard
...
Ross Goroshin
Laurent Sifre
Koray Kavukcuoglu
D. Kumaran
R. Hadsell
107
880
0
11 Nov 2016
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
220
3,789
0
18 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
170
7,641
0
22 Sep 2015
1