ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.14020
  4. Cited By
The Difficulty of Passive Learning in Deep Reinforcement Learning

The Difficulty of Passive Learning in Deep Reinforcement Learning

26 October 2021
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
    OffRL
ArXivPDFHTML

Papers citing "The Difficulty of Passive Learning in Deep Reinforcement Learning"

38 / 38 papers shown
Title
Eau De QQQ-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Théo Vincent
Tim Lukas Faust
Yogesh Tripathi
Jan Peters
Carlo DÉramo
42
0
0
03 Mar 2025
CALE: Continuous Arcade Learning Environment
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
38
0
0
31 Oct 2024
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Henrique Donâncio
Antoine Barrier
Leah F. South
Florence Forbes
28
0
0
16 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
44
1
0
11 Oct 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
45
1
0
07 Sep 2024
Mixture of Experts in a Mixture of RL settings
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
49
7
0
26 Jun 2024
On the consistency of hyper-parameter selection in value-based deep
  reinforcement learning
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
48
7
0
25 Jun 2024
Aligning Agents like Large Language Models
Aligning Agents like Large Language Models
Adam Jelley
Yuhan Cao
Dave Bignell
Sam Devlin
Tabish Rashid
LM&Ro
49
1
0
06 Jun 2024
Understanding the performance gap between online and offline alignment
  algorithms
Understanding the performance gap between online and offline alignment algorithms
Yunhao Tang
Daniel Guo
Zeyu Zheng
Daniele Calandriello
Yuan Cao
...
Rémi Munos
Bernardo Avila-Pires
Michal Valko
Yong Cheng
Will Dabney
OffRL
OnRL
27
61
0
14 May 2024
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
42
1
0
07 May 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
30
0
0
31 Mar 2024
In value-based deep reinforcement learning, a pruned network is a good
  network
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
48
18
0
19 Feb 2024
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Mixtures of Experts Unlock Parameter Scaling for Deep RL
J. Obando-Ceron
Ghada Sokar
Timon Willi
Clare Lyle
Jesse Farebrother
Jakob N. Foerster
Gintare Karolina Dziugaite
Doina Precup
Pablo Samuel Castro
63
31
0
13 Feb 2024
Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement
  Learning
Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement Learning
Maxence Faldor
Félix Chalumeau
Manon Flageat
Antoine Cully
35
2
0
10 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
92
10
0
10 Dec 2023
Understanding when Dynamics-Invariant Data Augmentations Benefit
  Model-Free Reinforcement Learning Updates
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
Nicholas Corrado
Josiah P. Hanna
29
5
0
26 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
34
1
0
12 Oct 2023
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in
  RL
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Miguel Suau
M. Spaan
F. Oliehoek
CML
27
4
0
04 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
85
0
30 May 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Kang Xu
Chenjia Bai
Xiaoteng Ma
Dong Wang
Bingyan Zhao
Zhen Wang
Xuelong Li
Wei Li
37
15
0
28 May 2023
Passive learning of active causal strategies in agents and language
  models
Passive learning of active causal strategies in agents and language models
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Ishita Dasgupta
A. Nam
Jane X. Wang
29
15
0
25 May 2023
Knowledge Transfer from Teachers to Learners in Growing-Batch
  Reinforcement Learning
Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning
P. Emedom-Nnamdi
A. Friesen
Bobak Shahriari
Nando de Freitas
Matthew W. Hoffman
OffRL
28
0
0
05 May 2023
Can Agents Run Relay Race with Strangers? Generalization of RL to
  Out-of-Distribution Trajectories
Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories
Li-Cheng Lan
Huan Zhang
Cho-Jui Hsieh
OODD
26
9
0
26 Apr 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
32
31
0
20 Apr 2023
MAP-Elites with Descriptor-Conditioned Gradients and Archive
  Distillation into a Single Policy
MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy
Maxence Faldor
Félix Chalumeau
Manon Flageat
Antoine Cully
35
18
0
07 Mar 2023
Conservative State Value Estimation for Offline Reinforcement Learning
Conservative State Value Estimation for Offline Reinforcement Learning
Liting Chen
Jie Yan
Zhengdao Shao
Lu Wang
Qingwei Lin
Saravan Rajmohan
Thomas Moscibroda
Dongmei Zhang
OffRL
26
6
0
14 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
11
2
0
09 Feb 2023
Efficient Reinforcement Learning Through Trajectory Generation
Efficient Reinforcement Learning Through Trajectory Generation
Wenqi Cui
Linbin Huang
Weiwei Yang
Baosen Zhang
OffRL
31
0
0
30 Nov 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
18
1
0
20 Oct 2022
CUP: Critic-Guided Policy Reuse
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
31
8
0
15 Oct 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
24
16
0
16 Sep 2022
An Empirical Study of Implicit Regularization in Deep Offline RL
An Empirical Study of Implicit Regularization in Deep Offline RL
Çağlar Gülçehre
Srivatsan Srinivasan
Jakub Sygnowski
Georg Ostrovski
Mehrdad Farajtabar
Matt Hoffman
Razvan Pascanu
Arnaud Doucet
OffRL
14
16
0
05 Jul 2022
Learning Dynamics and Generalization in Reinforcement Learning
Learning Dynamics and Generalization in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
Marta Z. Kwiatkowska
Y. Gal
OOD
OffRL
30
12
0
05 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
The Phenomenon of Policy Churn
The Phenomenon of Policy Churn
Tom Schaul
André Barreto
John Quan
Georg Ostrovski
31
26
0
01 Jun 2022
The Curse of Passive Data Collection in Batch Reinforcement Learning
The Curse of Passive Data Collection in Batch Reinforcement Learning
Chenjun Xiao
Ilbin Lee
Bo Dai
Dale Schuurmans
Csaba Szepesvári
OffRL
25
1
0
18 Jun 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Continuous Doubly Constrained Batch Reinforcement Learning
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
204
27
0
18 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
343
1,963
0
04 May 2020
1