ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.07670
  4. Cited By
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

16 September 2022
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
    OOD
ArXivPDFHTML

Papers citing "Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks"

10 / 10 papers shown
Title
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
34
0
0
22 Oct 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
44
0
0
05 Aug 2024
Mixture of Experts in a Mixture of RL settings
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
46
7
0
26 Jun 2024
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement
  Learning
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning
Davide Corsi
Davide Camponogara
Alessandro Farinelli
OffRL
46
1
0
30 May 2024
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
42
1
0
07 May 2024
REValueD: Regularised Ensemble Value-Decomposition for Factorisable
  Markov Decision Processes
REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes
David Ireland
Giovanni Montana
43
3
0
16 Jan 2024
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
25
6
0
09 Oct 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
32
20
0
08 Jun 2023
Ensemble Value Functions for Efficient Exploration in Multi-Agent
  Reinforcement Learning
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Lukas Schafer
Oliver Slumbers
Stephen Marcus McAleer
Yali Du
Stefano V. Albrecht
D. Mguni
79
7
0
07 Feb 2023
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch
  Size
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
33
14
0
20 Nov 2022
1