ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.13703
  4. Cited By
Why So Pessimistic? Estimating Uncertainties for Offline RL through
  Ensembles, and Why Their Independence Matters

Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters

27 May 2022
Seyed Kamyar Seyed Ghasemipour
S. Gu
Ofir Nachum
    OffRL
ArXivPDFHTML

Papers citing "Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters"

20 / 20 papers shown
Title
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
49
0
0
23 Feb 2025
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems
Peng Liu
Jiawei Zhu
Cong Xu
Ming Zhao
Bin Wang
31
1
0
18 Sep 2024
The Role of Deep Learning Regularizations on Actors in Offline RL
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov
Anja Surina
Çağlar Gülçehre
OffRL
AI4CE
53
1
0
11 Sep 2024
Is Value Functions Estimation with Classification Plug-and-play for
  Offline Reinforcement Learning?
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
Denis Tarasov
Kirill Brilliantov
Dmitrii Kharlapenko
OffRL
32
2
0
10 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
79
2
0
07 Jun 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
56
2
0
31 May 2024
State-Constrained Offline Reinforcement Learning
State-Constrained Offline Reinforcement Learning
Charles A. Hepburn
Yue Jin
Giovanni Montana
OffRL
37
0
0
23 May 2024
Ensemble Successor Representations for Task Generalization in
  Offline-to-Online Reinforcement Learning
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang
Xudong Yu
Chenjia Bai
Qiaosheng Zhang
Zhen Wang
40
1
0
12 May 2024
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
33
36
0
16 May 2023
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Tongzhou Wang
Antonio Torralba
Phillip Isola
Amy Zhang
OffRL
32
31
0
03 Apr 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function
  Approximation
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Thanh Nguyen-Tang
R. Arora
OffRL
46
5
0
24 Feb 2023
Anti-Exploration by Random Network Distillation
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
38
24
0
31 Jan 2023
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch
  Size
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
33
14
0
20 Nov 2022
Controlling Commercial Cooling Systems Using Reinforcement Learning
Controlling Commercial Cooling Systems Using Reinforcement Learning
Jerry Luo
Cosmin Paduraru
Octavian Voicu
Yuri Chervonyi
Scott A. Munns
...
Sims Witherspoon
D. Parish
Peter Dolan
Chenyu Zhao
D. Mankowitz
OffRL
AI4CE
28
25
0
11 Nov 2022
Offline RL Policies Should be Trained to be Adaptive
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
843
0
12 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
105
262
0
04 Oct 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
Simple and Scalable Predictive Uncertainty Estimation using Deep
  Ensembles
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
276
5,661
0
05 Dec 2016
1