ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.04938
  4. Cited By
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

9 July 2020
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
    OffRL
ArXivPDFHTML

Papers citing "SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning"

50 / 128 papers shown
Title
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Montaser Mohammedalamen
Michael Bowling
21
0
0
13 May 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Z. Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
Ilya Kolmanovsky
Dimitar Filev
54
0
0
31 Mar 2025
Entropy-regularized Gradient Estimators for Approximate Bayesian Inference
Entropy-regularized Gradient Estimators for Approximate Bayesian Inference
Jasmeet Kaur
BDL
UQCV
75
0
0
15 Mar 2025
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Natinael Solomon Neggatu
Jeremie Houssineau
Giovanni Montana
OffRL
OnRL
75
0
0
15 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
153
2
0
10 Mar 2025
Constrained Intrinsic Motivation for Reinforcement Learning
Constrained Intrinsic Motivation for Reinforcement Learning
Xiang Zheng
Jie Zhang
Chao Shen
Cong Wang
34
1
0
12 Jul 2024
Model-Free Active Exploration in Reinforcement Learning
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutière
OffRL
23
2
0
30 Jun 2024
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control
  Priors
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors
Emma Cramer
Bernd Frauenknecht
Ramil Sabirov
Sebastian Trimpe
OffRL
OnRL
52
3
0
28 Jun 2024
Boosting Soft Q-Learning by Bounding
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
56
2
0
26 Jun 2024
A Rate-Distortion View of Uncertainty Quantification
A Rate-Distortion View of Uncertainty Quantification
Ifigeneia Apostolopoulou
Benjamin Eysenbach
Frank Nielsen
Artur Dubrawski
UQCV
46
2
0
16 Jun 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
56
2
0
31 May 2024
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Marcel Hussing
Michael Kearns
Aaron Roth
S. B. Sengupta
Jessica Sorrell
43
0
0
27 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
40
3
0
25 May 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of
  Gradient Directions for Policy Improvement
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Zhou Fang
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
34
3
0
14 May 2024
Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled
  Q-Learning
Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning
M. Khan
Syed Hammad Ahmed
G. Sukthankar
33
0
0
14 May 2024
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
42
1
0
07 May 2024
Diverse Randomized Value Functions: A Provably Pessimistic Approach for
  Offline Reinforcement Learning
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
Xudong Yu
Chenjia Bai
Hongyi Guo
Changhong Wang
Zhen Wang
OffRL
39
0
0
09 Apr 2024
EnQuery: Ensemble Policies for Diverse Query-Generation in Preference
  Alignment of Robot Navigation
EnQuery: Ensemble Policies for Diverse Query-Generation in Preference Alignment of Robot Navigation
Jorge de Heuvel
Florian Seiler
Maren Bennewitz
35
2
0
07 Apr 2024
Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks
Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks
Shaunak A. Mehta
Soheil Habibian
Dylan P. Losey
SSL
73
2
0
20 Mar 2024
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor
  Re-planning
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor Re-planning
Gilhyun Ryou
Geoffrey Wang
S. Karaman
56
3
0
13 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
36
3
0
09 Mar 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
34
0
0
01 Mar 2024
Dataset Clustering for Improved Offline Policy Learning
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
29
2
0
14 Feb 2024
Reinforcement Learning from Bagged Reward
Reinforcement Learning from Bagged Reward
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
OffRL
36
0
0
06 Feb 2024
SLIM: Skill Learning with Multiple Critics
SLIM: Skill Learning with Multiple Critics
David Emukpere
Bingbing Wu
Julien Perez
J. Renders
20
1
0
01 Feb 2024
REValueD: Regularised Ensemble Value-Decomposition for Factorisable
  Markov Decision Processes
REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes
David Ireland
Giovanni Montana
43
3
0
16 Jan 2024
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for
  Reinforcement Learning
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning
Dohyeok Lee
Seung Han
Taehyun Cho
Jungwoo Lee
OffRL
38
2
0
06 Jan 2024
A unified uncertainty-aware exploration: Combining epistemic and
  aleatory uncertainty
A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
UD
23
2
0
05 Jan 2024
A Survey Analyzing Generalization in Deep Reinforcement Learning
A Survey Analyzing Generalization in Deep Reinforcement Learning
Ezgi Korkmaz
OffRL
32
2
0
04 Jan 2024
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement
  Learning?
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?
Gunshi Gupta
Tim G. J. Rudner
R. McAllister
Adrien Gaidon
Y. Gal
OffRL
56
3
0
28 Dec 2023
Active Reinforcement Learning for Robust Building Control
Active Reinforcement Learning for Robust Building Control
Doseok Jang
Larry Yan
Lucas Spangher
C. Spanos
28
1
0
16 Dec 2023
Model-Based Epistemic Variance of Values for Risk-Aware Policy
  Optimization
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
OffRL
38
3
0
07 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory
  Control
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
42
3
0
30 Nov 2023
Fuzzy Ensembles of Reinforcement Learning Policies for Robotic Systems
  with Varied Parameters
Fuzzy Ensembles of Reinforcement Learning Policies for Robotic Systems with Varied Parameters
A. Haddad
Mohammed B. Mohiuddin
I. Boiko
Yahya Zweiri
33
0
0
08 Nov 2023
Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep
  Ensemble Agents
Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents
Woojun Kim
Yongjae Shin
Jongeui Park
Young-Jin Sung
OnRL
18
6
0
31 Oct 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
35
1
0
30 Oct 2023
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic
  Locomotion
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion
Laura M. Smith
Yunhao Cao
Sergey Levine
OffRL
32
19
0
26 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient
  DRL
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
38
0
0
21 Oct 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in
  Continuous Control
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
Chao Li
Chen Gong
Qiang He
Xinwen Hou
30
0
0
17 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
28
6
0
09 Oct 2023
Learning and reusing primitive behaviours to improve Hindsight
  Experience Replay sample efficiency
Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency
Francisco Roldan Sanchez
Qiang Wang
David Córdova Bulens
Kevin McGuinness
Stephen J. Redmond
Noel E. O'Connor
OffRL
OnRL
26
1
0
03 Oct 2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty
  and Smoothness
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
Xiaoyu Wen
Xudong Yu
Rui Yang
Chenjia Bai
Zhen Wang
OffRL
OnRL
28
10
0
29 Sep 2023
Uncertainty-driven Exploration Strategies for Online Grasp Learning
Uncertainty-driven Exploration Strategies for Online Grasp Learning
Yitian Shi
Philipp Schillinger
Miroslav Gabriel
Alexander Kuss
Zohar Feldman
Hanna Ziesche
Ngo Anh Vien
OffRL
OnRL
21
4
0
21 Sep 2023
Hybrid Control Policy for Artificial Pancreas via Ensemble Deep
  Reinforcement Learning
Hybrid Control Policy for Artificial Pancreas via Ensemble Deep Reinforcement Learning
Wenzhou Lv
Tianyu Wu
Luolin Xiong
Liang Wu
Jianglei Zhou
Yang Tang
Feng Qian
30
2
0
13 Jul 2023
Thompson sampling for improved exploration in GFlowNets
Thompson sampling for improved exploration in GFlowNets
Jarrid Rector-Brooks
Kanika Madan
Moksh Jain
Maksym Korablyov
Cheng-Hao Liu
Sarath Chandar
Nikolay Malkin
Yoshua Bengio
26
24
0
30 Jun 2023
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error
  Feedback
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback
Hang Wang
Sen Lin
Junshan Zhang
21
19
0
20 Jun 2023
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online
  Reinforcement Learning
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
Siyuan Guo
Yanchao Sun
Jifeng Hu
Sili Huang
Hechang Chen
Haiyin Piao
Lichao Sun
Yi-Ju Chang
OffRL
OnRL
33
7
0
13 Jun 2023
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
20
12
0
12 Jun 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
32
20
0
08 Jun 2023
infoVerse: A Universal Framework for Dataset Characterization with
  Multidimensional Meta-information
infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information
Jaehyung Kim
Yekyung Kim
Karin de Langis
Jinwoo Shin
Dongyeop Kang
17
1
0
30 May 2023
123
Next