Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.12970
Cited By
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning
23 August 2024
Zhongjian Qiao
Jiafei Lyu
Kechen Jiao
Qi Liu
Xiu Li
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning"
12 / 12 papers shown
Title
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
78
10
0
06 Feb 2024
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
127
822
0
12 Jun 2021
On Feature Collapse and Deep Kernel Learning for Single Forward Pass Uncertainty
Joost R. van Amersfoort
Lewis Smith
Andrew Jesson
Oscar Key
Y. Gal
UQCV
69
104
0
22 Feb 2021
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
137
1,815
0
08 Jun 2020
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
76
770
0
27 May 2020
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
96
672
0
12 May 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
221
1,368
0
15 Apr 2020
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
129
1,060
0
03 Jun 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
226
1,613
0
07 Dec 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
311
8,352
0
04 Jan 2018
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
257
3,723
0
28 Feb 2017
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,115
0
22 Dec 2014
1