Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.09055
Cited By
Hyperparameter Selection for Offline Reinforcement Learning
17 July 2020
T. Paine
Cosmin Paduraru
Andrea Michi
Çağlar Gülçehre
Konrad Zolna
Alexander Novikov
Ziyun Wang
Nando de Freitas
GP
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hyperparameter Selection for Offline Reinforcement Learning"
37 / 37 papers shown
Title
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Rakesh Nadig
Vamanan Arulchelvan
Rahul Bera
Taha Shahroodi
Gagandeep Singh
Mohammad Sadrosadati
Jisung Park
O. Mutlu
Onur Mutlu
65
0
0
26 Mar 2025
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Pai Liu
Lingfeng Zhao
Shivangi Agarwal
Jinghan Liu
Audrey Huang
P. Amortila
Nan Jiang
OODD
OffRL
96
0
0
11 Feb 2025
AgentForge: A Flexible Low-Code Platform for Reinforcement Learning Agent Design
Francisco Erivaldo Fernandes Junior
Antti Oulasvirta
119
0
0
25 Oct 2024
Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It
Yuta Saito
Masahiro Nomura
OffRL
39
2
0
23 Apr 2024
Robustness of Algorithms for Causal Structure Learning to Hyperparameter Choice
Damian Machlanski
Spyridon Samothrakis
Paul Clarke
CML
36
1
0
27 Oct 2023
Active Policy Improvement from Multiple Black-box Oracles
Xuefeng Liu
Takuma Yoneda
Chaoqi Wang
Matthew R. Walter
Yuxin Chen
31
8
0
17 Jun 2023
Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions
Audrey Huang
Nan Jiang
OffRL
45
9
0
27 Oct 2022
Hierarchical reinforcement learning for in-hand robotic manipulation using Davenport chained rotations
Francisco Roldan Sanchez
Qiang-qiang Wang
David Córdova Bulens
Kevin McGuinness
Stephen J. Redmond
Noel E. O'Connor
18
1
0
03 Oct 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training
Gang Chen
Victoria Huang
OffRL
19
0
0
29 Sep 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
24
66
0
20 Jul 2022
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
David Brandfonbrener
Rémi Tachet des Combes
Romain Laroche
OffRL
29
5
0
02 Jun 2022
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Anurag Koul
Mariano Phielipp
Alan Fern
OffRL
20
0
0
22 May 2022
User-Interactive Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
23
11
0
21 May 2022
ARLO: A Framework for Automated Reinforcement Learning
Marco Mussi
Davide Lombarda
Alberto Maria Metelli
F. Trovò
Marcello Restelli
OffRL
28
4
0
20 May 2022
Sibyl: Adaptive and Extensible Data Placement in Hybrid Storage Systems Using Online Reinforcement Learning
Gagandeep Singh
Rakesh Nadig
Jisung Park
Rahul Bera
Nastaran Hajinazar
D. Novo
Juan Gómez Luna
S. Stuijk
Henk Corporaal
O. Mutlu
52
33
0
15 May 2022
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
29
64
0
13 Feb 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
11
125
0
05 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
31
0
28 Jan 2022
Hyperparameter Selection Methods for Fitted Q-Evaluation with Error Guarantee
Kohei Miyaguchi
OffRL
33
1
0
07 Jan 2022
Data Augmentation through Expert-guided Symmetry Detection to Improve Performance in Offline Reinforcement Learning
Giorgio Angelotti
Nicolas Drougard
Caroline Ponzoni Carvalho Chanel
OffRL
26
2
0
18 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
16
4
0
29 Nov 2021
Measuring Data Quality for Dataset Selection in Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
20
6
0
26 Nov 2021
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning
Christopher P. Diehl
Timo Sievernich
Martin Krüger
F. Hoffmann
Torsten Bertram
OffRL
16
25
0
22 Nov 2021
Validate on Sim, Detect on Real -- Model Selection for Domain Randomization
Gal Leibovich
Guy Jacob
Shadi Endrawis
Gal Novik
Aviv Tamar
17
7
0
01 Nov 2021
Offline RL With Resource Constrained Online Deployment
Jayanth Reddy Regatti
A. Deshmukh
Frank Cheng
Young Hun Jung
Abhishek Gupta
Ürün Dogan
OffRL
11
2
0
07 Oct 2021
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings
Shengpu Tang
Jenna Wiens
OffRL
26
78
0
23 Jul 2021
Supervised Off-Policy Ranking
Yue Jin
Yue Zhang
Tao Qin
Xudong Zhang
Jian Yuan
Houqiang Li
Tie-Yan Liu
OffRL
30
5
0
03 Jul 2021
UAV-assisted Online Machine Learning over Multi-Tiered Networks: A Hierarchical Nested Personalized Federated Learning Approach
Su Wang
Seyyedali Hosseinalipour
M. Gorlatova
Christopher G. Brinton
M. Chiang
32
36
0
29 Jun 2021
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
42
161
0
16 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
20
778
0
12 Jun 2021
On Instrumental Variable Regression for Deep Offline Policy Evaluation
Yutian Chen
Liyuan Xu
Çağlar Gülçehre
T. Paine
A. Gretton
Nando de Freitas
Arnaud Doucet
OffRL
31
17
0
21 May 2021
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
33
100
0
30 Mar 2021
Instabilities of Offline RL with Pre-Trained Neural Representation
Ruosong Wang
Yifan Wu
Ruslan Salakhutdinov
Sham Kakade
OffRL
13
41
0
08 Mar 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
132
78
0
01 Feb 2021
Semi-supervised reward learning for offline reinforcement learning
Ksenia Konyushkova
Konrad Zolna
Y. Aytar
Alexander Novikov
Scott E. Reed
Serkan Cabi
Nando de Freitas
SSL
OffRL
66
23
0
12 Dec 2020
Offline Learning from Demonstrations and Unlabeled Experience
Konrad Zolna
Alexander Novikov
Ksenia Konyushkova
Çağlar Gülçehre
Ziyun Wang
Y. Aytar
Misha Denil
Nando de Freitas
Scott E. Reed
SSL
OffRL
30
66
0
27 Nov 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
337
1,955
0
04 May 2020
1