Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.12418
Cited By
Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
25 May 2022
Jiawei Huang
Li Zhao
Tao Qin
Wei Chen
Nan Jiang
Tie-Yan Liu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret"
5 / 5 papers shown
Title
Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective
Jiawei Huang
Bingcong Li
Christoph Dann
Niao He
OffRL
85
0
0
26 Feb 2025
Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits
Mengmeng Li
Daniel Kuhn
Bahar Taşkesen
39
0
0
30 Sep 2024
Robust Knowledge Transfer in Tiered Reinforcement Learning
Jiawei Huang
Niao He
OffRL
26
1
0
10 Feb 2023
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
96
144
0
13 Jul 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
1