On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits

16 March 2023

Papers citing "On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits"

2 / 2 papers shown

Title
Reinforcement Learning from Human Feedback with Active Queries Kaixuan Ji Jiafan He Quanquan Gu 24 17 0 14 Feb 2024
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions Jiafan He Dongruo Zhou Tong Zhang Quanquan Gu 66 46 0 13 May 2022