ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.09390
  4. Cited By
On the Interplay Between Misspecification and Sub-optimality Gap in
  Linear Contextual Bandits

On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits

16 March 2023
Weitong Zhang
Jiafan He
Zhiyuan Fan
Q. Gu
ArXivPDFHTML

Papers citing "On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits"

2 / 2 papers shown
Title
Reinforcement Learning from Human Feedback with Active Queries
Reinforcement Learning from Human Feedback with Active Queries
Kaixuan Ji
Jiafan He
Quanquan Gu
24
17
0
14 Feb 2024
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial
  Corruptions
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
Jiafan He
Dongruo Zhou
Tong Zhang
Quanquan Gu
66
46
0
13 May 2022
1