ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.09048
  4. Cited By
Optimal Best-Arm Identification in Bandits with Access to Offline Data

Optimal Best-Arm Identification in Bandits with Access to Offline Data

15 June 2023
Shubhada Agrawal
Sandeep Juneja
Karthikeyan Shanmugam
A. Suggala
ArXivPDFHTML

Papers citing "Optimal Best-Arm Identification in Bandits with Access to Offline Data"

4 / 4 papers shown
Title
Online Bandit Learning with Offline Preference Data for Improved RLHF
Online Bandit Learning with Offline Preference Data for Improved RLHF
Akhil Agnihotri
Rahul Jain
Deepak Ramachandran
Zheng Wen
OffRL
44
2
0
13 Jun 2024
On Best-Arm Identification with a Fixed Budget in Non-Parametric
  Multi-Armed Bandits
On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits
Antoine Barrier
Aurélien Garivier
Gilles Stoltz
44
13
0
30 Sep 2022
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Siddhartha Banerjee
Sean R. Sinclair
Milind Tambe
Lily Xu
Chao Yu
AI4TS
33
6
0
30 Sep 2022
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
343
1,968
0
04 May 2020
1