ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.16730
  4. Cited By
Foundations of Reinforcement Learning and Interactive Decision Making

Foundations of Reinforcement Learning and Interactive Decision Making

27 December 2023
Dylan J. Foster
Alexander Rakhlin
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Foundations of Reinforcement Learning and Interactive Decision Making"

8 / 8 papers shown
Title
Minimax Optimal Reinforcement Learning with Quasi-Optimism
Minimax Optimal Reinforcement Learning with Quasi-Optimism
Harin Lee
Min-hwan Oh
OffRL
105
1
0
02 Mar 2025
Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits
Towards a Sharp Analysis of Offline Policy Learning for fff-Divergence-Regularized Contextual Bandits
Qingyue Zhao
Kaixuan Ji
Heyang Zhao
Tong Zhang
Q. Gu
OffRL
113
0
0
09 Feb 2025
Hybrid Preference Optimization for Alignment: Provably Faster
  Convergence Rates by Combining Offline Preferences with Online Exploration
Hybrid Preference Optimization for Alignment: Provably Faster Convergence Rates by Combining Offline Preferences with Online Exploration
Avinandan Bose
Zhihan Xiong
Aadirupa Saha
S. Du
Maryam Fazel
123
1
0
13 Dec 2024
Enhancing Quantum Memory Lifetime with Measurement-Free Local Error
  Correction and Reinforcement Learning
Enhancing Quantum Memory Lifetime with Measurement-Free Local Error Correction and Reinforcement Learning
Mincheol Park
N. Maskara
Marcin Kalinowski
Mikhail D. Lukin
53
1
0
18 Aug 2024
Exploratory Preference Optimization: Harnessing Implicit
  Q*-Approximation for Sample-Efficient RLHF
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Tengyang Xie
Dylan J. Foster
Akshay Krishnamurthy
Corby Rosset
Ahmed Hassan Awadallah
Alexander Rakhlin
100
45
0
31 May 2024
Data-driven Error Estimation: Upper Bounding Multiple Errors without Class Complexity as Input
Data-driven Error Estimation: Upper Bounding Multiple Errors without Class Complexity as Input
Sanath Kumar Krishnamurthy
Susan Athey
Emma Brunskill
Susan Athey
62
0
0
07 May 2024
Applied Causal Inference Powered by ML and AI
Applied Causal Inference Powered by ML and AI
Victor Chernozhukov
Christian Hansen
Nathan Kallus
Martin Spindler
Vasilis Syrgkanis
CML
84
32
0
04 Mar 2024
Selective Uncertainty Propagation in Offline RL
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
Branislav Kveton
A. Rangi
OffRL
234
0
0
01 Feb 2023
1