ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.18407
  4. Cited By
KL-regularization Itself is Differentially Private in Bandits and RLHF

KL-regularization Itself is Differentially Private in Bandits and RLHF

23 May 2025
Yizhou Zhang
Kishan Panaganti
Laixi Shi
Juba Ziani
Adam Wierman
ArXiv (abs)PDFHTML

Papers citing "KL-regularization Itself is Differentially Private in Bandits and RLHF"

3 / 3 papers shown
Title
Towards User-level Private Reinforcement Learning with Human Feedback
Towards User-level Private Reinforcement Learning with Human Feedback
Jing Zhang
Mingxi Lei
Meng Ding
Mengdi Li
Zihang Xiang
Difei Xu
Jinhui Xu
Di Wang
115
3
0
22 Feb 2025
Differentially Private Policy Gradient
Differentially Private Policy Gradient
Alexandre Rio
M. Barlier
Igor Colin
OffRL
71
1
0
31 Jan 2025
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
234
6
0
07 Nov 2024
1