ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.03091
  4. Cited By
Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback

Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback

7 July 2022
Adhyyan Narang
Omid Sadeghi
Lillian J. Ratliff
Maryam Fazel
J. Bilmes
    OffRL
ArXivPDFHTML

Papers citing "Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback"

3 / 3 papers shown
Title
Initializing Services in Interactive ML Systems for Diverse Users
Initializing Services in Interactive ML Systems for Diverse Users
Avinandan Bose
Mihaela Curmei
Daniel L. Jiang
Jamie Morgenstern
Sarah Dean
Lillian J. Ratliff
Maryam Fazel
21
5
0
19 Dec 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
372
12,081
0
04 Mar 2022
Matroid Bandits: Fast Combinatorial Optimization with Learning
Matroid Bandits: Fast Combinatorial Optimization with Learning
B. Kveton
Zheng Wen
Azin Ashkan
Hoda Eydgahi
Brian Eriksson
46
119
0
20 Mar 2014
1