ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.00980
  4. Cited By
Zeroth Order Non-convex optimization with Dueling-Choice Bandits

Zeroth Order Non-convex optimization with Dueling-Choice Bandits

3 November 2019
Yichong Xu
Aparna R. Joshi
Aarti Singh
A. Dubrawski
ArXivPDFHTML

Papers citing "Zeroth Order Non-convex optimization with Dueling-Choice Bandits"

1 / 1 papers shown
Title
Preference-based Reinforcement Learning with Finite-Time Guarantees
Preference-based Reinforcement Learning with Finite-Time Guarantees
Yichong Xu
Ruosong Wang
Lin F. Yang
Aarti Singh
A. Dubrawski
36
53
0
16 Jun 2020
1