Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.00980
Cited By
Zeroth Order Non-convex optimization with Dueling-Choice Bandits
3 November 2019
Yichong Xu
Aparna R. Joshi
Aarti Singh
A. Dubrawski
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Zeroth Order Non-convex optimization with Dueling-Choice Bandits"
1 / 1 papers shown
Title
Preference-based Reinforcement Learning with Finite-Time Guarantees
Yichong Xu
Ruosong Wang
Lin F. Yang
Aarti Singh
A. Dubrawski
36
53
0
16 Jun 2020
1