ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.01520
  4. Cited By
Contextual Bandits with Continuous Actions: Smoothing, Zooming, and
  Adapting

Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting

5 February 2019
A. Krishnamurthy
John Langford
Aleksandrs Slivkins
Chicheng Zhang
    OffRL
ArXivPDFHTML

Papers citing "Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting"

10 / 10 papers shown
Title
Contextual Bandits for Unbounded Context Distributions
Contextual Bandits for Unbounded Context Distributions
Puning Zhao
Jiafei Wu
Zhe Liu
Huiwen Wu
Q. Zhang
Zong Ke
Tianhang Zheng
68
3
0
19 Aug 2024
Batched Stochastic Bandit for Nondegenerate Functions
Batched Stochastic Bandit for Nondegenerate Functions
Yu Liu
Yunlu Shu
Tianyu Wang
44
0
0
09 May 2024
Infinite Action Contextual Bandits with Reusable Data Exhaust
Infinite Action Contextual Bandits with Reusable Data Exhaust
Mark Rucker
Yinglun Zhu
Paul Mineiro
OffRL
21
1
0
16 Feb 2023
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
35
109
0
05 Apr 2022
Coarse-Grained Smoothness for RL in Metric Spaces
Coarse-Grained Smoothness for RL in Metric Spaces
Giorgio Giannone
Kavosh Asadi
Cameron Allen
Sam Lobel
George Konidaris
Michael Littman
37
3
0
23 Oct 2021
Efficient Contextual Bandits with Continuous Actions
Efficient Contextual Bandits with Continuous Actions
Maryam Majzoubi
Chicheng Zhang
Rajan Chari
A. Krishnamurthy
John Langford
Aleksandrs Slivkins
OffRL
29
32
0
10 Jun 2020
Efficient Policy Learning from Surrogate-Loss Classification Reductions
Efficient Policy Learning from Surrogate-Loss Classification Reductions
Andrew Bennett
Nathan Kallus
OffRL
20
15
0
12 Feb 2020
Kernel Optimal Orthogonality Weighting: A Balancing Approach to
  Estimating Effects of Continuous Treatments
Kernel Optimal Orthogonality Weighting: A Balancing Approach to Estimating Effects of Continuous Treatments
Nathan Kallus
Michele Santacatterina
CML
17
20
0
26 Oct 2019
Model selection for contextual bandits
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
16
89
0
03 Jun 2019
Polynomial Cost of Adaptation for X -Armed Bandits
Polynomial Cost of Adaptation for X -Armed Bandits
Hédi Hadiji
15
11
0
24 May 2019
1