ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.00638
  4. Cited By
Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety
  Constraints

Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints

2 November 2019
Sam Daulton
Shaun Singh
Vashist Avadhanula
Drew Dimmery
E. Bakshy
ArXivPDFHTML

Papers citing "Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints"

5 / 5 papers shown
Title
Constrained Policy Optimization for Controlled Self-Learning in
  Conversational AI Systems
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems
Mohammad Kachuee
Sungjin Lee
76
4
0
17 Sep 2022
Stochastic Conservative Contextual Linear Bandits
Stochastic Conservative Contextual Linear Bandits
Jiabin Lin
Xian Yeow Lee
Talukder Jubery
Shana Moothedath
Soumik Sarkar
Baskar Ganapathysubramanian
16
7
0
29 Mar 2022
Stochastic Linear Bandits with Protected Subspace
Stochastic Linear Bandits with Protected Subspace
Advait Parulekar
Soumya Basu
Aditya Gopalan
Karthikeyan Shanmugam
Sanjay Shakkottai
82
2
0
02 Nov 2020
Scalable Thompson Sampling using Sparse Gaussian Process Models
Scalable Thompson Sampling using Sparse Gaussian Process Models
Sattar Vakili
Henry B. Moss
A. Artemev
Vincent Dutordoir
Victor Picheny
13
34
0
09 Jun 2020
Resourceful Contextual Bandits
Resourceful Contextual Bandits
Ashwinkumar Badanidiyuru
John Langford
Aleksandrs Slivkins
45
117
0
27 Feb 2014
1