Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints

2 November 2019

Papers citing "Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints"

5 / 5 papers shown

Title
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems Mohammad Kachuee Sungjin Lee 76 4 0 17 Sep 2022
Stochastic Conservative Contextual Linear Bandits Jiabin Lin Xian Yeow Lee Talukder Jubery Shana Moothedath Soumik Sarkar Baskar Ganapathysubramanian 16 7 0 29 Mar 2022
Stochastic Linear Bandits with Protected Subspace Advait Parulekar Soumya Basu Aditya Gopalan Karthikeyan Shanmugam Sanjay Shakkottai 82 2 0 02 Nov 2020
Scalable Thompson Sampling using Sparse Gaussian Process Models Sattar Vakili Henry B. Moss A. Artemev Vincent Dutordoir Victor Picheny 13 34 0 09 Jun 2020
Resourceful Contextual Bandits Ashwinkumar Badanidiyuru John Langford Aleksandrs Slivkins 45 117 0 27 Feb 2014