Incorporating Behavioral Constraints in Online AI Systems

Incorporating Behavioral Constraints in Online AI Systems

15 September 2018

Avinash Balakrishnan

Djallel Bouneffouf

Nicholas Mattei

Papers citing "Incorporating Behavioral Constraints in Online AI Systems"

15 / 15 papers shown

Title
Building Ethically Bounded AI F. Rossi Nicholas Mattei 61 75 0 10 Dec 2018
Building Ethics into Artificial Intelligence Han Yu Zhiqi Shen Chunyan Miao Cyril Leung V. Lesser Qiang Yang AI4TS 41 188 0 07 Dec 2018
Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration Ritesh Noothigattu Djallel Bouneffouf Nicholas Mattei Rachita Chandra Piyush Madan Kush R. Varshney Murray Campbell Moninder Singh F. Rossi AI4CE 32 23 0 21 Sep 2018
AI Safety Gridworlds Jan Leike Miljan Martic Victoria Krakovna Pedro A. Ortega Tom Everitt Andrew Lefrancq Laurent Orseau Shane Legg 95 250 0 27 Nov 2017
Context Attentive Bandits: Contextual Bandit with Restricted Context Djallel Bouneffouf Irina Rish Guillermo Cecchi Raphael Feraud 25 67 0 10 May 2017
Active Learning for Cost-Sensitive Classification A. Krishnamurthy Alekh Agarwal Tzu-Kuo Huang Hal Daumé John Langford 151 79 0 03 Mar 2017
Conservative Bandits Yifan Wu R. Shariff Tor Lattimore Csaba Szepesvári 104 98 0 13 Feb 2016
Research Priorities for Robust and Beneficial Artificial Intelligence Stuart J. Russell Dan Dewey Max Tegmark 56 656 0 10 Feb 2016
Linear Contextual Bandits with Knapsacks Shipra Agrawal Nikhil R. Devanur 110 142 0 24 Jul 2015
Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits Huasen Wu R. Srikant Xin Liu Chong Jiang 38 95 0 27 Apr 2015
Matroid Bandits: Fast Combinatorial Optimization with Learning Branislav Kveton Zheng Wen Azin Ashkan Hoda Eydgahi Brian Eriksson 62 119 0 20 Mar 2014
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 103 12,163 0 19 Dec 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs Shipra Agrawal Navin Goyal 133 993 0 15 Sep 2012
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis E. Kaufmann N. Korda Rémi Munos 102 585 0 18 May 2012
A Contextual-Bandit Approach to Personalized News Article Recommendation Lihong Li Wei Chu John Langford Robert Schapire 277 2,935 0 28 Feb 2010