Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.05720
Cited By
Incorporating Behavioral Constraints in Online AI Systems
15 September 2018
Avinash Balakrishnan
Djallel Bouneffouf
Nicholas Mattei
F. Rossi
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Incorporating Behavioral Constraints in Online AI Systems"
15 / 15 papers shown
Title
Building Ethically Bounded AI
F. Rossi
Nicholas Mattei
61
75
0
10 Dec 2018
Building Ethics into Artificial Intelligence
Han Yu
Zhiqi Shen
Chunyan Miao
Cyril Leung
V. Lesser
Qiang Yang
AI4TS
41
188
0
07 Dec 2018
Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration
Ritesh Noothigattu
Djallel Bouneffouf
Nicholas Mattei
Rachita Chandra
Piyush Madan
Kush R. Varshney
Murray Campbell
Moninder Singh
F. Rossi
AI4CE
32
23
0
21 Sep 2018
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
95
250
0
27 Nov 2017
Context Attentive Bandits: Contextual Bandit with Restricted Context
Djallel Bouneffouf
Irina Rish
Guillermo Cecchi
Raphael Feraud
25
67
0
10 May 2017
Active Learning for Cost-Sensitive Classification
A. Krishnamurthy
Alekh Agarwal
Tzu-Kuo Huang
Hal Daumé
John Langford
151
79
0
03 Mar 2017
Conservative Bandits
Yifan Wu
R. Shariff
Tor Lattimore
Csaba Szepesvári
104
98
0
13 Feb 2016
Research Priorities for Robust and Beneficial Artificial Intelligence
Stuart J. Russell
Dan Dewey
Max Tegmark
56
656
0
10 Feb 2016
Linear Contextual Bandits with Knapsacks
Shipra Agrawal
Nikhil R. Devanur
110
142
0
24 Jul 2015
Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits
Huasen Wu
R. Srikant
Xin Liu
Chong Jiang
38
95
0
27 Apr 2015
Matroid Bandits: Fast Combinatorial Optimization with Learning
Branislav Kveton
Zheng Wen
Azin Ashkan
Hoda Eydgahi
Brian Eriksson
62
119
0
20 Mar 2014
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
103
12,163
0
19 Dec 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
133
993
0
15 Sep 2012
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
E. Kaufmann
N. Korda
Rémi Munos
102
585
0
18 May 2012
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
277
2,935
0
28 Feb 2010
1