Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.07406
Cited By
Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
15 August 2022
Anna L. Trella
Kelly W. Zhang
Inbal Nahum-Shani
Vivek Shetty
Finale Doshi-Velez
S. Murphy
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care"
15 / 15 papers shown
Title
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward
Han Weng
Boyi Liu
Yuanfeng Song
Dun Zeng
Yingxiang Yang
Yi Zhan
Longjie Cui
Xiaoming Yin
Yang Sun
12
0
0
18 May 2025
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Mohammadreza Pourreza
Shayan Talaei
Ruoxi Sun
Xingchen Wan
Hailong Li
Azalia Mirhoseini
Amin Saberi
Sercan Ö. Arik
ReLM
AI4TS
LRM
46
4
0
29 Mar 2025
Reinforcement Learning on Dyads to Enhance Medication Adherence
Ziping Xu
Hinal Jajal
S. Choi
Inbal Nahum-Shani
Guy Shani
Alexandra M. Psihogios
Pei-Yao Hung
S. Murphy
58
0
0
06 Feb 2025
A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Anna L. Trella
Kelly W. Zhang
Hinal Jajal
Inbal Nahum-Shani
Vivek Shetty
Finale Doshi-Velez
S. Murphy
OnRL
21
1
0
03 Sep 2024
Effective Monitoring of Online Decision-Making Algorithms in Digital Intervention Implementation
Anna L. Trella
Susobhan Ghosh
Erin Bonar
Lara N. Coughlin
Finale Doshi-Velez
...
Vivek Shetty
Maureen Walton
Iris Yan
Kelly W. Zhang
S. Murphy
31
0
0
30 Aug 2024
Oralytics Reinforcement Learning Algorithm
Anna L. Trella
Kelly W. Zhang
Stephanie M Carpenter
David Elashoff
Zara M Greer
Inbal Nahum-Shani
Dennis Ruenger
Vivek Shetty
S. Murphy
20
0
0
19 Jun 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
41
4
0
27 Feb 2024
Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials
Anna L. Trella
Kelly W. Zhang
Inbal Nahum-Shani
Vivek Shetty
Iris Yan
Finale Doshi-Velez
Susan A. Murphy
OffRL
OnRL
33
3
0
26 Feb 2024
Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
Brian Cho
Kyra Gan
Nathan Kallus
25
6
0
09 Feb 2024
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
26
1
0
05 Feb 2024
Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study
Xueqing Liu
Nina Deliu
Tanujit Chakraborty
Lauren Bell
Bibhas Chakraborty
18
1
0
24 Nov 2023
Dyadic Reinforcement Learning
Shuangning Li
L. Niell
S. Choi
Inbal Nahum-Shani
Guy Shani
Susan Murphy
OffRL
25
1
0
15 Aug 2023
Online learning in bandits with predicted context
Yongyi Guo
Ziping Xu
Susan Murphy
26
4
0
26 Jul 2023
OPTWIN: Drift identification with optimal sub-windows
Mauro Dalle Lucca Tosi
Martin Theobald
36
1
0
19 May 2023
Semi-parametric inference based on adaptively collected data
Licong Lin
K. Khamaru
Martin J. Wainwright
OffRL
39
6
0
05 Mar 2023
1