ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.07406
  4. Cited By
Reward Design For An Online Reinforcement Learning Algorithm Supporting
  Oral Self-Care

Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care

15 August 2022
Anna L. Trella
Kelly W. Zhang
Inbal Nahum-Shani
Vivek Shetty
Finale Doshi-Velez
S. Murphy
    OnRL
ArXivPDFHTML

Papers citing "Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care"

15 / 15 papers shown
Title
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward
Han Weng
Boyi Liu
Yuanfeng Song
Dun Zeng
Yingxiang Yang
Yi Zhan
Longjie Cui
Xiaoming Yin
Yang Sun
12
0
0
18 May 2025
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Mohammadreza Pourreza
Shayan Talaei
Ruoxi Sun
Xingchen Wan
Hailong Li
Azalia Mirhoseini
Amin Saberi
Sercan Ö. Arik
ReLM
AI4TS
LRM
46
4
0
29 Mar 2025
Reinforcement Learning on Dyads to Enhance Medication Adherence
Reinforcement Learning on Dyads to Enhance Medication Adherence
Ziping Xu
Hinal Jajal
S. Choi
Inbal Nahum-Shani
Guy Shani
Alexandra M. Psihogios
Pei-Yao Hung
S. Murphy
58
0
0
06 Feb 2025
A Deployed Online Reinforcement Learning Algorithm In An Oral Health
  Clinical Trial
A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Anna L. Trella
Kelly W. Zhang
Hinal Jajal
Inbal Nahum-Shani
Vivek Shetty
Finale Doshi-Velez
S. Murphy
OnRL
21
1
0
03 Sep 2024
Effective Monitoring of Online Decision-Making Algorithms in Digital
  Intervention Implementation
Effective Monitoring of Online Decision-Making Algorithms in Digital Intervention Implementation
Anna L. Trella
Susobhan Ghosh
Erin Bonar
Lara N. Coughlin
Finale Doshi-Velez
...
Vivek Shetty
Maureen Walton
Iris Yan
Kelly W. Zhang
S. Murphy
31
0
0
30 Aug 2024
Oralytics Reinforcement Learning Algorithm
Oralytics Reinforcement Learning Algorithm
Anna L. Trella
Kelly W. Zhang
Stephanie M Carpenter
David Elashoff
Zara M Greer
Inbal Nahum-Shani
Dennis Ruenger
Vivek Shetty
S. Murphy
20
0
0
19 Jun 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis
  Use
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
41
4
0
27 Feb 2024
Monitoring Fidelity of Online Reinforcement Learning Algorithms in
  Clinical Trials
Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials
Anna L. Trella
Kelly W. Zhang
Inbal Nahum-Shani
Vivek Shetty
Iris Yan
Finale Doshi-Velez
Susan A. Murphy
OffRL
OnRL
33
3
0
26 Feb 2024
Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests
  for Means of Multiple Data Streams
Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
Brian Cho
Kyra Gan
Nathan Kallus
25
6
0
09 Feb 2024
Non-Stationary Latent Auto-Regressive Bandits
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
26
1
0
05 Feb 2024
Thompson sampling for zero-inflated count outcomes with an application
  to the Drink Less mobile health study
Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study
Xueqing Liu
Nina Deliu
Tanujit Chakraborty
Lauren Bell
Bibhas Chakraborty
18
1
0
24 Nov 2023
Dyadic Reinforcement Learning
Dyadic Reinforcement Learning
Shuangning Li
L. Niell
S. Choi
Inbal Nahum-Shani
Guy Shani
Susan Murphy
OffRL
25
1
0
15 Aug 2023
Online learning in bandits with predicted context
Online learning in bandits with predicted context
Yongyi Guo
Ziping Xu
Susan Murphy
26
4
0
26 Jul 2023
OPTWIN: Drift identification with optimal sub-windows
OPTWIN: Drift identification with optimal sub-windows
Mauro Dalle Lucca Tosi
Martin Theobald
36
1
0
19 May 2023
Semi-parametric inference based on adaptively collected data
Semi-parametric inference based on adaptively collected data
Licong Lin
K. Khamaru
Martin J. Wainwright
OffRL
39
6
0
05 Mar 2023
1