Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.20250
Cited By
Optimal Policy Learning with Observational Data in Multi-Action Scenarios: Estimation, Risk Preference, and Potential Failures
29 March 2024
Giovanni Cerulli
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimal Policy Learning with Observational Data in Multi-Action Scenarios: Estimation, Risk Preference, and Potential Failures"
3 / 3 papers shown
Title
Reinforcement learning
Florentin Wörgötter
82
2,544
0
16 May 2024
Convergence Guarantees for Deep Epsilon Greedy Policy Learning
Michael Rawson
R. Balan
63
8
0
02 Dec 2021
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
114
200
0
10 Jun 2020
1