Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.07087
Cited By
v1
v2
v3
v4 (latest)
OCMDP: Observation-Constrained Markov Decision Process
11 November 2024
Taiyi Wang
Jianheng Liu
Bryan Lee
Zhihao Wu
Yu Wu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OCMDP: Observation-Constrained Markov Decision Process"
13 / 13 papers shown
Title
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Taiyi Wang
Zhihao Wu
Jianheng Liu
Jianye Hao
Jun Wang
Kun Shao
OffRL
126
29
0
24 Feb 2025
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond
George-Octavian Barbulescu
Taiyi Wang
Zak Singh
Eiko Yoneki
89
2
0
19 Jun 2024
IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads
Taiyi Wang
Eiko Yoneki
66
2
0
08 Apr 2024
Learning for Robot Decision Making under Distribution Shift: A Survey
Abhishek Paudel
OOD
OffRL
102
6
0
14 Mar 2022
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
90
14
0
08 Jun 2021
Challenges for Reinforcement Learning in Healthcare
Elsa Riachi
M. Mamdani
M. Fralick
Frank Rudzicz
OffRL
55
19
0
09 Mar 2021
An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Taylor W. Killian
Haoran Zhang
Jayakumar Subramanian
Mehdi Fatemi
Marzyeh Ghassemi
OffRL
106
39
0
23 Nov 2020
Reinforcement Learning with Efficient Active Feature Acquisition
Haiyan Yin
Yingzhen Li
Sinno Jialin Pan
Cheng Zhang
Sebastian Tschiatschek
OffRL
57
14
0
02 Nov 2020
ASAC: Active Sensing using Actor-Critic models
Chang Jo Kim
James Jordon
M. Schaar
CML
59
16
0
16 Jun 2019
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
396
8,487
0
04 Jan 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
709
19,377
0
20 Jul 2017
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
439
13,348
0
09 Sep 2015
Q
D
QD
Q
D
-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations
S. Kar
José M. F. Moura
H. Vincent Poor
133
189
0
30 Apr 2012
1