ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.07087
  4. Cited By
OCMDP: Observation-Constrained Markov Decision Process
v1v2v3v4 (latest)

OCMDP: Observation-Constrained Markov Decision Process

11 November 2024
Taiyi Wang
Jianheng Liu
Bryan Lee
Zhihao Wu
Yu Wu
ArXiv (abs)PDFHTML

Papers citing "OCMDP: Observation-Constrained Markov Decision Process"

13 / 13 papers shown
Title
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Taiyi Wang
Zhihao Wu
Jianheng Liu
Jianye Hao
Jun Wang
Kun Shao
OffRL
126
29
0
24 Feb 2025
Learned Graph Rewriting with Equality Saturation: A New Paradigm in
  Relational Query Rewrite and Beyond
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond
George-Octavian Barbulescu
Taiyi Wang
Zak Singh
Eiko Yoneki
89
2
0
19 Jun 2024
IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning
  for Diverse Workloads
IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads
Taiyi Wang
Eiko Yoneki
66
2
0
08 Apr 2024
Learning for Robot Decision Making under Distribution Shift: A Survey
Learning for Robot Decision Making under Distribution Shift: A Survey
Abhishek Paudel
OODOffRL
102
6
0
14 Mar 2022
The Medkit-Learn(ing) Environment: Medical Decision Modelling through
  Simulation
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
90
14
0
08 Jun 2021
Challenges for Reinforcement Learning in Healthcare
Challenges for Reinforcement Learning in Healthcare
Elsa Riachi
M. Mamdani
M. Fralick
Frank Rudzicz
OffRL
55
19
0
09 Mar 2021
An Empirical Study of Representation Learning for Reinforcement Learning
  in Healthcare
An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Taylor W. Killian
Haoran Zhang
Jayakumar Subramanian
Mehdi Fatemi
Marzyeh Ghassemi
OffRL
106
39
0
23 Nov 2020
Reinforcement Learning with Efficient Active Feature Acquisition
Reinforcement Learning with Efficient Active Feature Acquisition
Haiyan Yin
Yingzhen Li
Sinno Jialin Pan
Cheng Zhang
Sebastian Tschiatschek
OffRL
57
14
0
02 Nov 2020
ASAC: Active Sensing using Actor-Critic models
ASAC: Active Sensing using Actor-Critic models
Chang Jo Kim
James Jordon
M. Schaar
CML
59
16
0
16 Jun 2019
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
396
8,487
0
04 Jan 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
709
19,377
0
20 Jul 2017
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
439
13,348
0
09 Sep 2015
$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent
  Reinforcement Learning Through Consensus + Innovations
QDQDQD-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations
S. Kar
José M. F. Moura
H. Vincent Poor
133
189
0
30 Apr 2012
1