ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.07978
  4. Cited By
On Improving Deep Reinforcement Learning for POMDPs

On Improving Deep Reinforcement Learning for POMDPs

26 April 2017
Pengfei Zhu
Xin Li
Pascal Poupart
Guanghui Miao
ArXivPDFHTML

Papers citing "On Improving Deep Reinforcement Learning for POMDPs"

50 / 59 papers shown
Title
Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes
Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes
Ashok Arora
Neetesh Kumar
22
0
0
16 May 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
46
0
0
20 Mar 2025
Towards Cost Sensitive Decision Making
Towards Cost Sensitive Decision Making
Yang Li
Junier Oliva
OffRL
23
0
0
04 Oct 2024
Autonomous Driving at Unsignalized Intersections: A Review of
  Decision-Making Challenges and Reinforcement Learning-Based Solutions
Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions
Mohammad K. Al-Sharman
Luc Edes
Bert Sun
Vishal Jayakumar
Mohamed A. Daoud
Derek Rayside
W. Melek
29
1
0
20 Sep 2024
Online Learning of Temporal Dependencies for Sustainable Foraging
  Problem
Online Learning of Temporal Dependencies for Sustainable Foraging Problem
John Payne
Aishwaryaprajna
Peter R. Lewis
25
0
0
01 Jul 2024
An investigation of belief-free DRL and MCTS for inspection and
  maintenance planning
An investigation of belief-free DRL and MCTS for inspection and maintenance planning
Daniel Koutas
E. Bismut
Daniel Straub
19
2
0
22 Dec 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Inverse Decision Modeling: Learning Interpretable Representations of
  Behavior
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Daniel Jarrett
Alihan Huyuk
M. Schaar
AI4CE
17
27
0
28 Oct 2023
POMDP inference and robust solution via deep reinforcement learning: An
  application to railway optimal maintenance
POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance
Giacomo Arcieri
C. Hoelzl
Oliver Schwery
D. Štraub
K. Papakonstantinou
Eleni Chatzi
30
13
0
16 Jul 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
25
7
0
20 Jun 2023
Model-free Motion Planning of Autonomous Agents for Complex Tasks in
  Partially Observable Environments
Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments
Junchao Li
Mingyu Cai
Z. Kan
Shaoping Xiao
20
1
0
30 Apr 2023
End-to-End Policy Gradient Method for POMDPs and Explainable Agents
End-to-End Policy Gradient Method for POMDPs and Explainable Agents
Soichiro Nishimori
Sotetsu Koyamada
Shin Ishii
20
0
0
19 Apr 2023
On the Challenges of using Reinforcement Learning in Precision Drug
  Dosing: Delay and Prolongedness of Action Effects
On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects
Sumana Basu
M. Legault
Adriana Romero Soriano
Doina Precup
OffRL
26
3
0
02 Jan 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward
  Functions for Policy Learning
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
38
5
0
17 Dec 2022
Bridging POMDPs and Bayesian decision making for robust maintenance
  planning under model uncertainty: An application to railway systems
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems
Giacomo Arcieri
C. Hoelzl
Oliver Schwery
D. Štraub
K. Papakonstantinou
Eleni Chatzi
22
22
0
15 Dec 2022
A Bayesian Framework for Digital Twin-Based Control, Monitoring, and
  Data Collection in Wireless Systems
A Bayesian Framework for Digital Twin-Based Control, Monitoring, and Data Collection in Wireless Systems
Clement Ruah
Osvaldo Simeone
Bashir M. Al-Hashimi
24
28
0
02 Dec 2022
Digital Twin-Based Multiple Access Optimization and Monitoring via
  Model-Driven Bayesian Learning
Digital Twin-Based Multiple Access Optimization and Monitoring via Model-Driven Bayesian Learning
Clement Ruah
Osvaldo Simeone
Bashir M. Al-Hashimi
15
6
0
11 Oct 2022
Efficient LSTM Training with Eligibility Traces
Efficient LSTM Training with Eligibility Traces
Mitchell L. Hoyer
Shahram Eivazi
S. Otte
4
1
0
30 Sep 2022
Recurrent networks, hidden states and beliefs in partially observable
  environments
Recurrent networks, hidden states and beliefs in partially observable environments
Gaspard Lambrechts
Adrien Bolland
D. Ernst
16
12
0
06 Aug 2022
Planning and Learning: Path-Planning for Autonomous Vehicles, a Review
  of the Literature
Planning and Learning: Path-Planning for Autonomous Vehicles, a Review of the Literature
Kevin Osanlou
Christophe Guettier
Tristan Cazenave
Éric Jacopin
31
3
0
26 Jul 2022
Generalized Beliefs for Cooperative AI
Generalized Beliefs for Cooperative AI
Darius Muglich
L. Zintgraf
Christian Schroeder de Witt
Shimon Whiteson
Jakob N. Foerster
15
7
0
26 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement
  Learning
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
32
35
0
02 Jun 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement
  Learning in Safety-Critical Driving Scenarios
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
19
0
0
11 Apr 2022
Reinforcement Learning in Presence of Discrete Markovian Context
  Evolution
Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Hang Ren
Aivar Sootla
Taher Jafferjee
Junxiao Shen
Jun Wang
Haitham Bou-Ammar
BDL
OffRL
32
9
0
14 Feb 2022
Provable Reinforcement Learning with a Short-Term Memory
Provable Reinforcement Learning with a Short-Term Memory
Yonathan Efroni
Chi Jin
A. Krishnamurthy
Sobhan Miryoosefi
OffRL
13
37
0
08 Feb 2022
Discovering Exfiltration Paths Using Reinforcement Learning with Attack
  Graphs
Discovering Exfiltration Paths Using Reinforcement Learning with Attack Graphs
Tyler Cody
Abdul Rahman
Christopher Redino
Lanxiao Huang
Ryan Clark
Akshay Kakkar
Deepak Kushwaha
Paul Park
Peter A. Beling
Edward Bowen
29
14
0
28 Jan 2022
Learning robust perceptive locomotion for quadrupedal robots in the wild
Learning robust perceptive locomotion for quadrupedal robots in the wild
Takahiro Miki
Joonho Lee
Jemin Hwangbo
Lorenz Wellhausen
V. Koltun
Marco Hutter
15
684
0
20 Jan 2022
Scientific Discovery and the Cost of Measurement -- Balancing
  Information and Cost in Reinforcement Learning
Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning
C. Bellinger
Andriy Drozdyuk
Mark Crowley
Isaac Tamblyn
OffRL
21
7
0
14 Dec 2021
Blockwise Sequential Model Learning for Partially Observable
  Reinforcement Learning
Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning
Giseung Park
Sungho Choi
Y. Sung
OffRL
28
3
0
10 Dec 2021
Towards Personalization of User Preferences in Partially Observable
  Smart Home Environments
Towards Personalization of User Preferences in Partially Observable Smart Home Environments
Shashi Suman
F. Rivest
Ali Etemad
19
4
0
02 Dec 2021
Sparsely Changing Latent States for Prediction and Planning in Partially
  Observable Domains
Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
Christian Gumbsch
Martin Volker Butz
Georg Martius
AI4CE
26
21
0
29 Oct 2021
Learning What to Memorize: Using Intrinsic Motivation to Form Useful
  Memory in Partially Observable Reinforcement Learning
Learning What to Memorize: Using Intrinsic Motivation to Form Useful Memory in Partially Observable Reinforcement Learning
Alper Demir
24
3
0
25 Oct 2021
Recurrent Off-policy Baselines for Memory-based Continuous Control
Recurrent Off-policy Baselines for Memory-based Continuous Control
Zhihan Yang
Hai V. Nguyen
CLL
OffRL
13
23
0
25 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
60
54
0
28 Sep 2021
Federated Reinforcement Learning: Techniques, Applications, and Open
  Challenges
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
31
145
0
26 Aug 2021
Structured World Belief for Reinforcement Learning in POMDP
Structured World Belief for Reinforcement Learning in POMDP
Gautam Singh
Skand Peri
Junghyun Kim
Hyunseok Kim
Sungjin Ahn
OCL
27
27
0
19 Jul 2021
Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team
  Composition
Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition
Bo Liu
Qiang Liu
Peter Stone
Animesh Garg
Yuke Zhu
Anima Anandkumar
33
49
0
18 May 2021
SIDE: State Inference for Partially Observable Cooperative Multi-Agent
  Reinforcement Learning
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Yunru Bai
Dapeng Li
Bin Zhang
Guoliang Fan
22
9
0
13 May 2021
How Are Learned Perception-Based Controllers Impacted by the Limits of
  Robust Control?
How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control?
Jingxi Xu
Bruce D. Lee
Nikolai Matni
Dinesh Jayaraman
97
6
0
02 Apr 2021
Memory-based Deep Reinforcement Learning for POMDPs
Memory-based Deep Reinforcement Learning for POMDPs
Lingheng Meng
R. Gorbet
Dana Kulić
13
89
0
24 Feb 2021
Uncertainty Maximization in Partially Observable Domains: A Cognitive
  Perspective
Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective
Mirza Ramicic
Andrea Bonarini
6
3
0
22 Feb 2021
Partially Observable Mean Field Reinforcement Learning
Partially Observable Mean Field Reinforcement Learning
Sriram Ganapathi Subramanian
Matthew E. Taylor
Mark Crowley
Pascal Poupart
OOD
16
26
0
31 Dec 2020
Online Service Migration in Mobile Edge with Incomplete System
  Information: A Deep Recurrent Actor-Critic Learning Approach
Online Service Migration in Mobile Edge with Incomplete System Information: A Deep Recurrent Actor-Critic Learning Approach
Jin Wang
Jia Hu
Geyong Min
Qiang Ni
Tarek A. El-Ghazawi
26
28
0
16 Dec 2020
Approximate information state for approximate planning and reinforcement
  learning in partially observed systems
Approximate information state for approximate planning and reinforcement learning in partially observed systems
Jayakumar Subramanian
Amit Sinha
Raihan Seraj
Aditya Mahajan
6
78
0
17 Oct 2020
Learning to Infer User Hidden States for Online Sequential Advertising
Learning to Infer User Hidden States for Online Sequential Advertising
Zhaoqing Peng
Junqi Jin
Lan Luo
Yaodong Yang
Rui Luo
...
Chuan Yu
Tiejian Luo
Han Li
Jian Xu
Kun Gai
OffRL
29
4
0
03 Sep 2020
EasyRL: A Simple and Extensible Reinforcement Learning Framework
EasyRL: A Simple and Extensible Reinforcement Learning Framework
Neil Hulbert
S. Spillers
Brandon Francis
James Haines-Temons
Ken Gil Romero
Benjamin De Jager
Sam Wong
Kevin Flora
Bowei Huang
Athirai Aravazhi Irissappane
OffRL
OnRL
SyDa
11
1
0
04 Aug 2020
Discriminative Particle Filter Reinforcement Learning for Complex
  Partial Observations
Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations
Xiao Ma
Peter Karkus
David Hsu
W. Lee
N. Ye
OffRL
17
42
0
23 Feb 2020
Real-time calibration of coherent-state receivers: learning by trial and
  error
Real-time calibration of coherent-state receivers: learning by trial and error
M. Bilkis
M. Rosati
R. M. Yepes
J. Calsamiglia
31
14
0
28 Jan 2020
DualSMC: Tunneling Differentiable Filtering and Planning under
  Continuous POMDPs
DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs
Yunbo Wang
Bo Liu
Jiajun Wu
Yuke Zhu
Simon S. Du
Fei-Fei Li
Joshua B. Tenenbaum
9
6
0
28 Sep 2019
Deep Reinforcement Learning with Modulated Hebbian plus Q Network
  Architecture
Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture
Pawel Ladosz
Eseoghene Ben-Iwhiwhu
Jeffery Dick
Yang Hu
Nicholas A. Ketz
Soheil Kolouri
J. Krichmar
Praveen K. Pilly
Andrea Soltoggio
18
19
0
21 Sep 2019
12
Next