On Improving Deep Reinforcement Learning for POMDPs

26 April 2017

Papers citing "On Improving Deep Reinforcement Learning for POMDPs"

50 / 59 papers shown

Title
Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes Ashok Arora Neetesh Kumar 22 0 0 16 May 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations Maria Makarova Qian Liu Dzmitry Tsetserukou OffRL 46 0 0 20 Mar 2025
Towards Cost Sensitive Decision Making Yang Li Junier Oliva OffRL 23 0 0 04 Oct 2024
Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions Mohammad K. Al-Sharman Luc Edes Bert Sun Vishal Jayakumar Mohamed A. Daoud Derek Rayside W. Melek 29 1 0 20 Sep 2024
Online Learning of Temporal Dependencies for Sustainable Foraging Problem John Payne Aishwaryaprajna Peter R. Lewis 25 0 0 01 Jul 2024
An investigation of belief-free DRL and MCTS for inspection and maintenance planning Daniel Koutas E. Bismut Daniel Straub 19 2 0 22 Dec 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning Hongming Zhang Tongzheng Ren Chenjun Xiao Dale Schuurmans Bo Dai 45 3 0 20 Nov 2023
Inverse Decision Modeling: Learning Interpretable Representations of Behavior Daniel Jarrett Alihan Huyuk M. Schaar AI4CE 17 27 0 28 Oct 2023
POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance Giacomo Arcieri C. Hoelzl Oliver Schwery D. Štraub K. Papakonstantinou Eleni Chatzi 30 13 0 16 Jul 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL Gaspard Lambrechts Adrien Bolland D. Ernst 25 7 0 20 Jun 2023
Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments Junchao Li Mingyu Cai Z. Kan Shaoping Xiao 20 1 0 30 Apr 2023
End-to-End Policy Gradient Method for POMDPs and Explainable Agents Soichiro Nishimori Sotetsu Koyamada Shin Ishii 20 0 0 19 Apr 2023
On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects Sumana Basu M. Legault Adriana Romero Soriano Doina Precup OffRL 26 3 0 02 Jan 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning Kun-Yen Huang E. Hu Dinesh Jayaraman OffRL 38 5 0 17 Dec 2022
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems Giacomo Arcieri C. Hoelzl Oliver Schwery D. Štraub K. Papakonstantinou Eleni Chatzi 22 22 0 15 Dec 2022
A Bayesian Framework for Digital Twin-Based Control, Monitoring, and Data Collection in Wireless Systems Clement Ruah Osvaldo Simeone Bashir M. Al-Hashimi 24 28 0 02 Dec 2022
Digital Twin-Based Multiple Access Optimization and Monitoring via Model-Driven Bayesian Learning Clement Ruah Osvaldo Simeone Bashir M. Al-Hashimi 15 6 0 11 Oct 2022
Efficient LSTM Training with Eligibility Traces Mitchell L. Hoyer Shahram Eivazi S. Otte 4 1 0 30 Sep 2022
Recurrent networks, hidden states and beliefs in partially observable environments Gaspard Lambrechts Adrien Bolland D. Ernst 16 12 0 06 Aug 2022
Planning and Learning: Path-Planning for Autonomous Vehicles, a Review of the Literature Kevin Osanlou Christophe Guettier Tristan Cazenave Éric Jacopin 31 3 0 26 Jul 2022
Generalized Beliefs for Cooperative AI Darius Muglich L. Zintgraf Christian Schroeder de Witt Shimon Whiteson Jakob N. Foerster 15 7 0 26 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning Kevin Esslinger Robert W. Platt Chris Amato OffRL 32 35 0 02 Jun 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios Ugo Lecerf Christelle Yemdji Tchassi S. Aubert Pietro Michiardi 19 0 0 11 Apr 2022
Reinforcement Learning in Presence of Discrete Markovian Context Evolution Hang Ren Aivar Sootla Taher Jafferjee Junxiao Shen Jun Wang Haitham Bou-Ammar BDL OffRL 32 9 0 14 Feb 2022
Provable Reinforcement Learning with a Short-Term Memory Yonathan Efroni Chi Jin A. Krishnamurthy Sobhan Miryoosefi OffRL 13 37 0 08 Feb 2022
Discovering Exfiltration Paths Using Reinforcement Learning with Attack Graphs Tyler Cody Abdul Rahman Christopher Redino Lanxiao Huang Ryan Clark Akshay Kakkar Deepak Kushwaha Paul Park Peter A. Beling Edward Bowen 29 14 0 28 Jan 2022
Learning robust perceptive locomotion for quadrupedal robots in the wild Takahiro Miki Joonho Lee Jemin Hwangbo Lorenz Wellhausen V. Koltun Marco Hutter 15 684 0 20 Jan 2022
Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning C. Bellinger Andriy Drozdyuk Mark Crowley Isaac Tamblyn OffRL 21 7 0 14 Dec 2021
Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning Giseung Park Sungho Choi Y. Sung OffRL 28 3 0 10 Dec 2021
Towards Personalization of User Preferences in Partially Observable Smart Home Environments Shashi Suman F. Rivest Ali Etemad 19 4 0 02 Dec 2021
Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains Christian Gumbsch Martin Volker Butz Georg Martius AI4CE 26 21 0 29 Oct 2021
Learning What to Memorize: Using Intrinsic Motivation to Form Useful Memory in Partially Observable Reinforcement Learning Alper Demir 24 3 0 25 Oct 2021
Recurrent Off-policy Baselines for Memory-based Continuous Control Zhihan Yang Hai V. Nguyen CLL OffRL 13 23 0 25 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey Amjad Yousef Majid Serge Saaybi Tomas van Rietbergen Vincent François-Lavet R. V. Prasad Chris Verhoeven OffRL 60 54 0 28 Sep 2021
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges Jiaju Qi Qihao Zhou Lei Lei Kan Zheng FedML 31 145 0 26 Aug 2021
Structured World Belief for Reinforcement Learning in POMDP Gautam Singh Skand Peri Junghyun Kim Hyunseok Kim Sungjin Ahn OCL 27 27 0 19 Jul 2021
Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition Bo Liu Qiang Liu Peter Stone Animesh Garg Yuke Zhu Anima Anandkumar 33 49 0 18 May 2021
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning Zhiwei Xu Yunru Bai Dapeng Li Bin Zhang Guoliang Fan 22 9 0 13 May 2021
How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control? Jingxi Xu Bruce D. Lee Nikolai Matni Dinesh Jayaraman 97 6 0 02 Apr 2021
Memory-based Deep Reinforcement Learning for POMDPs Lingheng Meng R. Gorbet Dana Kulić 13 89 0 24 Feb 2021
Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective Mirza Ramicic Andrea Bonarini 6 3 0 22 Feb 2021
Partially Observable Mean Field Reinforcement Learning Sriram Ganapathi Subramanian Matthew E. Taylor Mark Crowley Pascal Poupart OOD 16 26 0 31 Dec 2020
Online Service Migration in Mobile Edge with Incomplete System Information: A Deep Recurrent Actor-Critic Learning Approach Jin Wang Jia Hu Geyong Min Qiang Ni Tarek A. El-Ghazawi 26 28 0 16 Dec 2020
Approximate information state for approximate planning and reinforcement learning in partially observed systems Jayakumar Subramanian Amit Sinha Raihan Seraj Aditya Mahajan 6 78 0 17 Oct 2020
Learning to Infer User Hidden States for Online Sequential Advertising Zhaoqing Peng Junqi Jin Lan Luo Yaodong Yang Rui Luo ... Chuan Yu Tiejian Luo Han Li Jian Xu Kun Gai OffRL 29 4 0 03 Sep 2020
EasyRL: A Simple and Extensible Reinforcement Learning Framework Neil Hulbert S. Spillers Brandon Francis James Haines-Temons Ken Gil Romero Benjamin De Jager Sam Wong Kevin Flora Bowei Huang Athirai Aravazhi Irissappane OffRL OnRL SyDa 11 1 0 04 Aug 2020
Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations Xiao Ma Peter Karkus David Hsu W. Lee N. Ye OffRL 17 42 0 23 Feb 2020
Real-time calibration of coherent-state receivers: learning by trial and error M. Bilkis M. Rosati R. M. Yepes J. Calsamiglia 31 14 0 28 Jan 2020
DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs Yunbo Wang Bo Liu Jiajun Wu Yuke Zhu Simon S. Du Fei-Fei Li Joshua B. Tenenbaum 9 6 0 28 Sep 2019
Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture Pawel Ladosz Eseoghene Ben-Iwhiwhu Jeffery Dick Yang Hu Nicholas A. Ketz Soheil Kolouri J. Krichmar Praveen K. Pilly Andrea Soltoggio 18 19 0 21 Sep 2019