ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.06527
  4. Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs

Deep Recurrent Q-Learning for Partially Observable MDPs

23 July 2015
Matthew J. Hausknecht
Peter Stone
ArXivPDFHTML

Papers citing "Deep Recurrent Q-Learning for Partially Observable MDPs"

50 / 634 papers shown
Title
Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting
Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting
Emlyn Williams
Athanasios Polydoros
OffRL
34
0
0
13 May 2025
Depth-Constrained ASV Navigation with Deep RL and Limited Sensing
Depth-Constrained ASV Navigation with Deep RL and Limited Sensing
Amirhossein Zhalehmehrabi
Daniele Meli
Francesco Dal Santo
Francesco Trotti
Alessandro Farinelli
24
0
0
25 Apr 2025
Do We Need Transformers to Play FPS Video Games?
Do We Need Transformers to Play FPS Video Games?
Karmanbir Batth
Krish Sethi
Aly Shariff
Leo Shi
Hetul Patel
OffRL
AI4CE
29
0
0
24 Apr 2025
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Songyuan Zhang
Oswin So
Mitchell Black
Zachary Serlin
Chuchu Fan
28
0
0
21 Apr 2025
LERO: LLM-driven Evolutionary framework with Hybrid Rewards and Enhanced Observation for Multi-Agent Reinforcement Learning
LERO: LLM-driven Evolutionary framework with Hybrid Rewards and Enhanced Observation for Multi-Agent Reinforcement Learning
Yuan Wei
Xiaohan Shan
Jianmin Li
36
0
0
25 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
41
0
0
20 Mar 2025
Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems
Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems
George Stamatelis
Angelos-Nikolaos Kanatas
G. C. Alexandropoulos
48
0
0
19 Mar 2025
A Generalist Hanabi Agent
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
157
0
0
17 Mar 2025
Timing the Match: A Deep Reinforcement Learning Approach for Ride-Hailing and Ride-Pooling Services
Timing the Match: A Deep Reinforcement Learning Approach for Ride-Hailing and Ride-Pooling Services
Yiman Bao
Jie Gao
Jinke He
F. Oliehoek
Oded Cats
42
0
0
17 Mar 2025
Real-Time Risky Fault-Chain Search using Time-Varying Graph RNNs
Anmol Dwivedi
A. Tajer
AI4CE
53
0
0
12 Mar 2025
RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment
Md Morshed Alam
Lokesh Chandra Das
Sandip Roy
Sachin Shetty
Weichao Wang
AAML
OffRL
61
0
0
12 Mar 2025
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Steeven Janny
Hervé Poirier
L. Antsfeld
G. Bono
G. Monaci
Boris Chidlovskii
Francesco Giuliari
Alessio Del Bue
Christian Wolf
LM&Ro
53
0
0
11 Mar 2025
LTL Verification of Memoryful Neural Agents
Mehran Hosseini
A. Lomuscio
Nicola Paoletti
LLMAG
70
0
0
04 Mar 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
78
1
0
20 Feb 2025
Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System
Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System
Ziyou Jiang
Mingyang Li
Guowei Yang
Junjie Wang
Yuekai Huang
Zhiyuan Chang
Qing Wang
AAML
52
1
0
17 Feb 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
41
0
0
14 Feb 2025
Inverse Reinforcement Learning with Switching Rewards and History Dependency for Characterizing Animal Behaviors
Inverse Reinforcement Learning with Switching Rewards and History Dependency for Characterizing Animal Behaviors
Jingyang Ke
Feiyang Wu
Jiyi Wang
Jeffrey Markowitz
Anqi Wu
82
0
0
22 Jan 2025
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
S. Hazra
P. Dasgupta
Soumyajit Dey
34
0
0
21 Jan 2025
A Survey on LLM-based Multi-Agent System: Recent Advances and New Frontiers in Application
A Survey on LLM-based Multi-Agent System: Recent Advances and New Frontiers in Application
Shuaihang Chen
Yuanxing Liu
Wei Han
Weinan Zhang
Ting Liu
LLMAG
AI4CE
48
2
0
08 Jan 2025
Explainable Reinforcement Learning for Formula One Race Strategy
Explainable Reinforcement Learning for Formula One Race Strategy
Devin Thomas
Junqi Jiang
Avinash Kori
Aaron Russo
Steffen Winkler
Stuart Sale
Joseph McMillan
Francesco Belardinelli
Antonio Rago
LRM
35
0
0
07 Jan 2025
Embodied CoT Distillation From LLM To Off-the-shelf Agents
Embodied CoT Distillation From LLM To Off-the-shelf Agents
Wonje Choi
Woo Kyung Kim
Minjong Yoo
Honguk Woo
OffRL
LM&Ro
113
2
0
16 Dec 2024
Robot Policy Learning with Temporal Optimal Transport Reward
Robot Policy Learning with Temporal Optimal Transport Reward
Yuwei Fu
Haichao Zhang
Di Wu
Wei-ping Xu
Benoit Boulet
OffRL
37
1
0
29 Oct 2024
POMDP-Driven Cognitive Massive MIMO Radar: Joint Target Detection-Tracking In Unknown Disturbances
POMDP-Driven Cognitive Massive MIMO Radar: Joint Target Detection-Tracking In Unknown Disturbances
Imad Bouhou
Stefano Fortunati
Leila Gharsalli
Alexandre Renaux
23
0
0
23 Oct 2024
Delay-Constrained Grant-Free Random Access in MIMO Systems: Distributed
  Pilot Allocation and Power Control
Delay-Constrained Grant-Free Random Access in MIMO Systems: Distributed Pilot Allocation and Power Control
Jianan Bai
Zheng Chen
Erik G. Larsson
22
0
0
22 Oct 2024
MarineFormer: A Transformer-based Navigation Policy Model for Collision
  Avoidance in Marine Environment
MarineFormer: A Transformer-based Navigation Policy Model for Collision Avoidance in Marine Environment
Ehsan Kazemi
Iman Soltani
96
1
0
17 Oct 2024
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Tonghan Wang
Heng Dong
Yanchen Jiang
David C. Parkes
Milind Tambe
DiffM
44
2
0
17 Oct 2024
Towards Cost Sensitive Decision Making
Towards Cost Sensitive Decision Making
Yang Li
Junier Oliva
OffRL
23
0
0
04 Oct 2024
A Spatiotemporal Stealthy Backdoor Attack against Cooperative
  Multi-Agent Deep Reinforcement Learning
A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning
Yinbo Yu
Saihao Yan
Jiajia Liu
AAML
23
1
0
12 Sep 2024
An Introduction to Centralized Training for Decentralized Execution in
  Cooperative Multi-Agent Reinforcement Learning
An Introduction to Centralized Training for Decentralized Execution in Cooperative Multi-Agent Reinforcement Learning
Christopher Amato
OffRL
31
9
0
04 Sep 2024
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Esraa Elelimy
Adam White
Michael H. Bowling
Martha White
OffRL
36
2
0
02 Sep 2024
Equivariant Reinforcement Learning under Partial Observability
Equivariant Reinforcement Learning under Partial Observability
Hai Nguyen
Andrea Baisero
David M. Klee
Dian Wang
Robert Platt
Christopher Amato
39
14
0
26 Aug 2024
Beyond Local Views: Global State Inference with Diffusion Models for
  Cooperative Multi-Agent Reinforcement Learning
Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Hangyu Mao
Nianmin Zhang
Xin Xin
Pengjie Ren
...
Bin Zhang
Guoliang Fan
Zhumin Chen
Changwei Wang
Jiangjin Yin
DiffM
22
1
0
18 Aug 2024
Pessimistic Iterative Planning for Robust POMDPs
Pessimistic Iterative Planning for Robust POMDPs
Maris F. L. Galesloot
Marnix Suilen
T. D. Simão
Steven Carr
M. Spaan
Ufuk Topcu
Nils Jansen
36
2
0
16 Aug 2024
State-of-the-art in Robot Learning for Multi-Robot Collaboration: A
  Comprehensive Survey
State-of-the-art in Robot Learning for Multi-Robot Collaboration: A Comprehensive Survey
Bin Wu
C. S. Suh
3DV
39
2
0
03 Aug 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
34
1
0
30 Jul 2024
Integrated Communications and Security: RIS-Assisted Simultaneous
  Transmission and Generation of Secret Keys
Integrated Communications and Security: RIS-Assisted Simultaneous Transmission and Generation of Secret Keys
Ning Gao
Yuze Yao
Shi Jin
Cen Li
M. Matthaiou
24
0
0
29 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
Mitigating Partial Observability in Sequential Decision Processes via
  the Lambda Discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen
Aaron Kirtland
Ruo Yu Tao
Sam Lobel
Daniel Scott
Nicholas Petrocelli
Omer Gottesman
Ronald E. Parr
M. L. Littman
G. Konidaris
28
1
0
10 Jul 2024
Periodic agent-state based Q-learning for POMDPs
Periodic agent-state based Q-learning for POMDPs
Amit Sinha
Mathieu Geist
Aditya Mahajan
26
0
0
08 Jul 2024
Online Learning of Temporal Dependencies for Sustainable Foraging
  Problem
Online Learning of Temporal Dependencies for Sustainable Foraging Problem
John Payne
Aishwaryaprajna
Peter R. Lewis
20
0
0
01 Jul 2024
Generalisation to unseen topologies: Towards control of biological
  neural network activity
Generalisation to unseen topologies: Towards control of biological neural network activity
Laurens Engwegen
Daan Brinks
Wendelin Bohmer
MedIm
AI4CE
32
0
0
17 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
43
1
0
15 Jun 2024
Dispelling the Mirage of Progress in Offline MARL through Standardised
  Baselines and Evaluation
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation
Claude Formanek
C. Tilbury
Louise Beyers
Jonathan P. Shock
Arnu Pretorius
OffRL
34
1
0
13 Jun 2024
Multi-agent Reinforcement Learning with Deep Networks for Diverse
  Q-Vectors
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors
Zhenglong Luo
Zhiyong Chen
James Welsh
20
1
0
12 Jun 2024
Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization
Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization
Jesse van Remmerden
Maurice Kenter
D. Roijers
Charalampos Andriotis
Yingqian Zhang
Z. Bukhsh
23
0
0
10 Jun 2024
On Limitation of Transformer for Learning HMMs
On Limitation of Transformer for Learning HMMs
Jiachen Hu
Qinghua Liu
Chi Jin
47
3
0
06 Jun 2024
Reward Machines for Deep RL in Noisy and Uncertain Environments
Reward Machines for Deep RL in Noisy and Uncertain Environments
Andrew C. Li
Zizhao Chen
Toryn Q. Klassen
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
48
6
0
31 May 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
41
1
0
30 May 2024
Generalizing Multi-Step Inverse Models for Representation Learning to
  Finite-Memory POMDPs
Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs
Lili Wu
Ben Evans
Riashat Islam
Raihan Seraj
Yonathan Efroni
Alex Lamb
52
1
0
22 Apr 2024
Distributed Autonomous Swarm Formation for Dynamic Network Bridging
Distributed Autonomous Swarm Formation for Dynamic Network Bridging
Raffaele Galliera
Thies Möhlenhof
Alessandro Amato
Daniel Duran
K. Venable
N. Suri
25
3
0
02 Apr 2024
1234...111213
Next