ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.06527
  4. Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs

Deep Recurrent Q-Learning for Partially Observable MDPs

23 July 2015
Matthew J. Hausknecht
Peter Stone
ArXivPDFHTML

Papers citing "Deep Recurrent Q-Learning for Partially Observable MDPs"

50 / 634 papers shown
Title
Safety-Aware Multi-Agent Learning for Dynamic Network Bridging
Safety-Aware Multi-Agent Learning for Dynamic Network Bridging
Raffaele Galliera
Konstantinos Mitsopoulos
N. Suri
Raffaele Romagnoli
36
0
0
02 Apr 2024
Why Online Reinforcement Learning is Causal
Why Online Reinforcement Learning is Causal
Oliver Schulte
Pascal Poupart
CML
OffRL
30
1
0
07 Mar 2024
Efficient Episodic Memory Utilization of Cooperative Multi-Agent
  Reinforcement Learning
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na
Yunkyeong Seo
IL-Chul Moon
24
7
0
02 Mar 2024
Imagine, Initialize, and Explore: An Effective Exploration Method in
  Multi-Agent Reinforcement Learning
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
Zeyang Liu
Lipeng Wan
Xinrui Yang
Zhuoran Chen
Xingyu Chen
Xuguang Lan
21
3
0
28 Feb 2024
Do Transformer World Models Give Better Policy Gradients?
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
34
4
0
07 Feb 2024
CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting
CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting
Jiezhi Yang
Khushi Desai
Charles Packer
Harshil Bhatia
Nicholas Rhinehart
R. McAllister
Joseph E. Gonzalez
AI4CE
27
2
0
31 Jan 2024
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via
  large language models
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models
Xiao Shao
Weifu Jiang
Fei Zuo
Mengqing Liu
LLMAG
31
7
0
31 Jan 2024
Learning Online Belief Prediction for Efficient POMDP Planning in
  Autonomous Driving
Learning Online Belief Prediction for Efficient POMDP Planning in Autonomous Driving
Zhiyu Huang
Chen Tang
Chen Lv
Masayoshi Tomizuka
Wei Zhan
40
5
0
27 Jan 2024
Fully Independent Communication in Multi-Agent Reinforcement Learning
Fully Independent Communication in Multi-Agent Reinforcement Learning
Rafael Pina
V. D. Silva
Corentin Artaud
Xiaolan Liu
27
4
0
26 Jan 2024
Learning When to See for Long-term Traffic Data Collection on
  Power-constrained Devices
Learning When to See for Long-term Traffic Data Collection on Power-constrained Devices
Ruixuan Zhang
Wenyu Han
Zilin Bian
K. Ozbay
Chen Feng
AI4TS
13
0
0
25 Jan 2024
Traffic Learning and Proactive UAV Trajectory Planning for Data Uplink
  in Markovian IoT Models
Traffic Learning and Proactive UAV Trajectory Planning for Data Uplink in Markovian IoT Models
Eslam Eldeeb
M. Shehab
Hirley Alves
16
7
0
24 Jan 2024
Detecting Hidden Triggers: Mapping Non-Markov Reward Functions to Markov
Detecting Hidden Triggers: Mapping Non-Markov Reward Functions to Markov
Gregory Hyde
Eugene Santos
24
0
0
20 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
PDiT: Interleaving Perception and Decision-making Transformers for Deep
  Reinforcement Learning
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Ziyue Li
Zhiwei Xu
Hao Chen
Yiqun Chen
Bin Zhang
Zhen Xiao
Junge Zhang
Jiangjin Yin
OffRL
11
8
0
26 Dec 2023
An investigation of belief-free DRL and MCTS for inspection and
  maintenance planning
An investigation of belief-free DRL and MCTS for inspection and maintenance planning
Daniel Koutas
E. Bismut
Daniel Straub
19
2
0
22 Dec 2023
Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent
  Reinforcement Learning
Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning
Yanwen Ba
Xuan Liu
Xinning Chen
Hao Wang
Yang Xu
Kenli Li
Shigeng Zhang
20
2
0
19 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
30
8
0
15 Dec 2023
On The Expressivity of Recurrent Neural Cascades
On The Expressivity of Recurrent Neural Cascades
Nadezda A. Knorozova
Alessandro Ronca
18
1
0
14 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
78
5
0
13 Dec 2023
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and
  Skills
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
38
4
0
11 Dec 2023
TrustFed: A Reliable Federated Learning Framework with Malicious-Attack
  Resistance
TrustFed: A Reliable Federated Learning Framework with Malicious-Attack Resistance
Hangn Su
Jianhong Zhou
Xianhua Niu
Gang Feng
AAML
21
4
0
06 Dec 2023
GVFs in the Real World: Making Predictions Online for Water Treatment
GVFs in the Real World: Making Predictions Online for Water Treatment
Muhammad Kamran Janjua
Haseeb Shah
Martha White
Erfan Miahi
Marlos C. Machado
Adam White
AI4CE
35
8
0
04 Dec 2023
Generalisable Agents for Neural Network Optimisation
Generalisable Agents for Neural Network Optimisation
Kale-ab Tessera
C. Tilbury
Sasha Abramowitz
Ruan de Kock
Omayma Mahjoub
Benjamin Rosman
Sara Hooker
Arnu Pretorius
AI4CE
20
0
0
30 Nov 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
RigLSTM: Recurrent Independent Grid LSTM for Generalizable Sequence
  Learning
RigLSTM: Recurrent Independent Grid LSTM for Generalizable Sequence Learning
Ziyu Wang
Wenhao Jiang
Zixuan Zhang
Wei Tang
Junchi Yan
16
0
0
03 Nov 2023
Spacecraft Autonomous Decision-Planning for Collision Avoidance: a
  Reinforcement Learning Approach
Spacecraft Autonomous Decision-Planning for Collision Avoidance: a Reinforcement Learning Approach
Nicolas Bourriez
Adrien Loizeau
Adam F. Abdin
11
2
0
29 Oct 2023
Inverse Decision Modeling: Learning Interpretable Representations of
  Behavior
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Daniel Jarrett
Alihan Huyuk
M. Schaar
AI4CE
17
27
0
28 Oct 2023
Absolute Policy Optimization
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
24
4
0
20 Oct 2023
Leveraging Knowledge Distillation for Efficient Deep Reinforcement
  Learning in Resource-Constrained Environments
Leveraging Knowledge Distillation for Efficient Deep Reinforcement Learning in Resource-Constrained Environments
Guanlin Meng
23
1
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
38
10
0
15 Oct 2023
Dealing with uncertainty: balancing exploration and exploitation in deep
  recurrent reinforcement learning
Dealing with uncertainty: balancing exploration and exploitation in deep recurrent reinforcement learning
Valentina Zangirolami
Matteo Borrotti
18
6
0
12 Oct 2023
DeePref: Deep Reinforcement Learning For Video Prefetching In Content
  Delivery Networks
DeePref: Deep Reinforcement Learning For Video Prefetching In Content Delivery Networks
Nawras Alkassab
Chin-Tser Huang
Tania Lorido-Botran
11
1
0
11 Oct 2023
Lifelong Learning for Fog Load Balancing: A Transfer Learning Approach
Lifelong Learning for Fog Load Balancing: A Transfer Learning Approach
Maad Ebrahim
A. Hafid
Mohamed Riduan Abid
21
1
0
08 Oct 2023
Reinforcement Learning with Fast and Forgetful Memory
Reinforcement Learning with Fast and Forgetful Memory
Steven D. Morad
Ryan Kortvelesy
Stephan Liwicki
Amanda Prorok
OffRL
16
4
0
06 Oct 2023
Multi-Agent Reinforcement Learning Based on Representational
  Communication for Large-Scale Traffic Signal Control
Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
Rohit Bokade
Xiaoning Jin
Chris Amato
35
10
0
03 Oct 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of
  Agents
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
29
0
0
29 Sep 2023
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
Xu Zhao
Duzhen Zhang
Liyuan Han
Tielin Zhang
Bo Xu
37
7
0
25 Sep 2023
Ten Years of Generative Adversarial Nets (GANs): A survey of the
  state-of-the-art
Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art
Tanujit Chakraborty
Ujjwal Reddy K S
Shraddha M. Naik
Madhurima Panja
B. Manvitha
27
61
0
30 Aug 2023
Deep Reinforcement Learning for Uplink Scheduling in NOMA-URLLC Networks
Deep Reinforcement Learning for Uplink Scheduling in NOMA-URLLC Networks
Benoît-Marie Robaglia
M. Coupechoux
D. Tsilimantos
AI4TS
8
0
0
28 Aug 2023
Collaborative Information Dissemination with Graph-based Multi-Agent
  Reinforcement Learning
Collaborative Information Dissemination with Graph-based Multi-Agent Reinforcement Learning
Raffaele Galliera
K. Venable
Matteo Bassani
N. Suri
14
1
0
25 Aug 2023
Bayesian Exploration Networks
Bayesian Exploration Networks
Matt Fellows
Brandon Kaplowitz
Christian Schroeder de Witt
Shimon Whiteson
BDL
31
3
0
24 Aug 2023
DPMAC: Differentially Private Communication for Cooperative Multi-Agent
  Reinforcement Learning
DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Canzhe Zhao
Yanjie Ze
Jing Dong
Baoxiang Wang
Shuai Li
22
2
0
19 Aug 2023
A Deep Recurrent-Reinforcement Learning Method for Intelligent
  AutoScaling of Serverless Functions
A Deep Recurrent-Reinforcement Learning Method for Intelligent AutoScaling of Serverless Functions
Siddharth Agarwal
M. A. Rodriguez
Rajkumar Buyya
17
8
0
11 Aug 2023
Commodities Trading through Deep Policy Gradient Methods
Commodities Trading through Deep Policy Gradient Methods
Jonas Hanetho
11
2
0
10 Aug 2023
An In-Depth Analysis of Discretization Methods for Communication
  Learning using Backpropagation with Multi-Agent Reinforcement Learning
An In-Depth Analysis of Discretization Methods for Communication Learning using Backpropagation with Multi-Agent Reinforcement Learning
Astrid Vanneste
Simon Vanneste
Kevin Mets
Tom De Schepper
Siegfried Mercelis
P. Hellinckx
19
0
0
09 Aug 2023
Intelligence-Endogenous Management Platform for Computing and Network
  Convergence
Intelligence-Endogenous Management Platform for Computing and Network Convergence
Zicong Hong
Xiaoyu Qiu
Jiangnnan Lin
Wuhui Chen
Yue Yu
Hui Wang
Songxue Guo
Wen Gao
10
4
0
07 Aug 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed
  Markov Decision Processes
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
19
3
0
29 Jul 2023
Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning
Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning
Aqeel Labash
Florian Fletzer
Daniel Majoral
Raul Vicente
25
1
0
22 Jul 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
21
1
0
22 Jul 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
35
5
0
04 Jul 2023
Previous
12345...111213
Next