Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.06527
Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs
23 July 2015
Matthew J. Hausknecht
Peter Stone
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Recurrent Q-Learning for Partially Observable MDPs"
50 / 634 papers shown
Title
Safety-Aware Multi-Agent Learning for Dynamic Network Bridging
Raffaele Galliera
Konstantinos Mitsopoulos
N. Suri
Raffaele Romagnoli
36
0
0
02 Apr 2024
Why Online Reinforcement Learning is Causal
Oliver Schulte
Pascal Poupart
CML
OffRL
30
1
0
07 Mar 2024
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na
Yunkyeong Seo
IL-Chul Moon
24
7
0
02 Mar 2024
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
Zeyang Liu
Lipeng Wan
Xinrui Yang
Zhuoran Chen
Xingyu Chen
Xuguang Lan
21
3
0
28 Feb 2024
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
34
4
0
07 Feb 2024
CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting
Jiezhi Yang
Khushi Desai
Charles Packer
Harshil Bhatia
Nicholas Rhinehart
R. McAllister
Joseph E. Gonzalez
AI4CE
27
2
0
31 Jan 2024
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models
Xiao Shao
Weifu Jiang
Fei Zuo
Mengqing Liu
LLMAG
31
7
0
31 Jan 2024
Learning Online Belief Prediction for Efficient POMDP Planning in Autonomous Driving
Zhiyu Huang
Chen Tang
Chen Lv
Masayoshi Tomizuka
Wei Zhan
40
5
0
27 Jan 2024
Fully Independent Communication in Multi-Agent Reinforcement Learning
Rafael Pina
V. D. Silva
Corentin Artaud
Xiaolan Liu
27
4
0
26 Jan 2024
Learning When to See for Long-term Traffic Data Collection on Power-constrained Devices
Ruixuan Zhang
Wenyu Han
Zilin Bian
K. Ozbay
Chen Feng
AI4TS
13
0
0
25 Jan 2024
Traffic Learning and Proactive UAV Trajectory Planning for Data Uplink in Markovian IoT Models
Eslam Eldeeb
M. Shehab
Hirley Alves
16
7
0
24 Jan 2024
Detecting Hidden Triggers: Mapping Non-Markov Reward Functions to Markov
Gregory Hyde
Eugene Santos
24
0
0
20 Jan 2024
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Ziyue Li
Zhiwei Xu
Hao Chen
Yiqun Chen
Bin Zhang
Zhen Xiao
Junge Zhang
Jiangjin Yin
OffRL
11
8
0
26 Dec 2023
An investigation of belief-free DRL and MCTS for inspection and maintenance planning
Daniel Koutas
E. Bismut
Daniel Straub
19
2
0
22 Dec 2023
Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning
Yanwen Ba
Xuan Liu
Xinning Chen
Hao Wang
Yang Xu
Kenli Li
Shigeng Zhang
20
2
0
19 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
30
8
0
15 Dec 2023
On The Expressivity of Recurrent Neural Cascades
Nadezda A. Knorozova
Alessandro Ronca
18
1
0
14 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
78
5
0
13 Dec 2023
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
38
4
0
11 Dec 2023
TrustFed: A Reliable Federated Learning Framework with Malicious-Attack Resistance
Hangn Su
Jianhong Zhou
Xianhua Niu
Gang Feng
AAML
21
4
0
06 Dec 2023
GVFs in the Real World: Making Predictions Online for Water Treatment
Muhammad Kamran Janjua
Haseeb Shah
Martha White
Erfan Miahi
Marlos C. Machado
Adam White
AI4CE
35
8
0
04 Dec 2023
Generalisable Agents for Neural Network Optimisation
Kale-ab Tessera
C. Tilbury
Sasha Abramowitz
Ruan de Kock
Omayma Mahjoub
Benjamin Rosman
Sara Hooker
Arnu Pretorius
AI4CE
20
0
0
30 Nov 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
RigLSTM: Recurrent Independent Grid LSTM for Generalizable Sequence Learning
Ziyu Wang
Wenhao Jiang
Zixuan Zhang
Wei Tang
Junchi Yan
16
0
0
03 Nov 2023
Spacecraft Autonomous Decision-Planning for Collision Avoidance: a Reinforcement Learning Approach
Nicolas Bourriez
Adrien Loizeau
Adam F. Abdin
11
2
0
29 Oct 2023
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Daniel Jarrett
Alihan Huyuk
M. Schaar
AI4CE
17
27
0
28 Oct 2023
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
24
4
0
20 Oct 2023
Leveraging Knowledge Distillation for Efficient Deep Reinforcement Learning in Resource-Constrained Environments
Guanlin Meng
23
1
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
38
10
0
15 Oct 2023
Dealing with uncertainty: balancing exploration and exploitation in deep recurrent reinforcement learning
Valentina Zangirolami
Matteo Borrotti
18
6
0
12 Oct 2023
DeePref: Deep Reinforcement Learning For Video Prefetching In Content Delivery Networks
Nawras Alkassab
Chin-Tser Huang
Tania Lorido-Botran
11
1
0
11 Oct 2023
Lifelong Learning for Fog Load Balancing: A Transfer Learning Approach
Maad Ebrahim
A. Hafid
Mohamed Riduan Abid
21
1
0
08 Oct 2023
Reinforcement Learning with Fast and Forgetful Memory
Steven D. Morad
Ryan Kortvelesy
Stephan Liwicki
Amanda Prorok
OffRL
16
4
0
06 Oct 2023
Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
Rohit Bokade
Xiaoning Jin
Chris Amato
35
10
0
03 Oct 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
29
0
0
29 Sep 2023
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
Xu Zhao
Duzhen Zhang
Liyuan Han
Tielin Zhang
Bo Xu
37
7
0
25 Sep 2023
Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art
Tanujit Chakraborty
Ujjwal Reddy K S
Shraddha M. Naik
Madhurima Panja
B. Manvitha
27
61
0
30 Aug 2023
Deep Reinforcement Learning for Uplink Scheduling in NOMA-URLLC Networks
Benoît-Marie Robaglia
M. Coupechoux
D. Tsilimantos
AI4TS
8
0
0
28 Aug 2023
Collaborative Information Dissemination with Graph-based Multi-Agent Reinforcement Learning
Raffaele Galliera
K. Venable
Matteo Bassani
N. Suri
14
1
0
25 Aug 2023
Bayesian Exploration Networks
Matt Fellows
Brandon Kaplowitz
Christian Schroeder de Witt
Shimon Whiteson
BDL
31
3
0
24 Aug 2023
DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Canzhe Zhao
Yanjie Ze
Jing Dong
Baoxiang Wang
Shuai Li
22
2
0
19 Aug 2023
A Deep Recurrent-Reinforcement Learning Method for Intelligent AutoScaling of Serverless Functions
Siddharth Agarwal
M. A. Rodriguez
Rajkumar Buyya
17
8
0
11 Aug 2023
Commodities Trading through Deep Policy Gradient Methods
Jonas Hanetho
11
2
0
10 Aug 2023
An In-Depth Analysis of Discretization Methods for Communication Learning using Backpropagation with Multi-Agent Reinforcement Learning
Astrid Vanneste
Simon Vanneste
Kevin Mets
Tom De Schepper
Siegfried Mercelis
P. Hellinckx
19
0
0
09 Aug 2023
Intelligence-Endogenous Management Platform for Computing and Network Convergence
Zicong Hong
Xiaoyu Qiu
Jiangnnan Lin
Wuhui Chen
Yue Yu
Hui Wang
Songxue Guo
Wen Gao
10
4
0
07 Aug 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
19
3
0
29 Jul 2023
Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning
Aqeel Labash
Florian Fletzer
Daniel Majoral
Raul Vicente
25
1
0
22 Jul 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
21
1
0
22 Jul 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
35
5
0
04 Jul 2023
Previous
1
2
3
4
5
...
11
12
13
Next