Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.06527
Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs
23 July 2015
Matthew J. Hausknecht
Peter Stone
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Recurrent Q-Learning for Partially Observable MDPs"
50 / 634 papers shown
Title
Deep Reinforcement Learning and Transportation Research: A Comprehensive Review
Nahid Parvez Farazi
T. Ahamed
Limon Barua
Bo Zou
AI4TS
27
18
0
13 Oct 2020
Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum Sharing for 5G and Beyond
Hao-Hsuan Chang
Lingjia Liu
Yuhao Yi
8
46
0
12 Oct 2020
A novel control mode of bionic morphing tail based on deep reinforcement learning
Liming Zheng
Zhou Zhou
Peng Sun
Zhilin Zhang
Rui Wang
AI4CE
16
1
0
08 Oct 2020
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Mohit Shridhar
Xingdi Yuan
Marc-Alexandre Côté
Yonatan Bisk
Adam Trischler
Matthew J. Hausknecht
LM&Ro
LLMAG
27
397
0
08 Oct 2020
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Tarun Gupta
Anuj Mahajan
Bei Peng
Wendelin Bohmer
Shimon Whiteson
OffRL
14
49
0
06 Oct 2020
Latent World Models For Intrinsically Motivated Exploration
Aleksandr Ermolov
N. Sebe
25
25
0
05 Oct 2020
The act of remembering: a study in partially observable reinforcement learning
Rodrigo Toro Icarte
Richard Valenzano
Toryn Q. Klassen
Phillip J. K. Christoffersen
Amir-massoud Farahmand
Sheila A. McIlraith
OffRL
6
10
0
05 Oct 2020
Correcting Experience Replay for Multi-Agent Communication
S. Ahilan
Peter Dayan
16
10
0
02 Oct 2020
Reinforcement Learning Approaches in Social Robotics
Neziha Akalin
Amy Loutfi
OffRL
6
100
0
21 Sep 2020
QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Seth Austin Harding
Haibin Wu
Siyue Hu
Shih-Wei Liao
29
9
0
09 Sep 2020
Deep Active Inference for Partially Observable MDPs
Otto van der Himst
Pablo Lanillos
BDL
OffRL
AI4CE
12
26
0
08 Sep 2020
Deep Learning and Reinforcement Learning for Autonomous Unmanned Aerial Systems: Roadmap for Theory to Deployment
Jithin Jagannath
Anu Jagannath
Sean Furman
Tyler Gwin
14
25
0
07 Sep 2020
BGC: Multi-Agent Group Belief with Graph Clustering
Tianze Zhou
Fubiao Zhang
Pan Tang
Chenfei Wang
6
1
0
20 Aug 2020
A Survey of Knowledge-based Sequential Decision Making under Uncertainty
Shiqi Zhang
Mohan Sridharan
21
16
0
19 Aug 2020
Toward Smart Security Enhancement of Federated Learning Networks
Junjie Tan
Ying-Chang Liang
Nguyen Cong Luong
Dusit Niyato
AAML
30
38
0
19 Aug 2020
Control for Multifunctionality: Bioinspired Control Based on Feeding in Aplysia californica
Victoria A. Webster-Wood
Jeffrey P. Gill
P. Thomas
H. Chiel
24
16
0
11 Aug 2020
Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents
A. M. Hafiz
G. M. Bhat
AI4CE
23
2
0
06 Aug 2020
EasyRL: A Simple and Extensible Reinforcement Learning Framework
Neil Hulbert
S. Spillers
Brandon Francis
James Haines-Temons
Ken Gil Romero
Benjamin De Jager
Sam Wong
Kevin Flora
Bowei Huang
Athirai Aravazhi Irissappane
OffRL
OnRL
SyDa
11
1
0
04 Aug 2020
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
41
1
0
29 Jul 2020
Value-Decomposition Multi-Agent Actor-Critics
Jianyu Su
Stephen C. Adams
Peter A. Beling
68
101
0
24 Jul 2020
Attend and Segment: Attention Guided Active Semantic Segmentation
Soroush Seifi
Tinne Tuytelaars
31
13
0
22 Jul 2020
UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning
Sarthak Bhagat
Sujit PB
34
47
0
21 Jul 2020
Heterogeneous Task Offloading and Resource Allocations via Deep Recurrent Reinforcement Learning in Partial Observable Multi-Fog Networks
Jungyeon Baek
Georges Kaddoum
8
83
0
21 Jul 2020
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
30
232
0
13 Jul 2020
Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning
Matthias Hutsebaut-Buysse
Kevin Mets
Steven Latré
LM&Ro
OffRL
OnRL
25
6
0
10 Jul 2020
Learning "What-if" Explanations for Sequential Decision-Making
Ioana Bica
Daniel Jarrett
Alihan Huyuk
M. Schaar
OffRL
16
2
0
02 Jul 2020
Human-centered collaborative robots with deep reinforcement learning
Ali Ghadirzadeh
Xi Chen
Wenjie Yin
Zhengrong Yi
Mårten Björkman
Danica Kragic
15
66
0
02 Jul 2020
Enforcing Almost-Sure Reachability in POMDPs
Sebastian Junges
N. Jansen
S. Seshia
22
26
0
30 Jun 2020
Deep Reinforcement Learning Control for Radar Detection and Tracking in Congested Spectral Environments
C. Thornton
Mark A. Kozy
R. M. Buehrer
A. Martone
K. Sherbondy
11
82
0
23 Jun 2020
QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son
Sungsoo Ahn
Roben Delos Reyes
Jinwoo Shin
Yung Yi
OffRL
28
2
0
22 Jun 2020
Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis
Weihang Yuan
Hector Muñoz-Avila
8
1
0
21 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
8
28
0
18 Jun 2020
Multi-Agent Reinforcement Learning for Adaptive User Association in Dynamic mmWave Networks
Mohamed Sana
A. De Domenico
Wei Yu
Y. Lostanlen
Emilio Calvanese Strinati
18
54
0
16 Jun 2020
When is Particle Filtering Efficient for Planning in Partially Observed Linear Dynamical Systems?
S. Du
Wei Hu
Zhiyuan Li
Ruoqi Shen
Zhao-quan Song
Jiajun Wu
36
1
0
10 Jun 2020
Fitted Q-Learning for Relational Domains
Srijita Das
S. Natarajan
Kaushik Roy
Ronald E. Parr
Kristian Kersting
11
15
0
10 Jun 2020
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
Shariq Iqbal
Christian Schroeder de Witt
Bei Peng
Wendelin Bohmer
Shimon Whiteson
Fei Sha
29
64
0
07 Jun 2020
Re-understanding Finite-State Representations of Recurrent Policy Networks
Mohamad H. Danesh
Anurag Koul
Alan Fern
Saeed Khorram
31
21
0
06 Jun 2020
Logical Team Q-learning: An approach towards factored policies in cooperative MARL
Lucas Cassano
Ali H. Sayed
OffRL
25
2
0
05 Jun 2020
Learning Memory-Based Control for Human-Scale Bipedal Locomotion
J. Siekmann
S. Valluri
Jeremy Dao
Lorenzo Bermillo
Helei Duan
Alan Fern
J. Hurst
AI4CE
7
67
0
03 Jun 2020
Privileged Information Dropout in Reinforcement Learning
Pierre-Alexandre Kamienny
Kai Arulkumaran
Feryal M. P. Behbahani
Wendelin Boehmer
Shimon Whiteson
19
10
0
19 May 2020
A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions
Luis Miralles-Pechuán
Fernando Jiménez
Hiram Ponce
Lourdes Martínez-Villaseñor
8
13
0
15 May 2020
Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Yash Satsangi
Sungsu Lim
Shimon Whiteson
F. Oliehoek
Martha White
27
15
0
11 May 2020
Intelligent Querying for Target Tracking in Camera Networks using Deep Q-Learning with n-Step Bootstrapping
Anil Sharma
Saket Anand
S. Kaul
9
6
0
20 Apr 2020
Modeling Survival in model-based Reinforcement Learning
Saeed Moazami
P. Doerschuk
OffRL
17
1
0
18 Apr 2020
Macro-Action-Based Deep Multi-Agent Reinforcement Learning
Yuchen Xiao
Joshua Hoffman
Chris Amato
AI4CE
11
29
0
18 Apr 2020
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning
Wenhao Li
Bo Jin
Xiangfeng Wang
Junchi Yan
H. Zha
25
21
0
17 Apr 2020
Reinforcement Learning in a Physics-Inspired Semi-Markov Environment
C. Bellinger
Rory Coles
Mark Crowley
Isaac Tamblyn
OOD
9
2
0
15 Apr 2020
Learning from Learners: Adapting Reinforcement Learning Agents to be Competitive in a Card Game
Pablo V. A. Barros
Ana Tanevska
A. Sciutti
17
21
0
08 Apr 2020
An Application of Deep Reinforcement Learning to Algorithmic Trading
Thibaut Théate
D. Ernst
AIFin
19
162
0
07 Apr 2020
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
Weichao Mao
Kaipeng Zhang
Erik Miehling
Tamer Basar
6
23
0
02 Apr 2020
Previous
1
2
3
...
7
8
9
...
11
12
13
Next