Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.06527
Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs
23 July 2015
Matthew J. Hausknecht
Peter Stone
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Recurrent Q-Learning for Partially Observable MDPs"
50 / 634 papers shown
Title
Deep Reinforcement Learning for Adaptive Exploration of Unknown Environments
Ashley Peake
Joe McCalmon
Yixin Zhang
Daniel Myers
Sarra M. Alqahtani
V. P. Pauca
11
4
0
04 May 2021
Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data
Adrian Remonda
Sarah Krebs
Eduardo E. Veas
Granit Luzhnica
Roman Kern
OffRL
32
23
0
22 Apr 2021
Reinforcement Learning using Guided Observability
Stephan Weigand
Pascal Klink
Jan Peters
Joni Pajarinen
OffRL
15
4
0
22 Apr 2021
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings
Eltayeb Ahmed
L. Zintgraf
Christian Schroeder de Witt
Nicolas Usunier
SSL
24
0
0
17 Apr 2021
Two-stage training algorithm for AI robot soccer
Taeyoung Kim
L. Vecchietti
Kyujin Choi
Sanem Sariel
Dongsoo Har
21
6
0
13 Apr 2021
Learning to Coordinate via Multiple Graph Neural Networks
Zhiwei Xu
Bin Zhang
Yunpeng Bai
Dapeng Li
Guoliang Fan
GNN
AI4CE
27
8
0
08 Apr 2021
Data-Driven Simulation of Ride-Hailing Services using Imitation and Reinforcement Learning
H. Jayasinghe
Tarindu Jayatilaka
Ravin Gunawardena
Uthayasanker Thayasivam
30
0
0
06 Apr 2021
Distributed Deep Reinforcement Learning for Collaborative Spectrum Sharing
P. Pawar
Amir Leshem
11
3
0
06 Apr 2021
NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning
Quanlin Chen
OffRL
33
0
0
05 Apr 2021
How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control?
Jingxi Xu
Bruce D. Lee
Nikolai Matni
Dinesh Jayaraman
97
6
0
02 Apr 2021
Strengthening the Training of Convolutional Neural Networks By Using Walsh Matrix
T. Ölmez
Z. Dokur
11
12
0
31 Mar 2021
Simultaneous Navigation and Construction Benchmarking Environments
Wenyu Han
Chen Feng
Haoran Wu
Alexander Gao
Armand Jordana
Dong Liu
Lerrel Pinto
Ludovic Righetti
14
0
0
31 Mar 2021
Hard Attention Control By Mutual Information Maximization
Himanshu Sahni
Charles Isbell
14
0
0
10 Mar 2021
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning
Samarth Sinha
Ajay Mandlekar
Animesh Garg
OffRL
26
104
0
10 Mar 2021
WFA-IRL: Inverse Reinforcement Learning of Autonomous Behaviors Encoded as Weighted Finite Automata
Tianyu Wang
Nikolay Atanasov
24
0
0
10 Mar 2021
ELLA: Exploration through Learned Language Abstraction
Suvir Mirchandani
Siddharth Karamcheti
Dorsa Sadigh
LLMAG
24
57
0
10 Mar 2021
Memory-based Deep Reinforcement Learning for POMDPs
Lingheng Meng
R. Gorbet
Dana Kulić
15
89
0
24 Feb 2021
Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective
Mirza Ramicic
Andrea Bonarini
8
3
0
22 Feb 2021
Learning Memory-Dependent Continuous Control from Demonstrations
Siqing Hou
Dongqi Han
Jun Tani
11
0
0
18 Feb 2021
Multi-Agent Coordination in Adversarial Environments through Signal Mediated Strategies
Federico Cacciamani
A. Celli
Marco Ciccone
N. Gatti
21
8
0
09 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
25
12
0
09 Feb 2021
Adversarially Guided Actor-Critic
Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Philippe Preux
M. Geist
29
70
0
08 Feb 2021
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
Huan Zhang
Hongge Chen
Duane S. Boning
Cho-Jui Hsieh
67
162
0
21 Jan 2021
UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers
Siyi Hu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
OffRL
37
71
0
20 Jan 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael Bowling
Marc Lanctot
42
22
0
11 Jan 2021
Adaptive Synthetic Characters for Military Training
Volkan Ustun
Rajay Kumar
Adam Reilly
Seyed Sajjadi
Andrew Miller
AI4CE
18
10
0
06 Jan 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Erik Cambria
OffRL
47
73
0
01 Jan 2021
Partially Observable Mean Field Reinforcement Learning
Sriram Ganapathi Subramanian
Matthew E. Taylor
Mark Crowley
Pascal Poupart
OOD
20
26
0
31 Dec 2020
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization
Yoav Alon
Huiyu Zhou
46
9
0
31 Dec 2020
QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement Learning
Pascal Leroy
D. Ernst
Pierre Geurts
Gilles Louppe
J. Pisane
M. Sabatelli
23
6
0
22 Dec 2020
Online Service Migration in Mobile Edge with Incomplete System Information: A Deep Recurrent Actor-Critic Learning Approach
Jin Wang
Jia Hu
Geyong Min
Qiang Ni
Tarek A. El-Ghazawi
26
28
0
16 Dec 2020
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z. Leibo
Kate Larson
T. Graepel
34
199
0
15 Dec 2020
Specializing Inter-Agent Communication in Heterogeneous Multi-Agent Reinforcement Learning using Agent Class Information
Douglas De Rizzo Meneghetti
Reinaldo A. C. Bianchi
AI4CE
24
0
0
14 Dec 2020
Unsupervised Object Keypoint Learning using Local Spatial Predictability
Anand Gopalakrishnan
Sjoerd van Steenkiste
Jürgen Schmidhuber
SSL
31
21
0
25 Nov 2020
PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control
Dong Chen
Kaian Chen
Tianshu Chu
Rui Yao
F. Qiu
Kaixiang Lin
41
64
0
24 Nov 2020
An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Taylor W. Killian
Haoran Zhang
Jayakumar Subramanian
Mehdi Fatemi
Marzyeh Ghassemi
OffRL
8
36
0
23 Nov 2020
Multi-Step Recurrent Q-Learning for Robotic Velcro Peeling
Jiachen Yuan
Nicolai Häni
Volkan Isler
14
5
0
16 Nov 2020
Reinforcement Learning Control of a Biomechanical Model of the Upper Extremity
F. Fischer
Miroslav Bachinski
Markus Klar
A. Fleig
Jorg Muller
18
46
0
13 Nov 2020
Trajectory Planning for Autonomous Vehicles Using Hierarchical Reinforcement Learning
Kaleb Ben Naveed
Zhiqian Qiao
John M. Dolan
9
55
0
09 Nov 2020
Combining Propositional Logic Based Decision Diagrams with Decision Making in Urban Systems
Jiajing Ling
Kushagra Chandak
Akshat Kumar
AI4CE
12
0
0
09 Nov 2020
Hybrid Supervised Reinforced Model for Dialogue Systems
Carlos Miranda
Y. Kessaci
BDL
OffRL
6
0
0
04 Nov 2020
A Helmholtz equation solver using unsupervised learning: Application to transcranial ultrasound
A. Stanziola
Simon Arridge
B. Cox
B. Treeby
16
32
0
29 Oct 2020
Fighting Copycat Agents in Behavioral Cloning from Observation Histories
Chuan Wen
Jierui Lin
Trevor Darrell
Dinesh Jayaraman
Yang Gao
12
55
0
28 Oct 2020
Succinct and Robust Multi-Agent Communication With Temporal Message Control
S. Zhang
Jieyu Lin
Qi Zhang
21
57
0
27 Oct 2020
MELD: Meta-Reinforcement Learning from Images via Latent State Models
Tony Zhao
Anusha Nagabandi
Kate Rakelly
Chelsea Finn
Sergey Levine
OffRL
32
36
0
26 Oct 2020
Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network
Niranjan Deshpande
Dominique Vaufreydaz
A. Spalanzani
19
17
0
26 Oct 2020
Stabilizing Transformer-Based Action Sequence Generation For Q-Learning
Gideon Stein
Andrey Filchenkov
Arip Asadulaev
OffRL
29
2
0
23 Oct 2020
Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Hai V. Nguyen
Brett Daley
Xinchao Song
Chris Amato
Robert W. Platt
48
14
0
19 Oct 2020
Approximate information state for approximate planning and reinforcement learning in partially observed systems
Jayakumar Subramanian
Amit Sinha
Raihan Seraj
Aditya Mahajan
8
78
0
17 Oct 2020
A Learning Approach to Robot-Agnostic Force-Guided High Precision Assembly
Jieliang Luo
Hui Li
12
18
0
15 Oct 2020
Previous
1
2
3
...
6
7
8
...
11
12
13
Next