Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.06527
Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs
23 July 2015
Matthew J. Hausknecht
Peter Stone
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Recurrent Q-Learning for Partially Observable MDPs"
50 / 634 papers shown
Title
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
21
3
0
29 Jun 2023
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Luo Mai
Jun Wang
Haifeng Zhang
Weinan Zhang
LM&Ro
LRM
37
35
0
24 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
23
7
0
20 Jun 2023
Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification
Dong Xing
Pengjie Gu
Qian Zheng
Xinrun Wang
Shanqi Liu
Longtao Zheng
Bo An
Gang Pan
18
2
0
19 Jun 2023
Automatic Deduction Path Learning via Reinforcement Learning with Environmental Correction
Shuai Xiao
Chen Pan
Min Wang
Xinxin Zhu
Siqiao Xue
Jing Wang
Yun Hu
James Y. Zhang
Jinghua Feng
OffRL
36
2
0
16 Jun 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
21
9
0
15 Jun 2023
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse
Jannis Blüml
Bjarne Gregori
Sebastian Sztwiertnia
Kristian Kersting
40
17
0
14 Jun 2023
Approximate information state based convergence analysis of recurrent Q-learning
Erfan Seyedsalehi
N. Akbarzadeh
Amit Sinha
Aditya Mahajan
22
6
0
09 Jun 2023
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Bo Liu
Yifeng Zhu
Chongkai Gao
Yihao Feng
Qian Liu
Yuke Zhu
Peter Stone
CLL
35
115
0
05 Jun 2023
Model-aided Federated Reinforcement Learning for Multi-UAV Trajectory Planning in IoT Networks
Jichao Chen
Omid Esrafilian
Harald Bayerlein
David Gesbert
Marco Caccamo
16
4
0
03 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
16
6
0
01 Jun 2023
Active Vision Reinforcement Learning under Limited Visual Observability
Jinghuan Shang
Michael S. Ryoo
32
0
0
01 Jun 2023
Exploring the Promise and Limits of Real-Time Recurrent Learning
Kazuki Irie
Anand Gopalakrishnan
Jürgen Schmidhuber
24
15
0
30 May 2023
ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry
Chris Beeler
Sriram Ganapathi Subramanian
Kyle Sprague
Nouha Chatti
C. Bellinger
...
Amanuel Dawit
Zihan Yang
Xinkai Li
Mark Crowley
Isaac Tamblyn
OffRL
13
6
0
23 May 2023
RLocator: Reinforcement Learning for Bug Localization
Partha Chakraborty
Mahmoud Alfadel
M. Nagappan
18
8
0
09 May 2023
Goal-oriented inference of environment from redundant observations
Kazuki Takahashi
T. Fukai
Y. Sakai
T. Takekawa
9
0
0
08 May 2023
Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments
Junchao Li
Mingyu Cai
Z. Kan
Shaoping Xiao
17
0
0
30 Apr 2023
Optimal Scheduling in IoT-Driven Smart Isolated Microgrids Based on Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Kan Zheng
Simon X. Yang
Xuemin
X. Shen
16
11
0
28 Apr 2023
N
A
2
\text{A}^\text{2}
A
2
Q: Neural Attention Additive Model for Interpretable Multi-Agent Q-Learning
Zichuan Liu
Yuanyang Zhu
Chunlin Chen
45
10
0
26 Apr 2023
A optimization framework for herbal prescription planning based on deep reinforcement learning
Kuo Yang
Zecong Yu
X. Su
Xiong He
Ning Wang
Qiguang Zheng
Fei-yun Yu
Zhuang Liu
Tiancai Wen
Xuezhong Zhou
LM&MA
14
0
0
25 Apr 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
30
3
0
20 Apr 2023
Observer-Feedback-Feedforward Controller Structures in Reinforcement Learning
Ruoqing Zhang
Per Mattsson
T. Wigren
17
0
0
20 Apr 2023
End-to-End Policy Gradient Method for POMDPs and Explainable Agents
Soichiro Nishimori
Sotetsu Koyamada
Shin Ishii
15
0
0
19 Apr 2023
Sample-efficient Model-based Reinforcement Learning for Quantum Control
Irtaza Khalid
C. Weidner
E. Jonckheere
Sophie G. Shermer
F. Langbein
11
10
0
19 Apr 2023
Mastering Pair Trading with Risk-Aware Recurrent Reinforcement Learning
Weiguang Han
Jimin Huang
Qianqian Xie
Boyi Zhang
Yanzhao Lai
Min Peng
27
4
0
01 Apr 2023
Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning
Claude Formanek
C. Tilbury
Jonathan P. Shock
Kale-ab Tessera
Arnu Pretorius
24
3
0
31 Mar 2023
Decision Making for Autonomous Driving in Interactive Merge Scenarios via Learning-based Prediction
Salar Arbabi
D. Tavernini
Saber Fallah
Richard Bowden
17
1
0
29 Mar 2023
The challenge of redundancy on multi-agent value factorisation
Siddarth S. Singh
Benjamin Rosman
36
1
0
28 Mar 2023
Active hypothesis testing in unknown environments using recurrent neural networks and model free reinforcement learning
George Stamatelis
N. Kalouptsidis
19
2
0
19 Mar 2023
SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Qiang-qiang Wang
Jia-jia Zhang
Jing Xiao
Xihuai Wang
31
0
0
16 Mar 2023
Decision-Making Under Uncertainty: Beyond Probabilities
Thom S. Badings
T. D. Simão
Marnix Suilen
N. Jansen
UD
PER
31
12
0
10 Mar 2023
The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Raphael Avalos
Florent Delgrange
Ann Nowé
Guillermo A. Pérez
D. Roijers
36
2
0
06 Mar 2023
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Wenyan Yang
A. Angleraud
R. Pieters
Joni Pajarinen
Joni-Kristian Kämäräinen
32
6
0
05 Mar 2023
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp Metering and Perimeter Control in Large Scale Networks
Zijian Hu
Wei-Ying Ma
19
5
0
04 Mar 2023
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning
Cong Guan
F. Chen
Lei Yuan
Zongzhang Zhang
Yang Yu
OffRL
37
4
0
19 Feb 2023
Shared Information-Based Safe And Efficient Behavior Planning For Connected Autonomous Vehicles
Songyang Han
Shangli Zhou
Lynn Pepin
Jiangwei Wang
Caiwen Ding
Fei Miao
13
1
0
08 Feb 2023
Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving
Yehui Zhang
Joschka Boedecker
Chuxuan Li
Guyue Zhou
7
0
0
30 Jan 2023
Contrastive Meta-Learning for Partially Observable Few-Shot Learning
Adam Jelley
Amos Storkey
Antreas Antoniou
Sam Devlin
25
6
0
30 Jan 2023
Zero-Shot Transfer of Haptics-Based Object Insertion Policies
Samarth Brahmbhatt
A. Deka
Andrew Spielberg
M. Muller
9
5
0
29 Jan 2023
Double Deep Reinforcement Learning Techniques for Low Dimensional Sensing Mapless Navigation of Terrestrial Mobile Robots
Linda Dotto de Moraes
V. A. Kich
A. H. Kolling
J. A. Bottega
Raul Steinmetz
E. Silva
Ricardo B. Grando
Anselmo Rafael Cuckla
D. T. Gamarra
18
0
0
26 Jan 2023
DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning
Xu Hu
Jian Zhao
Wen-gang Zhou
Ruili Feng
Houqiang Li
29
1
0
25 Jan 2023
The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning
Andrea Soltoggio
Eseoghene Ben-Iwhiwhu
Christos Peridis
Pawel Ladosz
Jeffery Dick
Praveen K. Pilly
Soheil Kolouri
OffRL
32
3
0
21 Jan 2023
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav
Ram Ramrakhya
Dhruv Batra
Erik Wijmans
Abhishek Das
OffRL
20
53
0
18 Jan 2023
SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning
Maxwell Standen
Junae Kim
Claudia Szabo
AAML
29
5
0
11 Jan 2023
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRL
MU
AI4CE
37
55
0
08 Jan 2023
Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability
Thomy Phan
Fabian Ritz
Philipp Altmann
Maximilian Zorn
Jonas Nusslein
Michael Kolle
Thomas Gabor
Claudia Linnhoff-Popien
22
12
0
04 Jan 2023
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
OffRL
31
8
0
30 Dec 2022
On Deep Recurrent Reinforcement Learning for Active Visual Tracking of Space Noncooperative Objects
D. Zhou
Guanghui Sun
Zhao-jie Zhang
Ligang Wu
17
8
0
29 Dec 2022
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
27
25
0
29 Dec 2022
Previous
1
2
3
4
5
6
...
11
12
13
Next