Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.06527
Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs
23 July 2015
Matthew J. Hausknecht
Peter Stone
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Recurrent Q-Learning for Partially Observable MDPs"
50 / 634 papers shown
Title
RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning
Haoxing Chen
Guang Yang
Junge Zhang
Qiyue Yin
Kaiqi Huang
20
2
0
02 Jun 2022
Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Joseph Early
Tom Bewley
C. Evers
Sarvapali Ramchurn
OffRL
16
15
0
30 May 2022
Improving Bidding and Playing Strategies in the Trick-Taking game Wizard using Deep Q-Networks
Jonas Schumacher
Marco Pleines
24
0
0
27 May 2022
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
48
17
0
26 May 2022
History Compression via Language Models in Reinforcement Learning
Fabian Paischer
Thomas Adler
Vihang Patil
Angela Bitto-Nemling
Markus Holzleitner
Sebastian Lehner
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
AI4TS
18
42
0
24 May 2022
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization
Marco Pleines
Matthias Pallasch
F. Zimmer
Mike Preuss
23
13
0
23 May 2022
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
43
18
0
23 May 2022
A Temporal-Pattern Backdoor Attack to Deep Reinforcement Learning
Yinbo Yu
Jiajia Liu
Shouqing Li
Ke Huang
Xudong Feng
AAML
36
11
0
05 May 2022
Resilient robot teams: a review integrating decentralised control, change-detection, and learning
David M. Bossens
Sarvapali Ramchurn
Danesh Tarapore
25
5
0
21 Apr 2022
Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning
Jun Guo
Yonghong Chen
Yihang Hao
Zixin Yin
Yin Yu
Simin Li
AAML
32
32
0
17 Apr 2022
Reinforcement learning on graphs: A survey
Mingshuo Nie
Dongming Chen
Dongqi Wang
31
45
0
13 Apr 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
19
0
0
11 Apr 2022
Temporal Alignment for History Representation in Reinforcement Learning
Aleksandr Ermolov
E. Sangineto
N. Sebe
AI4TS
16
2
0
07 Apr 2022
Distributed Reinforcement Learning for Robot Teams: A Review
Yutong Wang
Mehul Damani
Pamela Wang
Yuhong Cao
Guillaume Sartoretti
39
22
0
07 Apr 2022
Safe Reinforcement Learning via Shielding under Partial Observability
Steven Carr
N. Jansen
Sebastian Junges
Ufuk Topcu
13
45
0
02 Apr 2022
Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks
Yang Shao
Quan Kong
Tadayuki Matsumura
Taiki Fuji
Kiyoto Ito
Hiroyuki Mizuno
17
6
0
31 Mar 2022
Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study
Xintong Wang
Gary Qiurui Ma
Alon Eden
Clara Li
Alexander R. Trott
Stephan Zheng
David C. Parkes
32
8
0
25 Mar 2022
Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning
Pascal Weber
Daniel Wälchli
Mustafa Zeqiri
Petros Koumoutsakos
CLL
OffRL
10
7
0
24 Mar 2022
Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation
Tarik Kelestemur
Robert W. Platt
T. Padır
27
32
0
21 Mar 2022
Explicit User Manipulation in Reinforcement Learning Based Recommender Systems
Matthew Sparr
OffRL
17
0
0
20 Mar 2022
Strategic Maneuver and Disruption with Reinforcement Learning Approaches for Multi-Agent Coordination
Derrik E. Asher
Anjon Basak
Rolando Fernandez
P. Sharma
Erin G. Zaroukian
...
Thomas Mahre
Gerardo Galindo
Luke Frerichs
J. Rogers
J. Fossaceca
AI4CE
6
5
0
17 Mar 2022
Backpropagation through Time and Space: Learning Numerical Methods with Multi-Agent Reinforcement Learning
E. Way
Dheeraj Kapilavai
Yiwei Fu
Lei Yu
AI4CE
12
2
0
16 Mar 2022
One-Shot Learning from a Demonstration with Hierarchical Latent Language
Nathaniel Weir
Xingdi Yuan
Marc-Alexandre Côté
Matthew J. Hausknecht
Romain Laroche
Ida Momennejad
H. V. Seijen
Benjamin Van Durme
BDL
19
6
0
09 Mar 2022
Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping
Ningyuan Zhang
Wenliang Liu
C. Belta
17
2
0
08 Mar 2022
Targeted Data Poisoning Attack on News Recommendation System by Content Perturbation
Xudong Zhang
Zan Wang
Jingke Zhao
Lanjun Wang
AAML
13
10
0
04 Mar 2022
Deep Q-network using reservoir computing with multi-layered readout
Toshitaka Matsuki
OffRL
18
2
0
03 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
17
9
0
24 Feb 2022
Multi-Agent Reinforcement Learning for Network Selection and Resource Allocation in Heterogeneous multi-RAT Networks
Mhd Saria Allahham
A. Abdellatif
N. Mhaisen
Amr M. Mohamed
A. Erbad
M. Guizani
15
31
0
21 Feb 2022
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs
Sammie Katt
Hai V. Nguyen
F. Oliehoek
Chris Amato
BDL
OffRL
13
1
0
17 Feb 2022
Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Hang Ren
Aivar Sootla
Taher Jafferjee
Junxiao Shen
Jun Wang
Haitham Bou-Ammar
BDL
OffRL
29
9
0
14 Feb 2022
Deep Reinforcement Learning and Convex Mean-Variance Optimisation for Portfolio Management
Ruan Pretorius
Terence L van Zyl
AI4TS
11
3
0
13 Feb 2022
Provable Reinforcement Learning with a Short-Term Memory
Yonathan Efroni
Chi Jin
A. Krishnamurthy
Sobhan Miryoosefi
OffRL
8
37
0
08 Feb 2022
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau
Jinke He
M. Spaan
F. Oliehoek
22
4
0
03 Feb 2022
FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems
Yutong Wang
Guillaume Sartoretti
25
8
0
28 Jan 2022
Planning in Observable POMDPs in Quasipolynomial Time
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
24
27
0
12 Jan 2022
Learning Reward Machines: A Study in Partially Observable Reinforcement Learning
Rodrigo Toro Icarte
Ethan Waldie
Toryn Q. Klassen
Richard Valenzano
Margarita P. Castro
Sheila A. McIlraith
11
13
0
17 Dec 2021
Learning to Share in Multi-Agent Reinforcement Learning
Yuxuan Yi
G. Li
Yaowei Wang
Zongqing Lu
20
13
0
16 Dec 2021
Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning
C. Bellinger
Andriy Drozdyuk
Mark Crowley
Isaac Tamblyn
OffRL
16
7
0
14 Dec 2021
Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning
Giseung Park
Sungho Choi
Y. Sung
OffRL
26
3
0
10 Dec 2021
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
Yunru Bai
Chen Gong
Bin Zhang
Guoliang Fan
Xinwen Hou
Yu Liu
18
6
0
09 Dec 2021
Reinforcement Learning for Navigation of Mobile Robot with LiDAR
Inhwan Kim
S. Nengroo
Dongsoo Har
21
13
0
06 Dec 2021
Towards Personalization of User Preferences in Partially Observable Smart Home Environments
Shashi Suman
F. Rivest
Ali Etemad
16
4
0
02 Dec 2021
MAMRL: Exploiting Multi-agent Meta Reinforcement Learning in WAN Traffic Engineering
Shan Sun
M. Kiran
Wei Ren
22
2
0
30 Nov 2021
Inducing Functions through Reinforcement Learning without Task Specification
Junmo Cho
Dong-hwan Lee
Young-Gyu Yoon
15
2
0
23 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
11
1
0
12 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
31
1
0
11 Nov 2021
HARPO: Learning to Subvert Online Behavioral Advertising
Jiang Zhang
Konstantinos Psounis
Muhammad Haroon
Zubair Shafiq
PICV
25
8
0
09 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
28
23
0
06 Nov 2021
Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer
Cesare Magnetti
Hadrien Reynaud
Bernhard Kainz
MedIm
8
0
0
05 Nov 2021
Autonomous Attack Mitigation for Industrial Control Systems
John Mern
Kyle Hatch
Ryan Silva
Cameron Hickert
Tamim I. Sookoor
Mykel J. Kochenderfer
AAML
11
7
0
03 Nov 2021
Previous
1
2
3
4
5
6
...
11
12
13
Next