ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.06527
  4. Cited By
Deep Recurrent Q-Learning for Partially Observable MDPs

Deep Recurrent Q-Learning for Partially Observable MDPs

23 July 2015
Matthew J. Hausknecht
Peter Stone
ArXivPDFHTML

Papers citing "Deep Recurrent Q-Learning for Partially Observable MDPs"

50 / 634 papers shown
Title
RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in
  Multi-Agent Deep Reinforcement Learning
RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning
Haoxing Chen
Guang Yang
Junge Zhang
Qiyue Yin
Kaiqi Huang
20
2
0
02 Jun 2022
Non-Markovian Reward Modelling from Trajectory Labels via Interpretable
  Multiple Instance Learning
Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Joseph Early
Tom Bewley
C. Evers
Sarvapali Ramchurn
OffRL
16
15
0
30 May 2022
Improving Bidding and Playing Strategies in the Trick-Taking game Wizard
  using Deep Q-Networks
Improving Bidding and Playing Strategies in the Trick-Taking game Wizard using Deep Q-Networks
Jonas Schumacher
Marco Pleines
24
0
0
27 May 2022
Embed to Control Partially Observed Systems: Representation Learning
  with Provable Sample Efficiency
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
48
17
0
26 May 2022
History Compression via Language Models in Reinforcement Learning
History Compression via Language Models in Reinforcement Learning
Fabian Paischer
Thomas Adler
Vihang Patil
Angela Bitto-Nemling
Markus Holzleitner
Sebastian Lehner
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
AI4TS
18
42
0
24 May 2022
Generalization, Mayhems and Limits in Recurrent Proximal Policy
  Optimization
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization
Marco Pleines
Matthias Pallasch
F. Zimmer
Mike Preuss
23
13
0
23 May 2022
Flow-based Recurrent Belief State Learning for POMDPs
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
43
18
0
23 May 2022
A Temporal-Pattern Backdoor Attack to Deep Reinforcement Learning
A Temporal-Pattern Backdoor Attack to Deep Reinforcement Learning
Yinbo Yu
Jiajia Liu
Shouqing Li
Ke Huang
Xudong Feng
AAML
36
11
0
05 May 2022
Resilient robot teams: a review integrating decentralised control,
  change-detection, and learning
Resilient robot teams: a review integrating decentralised control, change-detection, and learning
David M. Bossens
Sarvapali Ramchurn
Danesh Tarapore
25
5
0
21 Apr 2022
Towards Comprehensive Testing on the Robustness of Cooperative
  Multi-agent Reinforcement Learning
Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning
Jun Guo
Yonghong Chen
Yihang Hao
Zixin Yin
Yin Yu
Simin Li
AAML
32
32
0
17 Apr 2022
Reinforcement learning on graphs: A survey
Reinforcement learning on graphs: A survey
Mingshuo Nie
Dongming Chen
Dongqi Wang
31
45
0
13 Apr 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement
  Learning in Safety-Critical Driving Scenarios
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
19
0
0
11 Apr 2022
Temporal Alignment for History Representation in Reinforcement Learning
Temporal Alignment for History Representation in Reinforcement Learning
Aleksandr Ermolov
E. Sangineto
N. Sebe
AI4TS
16
2
0
07 Apr 2022
Distributed Reinforcement Learning for Robot Teams: A Review
Distributed Reinforcement Learning for Robot Teams: A Review
Yutong Wang
Mehul Damani
Pamela Wang
Yuhong Cao
Guillaume Sartoretti
39
22
0
07 Apr 2022
Safe Reinforcement Learning via Shielding under Partial Observability
Safe Reinforcement Learning via Shielding under Partial Observability
Steven Carr
N. Jansen
Sebastian Junges
Ufuk Topcu
13
45
0
02 Apr 2022
Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks
Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks
Yang Shao
Quan Kong
Tadayuki Matsumura
Taiki Fuji
Kiyoto Ito
Hiroyuki Mizuno
17
6
0
31 Mar 2022
Platform Behavior under Market Shocks: A Simulation Framework and
  Reinforcement-Learning Based Study
Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study
Xintong Wang
Gary Qiurui Ma
Alon Eden
Clara Li
Alexander R. Trott
Stephan Zheng
David C. Parkes
32
8
0
25 Mar 2022
Remember and Forget Experience Replay for Multi-Agent Reinforcement
  Learning
Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning
Pascal Weber
Daniel Wälchli
Mustafa Zeqiri
Petros Koumoutsakos
CLL
OffRL
10
7
0
24 Mar 2022
Tactile Pose Estimation and Policy Learning for Unknown Object
  Manipulation
Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation
Tarik Kelestemur
Robert W. Platt
T. Padır
27
32
0
21 Mar 2022
Explicit User Manipulation in Reinforcement Learning Based Recommender
  Systems
Explicit User Manipulation in Reinforcement Learning Based Recommender Systems
Matthew Sparr
OffRL
17
0
0
20 Mar 2022
Strategic Maneuver and Disruption with Reinforcement Learning Approaches
  for Multi-Agent Coordination
Strategic Maneuver and Disruption with Reinforcement Learning Approaches for Multi-Agent Coordination
Derrik E. Asher
Anjon Basak
Rolando Fernandez
P. Sharma
Erin G. Zaroukian
...
Thomas Mahre
Gerardo Galindo
Luke Frerichs
J. Rogers
J. Fossaceca
AI4CE
6
5
0
17 Mar 2022
Backpropagation through Time and Space: Learning Numerical Methods with
  Multi-Agent Reinforcement Learning
Backpropagation through Time and Space: Learning Numerical Methods with Multi-Agent Reinforcement Learning
E. Way
Dheeraj Kapilavai
Yiwei Fu
Lei Yu
AI4CE
12
2
0
16 Mar 2022
One-Shot Learning from a Demonstration with Hierarchical Latent Language
One-Shot Learning from a Demonstration with Hierarchical Latent Language
Nathaniel Weir
Xingdi Yuan
Marc-Alexandre Côté
Matthew J. Hausknecht
Romain Laroche
Ida Momennejad
H. V. Seijen
Benjamin Van Durme
BDL
19
6
0
09 Mar 2022
Distributed Control using Reinforcement Learning with
  Temporal-Logic-Based Reward Shaping
Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping
Ningyuan Zhang
Wenliang Liu
C. Belta
17
2
0
08 Mar 2022
Targeted Data Poisoning Attack on News Recommendation System by Content
  Perturbation
Targeted Data Poisoning Attack on News Recommendation System by Content Perturbation
Xudong Zhang
Zan Wang
Jingke Zhao
Lanjun Wang
AAML
13
10
0
04 Mar 2022
Deep Q-network using reservoir computing with multi-layered readout
Deep Q-network using reservoir computing with multi-layered readout
Toshitaka Matsuki
OffRL
18
2
0
03 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
17
9
0
24 Feb 2022
Multi-Agent Reinforcement Learning for Network Selection and Resource
  Allocation in Heterogeneous multi-RAT Networks
Multi-Agent Reinforcement Learning for Network Selection and Resource Allocation in Heterogeneous multi-RAT Networks
Mhd Saria Allahham
A. Abdellatif
N. Mhaisen
Amr M. Mohamed
A. Erbad
M. Guizani
15
31
0
21 Feb 2022
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs
Sammie Katt
Hai V. Nguyen
F. Oliehoek
Chris Amato
BDL
OffRL
13
1
0
17 Feb 2022
Reinforcement Learning in Presence of Discrete Markovian Context
  Evolution
Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Hang Ren
Aivar Sootla
Taher Jafferjee
Junxiao Shen
Jun Wang
Haitham Bou-Ammar
BDL
OffRL
29
9
0
14 Feb 2022
Deep Reinforcement Learning and Convex Mean-Variance Optimisation for
  Portfolio Management
Deep Reinforcement Learning and Convex Mean-Variance Optimisation for Portfolio Management
Ruan Pretorius
Terence L van Zyl
AI4TS
11
3
0
13 Feb 2022
Provable Reinforcement Learning with a Short-Term Memory
Provable Reinforcement Learning with a Short-Term Memory
Yonathan Efroni
Chi Jin
A. Krishnamurthy
Sobhan Miryoosefi
OffRL
8
37
0
08 Feb 2022
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep
  RL in Large Networked Systems
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau
Jinke He
M. Spaan
F. Oliehoek
22
4
0
03 Feb 2022
FCMNet: Full Communication Memory Net for Team-Level Cooperation in
  Multi-Agent Systems
FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems
Yutong Wang
Guillaume Sartoretti
25
8
0
28 Jan 2022
Planning in Observable POMDPs in Quasipolynomial Time
Planning in Observable POMDPs in Quasipolynomial Time
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
24
27
0
12 Jan 2022
Learning Reward Machines: A Study in Partially Observable Reinforcement
  Learning
Learning Reward Machines: A Study in Partially Observable Reinforcement Learning
Rodrigo Toro Icarte
Ethan Waldie
Toryn Q. Klassen
Richard Valenzano
Margarita P. Castro
Sheila A. McIlraith
11
13
0
17 Dec 2021
Learning to Share in Multi-Agent Reinforcement Learning
Learning to Share in Multi-Agent Reinforcement Learning
Yuxuan Yi
G. Li
Yaowei Wang
Zongqing Lu
20
13
0
16 Dec 2021
Scientific Discovery and the Cost of Measurement -- Balancing
  Information and Cost in Reinforcement Learning
Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning
C. Bellinger
Andriy Drozdyuk
Mark Crowley
Isaac Tamblyn
OffRL
16
7
0
14 Dec 2021
Blockwise Sequential Model Learning for Partially Observable
  Reinforcement Learning
Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning
Giseung Park
Sungho Choi
Y. Sung
OffRL
26
3
0
10 Dec 2021
Cooperative Multi-Agent Reinforcement Learning with Hypergraph
  Convolution
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
Yunru Bai
Chen Gong
Bin Zhang
Guoliang Fan
Xinwen Hou
Yu Liu
18
6
0
09 Dec 2021
Reinforcement Learning for Navigation of Mobile Robot with LiDAR
Reinforcement Learning for Navigation of Mobile Robot with LiDAR
Inhwan Kim
S. Nengroo
Dongsoo Har
21
13
0
06 Dec 2021
Towards Personalization of User Preferences in Partially Observable
  Smart Home Environments
Towards Personalization of User Preferences in Partially Observable Smart Home Environments
Shashi Suman
F. Rivest
Ali Etemad
16
4
0
02 Dec 2021
MAMRL: Exploiting Multi-agent Meta Reinforcement Learning in WAN Traffic
  Engineering
MAMRL: Exploiting Multi-agent Meta Reinforcement Learning in WAN Traffic Engineering
Shan Sun
M. Kiran
Wei Ren
22
2
0
30 Nov 2021
Inducing Functions through Reinforcement Learning without Task
  Specification
Inducing Functions through Reinforcement Learning without Task Specification
Junmo Cho
Dong-hwan Lee
Young-Gyu Yoon
15
2
0
23 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions'
  Sets
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
11
1
0
12 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
31
1
0
11 Nov 2021
HARPO: Learning to Subvert Online Behavioral Advertising
HARPO: Learning to Subvert Online Behavioral Advertising
Jiang Zhang
Konstantinos Psounis
Muhammad Haroon
Zubair Shafiq
PICV
25
8
0
09 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient
  Methods
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
28
23
0
06 Nov 2021
Cross Modality 3D Navigation Using Reinforcement Learning and Neural
  Style Transfer
Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer
Cesare Magnetti
Hadrien Reynaud
Bernhard Kainz
MedIm
8
0
0
05 Nov 2021
Autonomous Attack Mitigation for Industrial Control Systems
Autonomous Attack Mitigation for Industrial Control Systems
John Mern
Kyle Hatch
Ryan Silva
Cameron Hickert
Tamim I. Sookoor
Mykel J. Kochenderfer
AAML
11
7
0
03 Nov 2021
Previous
123456...111213
Next