ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.13406
  4. Cited By
Generalization of Reinforcement Learners with Working and Episodic
  Memory

Generalization of Reinforcement Learners with Working and Episodic Memory

29 October 2019
Meire Fortunato
Melissa Tan
Ryan Faulkner
Steven Hansen
Adria Puigdomenech Badia
Gavin Buttimore
Charlie Deck
Joel Z Leibo
Charles Blundell
ArXivPDFHTML

Papers citing "Generalization of Reinforcement Learners with Working and Episodic Memory"

44 / 44 papers shown
Title
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
Dolton Fernandes
Pramod Kaushik
Harsh Shukla
Bapi Raju Surampudi
21
0
0
08 Apr 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
78
1
0
20 Feb 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
41
0
0
14 Feb 2025
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down
  Maps
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
82
1
0
16 Dec 2024
Treating Brain-inspired Memories as Priors for Diffusion Model to
  Forecast Multivariate Time Series
Treating Brain-inspired Memories as Priors for Diffusion Model to Forecast Multivariate Time Series
Muyao Wang
Wenchao Chen
Zhibin Duan
Bo Chen
AI4TS
DiffM
39
0
0
27 Sep 2024
SplAgger: Split Aggregation for Meta-Reinforcement Learning
SplAgger: Split Aggregation for Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Zheng Xiong
Shimon Whiteson
OffRL
29
2
0
05 Mar 2024
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of
  Agents
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
29
0
0
29 Sep 2023
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Jacob Beck
Risto Vuorio
Zheng Xiong
Shimon Whiteson
45
9
0
26 Sep 2023
Grid Cell-Inspired Fragmentation and Recall for Efficient Map Building
Grid Cell-Inspired Fragmentation and Recall for Efficient Map Building
Jaedong Hwang
Zhang-Wei Hong
Eric Chen
Akhilan Boopathy
Pulkit Agrawal
Ila Fiete
22
1
0
11 Jul 2023
When Do Transformers Shine in RL? Decoupling Memory from Credit
  Assignment
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Tianwei Ni
Michel Ma
Benjamin Eysenbach
Pierre-Luc Bacon
OffRL
26
34
0
07 Jul 2023
Facing Off World Model Backbones: RNNs, Transformers, and S4
Facing Off World Model Backbones: RNNs, Transformers, and S4
Fei Deng
Junyeong Park
Sungjin Ahn
32
24
0
05 Jul 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
29
10
0
15 Jun 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
124
0
19 Jan 2023
Evaluating Long-Term Memory in 3D Mazes
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
21
21
0
24 Oct 2022
DMAP: a Distributed Morphological Attention Policy for Learning to
  Locomote with a Changing Body
DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body
A. Chiappa
Alessandro Marin Vargas
Alexander Mathis
34
7
0
28 Sep 2022
Memory-Augmented Graph Neural Networks: A Brain-Inspired Review
Memory-Augmented Graph Neural Networks: A Brain-Inspired Review
Guixiang Ma
Vy A. Vo
Ted Willke
Nesreen Ahmed
35
1
0
22 Sep 2022
Extended Intelligence
Extended Intelligence
D. Barack
Andrew Jaegle
33
5
0
15 Sep 2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded
  Language Agents
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Shunyu Yao
Howard Chen
John Yang
Karthik R. Narasimhan
LLMAG
LM&Ro
43
445
0
04 Jul 2022
Transformers are Meta-Reinforcement Learners
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
OffRL
41
50
0
14 Jun 2022
On Neural Architecture Inductive Biases for Relational Tasks
On Neural Architecture Inductive Biases for Relational Tasks
Giancarlo Kerg
Sarthak Mittal
David Rolnick
Yoshua Bengio
Blake A. Richards
Guillaume Lajoie
OOD
23
25
0
09 Jun 2022
Generalization, Mayhems and Limits in Recurrent Proximal Policy
  Optimization
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization
Marco Pleines
Matthias Pallasch
F. Zimmer
Mike Preuss
26
13
0
23 May 2022
Modeling Human Behavior Part I -- Learning and Belief Approaches
Modeling Human Behavior Part I -- Learning and Belief Approaches
Andrew Fuchs
A. Passarella
M. Conti
38
7
0
13 May 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
32
53
0
17 Feb 2022
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement
  Learning
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Cheng Wu
34
9
0
06 Dec 2021
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Robert Kirk
Amy Zhang
Edward Grefenstette
Tim Rocktaschel
OffRL
17
157
0
18 Nov 2021
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
30
19
0
03 Nov 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting
  Pot
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
58
103
0
14 Jul 2021
CoBERL: Contrastive BERT for Reinforcement Learning
CoBERL: Contrastive BERT for Reinforcement Learning
Andrea Banino
Adria Puidomenech Badia
Jacob Walker
Tim Scholtes
Jovana Mitrović
Charles Blundell
OffRL
32
36
0
12 Jul 2021
Differentiable Architecture Search for Reinforcement Learning
Differentiable Architecture Search for Reinforcement Learning
Yingjie Miao
Xingyou Song
John D. Co-Reyes
Daiyi Peng
Summer Yue
E. Brevdo
Aleksandra Faust
20
4
0
04 Jun 2021
Towards mental time travel: a hierarchical memory for reinforcement
  learning agents
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
24
47
0
28 May 2021
Synthetic Returns for Long-Term Credit Assignment
Synthetic Returns for Long-Term Credit Assignment
David Raposo
Samuel Ritter
Adam Santoro
Greg Wayne
T. Weber
M. Botvinick
H. V. Hasselt
Francis Song
AI4TS
21
34
0
24 Feb 2021
Emergent Symbols through Binding in External Memory
Emergent Symbols through Binding in External Memory
Taylor Webb
I. Sinha
Jonathan Cohen
67
65
0
29 Dec 2020
Using Unity to Help Solve Intelligence
Using Unity to Help Solve Intelligence
Tom Ward
Andrew Bolt
Nik Hemmings
Simon Carter
Manuel Sanchez
...
Jay Lemmon
J. Coe
Piotr Trochim
T. Handley
Adrian Bolton
19
18
0
18 Nov 2020
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online
  Representation Learning
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning
Banafsheh Rafiee
Zaheer Abbas
Sina Ghiassian
Raksha Kumaraswamy
R. Sutton
Elliot A. Ludvig
Adam White
OffRL
19
17
0
09 Nov 2020
Learning to Learn Variational Semantic Memory
Learning to Learn Variational Semantic Memory
Xiantong Zhen
Yingjun Du
Huan Xiong
Qiang Qiu
Cees G. M. Snoek
Ling Shao
SSL
BDL
VLM
DRL
15
34
0
20 Oct 2020
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for
  Physically Embedded 3D Sokoban
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Peter Karkus
M. Berk Mirza
A. Guez
Andrew Jaegle
Timothy Lillicrap
Lars Buesing
N. Heess
T. Weber
OffRL
20
8
0
03 Oct 2020
Grounded Language Learning Fast and Slow
Grounded Language Learning Fast and Slow
Felix Hill
O. Tieleman
Tamara von Glehn
Nathaniel Wong
Hamza Merzic
S. Clark
LM&Ro
29
77
0
03 Sep 2020
Online Spatio-Temporal Learning in Deep Neural Networks
Online Spatio-Temporal Learning in Deep Neural Networks
Thomas Bohnstingl
Stanislaw Wo'zniak
Wolfgang Maass
A. Pantazi
E. Eleftheriou
29
43
0
24 Jul 2020
Do Transformers Need Deep Long-Range Memory
Do Transformers Need Deep Long-Range Memory
Jack W. Rae
Ali Razavi
RALM
17
38
0
07 Jul 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
33
47
0
30 Jun 2020
Rapid Task-Solving in Novel Environments
Rapid Task-Solving in Novel Environments
Samuel Ritter
Ryan Faulkner
Laurent Sartran
Adam Santoro
M. Botvinick
David Raposo
16
29
0
05 Jun 2020
Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward
  Network Goes with Reinforcement Learning
Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward Network Goes with Reinforcement Learning
Marco Pleines
J. Jitsev
Mike Preuss
Frank Zimmer
25
2
0
01 Apr 2020
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
29
510
0
30 Mar 2020
Product Kanerva Machines: Factorized Bayesian Memory
Product Kanerva Machines: Factorized Bayesian Memory
Adam H. Marblestone
Yunsheng Wu
Greg Wayne
19
9
0
06 Feb 2020
1