ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.04455
  4. Cited By
Memory-based control with recurrent neural networks

Memory-based control with recurrent neural networks

14 December 2015
N. Heess
Jonathan J. Hunt
Timothy Lillicrap
David Silver
ArXivPDFHTML

Papers citing "Memory-based control with recurrent neural networks"

50 / 125 papers shown
Title
LineFlow: A Framework to Learn Active Control of Production Lines
LineFlow: A Framework to Learn Active Control of Production Lines
Kai Müller
Martin Wenzel
Tobias Windisch
AI4CE
26
0
0
10 May 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
78
1
0
20 Feb 2025
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Kaixi Bao
Chenhao Li
Yarden As
Andreas Krause
Marco Hutter
OffRL
CLL
119
1
0
03 Feb 2025
Equivariant Reinforcement Learning under Partial Observability
Equivariant Reinforcement Learning under Partial Observability
Hai Nguyen
Andrea Baisero
David M. Klee
Dian Wang
Robert Platt
Christopher Amato
42
14
0
26 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
92
3
0
20 Aug 2024
Mitigating Partial Observability in Sequential Decision Processes via
  the Lambda Discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen
Aaron Kirtland
Ruo Yu Tao
Sam Lobel
Daniel Scott
Nicholas Petrocelli
Omer Gottesman
Ronald E. Parr
M. L. Littman
George Konidaris
28
1
0
10 Jul 2024
Analysis of Off-Policy Multi-Step TD-Learning with Linear Function
  Approximation
Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation
Donghwan Lee
45
0
0
24 Feb 2024
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and
  Skills
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
51
4
0
11 Dec 2023
Two-step dynamic obstacle avoidance
Two-step dynamic obstacle avoidance
Fabian Hart
Martin Waltz
Ostap Okhrin
30
3
0
28 Nov 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Offline RL with Observation Histories: Analyzing and Improving Sample
  Complexity
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity
Joey Hong
Anca Dragan
Sergey Levine
OffRL
24
5
0
31 Oct 2023
Towards Open-World Co-Salient Object Detection with Generative
  Uncertainty-aware Group Selective Exchange-Masking
Towards Open-World Co-Salient Object Detection with Generative Uncertainty-aware Group Selective Exchange-Masking
Yang Wu
Shenglong Hu
Huihui Song
Kaihua Zhang
Bo Liu
Dong Liu
28
0
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
38
10
0
15 Oct 2023
AdaptNet: Policy Adaptation for Physics-Based Character Control
AdaptNet: Policy Adaptation for Physics-Based Character Control
Pei Xu
Kaixiang Xie
Sheldon Andrews
P. Kry
Michael Neff
Morgan McGuire
Ioannis Karamouzas
Victor Zordan
TTA
37
17
0
30 Sep 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of
  Agents
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
29
0
0
29 Sep 2023
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
Xu Zhao
Duzhen Zhang
Liyuan Han
Tielin Zhang
Bo Xu
37
7
0
25 Sep 2023
Learning Computational Efficient Bots with Costly Features
Learning Computational Efficient Bots with Costly Features
Anthony Kobanda
Valliappan C. A.
Joshua Romoff
Ludovic Denoyer
OffRL
27
1
0
18 Aug 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed
  Markov Decision Processes
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
27
3
0
29 Jul 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in
  Partially Observable Control Tasks
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
26
4
0
12 Jul 2023
RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$
RL3^33: Boosting Meta Reinforcement Learning via RL inside RL2^22
Abhinav Bhatia
Samer B. Nashed
S. Zilberstein
OffRL
25
0
0
28 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
25
7
0
20 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
34
1
0
15 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum
  Markov Games: Switching System Approach
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
21
2
0
09 Jun 2023
Model-free Motion Planning of Autonomous Agents for Complex Tasks in
  Partially Observable Environments
Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments
Junchao Li
Mingyu Cai
Z. Kan
Shaoping Xiao
23
1
0
30 Apr 2023
Optimal Scheduling in IoT-Driven Smart Isolated Microgrids Based on Deep
  Reinforcement Learning
Optimal Scheduling in IoT-Driven Smart Isolated Microgrids Based on Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Kan Zheng
Simon X. Yang
Xuemin
X. Shen
16
11
0
28 Apr 2023
Observer-Feedback-Feedforward Controller Structures in Reinforcement
  Learning
Observer-Feedback-Feedforward Controller Structures in Reinforcement Learning
Ruoqing Zhang
Per Mattsson
T. Wigren
27
0
0
20 Apr 2023
The configurable tree graph (CT-graph): measurable problems in partially
  observable and distal reward environments for lifelong reinforcement learning
The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning
Andrea Soltoggio
Eseoghene Ben-Iwhiwhu
Christos Peridis
Pawel Ladosz
Jeffery Dick
Praveen K. Pilly
Soheil Kolouri
OffRL
32
3
0
21 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Spatial-temporal recurrent reinforcement learning for autonomous ships
Spatial-temporal recurrent reinforcement learning for autonomous ships
Martin Waltz
Ostap Okhrin
24
9
0
02 Nov 2022
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent
  Reinforcement Learning
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning
Pihe Hu
L. Pan
Yu Chen
Zhixuan Fang
Longbo Huang
8
4
0
30 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement
  Learning: A Systematic Review
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
21
73
0
25 Aug 2022
Recurrent networks, hidden states and beliefs in partially observable
  environments
Recurrent networks, hidden states and beliefs in partially observable environments
Gaspard Lambrechts
Adrien Bolland
D. Ernst
19
12
0
06 Aug 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
30
31
0
13 Jul 2022
Task-Agnostic Continual Reinforcement Learning: Gaining Insights and
  Overcoming Challenges
Task-Agnostic Continual Reinforcement Learning: Gaining Insights and Overcoming Challenges
Massimo Caccia
Jonas W. Mueller
Taesup Kim
Laurent Charlin
Rasool Fakoor
CLL
32
8
0
28 May 2022
Generalization, Mayhems and Limits in Recurrent Proximal Policy
  Optimization
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization
Marco Pleines
Matthias Pallasch
F. Zimmer
Mike Preuss
26
13
0
23 May 2022
Hierarchical Reinforcement Learning under Mixed Observability
Hierarchical Reinforcement Learning under Mixed Observability
Hai V. Nguyen
Zhihan Yang
Andrea Baisero
Xiao Ma
Robert W. Platt
Chris Amato
25
4
0
02 Apr 2022
Platform Behavior under Market Shocks: A Simulation Framework and
  Reinforcement-Learning Based Study
Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study
Xintong Wang
Gary Qiurui Ma
Alon Eden
Clara Li
Alexander R. Trott
Stephan Zheng
David C. Parkes
40
8
0
25 Mar 2022
Distributional Reinforcement Learning for Scheduling of Chemical
  Production Processes
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes
M. Mowbray
Dongda Zhang
Ehecatl Antonio del Rio Chanona
OffRL
25
6
0
01 Mar 2022
RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization
RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization
Nikolaos Kourtzanidis
Sajad Saeedi
34
2
0
26 Feb 2022
Recursive Least Squares Advantage Actor-Critic Algorithms
Recursive Least Squares Advantage Actor-Critic Algorithms
Yuan Wang
Chunyuan Zhang
Tianzong Yu
Meng-tao Ma
14
0
0
15 Jan 2022
Missing Velocity in Dynamic Obstacle Avoidance based on Deep
  Reinforcement Learning
Missing Velocity in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning
Fabian Hart
Martin Waltz
Ostap Okhrin
13
0
0
23 Dec 2021
Inducing Functions through Reinforcement Learning without Task
  Specification
Inducing Functions through Reinforcement Learning without Task Specification
Junmo Cho
Dong-hwan Lee
Young-Gyu Yoon
20
2
0
23 Nov 2021
Recurrent Off-policy Baselines for Memory-based Continuous Control
Recurrent Off-policy Baselines for Memory-based Continuous Control
Zhihan Yang
Hai V. Nguyen
CLL
OffRL
18
23
0
25 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
20
103
0
11 Oct 2021
Recurrent Neural Network Controllers Synthesis with Stability Guarantees
  for Partially Observed Systems
Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems
Fangda Gu
He Yin
L. Ghaoui
Murat Arcak
Peter M. Seiler
Ming Jin
9
25
0
08 Sep 2021
Federated Reinforcement Learning: Techniques, Applications, and Open
  Challenges
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
31
145
0
26 Aug 2021
Self-optimizing adaptive optics control with Reinforcement Learning for
  high-contrast imaging
Self-optimizing adaptive optics control with Reinforcement Learning for high-contrast imaging
Rico Landman
S. Haffert
V. M. Radhakrishnan
C. Keller
12
28
0
24 Aug 2021
Graph Convolutional Memory using Topological Priors
Graph Convolutional Memory using Topological Priors
Steven D. Morad
Stephan Liwicki
Ryan Kortvelesy
R. Mecca
Amanda Prorok
20
0
0
27 Jun 2021
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function
  Approximation
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Anas Barakat
Pascal Bianchi
Julien Lehmann
21
9
0
14 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
66
645
0
03 Jun 2021
123
Next