ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.01449
  4. Cited By
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
v1v2 (latest)

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

2 September 2024
Esraa Elelimy
Adam White
Michael Bowling
Martha White
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Real-Time Recurrent Learning using Trace Units in Reinforcement Learning"

33 / 33 papers shown
Title
Towards Scalable and Stable Parallelization of Nonlinear RNNs
Towards Scalable and Stable Parallelization of Nonlinear RNNs
Xavier Gonzalez
Andrew Warrington
Jimmy T.H. Smith
Scott W. Linderman
240
11
0
17 Jan 2025
Rethinking Transformers in Solving POMDPs
Rethinking Transformers in Solving POMDPs
Chenhao Lu
Ruizhe Shi
Yuyao Liu
Kaizhe Hu
Simon S. Du
Huazhe Xu
AI4CE
84
3
0
27 May 2024
Time-Efficient Reinforcement Learning with Stochastic Stateful Policies
Time-Efficient Reinforcement Learning with Stochastic Stateful Policies
Firas Al-Hafez
Guoping Zhao
Jan Peters
Davide Tateo
OffRL
17
3
0
07 Nov 2023
Parallelizing non-linear sequential models over the sequence length
Parallelizing non-linear sequential models over the sequence length
Yi Heng Lim
Qi Zhu
Joshua Selfridge
M. F. Kasim
72
18
0
21 Sep 2023
When Do Transformers Shine in RL? Decoupling Memory from Credit
  Assignment
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Tianwei Ni
Michel Ma
Benjamin Eysenbach
Pierre-Luc Bacon
OffRL
88
41
0
07 Jul 2023
Exploring the Promise and Limits of Real-Time Recurrent Learning
Exploring the Promise and Limits of Real-Time Recurrent Learning
Kazuki Irie
Anand Gopalakrishnan
Jürgen Schmidhuber
57
16
0
30 May 2023
Online learning of long-range dependencies
Online learning of long-range dependencies
Nicolas Zucchet
Robert Meier
Simon Schug
Asier Mujika
João Sacramento
CLL
64
21
0
25 May 2023
Resurrecting Recurrent Neural Networks for Long Sequences
Resurrecting Recurrent Neural Networks for Long Sequences
Antonio Orvieto
Samuel L. Smith
Albert Gu
Anushan Fernando
Çağlar Gülçehre
Razvan Pascanu
Soham De
326
297
0
11 Mar 2023
POPGym: Benchmarking Partially Observable Reinforcement Learning
POPGym: Benchmarking Partially Observable Reinforcement Learning
Steven D. Morad
Ryan Kortvelesy
Matteo Bettini
Stephan Liwicki
Amanda Prorok
OffRL
62
40
0
03 Mar 2023
Mastering Diverse Domains through World Models
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
77
612
0
10 Jan 2023
Discovered Policy Optimisation
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
72
79
0
11 Oct 2022
Are Transformers Effective for Time Series Forecasting?
Are Transformers Effective for Time Series Forecasting?
Ailing Zeng
Mu-Hwa Chen
L. Zhang
Qiang Xu
AI4TS
154
1,772
0
26 May 2022
A Modern Self-Referential Weight Matrix That Learns to Modify Itself
A Modern Self-Referential Weight Matrix That Learns to Modify Itself
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
48
28
0
11 Feb 2022
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
67
110
0
11 Oct 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body
  Simulation
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
108
380
0
24 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
82
62
0
11 Jun 2021
Memory-based Deep Reinforcement Learning for POMDPs
Memory-based Deep Reinforcement Learning for POMDPs
Lingheng Meng
R. Gorbet
Dana Kulic
87
99
0
24 Feb 2021
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online
  Representation Learning
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning
Banafsheh Rafiee
Zaheer Abbas
Sina Ghiassian
Raksha Kumaraswamy
R. Sutton
Elliot A. Ludvig
Adam White
OffRL
40
17
0
09 Nov 2020
Variational Recurrent Models for Solving Partially Observable Control
  Tasks
Variational Recurrent Models for Solving Partially Observable Control Tasks
Dongqi Han
Kenji Doya
Jun Tani
DRLOffRL
60
63
0
23 Dec 2019
Stabilizing Transformers for Reinforcement Learning
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
91
366
0
13 Oct 2019
A Unified Framework of Online Learning Algorithms for Training Recurrent
  Neural Networks
A Unified Framework of Online Learning Algorithms for Training Recurrent Neural Networks
O. Marschall
Kyunghyun Cho
Cristina Savin
FedML
73
73
0
05 Jul 2019
On the Variance of Unbiased Online Recurrent Optimization
On the Variance of Unbiased Online Recurrent Optimization
Tim Cooijmans
James Martens
46
14
0
06 Feb 2019
Independently Recurrent Neural Network (IndRNN): Building A Longer and
  Deeper RNN
Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN
Shuai Li
W. Li
Chris Cook
Ce Zhu
Yanbo Gao
86
731
0
13 Mar 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
237
1,605
0
05 Feb 2018
Parallelizing Linear Recurrent Neural Nets Over Sequence Length
Parallelizing Linear Recurrent Neural Nets Over Sequence Length
Eric Martin
Chris Cundy
76
103
0
12 Sep 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
541
19,265
0
20 Jul 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
786
132,363
0
12 Jun 2017
Unbiased Online Recurrent Optimization
Unbiased Online Recurrent Optimization
Corentin Tallec
Yann Ollivier
83
99
0
16 Feb 2017
Recurrent Reinforcement Learning: A Hybrid Approach
Recurrent Reinforcement Learning: A Hybrid Approach
Xiujun Li
Lihong Li
Jianfeng Gao
Xiaodong He
Jianshu Chen
Li Deng
Ji He
OffRL
64
77
0
10 Sep 2015
Training recurrent networks online without backtracking
Training recurrent networks online without backtracking
Yann Ollivier
Corentin Tallec
Guillaume Charpiat
73
39
0
28 Jul 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
109
1,685
0
23 Jul 2015
Automatic differentiation in machine learning: a survey
Automatic differentiation in machine learning: a survey
A. G. Baydin
Barak A. Pearlmutter
Alexey Radul
J. Siskind
PINNAI4CEODL
172
2,816
0
20 Feb 2015
On the Properties of Neural Machine Translation: Encoder-Decoder
  Approaches
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CEAIMat
259
6,786
0
03 Sep 2014
1