Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.21064
Cited By
Recurrent neural networks: vanishing and exploding gradients are not the end of the story
31 May 2024
Nicolas Zucchet
Antonio Orvieto
ODL
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recurrent neural networks: vanishing and exploding gradients are not the end of the story"
3 / 3 papers shown
Title
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Soham De
Samuel L. Smith
Anushan Fernando
Aleksandar Botev
George-Christian Muraru
...
David Budden
Yee Whye Teh
Razvan Pascanu
Nando de Freitas
Çağlar Gülçehre
Mamba
61
117
0
29 Feb 2024
Theoretical Foundations of Deep Selective State-Space Models
Nicola Muca Cirone
Antonio Orvieto
Benjamin Walker
C. Salvi
Terry Lyons
Mamba
59
25
0
29 Feb 2024
Resurrecting Recurrent Neural Networks for Long Sequences
Antonio Orvieto
Samuel L. Smith
Albert Gu
Anushan Fernando
Çağlar Gülçehre
Razvan Pascanu
Soham De
88
266
0
11 Mar 2023
1