Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1503.01007
Cited By
Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets
3 March 2015
Armand Joulin
Tomas Mikolov
TPM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets"
50 / 232 papers shown
Title
Emergent Stack Representations in Modeling Counter Languages Using Transformers
Utkarsh Tiwari
Aviral Gupta
Michael Hahn
244
0
0
03 Feb 2025
Compositional Generalization Across Distributional Shifts with Sparse Tree Operations
Paul Soulos
Henry Conklin
Mattia Opper
P. Smolensky
Jianfeng Gao
Roland Fernandez
81
4
0
18 Dec 2024
Exploring Learnability in Memory-Augmented Recurrent Neural Networks: Precision, Stability, and Empirical Insights
Shrabon Das
Ankur Mali
27
0
0
04 Oct 2024
Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models
Yida Zhao
Chao Lou
Kewei Tu
56
0
0
24 Jul 2024
Learning Program Behavioral Models from Synthesized Input-Output Pairs
Tural Mammadov
Dietrich Klakow
Alexander Koller
Andreas Zeller
45
3
0
11 Jul 2024
Understanding Transformer Reasoning Capabilities via Graph Algorithms
Clayton Sanford
Bahare Fatemi
Ethan Hall
Anton Tsitsulin
Seyed Mehran Kazemi
Jonathan J. Halcrow
Bryan Perozzi
Vahab Mirrokni
46
31
0
28 May 2024
Thinking Tokens for Language Modeling
David Herel
Tomas Mikolov
LRM
35
2
0
14 May 2024
Memory Mosaics
Jianyu Zhang
Niklas Nolte
Ranajoy Sadhukhan
Beidi Chen
Léon Bottou
VLM
73
3
0
10 May 2024
A Transformer with Stack Attention
Jiaoda Li
Jennifer C. White
Mrinmaya Sachan
Ryan Cotterell
30
2
0
07 May 2024
On the Markov Property of Neural Algorithmic Reasoning: Analyses and Methods
Montgomery Bohde
Meng Liu
Alexandra Saxton
Shuiwang Ji
OOD
40
8
0
07 Mar 2024
MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models
Nathanaël Carraz Rakotonirina
Marco Baroni
VLM
KELM
35
0
0
23 Feb 2024
Neuro-mimetic Task-free Unsupervised Online Learning with Continual Self-Organizing Maps
Hitesh U. Vaidya
Travis J. Desell
A. Mali
Alexander Ororbia
CLL
39
2
0
19 Feb 2024
Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length
Nur Lan
Emmanuel Chemla
Roni Katzir
20
3
0
15 Feb 2024
Learning Universal Predictors
Jordi Grau-Moya
Tim Genewein
Marcus Hutter
Laurent Orseau
Grégoire Delétang
...
Anian Ruoss
Wenliang Kevin Li
Christopher Mattern
Matthew Aitchison
J. Veness
34
12
0
26 Jan 2024
Style Locality for Controllable Generation with kNN Language Models
Gilles Nawezi
Lucie Flek
Charles F Welch
RALM
24
0
0
01 Nov 2023
Pushdown Layers: Encoding Recursive Structure in Transformer Language Models
Shikhar Murty
Pratyusha Sharma
Jacob Andreas
Christopher D. Manning
AI4CE
49
14
0
29 Oct 2023
Practical Computational Power of Linear Transformers and Their Recurrent and Self-Referential Extensions
Kazuki Irie
Róbert Csordás
Jürgen Schmidhuber
36
11
0
24 Oct 2023
Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns
Brian DuSell
David Chiang
28
12
0
03 Oct 2023
A Framework for Inference Inspired by Human Memory Mechanisms
Xiangyu Zeng
Jie Lin
Piao Hu
Ruizheng Huang
Zhicheng Zhang
30
2
0
01 Oct 2023
Language models in molecular discovery
Chaoqi Wang
Yibo Jiang
Chenghao Yang
Han Liu
Yuxin Chen
32
7
0
28 Sep 2023
On the Computational Complexity and Formal Hierarchy of Second Order Recurrent Neural Networks
A. Mali
Alexander Ororbia
Daniel Kifer
L. Giles
18
8
0
26 Sep 2023
Benchmarking Neural Network Generalization for Grammar Induction
Nur Lan
Emmanuel Chemla
Roni Katzir
ELM
10
4
0
16 Aug 2023
Neural Priority Queues for Graph Neural Networks
Rishabh Jain
Petar Velivcković
Pietro Lio
GNN
32
5
0
18 Jul 2023
Recursive Algorithmic Reasoning
Jonas Jürß
Dulhan Jayalath
Petar Velickovic
38
7
0
01 Jul 2023
Length Generalization in Arithmetic Transformers
Samy Jelassi
Stéphane dÁscoli
Carles Domingo-Enrich
Yuhuai Wu
Yuan-Fang Li
Franccois Charton
30
38
0
27 Jun 2023
A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text
Jessica Nayeli López Espejel
Mahaman Sanoussi Yahaya Alassan
El Mehdi Chouham
Walid Dahhane
E. Ettifouri
28
13
0
10 Jun 2023
Birth of a Transformer: A Memory Viewpoint
A. Bietti
Vivien A. Cabannes
Diane Bouchacourt
Hervé Jégou
Léon Bottou
35
85
0
01 Jun 2023
Differentiable Tree Operations Promote Compositional Generalization
Paul Soulos
J. E. Hu
Kate McCurdy
Yunmo Chen
Roland Fernandez
P. Smolensky
Jianfeng Gao
AI4CE
27
7
0
01 Jun 2023
Scaling Transformer to 1M tokens and beyond with RMT
Aydar Bulatov
Yuri Kuratov
Yermek Kapushev
Andrey Kravchenko
LRM
27
87
0
19 Apr 2023
Theoretical Conditions and Empirical Failure of Bracket Counting on Long Sequences with Linear Recurrent Networks
Nadine El-Naggar
Pranava Madhyastha
Tillman Weyde
22
1
0
07 Apr 2023
Neural Attention Memory
Hyoungwook Nam
S. Seo
HAI
27
0
0
18 Feb 2023
Memory-Based Meta-Learning on Non-Stationary Distributions
Tim Genewein
Grégoire Delétang
Anian Ruoss
L. Wenliang
Elliot Catt
Vincent Dutordoir
Jordi Grau-Moya
Laurent Orseau
Marcus Hutter
J. Veness
BDL
24
11
0
06 Feb 2023
Scalable Adaptive Computation for Iterative Generation
Allan Jabri
David Fleet
Ting-Li Chen
DiffM
35
107
0
22 Dec 2022
The Surprising Computational Power of Nondeterministic Stack RNNs
Brian DuSell
David Chiang
LRM
39
4
0
04 Oct 2022
Benchmarking Learning Efficiency in Deep Reservoir Computing
Hugo Cisneros
Josef Sivic
Tomas Mikolov
14
2
0
29 Sep 2022
Induced Natural Language Rationales and Interleaved Markup Tokens Enable Extrapolation in Large Language Models
M. Bueno
Carlos Gemmel
Jeffrey Stephen Dalton
R. Lotufo
Rodrigo Nogueira
LRM
47
12
0
24 Aug 2022
Recurrent Memory Transformer
Aydar Bulatov
Yuri Kuratov
Andrey Kravchenko
CLL
13
103
0
14 Jul 2022
Neural Networks and the Chomsky Hierarchy
Grégoire Delétang
Anian Ruoss
Jordi Grau-Moya
Tim Genewein
L. Wenliang
...
Chris Cundy
Marcus Hutter
Shane Legg
Joel Veness
Pedro A. Ortega
UQCV
109
131
0
05 Jul 2022
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
Guosheng Lin
SyDa
ALM
135
243
0
05 Jul 2022
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis
Yushi Cao
Zhiming Li
Tianpei Yang
Hao Zhang
Yan Zheng
Yi Li
Jianye Hao
Yang Liu
NAI
38
16
0
27 May 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
Denny Zhou
Nathanael Scharli
Le Hou
Jason W. Wei
Nathan Scales
...
Dale Schuurmans
Claire Cui
Olivier Bousquet
Quoc Le
Ed H. Chi
RALM
LRM
AI4CE
27
1,057
0
21 May 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
28
6
0
11 Apr 2022
Fine-tuning Image Transformers using Learnable Memory
Mark Sandler
A. Zhmoginov
Max Vladymyrov
Andrew Jackson
ViT
34
47
0
29 Mar 2022
UnweaveNet: Unweaving Activity Stories
Will Price
Carl Vondrick
Dima Damen
EgoV
29
13
0
19 Dec 2021
Learning Bounded Context-Free-Grammar via LSTM and the Transformer:Difference and Explanations
Hui Shi
Sicun Gao
Yuandong Tian
Xinyun Chen
Jishen Zhao
13
21
0
16 Dec 2021
A molecular generative model with genetic algorithm and tree search for cancer samples
Sejin Park
Hyunju Lee
21
1
0
16 Dec 2021
Personalized Federated Learning through Local Memorization
Othmane Marfoq
Giovanni Neglia
Laetitia Kameni
Richard Vidal
FedML
43
88
0
17 Nov 2021
Minimum Description Length Recurrent Neural Networks
Nur Lan
Michal Geyer
Emmanuel Chemla
Roni Katzir
21
13
0
31 Oct 2021
State-Space Constraints Improve the Generalization of the Differentiable Neural Computer in some Algorithmic Tasks
P. Ofner
Roman Kern
30
1
0
18 Oct 2021
ABC: Attention with Bounded-memory Control
Hao Peng
Jungo Kasai
Nikolaos Pappas
Dani Yogatama
Zhaofeng Wu
Lingpeng Kong
Roy Schwartz
Noah A. Smith
76
22
0
06 Oct 2021
1
2
3
4
5
Next