ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.08228
  4. Cited By
Neural GPUs Learn Algorithms

Neural GPUs Learn Algorithms

25 November 2015
Lukasz Kaiser
Ilya Sutskever
ArXivPDFHTML

Papers citing "Neural GPUs Learn Algorithms"

50 / 80 papers shown
Title
Distributional Scaling Laws for Emergent Capabilities
Distributional Scaling Laws for Emergent Capabilities
Rosie Zhao
Tian Qin
David Alvarez-Melis
Sham Kakade
Naomi Saphra
LRM
39
1
0
24 Feb 2025
TCNet: Continuous Sign Language Recognition from Trajectories and
  Correlated Regions
TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions
Hui Lu
A. A. Salah
Ronald Poppe
SLR
32
5
0
18 Mar 2024
The Expected Loss of Preconditioned Langevin Dynamics Reveals the
  Hessian Rank
The Expected Loss of Preconditioned Langevin Dynamics Reveals the Hessian Rank
Amitay Bar
Rotem Mulayoff
T. Michaeli
Ronen Talmon
64
0
0
21 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
41
1
0
01 Feb 2024
Neural Algorithmic Reasoning Without Intermediate Supervision
Neural Algorithmic Reasoning Without Intermediate Supervision
Gleb Rodionov
Liudmila Prokhorenkova
OffRL
LRM
OOD
31
10
0
23 Jun 2023
SALSA VERDE: a machine learning attack on Learning With Errors with
  sparse small secrets
SALSA VERDE: a machine learning attack on Learning With Errors with sparse small secrets
Cathy Li
Emily Wenger
Zeyuan Allen-Zhu
François Charton
Kristin E. Lauter
AAML
30
10
0
20 Jun 2023
Neural Machine Translation for Code Generation
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Learning to solve arithmetic problems with a virtual abacus
Learning to solve arithmetic problems with a virtual abacus
Flavio Petruzzellis
Ling-Hao Chen
Alberto Testolin
34
1
0
17 Jan 2023
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Ippei Fujisawa
Ryota Kanai
ELM
LRM
28
4
0
14 Nov 2022
Recurrent Convolutional Neural Networks Learn Succinct Learning
  Algorithms
Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Surbhi Goel
Sham Kakade
Adam Tauman Kalai
Cyril Zhang
34
1
0
01 Sep 2022
Exploring Length Generalization in Large Language Models
Exploring Length Generalization in Large Language Models
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLM
LRM
33
158
0
11 Jul 2022
Transformers discover an elementary calculation system exploiting local
  attention and grid-like problem representation
Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation
Samuel Cognolato
Alberto Testolin
42
7
0
06 Jul 2022
Neural Networks and the Chomsky Hierarchy
Neural Networks and the Chomsky Hierarchy
Grégoire Delétang
Anian Ruoss
Jordi Grau-Moya
Tim Genewein
L. Wenliang
...
Chris Cundy
Marcus Hutter
Shane Legg
Joel Veness
Pedro A. Ortega
UQCV
107
131
0
05 Jul 2022
The CLRS Algorithmic Reasoning Benchmark
The CLRS Algorithmic Reasoning Benchmark
Petar Velivcković
Adria Puigdomenech Badia
David Budden
Razvan Pascanu
Andrea Banino
Mikhail Dashevskiy
R. Hadsell
Charles Blundell
163
88
0
31 May 2022
Highly Accurate FMRI ADHD Classification using time distributed multi
  modal 3D CNNs
Highly Accurate FMRI ADHD Classification using time distributed multi modal 3D CNNs
Christopher Sims
MedIm
19
3
0
24 May 2022
A Probabilistic Interpretation of Transformers
A Probabilistic Interpretation of Transformers
Alexander Shim
41
1
0
28 Apr 2022
HyperNCA: Growing Developmental Networks with Neural Cellular Automata
HyperNCA: Growing Developmental Networks with Neural Cellular Automata
Elias Najarro
Shyam Sudhakaran
Claire Glanois
S. Risi
34
14
0
25 Apr 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with
  External Resource Descriptions
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions
David Bieber
Rishab Goel
Daniel Zheng
Hugo Larochelle
Daniel Tarlow
28
15
0
07 Mar 2022
End-to-end Algorithm Synthesis with Recurrent Networks: Logical
  Extrapolation Without Overthinking
End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking
Arpit Bansal
Avi Schwarzschild
Eitan Borgnia
Z. Emam
Furong Huang
Micah Goldblum
Tom Goldstein
LRM
11
24
0
11 Feb 2022
Deep Symbolic Regression for Recurrent Sequences
Deep Symbolic Regression for Recurrent Sequences
Stéphane dÁscoli
Pierre-Alexandre Kamienny
Guillaume Lample
Franccois Charton
47
54
0
12 Jan 2022
Linear algebra with transformers
Linear algebra with transformers
Franccois Charton
AIMat
29
56
0
03 Dec 2021
Show Your Work: Scratchpads for Intermediate Computation with Language
  Models
Show Your Work: Scratchpads for Intermediate Computation with Language Models
Maxwell Nye
Anders Andreassen
Guy Gur-Ari
Henryk Michalewski
Jacob Austin
...
Aitor Lewkowycz
Maarten Bosma
D. Luan
Charles Sutton
Augustus Odena
ReLM
LRM
62
705
0
30 Nov 2021
Gradients are Not All You Need
Gradients are Not All You Need
Luke Metz
C. Freeman
S. Schoenholz
Tal Kachman
30
93
0
10 Nov 2021
State-Space Constraints Improve the Generalization of the Differentiable
  Neural Computer in some Algorithmic Tasks
State-Space Constraints Improve the Generalization of the Differentiable Neural Computer in some Algorithmic Tasks
P. Ofner
Roman Kern
30
1
0
18 Oct 2021
Learning to Synthesize Programs as Interpretable and Generalizable
  Policies
Learning to Synthesize Programs as Interpretable and Generalizable Policies
Dweep Trivedi
Jesse Zhang
Shao-Hua Sun
Joseph J. Lim
NAI
24
72
0
31 Aug 2021
The Devil is in the Detail: Simple Tricks Improve Systematic
  Generalization of Transformers
The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
ViT
30
128
0
26 Aug 2021
Evaluating Large Language Models Trained on Code
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
83
5,082
0
07 Jul 2021
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with
  Recurrent Networks
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks
Avi Schwarzschild
Eitan Borgnia
Arjun Gupta
Furong Huang
U. Vishkin
Micah Goldblum
Tom Goldstein
24
73
0
08 Jun 2021
Neural Algorithmic Reasoning
Neural Algorithmic Reasoning
Petar Velickovic
Charles Blundell
NAI
OOD
25
99
0
06 May 2021
CLVSA: A Convolutional LSTM Based Variational Sequence-to-Sequence Model
  with Attention for Predicting Trends of Financial Markets
CLVSA: A Convolutional LSTM Based Variational Sequence-to-Sequence Model with Attention for Predicting Trends of Financial Markets
Jia Wang
Tong Sun
Benyuan Liu
Yu Cao
Hongwei Zhu
AI4TS
25
63
0
08 Apr 2021
Investigating the Limitations of Transformers with Simple Arithmetic
  Tasks
Investigating the Limitations of Transformers with Simple Arithmetic Tasks
Rodrigo Nogueira
Zhiying Jiang
Jimmy J. Li
LRM
24
123
0
25 Feb 2021
Combinatorial optimization and reasoning with graph neural networks
Combinatorial optimization and reasoning with graph neural networks
Quentin Cappart
Didier Chételat
Elias Boutros Khalil
Andrea Lodi
Christopher Morris
Petar Velickovic
AI4CE
32
347
0
18 Feb 2021
Neural Sequence-to-grid Module for Learning Symbolic Rules
Neural Sequence-to-grid Module for Learning Symbolic Rules
Segwang Kim
Hyoungwook Nam
Joonyoung Kim
Kyomin Jung
NAI
72
11
0
13 Jan 2021
Learning to Execute Programs with Instruction Pointer Attention Graph
  Neural Networks
Learning to Execute Programs with Instruction Pointer Attention Graph Neural Networks
David Bieber
Charles Sutton
Hugo Larochelle
Daniel Tarlow
GNN
24
43
0
23 Oct 2020
It's Hard for Neural Networks To Learn the Game of Life
It's Hard for Neural Networks To Learn the Game of Life
Jacob Mitchell Springer
Garrett Kenyon
24
21
0
03 Sep 2020
Compositional Generalization in Semantic Parsing: Pre-training vs.
  Specialized Architectures
Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures
Daniel Furrer
Marc van Zee
Nathan Scales
Nathanael Scharli
CoGe
26
113
0
17 Jul 2020
Learning Reasoning Strategies in End-to-End Differentiable Proving
Learning Reasoning Strategies in End-to-End Differentiable Proving
Pasquale Minervini
Sebastian Riedel
Pontus Stenetorp
Edward Grefenstette
Tim Rocktaschel
LRM
45
96
0
13 Jul 2020
Hierarchically Compositional Tasks and Deep Convolutional Networks
Hierarchically Compositional Tasks and Deep Convolutional Networks
Arturo Deza
Q. Liao
Andrzej Banburski
T. Poggio
BDL
OOD
27
2
0
24 Jun 2020
Neural Power Units
Neural Power Units
Niklas Heim
Tomás Pevný
Václav Smídl
29
9
0
02 Jun 2020
Progress Extrapolating Algorithmic Learning to Arbitrary Sequence
  Lengths
Progress Extrapolating Algorithmic Learning to Arbitrary Sequence Lengths
Andreas Robinson
42
0
0
18 Mar 2020
It's Not What Machines Can Learn, It's What We Cannot Teach
It's Not What Machines Can Learn, It's What We Cannot Teach
Gal Yehuda
Moshe Gabel
Assaf Schuster
FaML
14
37
0
21 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
Grammar Filtering For Syntax-Guided Synthesis
Grammar Filtering For Syntax-Guided Synthesis
K. Morton
William T. Hallahan
Elven Shum
R. Piskac
Mark Santolucito
31
10
0
07 Feb 2020
Differentiable Reasoning on Large Knowledge Bases and Natural Language
Differentiable Reasoning on Large Knowledge Bases and Natural Language
Pasquale Minervini
Matko Bovsnjak
Tim Rocktaschel
Sebastian Riedel
Edward Grefenstette
LRM
18
88
0
17 Dec 2019
Synthetic Data for Deep Learning
Synthetic Data for Deep Learning
Sergey I. Nikolenko
46
348
0
25 Sep 2019
On Defending Against Label Flipping Attacks on Malware Detection Systems
On Defending Against Label Flipping Attacks on Malware Detection Systems
R. Taheri
R. Javidan
Mohammad Shojafar
Zahra Pooranian
A. Miri
Mauro Conti
AAML
21
88
0
13 Aug 2019
Towards Finding Longer Proofs
Towards Finding Longer Proofs
Zsolt Zombori
Adrián Csiszárik
Henryk Michalewski
C. Kaliszyk
Josef Urban
OffRL
LRM
29
15
0
30 May 2019
Learning Compositional Neural Programs with Recursive Tree Search and
  Planning
Learning Compositional Neural Programs with Recursive Tree Search and Planning
Thomas Pierrot
Guillaume Ligner
Scott E. Reed
Olivier Sigaud
Nicolas Perrin
Alexandre Laterre
David Kas
Karim Beguir
Nando de Freitas
41
41
0
30 May 2019
A Learned Representation for Scalable Vector Graphics
A Learned Representation for Scalable Vector Graphics
Raphael Gontijo-Lopes
David R Ha
Douglas Eck
Jonathon Shlens
GAN
OCL
30
113
0
04 Apr 2019
Analysing Mathematical Reasoning Abilities of Neural Models
Analysing Mathematical Reasoning Abilities of Neural Models
D. Saxton
Edward Grefenstette
Felix Hill
Pushmeet Kohli
LRM
24
414
0
02 Apr 2019
12
Next