Neural GPUs Learn Algorithms

25 November 2015

Papers citing "Neural GPUs Learn Algorithms"

50 / 80 papers shown

Title
Distributional Scaling Laws for Emergent Capabilities Rosie Zhao Tian Qin David Alvarez-Melis Sham Kakade Naomi Saphra LRM 39 1 0 24 Feb 2025
TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions Hui Lu A. A. Salah Ronald Poppe SLR 32 5 0 18 Mar 2024
The Expected Loss of Preconditioned Langevin Dynamics Reveals the Hessian Rank Amitay Bar Rotem Mulayoff T. Michaeli Ronen Talmon 64 0 0 21 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt Jishnu Ray Chowdhury Cornelia Caragea 41 1 0 01 Feb 2024
Neural Algorithmic Reasoning Without Intermediate Supervision Gleb Rodionov Liudmila Prokhorenkova OffRL LRM OOD 31 10 0 23 Jun 2023
SALSA VERDE: a machine learning attack on Learning With Errors with sparse small secrets Cathy Li Emily Wenger Zeyuan Allen-Zhu François Charton Kristin E. Lauter AAML 30 10 0 20 Jun 2023
Neural Machine Translation for Code Generation K. Dharma Clayton T. Morrison 32 4 0 22 May 2023
Learning to solve arithmetic problems with a virtual abacus Flavio Petruzzellis Ling-Hao Chen Alberto Testolin 34 1 0 17 Jan 2023
Logical Tasks for Measuring Extrapolation and Rule Comprehension Ippei Fujisawa Ryota Kanai ELM LRM 28 4 0 14 Nov 2022
Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms Surbhi Goel Sham Kakade Adam Tauman Kalai Cyril Zhang 34 1 0 01 Sep 2022
Exploring Length Generalization in Large Language Models Cem Anil Yuhuai Wu Anders Andreassen Aitor Lewkowycz Vedant Misra V. Ramasesh Ambrose Slone Guy Gur-Ari Ethan Dyer Behnam Neyshabur ReLM LRM 33 158 0 11 Jul 2022
Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation Samuel Cognolato Alberto Testolin 42 7 0 06 Jul 2022
Neural Networks and the Chomsky Hierarchy Grégoire Delétang Anian Ruoss Jordi Grau-Moya Tim Genewein L. Wenliang ... Chris Cundy Marcus Hutter Shane Legg Joel Veness Pedro A. Ortega UQCV 107 131 0 05 Jul 2022
The CLRS Algorithmic Reasoning Benchmark Petar Velivcković Adria Puigdomenech Badia David Budden Razvan Pascanu Andrea Banino Mikhail Dashevskiy R. Hadsell Charles Blundell 163 88 0 31 May 2022
Highly Accurate FMRI ADHD Classification using time distributed multi modal 3D CNNs Christopher Sims MedIm 19 3 0 24 May 2022
A Probabilistic Interpretation of Transformers Alexander Shim 41 1 0 28 Apr 2022
HyperNCA: Growing Developmental Networks with Neural Cellular Automata Elias Najarro Shyam Sudhakaran Claire Glanois S. Risi 34 14 0 25 Apr 2022
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions David Bieber Rishab Goel Daniel Zheng Hugo Larochelle Daniel Tarlow 28 15 0 07 Mar 2022
End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking Arpit Bansal Avi Schwarzschild Eitan Borgnia Z. Emam Furong Huang Micah Goldblum Tom Goldstein LRM 11 24 0 11 Feb 2022
Deep Symbolic Regression for Recurrent Sequences Stéphane dÁscoli Pierre-Alexandre Kamienny Guillaume Lample Franccois Charton 47 54 0 12 Jan 2022
Linear algebra with transformers Franccois Charton AIMat 29 56 0 03 Dec 2021
Show Your Work: Scratchpads for Intermediate Computation with Language Models Maxwell Nye Anders Andreassen Guy Gur-Ari Henryk Michalewski Jacob Austin ... Aitor Lewkowycz Maarten Bosma D. Luan Charles Sutton Augustus Odena ReLM LRM 62 705 0 30 Nov 2021
Gradients are Not All You Need Luke Metz C. Freeman S. Schoenholz Tal Kachman 30 93 0 10 Nov 2021
State-Space Constraints Improve the Generalization of the Differentiable Neural Computer in some Algorithmic Tasks P. Ofner Roman Kern 30 1 0 18 Oct 2021
Learning to Synthesize Programs as Interpretable and Generalizable Policies Dweep Trivedi Jesse Zhang Shao-Hua Sun Joseph J. Lim NAI 24 72 0 31 Aug 2021
The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers Róbert Csordás Kazuki Irie Jürgen Schmidhuber ViT 30 128 0 26 Aug 2021
Evaluating Large Language Models Trained on Code Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique Pondé ... Bob McGrew Dario Amodei Sam McCandlish Ilya Sutskever Wojciech Zaremba ELM ALM 83 5,082 0 07 Jul 2021
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks Avi Schwarzschild Eitan Borgnia Arjun Gupta Furong Huang U. Vishkin Micah Goldblum Tom Goldstein 24 73 0 08 Jun 2021
Neural Algorithmic Reasoning Petar Velickovic Charles Blundell NAI OOD 25 99 0 06 May 2021
CLVSA: A Convolutional LSTM Based Variational Sequence-to-Sequence Model with Attention for Predicting Trends of Financial Markets Jia Wang Tong Sun Benyuan Liu Yu Cao Hongwei Zhu AI4TS 25 63 0 08 Apr 2021
Investigating the Limitations of Transformers with Simple Arithmetic Tasks Rodrigo Nogueira Zhiying Jiang Jimmy J. Li LRM 24 123 0 25 Feb 2021
Combinatorial optimization and reasoning with graph neural networks Quentin Cappart Didier Chételat Elias Boutros Khalil Andrea Lodi Christopher Morris Petar Velickovic AI4CE 32 347 0 18 Feb 2021
Neural Sequence-to-grid Module for Learning Symbolic Rules Segwang Kim Hyoungwook Nam Joonyoung Kim Kyomin Jung NAI 72 11 0 13 Jan 2021
Learning to Execute Programs with Instruction Pointer Attention Graph Neural Networks David Bieber Charles Sutton Hugo Larochelle Daniel Tarlow GNN 24 43 0 23 Oct 2020
It's Hard for Neural Networks To Learn the Game of Life Jacob Mitchell Springer Garrett Kenyon 24 21 0 03 Sep 2020
Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures Daniel Furrer Marc van Zee Nathan Scales Nathanael Scharli CoGe 26 113 0 17 Jul 2020
Learning Reasoning Strategies in End-to-End Differentiable Proving Pasquale Minervini Sebastian Riedel Pontus Stenetorp Edward Grefenstette Tim Rocktaschel LRM 45 96 0 13 Jul 2020
Hierarchically Compositional Tasks and Deep Convolutional Networks Arturo Deza Q. Liao Andrzej Banburski T. Poggio BDL OOD 27 2 0 24 Jun 2020
Neural Power Units Niklas Heim Tomás Pevný Václav Smídl 29 9 0 02 Jun 2020
Progress Extrapolating Algorithmic Learning to Arbitrary Sequence Lengths Andreas Robinson 42 0 0 18 Mar 2020
It's Not What Machines Can Learn, It's What We Cannot Teach Gal Yehuda Moshe Gabel Assaf Schuster FaML 14 37 0 21 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation Shu Yang Yuxin Wang Xiaowen Chu VLM AI4TS AI4CE 22 138 0 18 Feb 2020
Grammar Filtering For Syntax-Guided Synthesis K. Morton William T. Hallahan Elven Shum R. Piskac Mark Santolucito 31 10 0 07 Feb 2020
Differentiable Reasoning on Large Knowledge Bases and Natural Language Pasquale Minervini Matko Bovsnjak Tim Rocktaschel Sebastian Riedel Edward Grefenstette LRM 18 88 0 17 Dec 2019
Synthetic Data for Deep Learning Sergey I. Nikolenko 46 348 0 25 Sep 2019
On Defending Against Label Flipping Attacks on Malware Detection Systems R. Taheri R. Javidan Mohammad Shojafar Zahra Pooranian A. Miri Mauro Conti AAML 21 88 0 13 Aug 2019
Towards Finding Longer Proofs Zsolt Zombori Adrián Csiszárik Henryk Michalewski C. Kaliszyk Josef Urban OffRL LRM 29 15 0 30 May 2019
Learning Compositional Neural Programs with Recursive Tree Search and Planning Thomas Pierrot Guillaume Ligner Scott E. Reed Olivier Sigaud Nicolas Perrin Alexandre Laterre David Kas Karim Beguir Nando de Freitas 41 41 0 30 May 2019
A Learned Representation for Scalable Vector Graphics Raphael Gontijo-Lopes David R Ha Douglas Eck Jonathon Shlens GAN OCL 30 113 0 04 Apr 2019
Analysing Mathematical Reasoning Abilities of Neural Models D. Saxton Edward Grefenstette Felix Hill Pushmeet Kohli LRM 24 414 0 02 Apr 2019