ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.03218
  4. Cited By
Learning Efficient Algorithms with Hierarchical Attentive Memory

Learning Efficient Algorithms with Hierarchical Attentive Memory

9 February 2016
Marcin Andrychowicz
Karol Kurach
ArXivPDFHTML

Papers citing "Learning Efficient Algorithms with Hierarchical Attentive Memory"

10 / 10 papers shown
Title
SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers
SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers
A. Deshpande
Md Arafat Sultan
Anthony Ferritto
A. Kalyan
Karthik Narasimhan
Avirup Sil
MoE
33
1
0
29 Nov 2022
Progress Extrapolating Algorithmic Learning to Arbitrary Sequence
  Lengths
Progress Extrapolating Algorithmic Learning to Arbitrary Sequence Lengths
Andreas Robinson
34
0
0
18 Mar 2020
Grammar Filtering For Syntax-Guided Synthesis
Grammar Filtering For Syntax-Guided Synthesis
K. Morton
William T. Hallahan
Elven Shum
R. Piskac
Mark Santolucito
21
10
0
07 Feb 2020
Towards Neural Theorem Proving at Scale
Towards Neural Theorem Proving at Scale
Pasquale Minervini
Matko Bosnjak
Tim Rocktaschel
Sebastian Riedel
LRM
NAI
10
38
0
21 Jul 2018
Learning Explanatory Rules from Noisy Data
Learning Explanatory Rules from Noisy Data
Richard Evans
Edward Grefenstette
45
478
0
13 Nov 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
27
255
0
03 Apr 2017
Improving the Neural GPU Architecture for Algorithm Learning
Improving the Neural GPU Architecture for Algorithm Learning
Kārlis Freivalds
Renars Liepins
16
43
0
28 Feb 2017
Divide and Conquer Networks
Divide and Conquer Networks
Alex W. Nowak
David Folqué
Joan Bruna
24
20
0
08 Nov 2016
Deep Multi-Task Learning with Shared Memory
Deep Multi-Task Learning with Shared Memory
Pengfei Liu
Xipeng Qiu
Xuanjing Huang
26
48
0
23 Sep 2016
TerpreT: A Probabilistic Programming Language for Program Induction
TerpreT: A Probabilistic Programming Language for Program Induction
Alexander L. Gaunt
Marc Brockschmidt
Rishabh Singh
Nate Kushman
Pushmeet Kohli
Jonathan Taylor
Daniel Tarlow
17
123
0
15 Aug 2016
1