ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.08983
  4. Cited By
Adaptive Computation Time for Recurrent Neural Networks

Adaptive Computation Time for Recurrent Neural Networks

29 March 2016
Alex Graves
ArXivPDFHTML

Papers citing "Adaptive Computation Time for Recurrent Neural Networks"

27 / 27 papers shown
Title
Do Language Models Use Their Depth Efficiently?
Do Language Models Use Their Depth Efficiently?
Róbert Csordás
Christopher D. Manning
Christopher Potts
84
0
0
20 May 2025
Int2Int: a framework for mathematics with transformers
Int2Int: a framework for mathematics with transformers
François Charton
ViT
103
0
0
22 Feb 2025
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
93
6
0
28 Oct 2024
Investigating Recurrent Transformers with Dynamic Halt
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
63
1
0
01 Feb 2024
Fast-Slow Recurrent Neural Networks
Fast-Slow Recurrent Neural Networks
Asier Mujika
Florian Meier
Angelika Steger
65
76
0
24 May 2017
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
S. M. Ali Eslami
N. Heess
T. Weber
Yuval Tassa
David Szepesvari
Koray Kavukcuoglu
Geoffrey E. Hinton
3DV
BDL
OCL
97
550
0
28 Mar 2016
Order Matters: Sequence to sequence for sets
Order Matters: Sequence to sequence for sets
Oriol Vinyals
Samy Bengio
M. Kudlur
BDL
119
950
0
19 Nov 2015
Conditional Computation in Neural Networks for faster models
Conditional Computation in Neural Networks for faster models
Emmanuel Bengio
Pierre-Luc Bacon
Joelle Pineau
Doina Precup
AI4CE
94
320
0
19 Nov 2015
Neural Programmer-Interpreters
Neural Programmer-Interpreters
Scott E. Reed
Nando de Freitas
70
408
0
19 Nov 2015
Training Very Deep Networks
Training Very Deep Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
96
1,675
0
22 Jul 2015
Grid Long Short-Term Memory
Grid Long Short-Term Memory
Nal Kalchbrenner
Ivo Danihelka
Alex Graves
AI4TS
62
362
0
06 Jul 2015
Pointer Networks
Pointer Networks
Oriol Vinyals
Meire Fortunato
Navdeep Jaitly
92
3,036
0
09 Jun 2015
Learning to Transduce with Unbounded Memory
Learning to Transduce with Unbounded Memory
Edward Grefenstette
Karl Moritz Hermann
Mustafa Suleyman
Phil Blunsom
65
297
0
08 Jun 2015
DRAW: A Recurrent Neural Network For Image Generation
DRAW: A Recurrent Neural Network For Image Generation
Karol Gregor
Ivo Danihelka
Alex Graves
Danilo Jimenez Rezende
Daan Wierstra
GAN
DRL
142
1,959
0
16 Feb 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
806
149,474
0
22 Dec 2014
Neural Turing Machines
Neural Turing Machines
Alex Graves
Greg Wayne
Ivo Danihelka
70
2,318
0
20 Oct 2014
Deep Sequential Neural Network
Deep Sequential Neural Network
Ludovic Denoyer
Patrick Gallinari
BDL
33
62
0
02 Oct 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
280
20,491
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
388
27,205
0
01 Sep 2014
Distributed Representations of Sentences and Documents
Distributed Representations of Sentences and Documents
Quoc V. Le
Tomas Mikolov
FaML
180
9,231
0
16 May 2014
Distributed Representations of Words and Phrases and their
  Compositionality
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
293
33,445
0
16 Oct 2013
Generating Sequences With Recurrent Neural Networks
Generating Sequences With Recurrent Neural Networks
Alex Graves
GAN
104
4,025
0
04 Aug 2013
Speech Recognition with Deep Recurrent Neural Networks
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
152
8,503
0
22 Mar 2013
First Experiments with PowerPlay
First Experiments with PowerPlay
R. Srivastava
Bas R. Steunebrink
Jürgen Schmidhuber
62
51
0
31 Oct 2012
Self-Delimiting Neural Networks
Self-Delimiting Neural Networks
Jürgen Schmidhuber
45
37
0
29 Sep 2012
Multi-column Deep Neural Networks for Image Classification
Multi-column Deep Neural Networks for Image Classification
D. Ciresan
U. Meier
Jürgen Schmidhuber
108
3,935
0
13 Feb 2012
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient
  Descent
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
Feng Niu
Benjamin Recht
Christopher Ré
Stephen J. Wright
135
2,272
0
28 Jun 2011
1