Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.08983
Cited By
Adaptive Computation Time for Recurrent Neural Networks
29 March 2016
Alex Graves
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Computation Time for Recurrent Neural Networks"
27 / 27 papers shown
Title
Do Language Models Use Their Depth Efficiently?
Róbert Csordás
Christopher D. Manning
Christopher Potts
84
0
0
20 May 2025
Int2Int: a framework for mathematics with transformers
François Charton
ViT
103
0
0
22 Feb 2025
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
93
6
0
28 Oct 2024
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
63
1
0
01 Feb 2024
Fast-Slow Recurrent Neural Networks
Asier Mujika
Florian Meier
Angelika Steger
65
76
0
24 May 2017
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
S. M. Ali Eslami
N. Heess
T. Weber
Yuval Tassa
David Szepesvari
Koray Kavukcuoglu
Geoffrey E. Hinton
3DV
BDL
OCL
97
550
0
28 Mar 2016
Order Matters: Sequence to sequence for sets
Oriol Vinyals
Samy Bengio
M. Kudlur
BDL
119
950
0
19 Nov 2015
Conditional Computation in Neural Networks for faster models
Emmanuel Bengio
Pierre-Luc Bacon
Joelle Pineau
Doina Precup
AI4CE
94
320
0
19 Nov 2015
Neural Programmer-Interpreters
Scott E. Reed
Nando de Freitas
70
408
0
19 Nov 2015
Training Very Deep Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
96
1,675
0
22 Jul 2015
Grid Long Short-Term Memory
Nal Kalchbrenner
Ivo Danihelka
Alex Graves
AI4TS
62
362
0
06 Jul 2015
Pointer Networks
Oriol Vinyals
Meire Fortunato
Navdeep Jaitly
92
3,036
0
09 Jun 2015
Learning to Transduce with Unbounded Memory
Edward Grefenstette
Karl Moritz Hermann
Mustafa Suleyman
Phil Blunsom
65
297
0
08 Jun 2015
DRAW: A Recurrent Neural Network For Image Generation
Karol Gregor
Ivo Danihelka
Alex Graves
Danilo Jimenez Rezende
Daan Wierstra
GAN
DRL
142
1,959
0
16 Feb 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
806
149,474
0
22 Dec 2014
Neural Turing Machines
Alex Graves
Greg Wayne
Ivo Danihelka
70
2,318
0
20 Oct 2014
Deep Sequential Neural Network
Ludovic Denoyer
Patrick Gallinari
BDL
33
62
0
02 Oct 2014
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
280
20,491
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
388
27,205
0
01 Sep 2014
Distributed Representations of Sentences and Documents
Quoc V. Le
Tomas Mikolov
FaML
180
9,231
0
16 May 2014
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
293
33,445
0
16 Oct 2013
Generating Sequences With Recurrent Neural Networks
Alex Graves
GAN
104
4,025
0
04 Aug 2013
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
152
8,503
0
22 Mar 2013
First Experiments with PowerPlay
R. Srivastava
Bas R. Steunebrink
Jürgen Schmidhuber
62
51
0
31 Oct 2012
Self-Delimiting Neural Networks
Jürgen Schmidhuber
45
37
0
29 Sep 2012
Multi-column Deep Neural Networks for Image Classification
D. Ciresan
U. Meier
Jürgen Schmidhuber
108
3,935
0
13 Feb 2012
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
Feng Niu
Benjamin Recht
Christopher Ré
Stephen J. Wright
135
2,272
0
28 Jun 2011
1