Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1308.3432
Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation
15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"
50 / 1,511 papers shown
Title
End-to-End Supervised Product Quantization for Image Search and Retrieval
Benjamin Klein
Lior Wolf
MQ
74
65
0
23 Nov 2017
ACtuAL: Actor-Critic Under Adversarial Learning
Anirudh Goyal
Nan Rosemary Ke
Alex Lamb
R. Devon Hjelm
C. Pal
Joelle Pineau
Yoshua Bengio
GAN
49
9
0
13 Nov 2017
Tangent: Automatic Differentiation Using Source Code Transformation in Python
B. V. Merrienboer
Alexander B. Wiltschko
D. Moldovan
73
29
0
07 Nov 2017
Neural Speed Reading via Skim-RNN
Minjoon Seo
Sewon Min
Ali Farhadi
Hannaneh Hajishirzi
92
79
0
06 Nov 2017
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
259
5,093
0
02 Nov 2017
Attacking Binarized Neural Networks
A. Galloway
Graham W. Taylor
M. Moussa
MQ
AAML
93
106
0
01 Nov 2017
Minimum Energy Quantized Neural Networks
Bert Moons
Koen Goetschalckx
Nick Van Berckelaer
Marian Verhelst
MQ
84
124
0
01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
100
234
0
01 Nov 2017
Algorithm and Hardware Design of Discrete-Time Spiking Neural Networks Based on Back Propagation with Binary Activations
Shihui Yin
S. Venkataramanaiah
Gregory K. Chen
R. Krishnamurthy
Yu Cao
C. Chakrabarti
Jae-sun Seo
68
59
0
19 Sep 2017
WRPN: Wide Reduced-Precision Networks
Asit K. Mishra
Eriko Nurvitadhi
Jeffrey J. Cook
Debbie Marr
MQ
100
267
0
04 Sep 2017
Hierarchical Multi-scale Attention Networks for Action Recognition
Shiyang Yan
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
91
37
0
25 Aug 2017
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Victor Campos
Brendan Jou
Xavier Giró-i-Nieto
Jordi Torres
Shih-Fu Chang
93
220
0
22 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
68
10
0
16 Aug 2017
Scalable Full Flow with Learned Binary Descriptors
Gottfried Munda
Alexander Shekhovtsov
Patrick Knöbelreiter
Thomas Pock
78
4
0
20 Jul 2017
Learning to Compose Task-Specific Tree Structures
Jihun Choi
Kang Min Yoo
Sang-goo Lee
98
189
0
10 Jul 2017
Neural Machine Translation with Gumbel-Greedy Decoding
Jiatao Gu
Daniel Jiwoong Im
Victor O.K. Li
101
36
0
22 Jun 2017
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks
Shuchang Zhou
Yuzhi Wang
He Wen
Qinyao He
Yuheng Zou
MQ
102
111
0
22 Jun 2017
Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder
Çağlar Gülçehre
Francis Dutil
Adam Trischler
Yoshua Bengio
48
7
0
13 Jun 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
102
137
0
05 Jun 2017
Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols
Serhii Havrylov
Ivan Titov
LLMAG
103
288
0
31 May 2017
SuperSpike: Supervised learning in multi-layer spiking neural networks
Friedemann Zenke
Surya Ganguli
113
570
0
31 May 2017
Adversarial Generation of Natural Language
Sai Rajeswar
Sandeep Subramanian
Francis Dutil
C. Pal
Aaron Courville
GAN
79
205
0
31 May 2017
Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs
Jean Maillard
S. Clark
Dani Yogatama
82
89
0
25 May 2017
A Regularized Framework for Sparse and Structured Neural Attention
Vlad Niculae
Mathieu Blondel
94
100
0
22 May 2017
The High-Dimensional Geometry of Binary Neural Networks
Alexander G. Anderson
C. P. Berg
MQ
91
76
0
19 May 2017
Espresso: Efficient Forward Propagation for BCNNs
Fabrizio Pedersoli
George Tzanetakis
Andrea Tagliasacchi
MQ
40
13
0
19 May 2017
Simplified Stochastic Feedforward Neural Networks
Kimin Lee
Jaehyung Kim
S. Chong
Jinwoo Shin
43
3
0
11 Apr 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
114
261
0
03 Apr 2017
Boundary-Seeking Generative Adversarial Networks
R. Devon Hjelm
Athul Paul Jacob
Tong Che
Adam Trischler
Kyunghyun Cho
Yoshua Bengio
GAN
112
170
0
27 Feb 2017
Memory Augmented Neural Networks with Wormhole Connections
Çağlar Gülçehre
A. Chandar
Yoshua Bengio
102
63
0
30 Jan 2017
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
255
2,708
0
23 Jan 2017
Stochastic Generative Hashing
Bo Dai
Ruiqi Guo
Sanjiv Kumar
Niao He
Le Song
TPM
112
107
0
11 Jan 2017
Dynamic Deep Neural Networks: Optimizing Accuracy-Efficiency Trade-offs by Selective Execution
Lanlan Liu
Jia Deng
119
206
0
02 Jan 2017
Effective Quantization Methods for Recurrent Neural Networks
Qinyao He
He Wen
Shuchang Zhou
Yuxin Wu
Cong Yao
Xinyu Zhou
Yuheng Zou
MQ
83
76
0
30 Nov 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
82
192
0
28 Nov 2016
Generalized Dropout
Suraj Srinivas
R. Venkatesh Babu
BDL
63
48
0
21 Nov 2016
Training Sparse Neural Networks
Suraj Srinivas
Akshayvarun Subramanya
R. Venkatesh Babu
165
208
0
21 Nov 2016
Categorical Reparameterization with Gumbel-Softmax
Eric Jang
S. Gu
Ben Poole
BDL
382
5,402
0
03 Nov 2016
The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables
Chris J. Maddison
A. Mnih
Yee Whye Teh
BDL
230
2,544
0
02 Nov 2016
Professor Forcing: A New Algorithm for Training Recurrent Networks
Alex Lamb
Anirudh Goyal
Ying Zhang
Saizheng Zhang
Aaron Courville
Yoshua Bengio
GAN
145
598
0
27 Oct 2016
Discrete Variational Autoencoders
J. Rolfe
BDL
DRL
219
261
0
07 Sep 2016
Hierarchical Multiscale Recurrent Neural Networks
Junyoung Chung
Sungjin Ahn
Yoshua Bengio
BDL
132
538
0
06 Sep 2016
Minimalist Regression Network with Reinforced Gradients and Weighted Estimates: a Case Study on Parameters Estimation in Automated Welding
Soheil Keshmiri
36
0
0
05 Jul 2016
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients
Shuchang Zhou
Yuxin Wu
Zekun Ni
Xinyu Zhou
He Wen
Yuheng Zou
MQ
198
2,093
0
20 Jun 2016
Strategic Attentive Writer for Learning Macro-Actions
Alexander
A. Vezhnevets
Volodymyr Mnih
J. Agapiou
Simon Osindero
Alex Graves
Oriol Vinyals
Koray Kavukcuoglu
67
171
0
15 Jun 2016
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations
David M. Krueger
Tegan Maharaj
János Kramár
Mohammad Pezeshki
Nicolas Ballas
Nan Rosemary Ke
Anirudh Goyal
Yoshua Bengio
Aaron Courville
C. Pal
115
318
0
03 Jun 2016
Adversarially Learned Inference
Vincent Dumoulin
Ishmael Belghazi
Ben Poole
Olivier Mastropietro
Alex Lamb
Martín Arjovsky
Aaron Courville
GAN
244
1,316
0
02 Jun 2016
Noisy Activation Functions
Çağlar Gülçehre
Marcin Moczulski
Misha Denil
Yoshua Bengio
54
284
0
01 Mar 2016
Natural Language Understanding with Distributed Representation
Kyunghyun Cho
GNN
BDL
88
55
0
24 Nov 2015
Dynamic Capacity Networks
Amjad Almahairi
Nicolas Ballas
Tim Cooijmans
Yin Zheng
Hugo Larochelle
Aaron Courville
120
96
0
24 Nov 2015
Previous
1
2
3
...
29
30
31
Next