Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.2329
Cited By
Recurrent Neural Network Regularization
8 September 2014
Wojciech Zaremba
Ilya Sutskever
Oriol Vinyals
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recurrent Neural Network Regularization"
50 / 289 papers shown
Title
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
21
12
0
24 Dec 2018
Learning Private Neural Language Modeling with Attentive Aggregation
Shaoxiong Ji
Shirui Pan
Guodong Long
Xue Li
Jing Jiang
Zi Huang
FedML
MoMe
16
136
0
17 Dec 2018
Parameter Re-Initialization through Cyclical Batch Size Schedules
Norman Mu
Z. Yao
A. Gholami
Kurt Keutzer
Michael W. Mahoney
ODL
30
8
0
04 Dec 2018
Quantifying Uncertainties in Natural Language Processing Tasks
Yijun Xiao
William Yang Wang
UQCV
BDL
32
142
0
18 Nov 2018
Spatio-temporal Stacked LSTM for Temperature Prediction in Weather Forecasting
Zahra Karevan
Johan A. K. Suykens
16
39
0
15 Nov 2018
Learning to Skip Ineffectual Recurrent Computations in LSTMs
A. Ardakani
Zhengyun Ji
W. Gross
13
16
0
09 Nov 2018
You May Not Need Attention
Ofir Press
Noah A. Smith
14
27
0
31 Oct 2018
Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training
Hila Gonen
Yoav Goldberg
16
31
0
28 Oct 2018
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Songlin Yang
Shawn Tan
Alessandro Sordoni
Aaron Courville
32
323
0
22 Oct 2018
Constituent Parsing as Sequence Labeling
Carlos Gómez-Rodríguez
David Vilares
30
60
0
21 Oct 2018
An ETF view of Dropout regularization
Dor Bank
Raja Giryes
8
4
0
14 Oct 2018
A System for Massively Parallel Hyperparameter Tuning
Liam Li
Kevin G. Jamieson
Afshin Rostamizadeh
Ekaterina Gonina
Moritz Hardt
Benjamin Recht
Ameet Talwalkar
24
372
0
13 Oct 2018
SNIP: Single-shot Network Pruning based on Connection Sensitivity
Namhoon Lee
Thalaiyasingam Ajanthan
Philip Torr
VLM
21
1,172
0
04 Oct 2018
Long-Term Occupancy Grid Prediction Using Recurrent Neural Networks
M. Schreiber
S. Hörmann
Klaus C. J. Dietmayer
24
63
0
11 Sep 2018
A Neural Temporal Model for Human Motion Prediction
Anand Gopalakrishnan
A. Mali
Daniel Kifer
C. Lee Giles
Alexander Ororbia
3DH
25
173
0
09 Sep 2018
Noise Contrastive Estimation and Negative Sampling for Conditional Models: Consistency and Statistical Efficiency
Zhuang Ma
Michael Collins
16
143
0
06 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric Xing
VLM
19
56
0
04 Sep 2018
Direct Output Connection for a High-Rank Language Model
Sho Takase
Jun Suzuki
Masaaki Nagata
18
36
0
30 Aug 2018
Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network
A. Sherstinsky
39
3,603
0
09 Aug 2018
Distinctive-attribute Extraction for Image Captioning
Boeun Kim
Young Han Lee
Hyedong Jung
Choongsang Cho
22
6
0
25 Jul 2018
Hierarchical Multitask Learning for CTC-based Speech Recognition
Kalpesh Krishna
Shubham Toshniwal
Karen Livescu
19
44
0
17 Jul 2018
Scheduling Computation Graphs of Deep Learning Models on Manycore CPUs
Linpeng Tang
Yida Wang
Theodore L. Willke
Kai Li
GNN
21
22
0
16 Jul 2018
Beyond Data and Model Parallelism for Deep Neural Networks
Zhihao Jia
Matei A. Zaharia
A. Aiken
GNN
AI4CE
38
497
0
14 Jul 2018
Measuring abstract reasoning in neural networks
David Barrett
Felix Hill
Adam Santoro
Ari S. Morcos
Timothy Lillicrap
OOD
25
356
0
11 Jul 2018
Recurrent Auto-Encoder Model for Large-Scale Industrial Sensor Signal Analysis
Timothy Wong
Zhiyuan Luo
11
12
0
10 Jul 2018
Convolutional Recurrent Neural Networks for Glucose Prediction
Kezhi Li
J. Daniels
Chengyuan Liu
P. Herrero
Pantelis Georgiou
BDL
21
215
0
09 Jul 2018
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
Yerbolat Khassanov
Chng Eng Siong
KELM
24
5
0
27 Jun 2018
Understanding Dropout as an Optimization Trick
Sangchul Hahn
Heeyoul Choi
ODL
13
34
0
26 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
30
212
0
20 Jun 2018
StructVAE: Tree-structured Latent Variable Models for Semi-supervised Semantic Parsing
Pengcheng Yin
Chunting Zhou
Junxian He
Graham Neubig
GNN
33
102
0
20 Jun 2018
Learning to Update for Object Tracking with Recurrent Meta-learner
Bi Li
Wenxuan Xie
Wenjun Zeng
Wenyu Liu
27
25
0
19 Jun 2018
Semantic Variation in Online Communities of Practice
Marco Del Tredici
Raquel Fernández
21
39
0
15 Jun 2018
Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine Translation
Francis Grégoire
Philippe Langlais
15
43
0
13 Jun 2018
Are All Languages Equally Hard to Language-Model?
Ryan Cotterell
Sabrina J. Mielke
Jason Eisner
Brian Roark
14
94
0
10 Jun 2018
Training LSTM Networks with Resistive Cross-Point Devices
Tayfun Gokmen
Malte J. Rasch
W. Haensch
8
45
0
01 Jun 2018
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
Kelvin Xu
Ellis Ratner
Anca Dragan
Sergey Levine
Chelsea Finn
27
66
0
31 May 2018
Grow and Prune Compact, Fast, and Accurate LSTMs
Xiaoliang Dai
Hongxu Yin
N. Jha
VLM
SyDa
31
90
0
30 May 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Dongsoo Lee
Byeongwook Kim
MQ
36
16
0
29 May 2018
Sigsoftmax: Reanalysis of the Softmax Bottleneck
Sekitoshi Kanai
Yasuhiro Fujiwara
Yuki Yamanaka
S. Adachi
13
68
0
28 May 2018
cpSGD: Communication-efficient and differentially-private distributed SGD
Naman Agarwal
A. Suresh
Felix X. Yu
Sanjiv Kumar
H. B. McMahan
FedML
28
486
0
27 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
27
44
0
22 May 2018
Sparse Binary Compression: Towards Distributed Deep Learning with minimal Communication
Felix Sattler
Simon Wiedemann
K. Müller
Wojciech Samek
MQ
36
211
0
22 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
29
26
0
21 May 2018
Zero-Shot Dialog Generation with Cross-Domain Latent Actions
Tiancheng Zhao
M. Eskénazi
VLM
27
76
0
13 May 2018
Born Again Neural Networks
Tommaso Furlanello
Zachary Chase Lipton
Michael Tschannen
Laurent Itti
Anima Anandkumar
36
1,024
0
12 May 2018
Noisin: Unbiased Regularization for Recurrent Neural Networks
Adji Bousso Dieng
Rajesh Ranganath
Jaan Altosaar
David M. Blei
22
22
0
03 May 2018
Object Counts! Bringing Explicit Detections Back into Image Captioning
Josiah Wang
Pranava Madhyastha
Lucia Specia
ObjD
19
37
0
23 Apr 2018
Unsupervised Discrete Sentence Representation Learning for Interpretable Neural Dialog Generation
Tiancheng Zhao
Kyusong Lee
M. Eskénazi
DRL
24
141
0
22 Apr 2018
Value-aware Quantization for Training and Inference of Neural Networks
Eunhyeok Park
S. Yoo
Peter Vajda
MQ
14
158
0
20 Apr 2018
Sentence Simplification with Memory-Augmented Neural Networks
Tu Vu
Baotian Hu
Tsendsuren Munkhdalai
Hong-ye Yu
22
57
0
20 Apr 2018
Previous
1
2
3
4
5
6
Next