Recurrent Neural Network Regularization

8 September 2014

Papers citing "Recurrent Neural Network Regularization"

50 / 289 papers shown

Title
Precision Highway for Ultra Low-Precision Quantization Eunhyeok Park Dongyoung Kim S. Yoo Peter Vajda MQ AI4TS 21 12 0 24 Dec 2018
Learning Private Neural Language Modeling with Attentive Aggregation Shaoxiong Ji Shirui Pan Guodong Long Xue Li Jing Jiang Zi Huang FedML MoMe 16 136 0 17 Dec 2018
Parameter Re-Initialization through Cyclical Batch Size Schedules Norman Mu Z. Yao A. Gholami Kurt Keutzer Michael W. Mahoney ODL 30 8 0 04 Dec 2018
Quantifying Uncertainties in Natural Language Processing Tasks Yijun Xiao William Yang Wang UQCV BDL 32 142 0 18 Nov 2018
Spatio-temporal Stacked LSTM for Temperature Prediction in Weather Forecasting Zahra Karevan Johan A. K. Suykens 16 39 0 15 Nov 2018
Learning to Skip Ineffectual Recurrent Computations in LSTMs A. Ardakani Zhengyun Ji W. Gross 13 16 0 09 Nov 2018
You May Not Need Attention Ofir Press Noah A. Smith 14 27 0 31 Oct 2018
Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training Hila Gonen Yoav Goldberg 16 31 0 28 Oct 2018
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks Songlin Yang Shawn Tan Alessandro Sordoni Aaron Courville 32 323 0 22 Oct 2018
Constituent Parsing as Sequence Labeling Carlos Gómez-Rodríguez David Vilares 30 60 0 21 Oct 2018
An ETF view of Dropout regularization Dor Bank Raja Giryes 8 4 0 14 Oct 2018
A System for Massively Parallel Hyperparameter Tuning Liam Li Kevin G. Jamieson Afshin Rostamizadeh Ekaterina Gonina Moritz Hardt Benjamin Recht Ameet Talwalkar 24 372 0 13 Oct 2018
SNIP: Single-shot Network Pruning based on Connection Sensitivity Namhoon Lee Thalaiyasingam Ajanthan Philip Torr VLM 21 1,172 0 04 Oct 2018
Long-Term Occupancy Grid Prediction Using Recurrent Neural Networks M. Schreiber S. Hörmann Klaus C. J. Dietmayer 24 63 0 11 Sep 2018
A Neural Temporal Model for Human Motion Prediction Anand Gopalakrishnan A. Mali Daniel Kifer C. Lee Giles Alexander Ororbia 3DH 25 173 0 09 Sep 2018
Noise Contrastive Estimation and Negative Sampling for Conditional Models: Consistency and Statistical Efficiency Zhuang Ma Michael Collins 16 143 0 06 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation Zhiting Hu Haoran Shi Bowen Tan Wentao Wang Zichao Yang ... Zhengzhong Liu Xiaodan Liang Wangrong Zhu Devendra Singh Sachan Eric Xing VLM 19 56 0 04 Sep 2018
Direct Output Connection for a High-Rank Language Model Sho Takase Jun Suzuki Masaaki Nagata 18 36 0 30 Aug 2018
Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network A. Sherstinsky 39 3,603 0 09 Aug 2018
Distinctive-attribute Extraction for Image Captioning Boeun Kim Young Han Lee Hyedong Jung Choongsang Cho 22 6 0 25 Jul 2018
Hierarchical Multitask Learning for CTC-based Speech Recognition Kalpesh Krishna Shubham Toshniwal Karen Livescu 19 44 0 17 Jul 2018
Scheduling Computation Graphs of Deep Learning Models on Manycore CPUs Linpeng Tang Yida Wang Theodore L. Willke Kai Li GNN 21 22 0 16 Jul 2018
Beyond Data and Model Parallelism for Deep Neural Networks Zhihao Jia Matei A. Zaharia A. Aiken GNN AI4CE 38 497 0 14 Jul 2018
Measuring abstract reasoning in neural networks David Barrett Felix Hill Adam Santoro Ari S. Morcos Timothy Lillicrap OOD 25 356 0 11 Jul 2018
Recurrent Auto-Encoder Model for Large-Scale Industrial Sensor Signal Analysis Timothy Wong Zhiyuan Luo 11 12 0 10 Jul 2018
Convolutional Recurrent Neural Networks for Glucose Prediction Kezhi Li J. Daniels Chengyuan Liu P. Herrero Pantelis Georgiou BDL 21 215 0 09 Jul 2018
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR Yerbolat Khassanov Chng Eng Siong KELM 24 5 0 27 Jun 2018
Understanding Dropout as an Optimization Trick Sangchul Hahn Heeyoul Choi ODL 13 34 0 26 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards Jose A. Arjona-Medina Michael Gillhofer Michael Widrich Thomas Unterthiner Johannes Brandstetter Sepp Hochreiter 30 212 0 20 Jun 2018
StructVAE: Tree-structured Latent Variable Models for Semi-supervised Semantic Parsing Pengcheng Yin Chunting Zhou Junxian He Graham Neubig GNN 33 102 0 20 Jun 2018
Learning to Update for Object Tracking with Recurrent Meta-learner Bi Li Wenxuan Xie Wenjun Zeng Wenyu Liu 27 25 0 19 Jun 2018
Semantic Variation in Online Communities of Practice Marco Del Tredici Raquel Fernández 21 39 0 15 Jun 2018
Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine Translation Francis Grégoire Philippe Langlais 15 43 0 13 Jun 2018
Are All Languages Equally Hard to Language-Model? Ryan Cotterell Sabrina J. Mielke Jason Eisner Brian Roark 14 94 0 10 Jun 2018
Training LSTM Networks with Resistive Cross-Point Devices Tayfun Gokmen Malte J. Rasch W. Haensch 8 45 0 01 Jun 2018
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning Kelvin Xu Ellis Ratner Anca Dragan Sergey Levine Chelsea Finn 27 66 0 31 May 2018
Grow and Prune Compact, Fast, and Accurate LSTMs Xiaoliang Dai Hongxu Yin N. Jha VLM SyDa 31 90 0 30 May 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks Dongsoo Lee Byeongwook Kim MQ 36 16 0 29 May 2018
Sigsoftmax: Reanalysis of the Softmax Bottleneck Sekitoshi Kanai Yasuhiro Fujiwara Yuki Yamanaka S. Adachi 13 68 0 28 May 2018
cpSGD: Communication-efficient and differentially-private distributed SGD Naman Agarwal A. Suresh Felix X. Yu Sanjiv Kumar H. B. McMahan FedML 28 486 0 27 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training Bojian Zheng Abhishek Tiwari Nandita Vijaykumar Gennady Pekhimenko 27 44 0 22 May 2018
Sparse Binary Compression: Towards Distributed Deep Learning with minimal Communication Felix Sattler Simon Wiedemann K. Müller Wojciech Samek MQ 36 211 0 22 May 2018
Faster Neural Network Training with Approximate Tensor Operations Menachem Adelman Kfir Y. Levy Ido Hakimi M. Silberstein 29 26 0 21 May 2018
Zero-Shot Dialog Generation with Cross-Domain Latent Actions Tiancheng Zhao M. Eskénazi VLM 27 76 0 13 May 2018
Born Again Neural Networks Tommaso Furlanello Zachary Chase Lipton Michael Tschannen Laurent Itti Anima Anandkumar 36 1,024 0 12 May 2018
Noisin: Unbiased Regularization for Recurrent Neural Networks Adji Bousso Dieng Rajesh Ranganath Jaan Altosaar David M. Blei 22 22 0 03 May 2018
Object Counts! Bringing Explicit Detections Back into Image Captioning Josiah Wang Pranava Madhyastha Lucia Specia ObjD 19 37 0 23 Apr 2018
Unsupervised Discrete Sentence Representation Learning for Interpretable Neural Dialog Generation Tiancheng Zhao Kyusong Lee M. Eskénazi DRL 24 141 0 22 Apr 2018
Value-aware Quantization for Training and Inference of Neural Networks Eunhyeok Park S. Yoo Peter Vajda MQ 14 158 0 20 Apr 2018
Sentence Simplification with Memory-Augmented Neural Networks Tu Vu Baotian Hu Tsendsuren Munkhdalai Hong-ye Yu 22 57 0 20 Apr 2018