Direct Output Connection for a High-Rank Language Model

30 August 2018

Papers citing "Direct Output Connection for a High-Rank Language Model"

30 / 30 papers shown

Title
Delving Deeper Into Astromorphic Transformers Md. Zesun Ahmed Mia Malyaban Bal Abhronil Sengupta 85 1 0 18 Dec 2023
Constituency Parsing with a Self-Attentive Encoder Nikita Kitaev Dan Klein 51 537 0 02 May 2018
Deep contextualized word representations Matthew E. Peters Mark Neumann Mohit Iyyer Matt Gardner Christopher Clark Kenton Lee Luke Zettlemoyer NAI 111 11,520 0 15 Feb 2018
Source-side Prediction for Neural Headline Generation Shun Kiyono Sho Takase Jun Suzuki Naoaki Okazaki Kentaro Inui Masaaki Nagata 36 9 0 22 Dec 2017
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model Zhilin Yang Zihang Dai Ruslan Salakhutdinov William W. Cohen BDL 45 367 0 10 Nov 2017
Fraternal Dropout Konrad Zolna Devansh Arpit Dendi Suhubdy Yoshua Bengio 45 53 0 31 Oct 2017
Input-to-Output Gate to Improve RNN Language Models Sho Takase Jun Suzuki Masaaki Nagata AI4CE 31 6 0 26 Sep 2017
Dynamic Evaluation of Neural Sequence Models Ben Krause Emmanuel Kahembwe Iain Murray Steve Renals 53 133 0 21 Sep 2017
Regularizing and Optimizing LSTM Language Models Stephen Merity N. Keskar R. Socher 142 1,093 0 07 Aug 2017
On the State of the Art of Evaluation in Neural Language Models Gábor Melis Chris Dyer Phil Blunsom 45 532 0 18 Jul 2017
Improving Neural Parsing by Disentangling Model Combination and Reranking Effects Daniel Fried Mitchell Stern Dan Klein AIMat 50 39 0 10 Jul 2017
Selective Encoding for Abstractive Sentence Summarization Qingyu Zhou Nan Yang Furu Wei M. Zhou CVBM 56 259 0 24 Apr 2017
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Noam M. Shazeer Azalia Mirhoseini Krzysztof Maziarz Andy Davis Quoc V. Le Geoffrey E. Hinton J. Dean MoE 160 2,614 0 23 Jan 2017
Improving Neural Language Models with a Continuous Cache Edouard Grave Armand Joulin Nicolas Usunier KELM 41 300 0 13 Dec 2016
Neural Architecture Search with Reinforcement Learning Barret Zoph Quoc V. Le 380 5,362 0 05 Nov 2016
Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling Hakan Inan Khashayar Khosravi R. Socher 94 384 0 04 Nov 2016
Pointer Sentinel Mixture Models Stephen Merity Caiming Xiong James Bradbury R. Socher RALM 168 2,814 0 26 Sep 2016
Using the Output Embedding to Improve Language Models Ofir Press Lior Wolf 51 733 0 20 Aug 2016
Recurrent Highway Networks J. Zilly R. Srivastava Jan Koutník Jürgen Schmidhuber 63 414 0 12 Jul 2016
Recurrent Neural Network Grammars Chris Dyer A. Kuncoro Miguel Ballesteros Noah A. Smith GNN 71 524 0 25 Feb 2016
A Theoretically Grounded Application of Dropout in Recurrent Neural Networks Y. Gal Zoubin Ghahramani UQCV DRL BDL 116 1,644 0 16 Dec 2015
Improving Neural Machine Translation Models with Monolingual Data Rico Sennrich Barry Haddow Alexandra Birch 206 2,710 0 20 Nov 2015
A Neural Attention Model for Abstractive Sentence Summarization Alexander M. Rush S. Chopra Jason Weston CVBM 110 2,695 0 02 Sep 2015
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 316 7,951 0 17 Aug 2015
Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems Tsung-Hsien Wen Milica Gasic N. Mrksic Pei-hao Su David Vandyke S. Young 80 948 0 07 Aug 2015
Highway Networks R. Srivastava Klaus Greff Jürgen Schmidhuber 118 1,765 0 03 May 2015
Going Deeper with Convolutions Christian Szegedy Wei Liu Yangqing Jia P. Sermanet Scott E. Reed Dragomir Anguelov D. Erhan Vincent Vanhoucke Andrew Rabinovich 333 43,511 0 17 Sep 2014
Sequence to Sequence Learning with Neural Networks Ilya Sutskever Oriol Vinyals Quoc V. Le AIMat 287 20,491 0 10 Sep 2014
Recurrent Neural Network Regularization Wojciech Zaremba Ilya Sutskever Oriol Vinyals ODL 104 2,768 0 08 Sep 2014
Distributed Representations of Words and Phrases and their Compositionality Tomas Mikolov Ilya Sutskever Kai Chen G. Corrado J. Dean NAI OCL 300 33,445 0 16 Oct 2013