ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.10143
  4. Cited By
Direct Output Connection for a High-Rank Language Model

Direct Output Connection for a High-Rank Language Model

30 August 2018
Sho Takase
Jun Suzuki
Masaaki Nagata
ArXivPDFHTML

Papers citing "Direct Output Connection for a High-Rank Language Model"

30 / 30 papers shown
Title
Delving Deeper Into Astromorphic Transformers
Delving Deeper Into Astromorphic Transformers
Md. Zesun Ahmed Mia
Malyaban Bal
Abhronil Sengupta
85
1
0
18 Dec 2023
Constituency Parsing with a Self-Attentive Encoder
Constituency Parsing with a Self-Attentive Encoder
Nikita Kitaev
Dan Klein
51
537
0
02 May 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
111
11,520
0
15 Feb 2018
Source-side Prediction for Neural Headline Generation
Source-side Prediction for Neural Headline Generation
Shun Kiyono
Sho Takase
Jun Suzuki
Naoaki Okazaki
Kentaro Inui
Masaaki Nagata
36
9
0
22 Dec 2017
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model
Zhilin Yang
Zihang Dai
Ruslan Salakhutdinov
William W. Cohen
BDL
45
367
0
10 Nov 2017
Fraternal Dropout
Fraternal Dropout
Konrad Zolna
Devansh Arpit
Dendi Suhubdy
Yoshua Bengio
45
53
0
31 Oct 2017
Input-to-Output Gate to Improve RNN Language Models
Input-to-Output Gate to Improve RNN Language Models
Sho Takase
Jun Suzuki
Masaaki Nagata
AI4CE
31
6
0
26 Sep 2017
Dynamic Evaluation of Neural Sequence Models
Dynamic Evaluation of Neural Sequence Models
Ben Krause
Emmanuel Kahembwe
Iain Murray
Steve Renals
53
133
0
21 Sep 2017
Regularizing and Optimizing LSTM Language Models
Regularizing and Optimizing LSTM Language Models
Stephen Merity
N. Keskar
R. Socher
142
1,093
0
07 Aug 2017
On the State of the Art of Evaluation in Neural Language Models
On the State of the Art of Evaluation in Neural Language Models
Gábor Melis
Chris Dyer
Phil Blunsom
45
532
0
18 Jul 2017
Improving Neural Parsing by Disentangling Model Combination and
  Reranking Effects
Improving Neural Parsing by Disentangling Model Combination and Reranking Effects
Daniel Fried
Mitchell Stern
Dan Klein
AIMat
50
39
0
10 Jul 2017
Selective Encoding for Abstractive Sentence Summarization
Selective Encoding for Abstractive Sentence Summarization
Qingyu Zhou
Nan Yang
Furu Wei
M. Zhou
CVBM
56
259
0
24 Apr 2017
Outrageously Large Neural Networks: The Sparsely-Gated
  Mixture-of-Experts Layer
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
160
2,614
0
23 Jan 2017
Improving Neural Language Models with a Continuous Cache
Improving Neural Language Models with a Continuous Cache
Edouard Grave
Armand Joulin
Nicolas Usunier
KELM
41
300
0
13 Dec 2016
Neural Architecture Search with Reinforcement Learning
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
380
5,362
0
05 Nov 2016
Tying Word Vectors and Word Classifiers: A Loss Framework for Language
  Modeling
Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling
Hakan Inan
Khashayar Khosravi
R. Socher
94
384
0
04 Nov 2016
Pointer Sentinel Mixture Models
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
168
2,814
0
26 Sep 2016
Using the Output Embedding to Improve Language Models
Using the Output Embedding to Improve Language Models
Ofir Press
Lior Wolf
51
733
0
20 Aug 2016
Recurrent Highway Networks
Recurrent Highway Networks
J. Zilly
R. Srivastava
Jan Koutník
Jürgen Schmidhuber
63
414
0
12 Jul 2016
Recurrent Neural Network Grammars
Recurrent Neural Network Grammars
Chris Dyer
A. Kuncoro
Miguel Ballesteros
Noah A. Smith
GNN
71
524
0
25 Feb 2016
A Theoretically Grounded Application of Dropout in Recurrent Neural
  Networks
A Theoretically Grounded Application of Dropout in Recurrent Neural Networks
Y. Gal
Zoubin Ghahramani
UQCV
DRL
BDL
116
1,644
0
16 Dec 2015
Improving Neural Machine Translation Models with Monolingual Data
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich
Barry Haddow
Alexandra Birch
206
2,710
0
20 Nov 2015
A Neural Attention Model for Abstractive Sentence Summarization
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush
S. Chopra
Jason Weston
CVBM
110
2,695
0
02 Sep 2015
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
316
7,951
0
17 Aug 2015
Semantically Conditioned LSTM-based Natural Language Generation for
  Spoken Dialogue Systems
Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems
Tsung-Hsien Wen
Milica Gasic
N. Mrksic
Pei-hao Su
David Vandyke
S. Young
80
948
0
07 Aug 2015
Highway Networks
Highway Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
118
1,765
0
03 May 2015
Going Deeper with Convolutions
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
333
43,511
0
17 Sep 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
287
20,491
0
10 Sep 2014
Recurrent Neural Network Regularization
Recurrent Neural Network Regularization
Wojciech Zaremba
Ilya Sutskever
Oriol Vinyals
ODL
104
2,768
0
08 Sep 2014
Distributed Representations of Words and Phrases and their
  Compositionality
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
300
33,445
0
16 Oct 2013
1