Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.05859
Cited By
Using the Output Embedding to Improve Language Models
20 August 2016
Ofir Press
Lior Wolf
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Using the Output Embedding to Improve Language Models"
50 / 156 papers shown
Title
Improving Pre-Trained Multilingual Models with Vocabulary Expansion
Hai Wang
Dian Yu
Kai Sun
Jianshu Chen
Dong Yu
30
41
0
26 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
27
73
0
18 Sep 2019
Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences
Genta Indra Winata
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
SyDa
135
92
0
18 Sep 2019
CTRL: A Conditional Transformer Language Model for Controllable Generation
N. Keskar
Bryan McCann
L. Varshney
Caiming Xiong
R. Socher
AI4CE
57
1,236
0
11 Sep 2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
KELM
24
38
0
13 Jul 2019
Federated Learning for Emoji Prediction in a Mobile Keyboard
Swaroop Indra Ramaswamy
Rajiv Mathews
Kanishka Rao
Franccoise Beaufays
FedML
21
309
0
11 Jun 2019
Shared-Private Bilingual Word Embeddings for Neural Machine Translation
Xuebo Liu
Derek F. Wong
Yang Liu
Lidia S. Chao
Tong Xiao
Jingbo Zhu
35
37
0
07 Jun 2019
Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based Encoder-Decoder for Automatic Post-Editing
António Vilarinho Lopes
M. Amin Farajian
Gonçalo M. Correia
Jonay Trénous
André F. T. Martins
33
35
0
30 May 2019
MATCHA: Speeding Up Decentralized SGD via Matching Decomposition Sampling
Jianyu Wang
Anit Kumar Sahu
Zhouyi Yang
Gauri Joshi
S. Kar
29
159
0
23 May 2019
Model Slicing for Supporting Complex Analytics with Elastic Inference Cost and Resource Constraints
Shaofeng Cai
Gang Chen
Beng Chin Ooi
Jinyang Gao
25
19
0
03 Apr 2019
Prospection: Interpretable Plans From Language By Predicting the Future
Chris Paxton
Yonatan Bisk
Jesse Thomason
Arunkumar Byravan
Dieter Fox
LM&Ro
26
47
0
20 Mar 2019
Context Vectors are Reflections of Word Vectors in Half the Dimensions
Z. Assylbekov
Rustem Takhanov
16
10
0
26 Feb 2019
Improving Robustness of Machine Translation with Synthetic Noise
Vaibhav
Sumeet Singh
Craig Alan Stewart
Graham Neubig
16
83
0
25 Feb 2019
Latent Normalizing Flows for Discrete Sequences
Zachary M. Ziegler
Alexander M. Rush
BDL
DRL
24
122
0
29 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
38
3,674
0
09 Jan 2019
Learning Private Neural Language Modeling with Attentive Aggregation
Shaoxiong Ji
Shirui Pan
Guodong Long
Xue Li
Jing Jiang
Zi Huang
FedML
MoMe
16
136
0
17 Dec 2018
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs
Sachin Kumar
Yulia Tsvetkov
22
70
0
10 Dec 2018
Input Combination Strategies for Multi-Source Transformer Decoder
Jindrich Libovický
Jindřich Helcl
David Marecek
27
73
0
12 Nov 2018
CUNI System for the WMT18 Multimodal Translation Task
Jindřich Helcl
Jindrich Libovický
Dušan Variš
16
57
0
12 Nov 2018
Federated Learning for Mobile Keyboard Prediction
Andrew Straiton Hard
Kanishka Rao
Zhifeng Lin
Swaroop Indra Ramaswamy
Youjie Li
S. Augenstein
A. Schwing
M. Annavaram
A. Avestimehr
FedML
53
1,511
0
08 Nov 2018
Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation
Xing Niu
Weijia Xu
Marine Carpuat
19
17
0
02 Nov 2018
Language-Independent Representor for Neural Machine Translation
Long Zhou
Yuchen Liu
Jiajun Zhang
Chengqing Zong
Guoping Huang
21
1
0
01 Nov 2018
You May Not Need Attention
Ofir Press
Noah A. Smith
14
27
0
31 Oct 2018
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Songlin Yang
Shawn Tan
Alessandro Sordoni
Aaron Courville
32
323
0
22 Oct 2018
Real-time Neural-based Input Method
Jiali Yao
Raphael Shu
Xinjian Li
K. Ohtsuki
Hideki Nakayama
6
4
0
19 Oct 2018
Adaptive Input Representations for Neural Language Modeling
Alexei Baevski
Michael Auli
26
387
0
28 Sep 2018
Direct Output Connection for a High-Rank Language Model
Sho Takase
Jun Suzuki
Masaaki Nagata
18
36
0
30 Aug 2018
GILE: A Generalized Input-Label Embedding for Text Classification
Nikolaos Pappas
James Henderson
AI4TS
AILaw
VLM
27
79
0
16 Jun 2018
Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning
Genta Indra Winata
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
22
39
0
30 May 2018
Like a Baby: Visually Situated Neural Language Acquisition
Alexander Ororbia
A. Mali
Mary Alexandria Kelly
David Reitter
20
4
0
29 May 2018
Born Again Neural Networks
Tommaso Furlanello
Zachary Chase Lipton
Michael Tschannen
Laurent Itti
Anima Anandkumar
36
1,020
0
12 May 2018
Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context
Urvashi Khandelwal
He He
Peng Qi
Dan Jurafsky
RALM
16
293
0
12 May 2018
Extreme Adaptation for Personalized Neural Machine Translation
Paul Michel
Graham Neubig
19
103
0
04 May 2018
Value-aware Quantization for Training and Inference of Neural Networks
Eunhyeok Park
S. Yoo
Peter Vajda
MQ
14
158
0
20 Apr 2018
Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task
Marcin Junczys-Dowmunt
Roman Grundkiewicz
Shubha Guha
Kenneth Heafield
33
192
0
16 Apr 2018
An Analysis of Neural Language Modeling at Multiple Scales
Stephen Merity
N. Keskar
R. Socher
24
170
0
22 Mar 2018
From Nodes to Networks: Evolving Recurrent Neural Networks
Aditya Rawal
Risto Miikkulainen
13
53
0
12 Mar 2018
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
DRL
42
4,724
0
04 Mar 2018
MaskGAN: Better Text Generation via Filling in the______
W. Fedus
Ian Goodfellow
Andrew M. Dai
24
468
0
23 Jan 2018
Fix your classifier: the marginal value of training the last weight layer
Elad Hoffer
Itay Hubara
Daniel Soudry
35
101
0
14 Jan 2018
Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Yujun Lin
Song Han
Huizi Mao
Yu Wang
W. Dally
44
1,388
0
05 Dec 2017
Modeling Past and Future for Neural Machine Translation
Zaixiang Zheng
Hao Zhou
Shujian Huang
Lili Mou
Xinyu Dai
Jiajun Chen
Zhaopeng Tu
32
48
0
27 Nov 2017
Neural Language Modeling by Jointly Learning Syntax and Lexicon
Songlin Yang
Zhouhan Lin
Chin-Wei Huang
Aaron Courville
40
178
0
02 Nov 2017
Neural Optimizer Search with Reinforcement Learning
Irwan Bello
Barret Zoph
Vijay Vasudevan
Quoc V. Le
ODL
29
383
0
21 Sep 2017
Gradual Learning of Recurrent Neural Networks
Ziv Aharoni
Gal Rattner
Haim Permuter
AI4CE
30
4
0
29 Aug 2017
Regularizing and Optimizing LSTM Language Models
Stephen Merity
N. Keskar
R. Socher
83
1,091
0
07 Aug 2017
Revisiting Activation Regularization for Language RNNs
Stephen Merity
Bryan McCann
R. Socher
33
44
0
03 Aug 2017
YellowFin and the Art of Momentum Tuning
Jian Zhang
Ioannis Mitliagkas
ODL
23
108
0
12 Jun 2017
Deriving Neural Architectures from Sequence and Graph Kernels
Tao Lei
Wengong Jin
Regina Barzilay
Tommi Jaakkola
GNN
45
137
0
25 May 2017
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
41
1,548
0
11 May 2017
Previous
1
2
3
4
Next