Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.01038
Cited By
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
1 April 2019
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"fairseq: A Fast, Extensible Toolkit for Sequence Modeling"
37 / 87 papers shown
Title
Scaling Neural Machine Translation
Myle Ott
Sergey Edunov
David Grangier
Michael Auli
AIMat
192
615
0
01 Jun 2018
Hierarchical Neural Story Generation
Angela Fan
M. Lewis
Yann N. Dauphin
DiffM
183
1,628
0
13 May 2018
A Call for Clarity in Reporting BLEU Scores
Matt Post
181
2,998
0
23 Apr 2018
A Stable and Effective Learning Strategy for Trainable Greedy Decoding
Yun Chen
Victor O.K. Li
Kyunghyun Cho
Samuel R. Bowman
57
28
0
21 Apr 2018
Phrase-Based & Neural Unsupervised Machine Translation
Guillaume Lample
Myle Ott
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
91
682
0
20 Apr 2018
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Noam M. Shazeer
Mitchell Stern
ODL
84
1,052
0
11 Apr 2018
Marian: Fast Neural Machine Translation in C++
Marcin Junczys-Dowmunt
Roman Grundkiewicz
Tomasz Dwojak
Hieu T. Hoang
Kenneth Heafield
...
Ulrich Germann
Alham Fikri Aji
Nikolay Bogoychev
André F. T. Martins
Alexandra Birch
98
718
0
01 Apr 2018
Fast Parametric Learning with Activation Memorization
Jack W. Rae
Chris Dyer
Peter Dayan
Timothy Lillicrap
KELM
146
46
0
27 Mar 2018
An Analysis of Neural Language Modeling at Multiple Scales
Stephen Merity
N. Keskar
R. Socher
59
171
0
22 Mar 2018
Tensor2Tensor for Neural Machine Translation
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
...
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
98
530
0
16 Mar 2018
Self-Attention with Relative Position Representations
Peter Shaw
Jakob Uszkoreit
Ashish Vaswani
182
2,299
0
06 Mar 2018
Analyzing Uncertainty in Neural Machine Translation
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
UQLM
113
275
0
28 Feb 2018
A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction
Shamil Chollampatt
Hwee Tou Ng
KELM
58
222
0
26 Jan 2018
Sockeye: A Toolkit for Neural Machine Translation
Felix Hieber
Tobias Domhan
Michael J. Denkowski
David Vilar
Artem Sokolov
Ann Clifton
Matt Post
62
215
0
15 Dec 2017
Controllable Abstractive Summarization
Angela Fan
David Grangier
Michael Auli
87
312
0
14 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
115
186
0
14 Nov 2017
Weighted Transformer Network for Machine Translation
Karim Ahmed
N. Keskar
R. Socher
73
133
0
06 Nov 2017
Mixed Precision Training
Paulius Micikevicius
Sharan Narang
Jonah Alben
G. Diamos
Erich Elsen
...
Boris Ginsburg
Michael Houston
Oleksii Kuchaiev
Ganesh Venkatesh
Hao Wu
176
1,805
0
10 Oct 2017
An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation
Makoto Morishita
Yusuke Oda
Graham Neubig
Koichiro Yoshino
Katsuhito Sudoh
Satoshi Nakamura
MoE
54
26
0
19 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
798
132,454
0
12 Jun 2017
ParlAI: A Dialog Research Software Platform
Alexander H. Miller
Will Feng
Adam Fisch
Jiasen Lu
Dhruv Batra
Antoine Bordes
Devi Parikh
Jason Weston
91
376
0
18 May 2017
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
208
1,559
0
11 May 2017
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
171
3,290
0
08 May 2017
Get To The Point: Summarization with Pointer-Generator Networks
A. See
Peter J. Liu
Christopher D. Manning
3DPC
311
4,029
0
14 Apr 2017
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
253
2,692
0
23 Jan 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
330
1,900
0
10 Jan 2017
Language Modeling with Gated Convolutional Networks
Yann N. Dauphin
Angela Fan
Michael Auli
David Grangier
245
2,408
0
23 Dec 2016
Improving Neural Language Models with a Continuous Cache
Edouard Grave
Armand Joulin
Nicolas Usunier
KELM
66
302
0
13 Dec 2016
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
Ashwin K. Vijayakumar
Michael Cogswell
Ramprasaath R. Selvaraju
Q. Sun
Stefan Lee
David J. Crandall
Dhruv Batra
91
555
0
07 Oct 2016
Efficient softmax approximation for GPUs
Edouard Grave
Armand Joulin
Moustapha Cissé
David Grangier
Hervé Jégou
100
272
0
14 Sep 2016
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
350
8,179
0
13 Aug 2016
Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond
Ramesh Nallapati
Bowen Zhou
Cicero Nogueira dos Santos
Çağlar Gülçehre
Bing Xiang
AIMat
281
2,567
0
19 Feb 2016
Exploring the Limits of Language Modeling
Rafal Jozefowicz
Oriol Vinyals
M. Schuster
Noam M. Shazeer
Yonghui Wu
201
1,145
0
07 Feb 2016
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
235
7,760
0
31 Aug 2015
Character-Aware Neural Language Models
Yoon Kim
Yacine Jernite
David Sontag
Alexander M. Rush
107
1,670
0
26 Aug 2015
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
413
7,971
0
17 Aug 2015
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
355
3,553
0
10 Jun 2015
Previous
1
2