Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.14528
Cited By
Fast Nearest Neighbor Machine Translation
30 May 2021
Yuxian Meng
Xiaoya Li
Xiayu Zheng
Fei Wu
Xiaofei Sun
Tianwei Zhang
Jiwei Li
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fast Nearest Neighbor Machine Translation"
43 / 43 papers shown
Title
Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks
Yichao Du
Zhirui Zhang
Linan Yue
Xu Huang
Yuqing Zhang
Tong Xu
Linli Xu
Enhong Chen
FedML
90
5
0
18 Jan 2024
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
70
15
0
15 Oct 2023
Finetuning Pretrained Transformers into RNNs
Jungo Kasai
Hao Peng
Yizhe Zhang
Dani Yogatama
Gabriel Ilharco
Nikolaos Pappas
Yi Mao
Weizhu Chen
Noah A. Smith
72
64
0
24 Mar 2021
Random Feature Attention
Hao Peng
Nikolaos Pappas
Dani Yogatama
Roy Schwartz
Noah A. Smith
Lingpeng Kong
58
353
0
03 Mar 2021
Efficient Retrieval Augmented Generation from Unstructured Knowledge for Task-Oriented Dialog
David Thulke
Nico Daheim
Christian Dugast
Hermann Ney
RALM
44
49
0
09 Feb 2021
Incorporating BERT into Parallel Sequence Decoding with Adapters
Junliang Guo
Zhirui Zhang
Linli Xu
Hao-Ran Wei
Boxing Chen
Enhong Chen
63
69
0
13 Oct 2020
Nearest Neighbor Machine Translation
Urvashi Khandelwal
Angela Fan
Dan Jurafsky
Luke Zettlemoyer
M. Lewis
RALM
48
283
0
01 Oct 2020
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
Lee Xiong
Chenyan Xiong
Ye Li
Kwok-Fung Tang
Jialin Liu
Paul N. Bennett
Junaid Ahmed
Arnold Overwijk
72
1,207
0
01 Jul 2020
Pre-training via Paraphrasing
M. Lewis
Marjan Ghazvininejad
Gargi Ghosh
Armen Aghajanyan
Sida I. Wang
Luke Zettlemoyer
AIMat
72
160
0
26 Jun 2020
Synthesizer: Rethinking Self-Attention in Transformer Models
Yi Tay
Dara Bahri
Donald Metzler
Da-Cheng Juan
Zhe Zhao
Che Zheng
34
334
0
02 May 2020
Augmenting Transformers with KNN-Based Composite Memory for Dialogue
Angela Fan
Claire Gardent
Chloé Braud
Antoine Bordes
RALM
108
76
0
27 Apr 2020
Understanding the Difficulty of Training Transformers
Liyuan Liu
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
Jiawei Han
AI4CE
31
251
0
17 Apr 2020
Unsupervised Domain Clusters in Pretrained Language Models
Roee Aharoni
Yoav Goldberg
56
247
0
05 Apr 2020
SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection
Xiaoya Li
Yuxian Meng
Mingxin Zhou
Qinghong Han
Fei Wu
Jiwei Li
50
20
0
22 Mar 2020
Incorporating BERT into Neural Machine Translation
Jinhua Zhu
Yingce Xia
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
FedML
AIMat
32
357
0
17 Feb 2020
On Layer Normalization in the Transformer Architecture
Ruibin Xiong
Yunchang Yang
Di He
Kai Zheng
Shuxin Zheng
Chen Xing
Huishuai Zhang
Yanyan Lan
Liwei Wang
Tie-Yan Liu
AI4CE
74
973
0
12 Feb 2020
REALM: Retrieval-Augmented Language Model Pre-Training
Kelvin Guu
Kenton Lee
Zora Tung
Panupong Pasupat
Ming-Wei Chang
RALM
69
2,050
0
10 Feb 2020
Time-aware Large Kernel Convolutions
Vasileios Lioutas
Yuhong Guo
AI4TS
36
29
0
08 Feb 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
92
1,786
0
22 Jan 2020
Generalization through Memorization: Nearest Neighbor Language Models
Urvashi Khandelwal
Omer Levy
Dan Jurafsky
Luke Zettlemoyer
M. Lewis
RALM
112
831
0
01 Nov 2019
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
90
10,720
0
29 Oct 2019
Transformers without Tears: Improving the Normalization of Self-Attention
Toan Q. Nguyen
Julian Salazar
64
228
0
14 Oct 2019
Large-scale Pretraining for Neural Machine Translation with Tens of Billions of Sentence Pairs
Yuxian Meng
Xiangyuan Ren
Zijun Sun
Xiaoya Li
Arianna Yuan
Fei Wu
Jiwei Li
AIMat
AI4CE
17
8
0
26 Sep 2019
Facebook FAIR's WMT19 News Translation Task Submission
Nathan Ng
Kyra Yee
Alexei Baevski
Myle Ott
Michael Auli
Sergey Edunov
VLM
42
394
0
15 Jul 2019
Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges
N. Arivazhagan
Ankur Bapna
Orhan Firat
Dmitry Lepikhin
Melvin Johnson
...
George F. Foster
Colin Cherry
Wolfgang Macherey
Zhiwen Chen
Yonghui Wu
45
424
0
11 Jul 2019
Learning Deep Transformer Models for Machine Translation
Qiang Wang
Bei Li
Tong Xiao
Jingbo Zhu
Changliang Li
Derek F. Wong
Lidia S. Chao
50
666
0
05 Jun 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
61
3,141
0
01 Apr 2019
Massively Multilingual Neural Machine Translation
Roee Aharoni
Melvin Johnson
Orhan Firat
LRM
AI4CE
51
485
0
28 Feb 2019
Non-Parametric Adaptation for Neural Machine Translation
Ankur Bapna
Orhan Firat
41
73
0
28 Feb 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
751
93,936
0
11 Oct 2018
Retrieve and Refine: Improved Sequence Generation Models For Dialogue
Jason Weston
Emily Dinan
Alexander H. Miller
RALM
46
204
0
14 Aug 2018
A Call for Clarity in Reporting BLEU Scores
Matt Post
73
2,941
0
23 Apr 2018
Guiding Neural Machine Translation with Retrieved Translation Pieces
Jingyi Zhang
Masao Utiyama
Eiichiro Sumita
Graham Neubig
Satoshi Nakamura
AAML
55
137
0
07 Apr 2018
Learning to Remember Translation History with a Continuous Cache
Zhaopeng Tu
Yang Liu
Shuming Shi
Tong Zhang
CLL
49
180
0
26 Nov 2017
Six Challenges for Neural Machine Translation
Philipp Koehn
Rebecca Knowles
AAML
AIMat
290
1,215
0
12 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
278
129,831
0
12 Jun 2017
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
111
3,279
0
08 May 2017
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
123
3,682
0
28 Feb 2017
One Sentence One Model for Neural Machine Translation
Xiaoqing Li
Jiajun Zhang
Chengqing Zong
AI4CE
140
62
0
21 Sep 2016
Mutual Information and Diverse Decoding Improve Neural Machine Translation
Jiwei Li
Dan Jurafsky
35
120
0
04 Jan 2016
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
278
7,942
0
17 Aug 2015
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
235
20,467
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
308
27,205
0
01 Sep 2014
1