ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.07416
  4. Cited By
Tensor2Tensor for Neural Machine Translation

Tensor2Tensor for Neural Machine Translation

16 March 2018
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
Stephan Gouws
Llion Jones
Lukasz Kaiser
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
ArXivPDFHTML

Papers citing "Tensor2Tensor for Neural Machine Translation"

50 / 261 papers shown
Title
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire
Jinglin Liu
Yi Ren
Zhou Zhao
Chen Zhang
Baoxing Huai
Jing Yuan
14
11
0
06 Aug 2020
DeLighT: Deep and Light-weight Transformer
DeLighT: Deep and Light-weight Transformer
Sachin Mehta
Marjan Ghazvininejad
Srini Iyer
Luke Zettlemoyer
Hannaneh Hajishirzi
VLM
33
32
0
03 Aug 2020
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine
  Translation
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation
Xin Liu
Yi Ren
Jiefu Ou
Chen Zhang
Yangqiu Song
Zhou Zhao
Tie-Yan Liu
22
2
0
17 Jul 2020
Neural Composition: Learning to Generate from Multiple Models
Neural Composition: Learning to Generate from Multiple Models
Denis Filimonov
R. Gadde
Ariya Rastrow
18
3
0
10 Jul 2020
Learning Graph Structure With A Finite-State Automaton Layer
Learning Graph Structure With A Finite-State Automaton Layer
Daniel D. Johnson
Hugo Larochelle
Daniel Tarlow
GNN
AI4CE
19
17
0
09 Jul 2020
Best-First Beam Search
Best-First Beam Search
Clara Meister
Tim Vieira
Ryan Cotterell
16
71
0
08 Jul 2020
Announcing CzEng 2.0 Parallel Corpus with over 2 Gigawords
Announcing CzEng 2.0 Parallel Corpus with over 2 Gigawords
Tom Kocmi
Martin Popel
Ondrej Bojar
6
38
0
06 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
59
82
0
02 Jul 2020
UWSpeech: Speech to Speech Translation for Unwritten Languages
UWSpeech: Speech to Speech Translation for Unwritten Languages
Chen Zhang
Xu Tan
Yi Ren
Tao Qin
Ke-jun Zhang
Tie-Yan Liu
9
52
0
14 Jun 2020
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
Matej Ulvcar
Marko Robnik-Šikonja
19
48
0
14 Jun 2020
Wat zei je? Detecting Out-of-Distribution Translations with Variational
  Transformers
Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers
Tim Z. Xiao
Aidan Gomez
Y. Gal
UQLM
15
33
0
08 Jun 2020
$O(n)$ Connections are Expressive Enough: Universal Approximability of
  Sparse Transformers
O(n)O(n)O(n) Connections are Expressive Enough: Universal Approximability of Sparse Transformers
Chulhee Yun
Yin-Wen Chang
Srinadh Bhojanapalli
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
6
78
0
08 Jun 2020
ELITR Non-Native Speech Translation at IWSLT 2020
ELITR Non-Native Speech Translation at IWSLT 2020
Dominik Machávcek
Jonávs Kratochvíl
Sangeet Sagar
Matúvs vZilinec
Ondrej Bojar
T. Nguyen
Felix Schneider
P. Williams
Yuekun Yao
11
11
0
05 Jun 2020
Applying the Transformer to Character-level Transduction
Applying the Transformer to Character-level Transduction
Shijie Wu
Ryan Cotterell
Mans Hulden
AI4CE
17
102
0
20 May 2020
It's Easier to Translate out of English than into it: Measuring Neural
  Translation Difficulty by Cross-Mutual Information
It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
Emanuele Bugliarello
Sabrina J. Mielke
Antonios Anastasopoulos
Ryan Cotterell
Naoaki Okazaki
36
23
0
05 May 2020
Successfully Applying the Stabilized Lottery Ticket Hypothesis to the
  Transformer Architecture
Successfully Applying the Stabilized Lottery Ticket Hypothesis to the Transformer Architecture
Christopher Brix
Parnia Bahar
Hermann Ney
8
38
0
04 May 2020
Using Context in Neural Machine Translation Training Objectives
Using Context in Neural Machine Translation Training Objectives
Danielle Saunders
Felix Stahlberg
Bill Byrne
19
20
0
04 May 2020
Monitoring COVID-19 social distancing with person detection and tracking
  via fine-tuned YOLO v3 and Deepsort techniques
Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques
Narinder Singh Punn
S. K. Sonbhadra
Sonali Agarwal
Gaurav Rai
31
240
0
04 May 2020
Generalized Entropy Regularization or: There's Nothing Special about
  Label Smoothing
Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing
Clara Meister
Elizabeth Salesky
Ryan Cotterell
UQCV
8
61
0
02 May 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
33
161
0
21 Apr 2020
Fast and Accurate Deep Bidirectional Language Representations for
  Unsupervised Learning
Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning
Joongbo Shin
Yoonhyung Lee
Seunghyun Yoon
Kyomin Jung
OOD
25
12
0
17 Apr 2020
Non-Autoregressive Machine Translation with Latent Alignments
Non-Autoregressive Machine Translation with Latent Alignments
Chitwan Saharia
William Chan
Saurabh Saxena
Mohammad Norouzi
19
157
0
16 Apr 2020
Reducing Gender Bias in Neural Machine Translation as a Domain
  Adaptation Problem
Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem
Danielle Saunders
Bill Byrne
AI4CE
24
137
0
09 Apr 2020
Characterizing and Modeling Distributed Training with Transient Cloud
  GPU Servers
Characterizing and Modeling Distributed Training with Transient Cloud GPU Servers
Shijian Li
R. Walls
Tian Guo
31
23
0
07 Apr 2020
AR: Auto-Repair the Synthetic Data for Neural Machine Translation
AR: Auto-Repair the Synthetic Data for Neural Machine Translation
Shanbo Cheng
Shaohui Kuang
Rongxiang Weng
Heng Yu
Changfeng Zhu
Weihua Luo
SyDa
25
3
0
05 Apr 2020
On-the-Fly Adaptation of Source Code Models using Meta-Learning
Disha Shrivastava
Hugo Larochelle
Daniel Tarlow
TTA
12
6
0
26 Mar 2020
VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation
VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation
Ryan Hoque
Daniel Seita
Ashwin Balakrishna
Aditya Ganapathi
A. Tanwani
Nawid Jamali
K. Yamane
Soshi Iba
Ken Goldberg
61
99
0
19 Mar 2020
Capturing document context inside sentence-level neural machine
  translation models with self-training
Capturing document context inside sentence-level neural machine translation models with self-training
Elman Mansimov
Gábor Melis
Lei Yu
41
13
0
11 Mar 2020
Teaching Temporal Logics to Neural Networks
Teaching Temporal Logics to Neural Networks
Christopher Hahn
Frederik Schmitt
Jens U. Kreber
M. Rabe
Bernd Finkbeiner
NAI
29
66
0
06 Mar 2020
Train Large, Then Compress: Rethinking Model Size for Efficient Training
  and Inference of Transformers
Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
Zhuohan Li
Eric Wallace
Sheng Shen
Kevin Lin
Kurt Keutzer
Dan Klein
Joseph E. Gonzalez
22
148
0
26 Feb 2020
Sparse Sinkhorn Attention
Sparse Sinkhorn Attention
Yi Tay
Dara Bahri
Liu Yang
Donald Metzler
Da-Cheng Juan
23
330
0
26 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
Controlling Computation versus Quality for Neural Sequence Models
Controlling Computation versus Quality for Neural Sequence Models
Ankur Bapna
N. Arivazhagan
Orhan Firat
24
30
0
17 Feb 2020
Low-Rank Bottleneck in Multi-head Attention Models
Low-Rank Bottleneck in Multi-head Attention Models
Srinadh Bhojanapalli
Chulhee Yun
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
24
94
0
17 Feb 2020
Stress Test Evaluation of Transformer-based Models in Natural Language
  Understanding Tasks
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks
Carlos Aspillaga
Andrés Carvallo
Vladimir Araujo
ELM
44
31
0
14 Feb 2020
On Layer Normalization in the Transformer Architecture
On Layer Normalization in the Transformer Architecture
Ruibin Xiong
Yunchang Yang
Di He
Kai Zheng
Shuxin Zheng
Chen Xing
Huishuai Zhang
Yanyan Lan
Liwei Wang
Tie-Yan Liu
AI4CE
19
949
0
12 Feb 2020
Towards a Human-like Open-Domain Chatbot
Towards a Human-like Open-Domain Chatbot
Daniel De Freitas
Minh-Thang Luong
David R. So
Jamie Hall
Noah Fiedel
...
Zi Yang
Apoorv Kulshreshtha
Gaurav Nemade
Yifeng Lu
Quoc V. Le
42
924
0
27 Jan 2020
Pre-training via Leveraging Assisting Languages and Data Selection for
  Neural Machine Translation
Pre-training via Leveraging Assisting Languages and Data Selection for Neural Machine Translation
Haiyue Song
Raj Dabre
Zhuoyuan Mao
Fei Cheng
Sadao Kurohashi
Eiichiro Sumita
11
2
0
23 Jan 2020
Shifted and Squeezed 8-bit Floating Point format for Low-Precision
  Training of Deep Neural Networks
Shifted and Squeezed 8-bit Floating Point format for Low-Precision Training of Deep Neural Networks
Léopold Cambier
Anahita Bhiwandiwalla
Ting Gong
M. Nekuii
Oguz H. Elibol
Hanlin Tang
MQ
21
48
0
16 Jan 2020
Learning Accurate Integer Transformer Machine-Translation Models
Learning Accurate Integer Transformer Machine-Translation Models
Ephrem Wu
11
4
0
03 Jan 2020
Learning from Learning Machines: Optimisation, Rules, and Social Norms
Learning from Learning Machines: Optimisation, Rules, and Social Norms
Travis LaCroix
Yoshua Bengio
28
7
0
29 Dec 2019
Synthetic Datasets for Neural Program Synthesis
Synthetic Datasets for Neural Program Synthesis
Richard Shin
Neel Kant
Kavi Gupta
Christopher M. Bender
Brandon Trabucco
Rishabh Singh
D. Song
NAI
16
45
0
27 Dec 2019
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures
  Translation
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
Haiyue Song
Raj Dabre
Atsushi Fujita
Sadao Kurohashi
30
4
0
26 Dec 2019
Learning and Evaluating Contextual Embedding of Source Code
Learning and Evaluating Contextual Embedding of Source Code
Aditya Kanade
Petros Maniatis
Gogul Balakrishnan
Kensen Shi
ELM
19
76
0
21 Dec 2019
Measuring Compositional Generalization: A Comprehensive Method on
  Realistic Data
Measuring Compositional Generalization: A Comprehensive Method on Realistic Data
Daniel Keysers
Nathanael Scharli
Nathan Scales
Hylke Buisman
Daniel Furrer
...
Tibor Tihon
Dmitry Tsarkov
Tianlin Li
Marc van Zee
Olivier Bousquet
CoGe
21
347
0
20 Dec 2019
A Survey on Document-level Neural Machine Translation: Methods and
  Evaluation
A Survey on Document-level Neural Machine Translation: Methods and Evaluation
Sameen Maruf
Fahimeh Saleh
Gholamreza Haffari
AI4TS
30
23
0
18 Dec 2019
In Nomine Function: Naming Functions in Stripped Binaries with Neural
  Networks
In Nomine Function: Naming Functions in Stripped Binaries with Neural Networks
Fiorella Artuso
Giuseppe Antonio Di Luna
Luca Massarelli
Leonardo Querzoni
10
5
0
17 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
49
395
0
11 Dec 2019
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
20
312
0
04 Dec 2019
Neural Academic Paper Generation
Neural Academic Paper Generation
Samet Demir
Uras Mutlu
Özgür Özdemir
21
3
0
02 Dec 2019
Previous
123456
Next