ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.10959
  4. Cited By
Subword Regularization: Improving Neural Network Translation Models with
  Multiple Subword Candidates

Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates

29 April 2018
Taku Kudo
ArXivPDFHTML

Papers citing "Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates"

50 / 617 papers shown
Title
Aligning the Pretraining and Finetuning Objectives of Language Models
Aligning the Pretraining and Finetuning Objectives of Language Models
Nuo Wang Pierse
Jing Lu
AI4CE
27
2
0
05 Feb 2020
Scaling Up Online Speech Recognition Using ConvNets
Scaling Up Online Speech Recognition Using ConvNets
Vineel Pratap
Qiantong Xu
Jacob Kahn
Gilad Avidov
Tatiana Likhomanenko
Awni Y. Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
154
38
0
27 Jan 2020
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive
  Summarization
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
45
2,018
0
18 Dec 2019
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Karthikeyan K
Zihan Wang
Stephen D. Mayhew
Dan Roth
LRM
36
334
0
17 Dec 2019
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
20
312
0
04 Dec 2019
A Subword Level Language Model for Bangla Language
A Subword Level Language Model for Bangla Language
Aisha Khatun
Anisur Rahman
Hemayet Ahmed Chowdhury
Md. Saiful Islam
A. Tasnim
15
4
0
15 Nov 2019
CamemBERT: a Tasty French Language Model
CamemBERT: a Tasty French Language Model
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
42
956
0
10 Nov 2019
Domain Robustness in Neural Machine Translation
Domain Robustness in Neural Machine Translation
Mathias Müller
Annette Rios Gonzales
Rico Sennrich
33
95
0
08 Nov 2019
Unsupervised Cross-lingual Representation Learning at Scale
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
20
6,385
0
05 Nov 2019
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data
Guillaume Wenzek
Marie-Anne Lachaux
Alexis Conneau
Vishrav Chaudhary
Francisco Guzmán
Armand Joulin
Edouard Grave
13
638
0
01 Nov 2019
BPE-Dropout: Simple and Effective Subword Regularization
BPE-Dropout: Simple and Effective Subword Regularization
Ivan Provilkov
Dmitrii Emelianenko
Elena Voita
38
276
0
29 Oct 2019
Multitask Learning For Different Subword Segmentations In Neural Machine
  Translation
Multitask Learning For Different Subword Segmentations In Neural Machine Translation
Tejas Srinivasan
Ramon Sanabria
Florian Metze
12
5
0
27 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
129
19,493
0
23 Oct 2019
Deja-vu: Double Feature Presentation and Iterated Loss in Deep
  Transformer Networks
Deja-vu: Double Feature Presentation and Iterated Loss in Deep Transformer Networks
Andros Tjandra
Chunxi Liu
Frank Zhang
Xiaohui Zhang
Yongqiang Wang
Gabriel Synnaeve
Satoshi Nakamura
Geoffrey Zweig
ViT
25
44
0
23 Oct 2019
End-to-End Speech Recognition: A review for the French Language
End-to-End Speech Recognition: A review for the French Language
Florian Boyer
Jean-Luc Rouas
AI4TS
22
10
0
18 Oct 2019
Controlling Utterance Length in NMT-based Word Segmentation with
  Attention
Controlling Utterance Length in NMT-based Word Segmentation with Attention
Pierre Godard
Laurent Besacier
François Yvon
24
2
0
18 Oct 2019
Learning Invariant Representations of Social Media Users
Learning Invariant Representations of Social Media Users
Nicholas Andrews
M. Bishop
28
35
0
11 Oct 2019
Federated Learning of N-gram Language Models
Federated Learning of N-gram Language Models
Mingqing Chen
A. Suresh
Rajiv Mathews
Adeline Wong
Cyril Allauzen
F. Beaufays
Michael Riley
FedML
18
74
0
08 Oct 2019
Modeling Color Terminology Across Thousands of Languages
Modeling Color Terminology Across Thousands of Languages
Arya D. McCarthy
Winston Wu
S. Cascianelli
Bill Watson
Rita Cucchiara
9
11
0
03 Oct 2019
Regressing Word and Sentence Embeddings for Regularization of Neural
  Machine Translation
Regressing Word and Sentence Embeddings for Regularization of Neural Machine Translation
Inigo Jauregi Unanue
E. Z. Borzeshi
Massimo Piccardi
AI4TS
35
0
0
30 Sep 2019
On the Importance of Subword Information for Morphological Tasks in
  Truly Low-Resource Languages
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages
Yi Zhu
Benjamin Heinzerling
Ivan Vulić
Michael Strube
Roi Reichart
Anna Korhonen
21
19
0
26 Sep 2019
Self-Training for End-to-End Speech Recognition
Self-Training for End-to-End Speech Recognition
Jacob Kahn
Ann Lee
Awni Y. Hannun
SSL
27
231
0
19 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
27
73
0
18 Sep 2019
Subword ELMo
Subword ELMo
Jiangtong Li
Hai Zhao
Z. Li
Wei Bi
Xiaojiang Liu
8
1
0
18 Sep 2019
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End
  Speech Translation
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation
Chengyi Wang
Yu-Huan Wu
Shujie Liu
Zhenglu Yang
M. Zhou
10
83
0
17 Sep 2019
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Julian Martin Eisenschlos
Sebastian Ruder
Piotr Czapla
Marcin Kardas
Sylvain Gugger
Jeremy Howard
14
99
0
10 Sep 2019
Neural Machine Translation with Byte-Level Subwords
Neural Machine Translation with Byte-Level Subwords
Changhan Wang
Kyunghyun Cho
Jiatao Gu
26
173
0
07 Sep 2019
Subword Language Model for Query Auto-Completion
Subword Language Model for Query Auto-Completion
Gyuwan Kim
17
14
0
02 Sep 2019
Learning a Multi-Domain Curriculum for Neural Machine Translation
Learning a Multi-Domain Curriculum for Neural Machine Translation
Wei Wang
Ye Tian
Jiquan Ngiam
Yinfei Yang
Isaac Caswell
Zarana Parekh
25
39
0
28 Aug 2019
Parsimonious Morpheme Segmentation with an Application to Enriching Word
  Embeddings
Parsimonious Morpheme Segmentation with an Application to Enriching Word Embeddings
Ahmed El-Kishky
Frank F. Xu
Aston Zhang
Jiawei Han
14
4
0
18 Aug 2019
Transformer-based Automatic Post-Editing with a Context-Aware Encoding
  Approach for Multi-Source Inputs
Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs
WonKee Lee
Junsuk Park
Byung-Hyun Go
Jong-Hyeok Lee
KELM
8
3
0
15 Aug 2019
IMS-Speech: A Speech to Text Tool
IMS-Speech: A Speech to Text Tool
Pavel Denisov
Ngoc Thang Vu
19
11
0
13 Aug 2019
A Baseline Neural Machine Translation System for Indian Languages
A Baseline Neural Machine Translation System for Indian Languages
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
63
17
0
29 Jul 2019
Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness
  Task
Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task
Alexandre Berard
Ioan Calapodescu
Claude Roux
VLM
9
59
0
15 Jul 2019
The University of Edinburgh's Submissions to the WMT19 News Translation
  Task
The University of Edinburgh's Submissions to the WMT19 News Translation Task
Rachel Bawden
Nikolay Bogoychev
Ulrich Germann
Roman Grundkiewicz
Faheem Kirefu
Antonio Valerio Miceli Barone
Alexandra Birch
22
32
0
12 Jul 2019
NTT's Machine Translation Systems for WMT19 Robustness Task
NTT's Machine Translation Systems for WMT19 Robustness Task
Soichiro Murakami
Makoto Morishita
Tsutomu Hirao
Masaaki Nagata
VLM
10
9
0
09 Jul 2019
Lattice Transformer for Speech Translation
Lattice Transformer for Speech Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
37
48
0
13 Jun 2019
What Kind of Language Is Hard to Language-Model?
What Kind of Language Is Hard to Language-Model?
Sabrina J. Mielke
Ryan Cotterell
Kyle Gorman
Brian Roark
Jason Eisner
19
75
0
11 Jun 2019
Lattice-Based Transformer Encoder for Neural Machine Translation
Lattice-Based Transformer Encoder for Neural Machine Translation
Fengshun Xiao
Jiangtong Li
Zhao Hai
Rui Wang
Kehai Chen
29
42
0
04 Jun 2019
Dynamically Composing Domain-Data Selection with Clean-Data Selection by
  "Co-Curricular Learning" for Neural Machine Translation
Dynamically Composing Domain-Data Selection with Clean-Data Selection by "Co-Curricular Learning" for Neural Machine Translation
Wei Wang
Isaac Caswell
Ciprian Chelba
28
57
0
03 Jun 2019
Choosing Transfer Languages for Cross-Lingual Learning
Choosing Transfer Languages for Cross-Lingual Learning
Yu-Hsiang Lin
Chian-Yu Chen
Jean Lee
Zirui Li
Yuyan Zhang
...
Zhisong Zhang
Xuezhe Ma
Antonios Anastasopoulos
Patrick Littell
Graham Neubig
21
231
0
29 May 2019
A Call for Prudent Choice of Subword Merge Operations in Neural Machine
  Translation
A Call for Prudent Choice of Subword Merge Operations in Neural Machine Translation
Shuoyang Ding
Adithya Renduchintala
Kevin Duh
11
63
0
24 May 2019
A Systematic Study of Leveraging Subword Information for Learning Word
  Representations
A Systematic Study of Leveraging Subword Information for Learning Word Representations
Yi Zhu
Ivan Vulić
Anna Korhonen
27
30
0
16 Apr 2019
Positional Encoding to Control Output Sequence Length
Positional Encoding to Control Output Sequence Length
Sho Takase
Naoaki Okazaki
9
109
0
16 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable
  Convolutions
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
28
95
0
04 Apr 2019
ReWE: Regressing Word Embeddings for Regularization of Neural Machine
  Translation Systems
ReWE: Regressing Word Embeddings for Regularization of Neural Machine Translation Systems
Inigo Jauregi Unanue
E. Z. Borzeshi
Nazanin Esmaili
Massimo Piccardi
27
8
0
04 Apr 2019
CVIT-MT Systems for WAT-2018
CVIT-MT Systems for WAT-2018
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
13
10
0
19 Mar 2019
The ARIEL-CMU Systems for LoReHLT18
The ARIEL-CMU Systems for LoReHLT18
Aditi Chaudhary
Siddharth Dalmia
Junjie Hu
Xinjian Li
Austin Matthews
...
Shabnam Tafreshi
Mona T. Diab
Efsun Sarioglu Kayi
N. Farra
Kathleen McKeown
VLM
27
5
0
24 Feb 2019
Multilingual Neural Machine Translation With Soft Decoupled Encoding
Multilingual Neural Machine Translation With Soft Decoupled Encoding
Xinyi Wang
Hieu H. Pham
Philip Arthur
Graham Neubig
13
59
0
09 Feb 2019
How Much Does Tokenization Affect Neural Machine Translation?
How Much Does Tokenization Affect Neural Machine Translation?
Miguel Domingo
Mercedes García-Martínez
A. Helle
F. Casacuberta
Manuel Herranz
17
55
0
20 Dec 2018
Previous
123...111213
Next