Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.10959
Cited By
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
29 April 2018
Taku Kudo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates"
50 / 628 papers shown
Title
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie Chen
Yu-Huan Wu
Zhenghao Wang
Shujie Liu
Jinyu Li
144
177
0
22 Oct 2020
Revisiting Modularized Multilingual NMT to Meet Industrial Demands
Sungwon Lyu
Bokyung Son
Kichang Yang
Jaekyoung Bae
MoE
64
21
0
19 Oct 2020
Multi-Task Learning for Cross-Lingual Abstractive Summarization
Sho Takase
Naoaki Okazaki
100
19
0
15 Oct 2020
End to End Binarized Neural Networks for Text Classification
Harshil Jain
Akshat Agarwal
Kumar Shridhar
Denis Kleyko
MQ
79
27
0
11 Oct 2020
Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels
Harris Chan
J. Kiros
William Chan
LRM
23
0
0
09 Oct 2020
Differentiable Weighted Finite-State Transducers
Awni Y. Hannun
Vineel Pratap
Jacob Kahn
Wei-Ning Hsu
118
29
0
02 Oct 2020
Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation
Tahmid Hasan
Abhik Bhattacharjee
Kazi Samin Mubasshir
Masum Hasan
Madhusudan Basak
M. Rahman
Rifat Shahriyar
VLM
82
77
0
20 Sep 2020
Computer Assisted Translation with Neural Quality Estimation and Automatic Post-Editing
Jiayi Wang
Ke Min Wang
Niyu Ge
Yangbin Shi
Yu Zhao
Kai Fan
49
13
0
19 Sep 2020
Will it Unblend?
Yuval Pinter
Cassandra L. Jacobs
Jacob Eisenstein
66
14
0
18 Sep 2020
NABU
−
\mathrm{-}
−
Multilingual Graph-based Neural RDF Verbalizer
Diego Moussallem
Dwaraknath Gnaneshwar
Thiago Castro Ferreira
A. N. Ngomo
57
16
0
16 Sep 2020
Neural Machine Translation without Embeddings
Uri Shaham
Omer Levy
83
16
0
21 Aug 2020
PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data
Diedre Carmo
Marcos Piau
Israel Campiotti
Rodrigo Nogueira
R. Lotufo
LM&MA
79
52
0
20 Aug 2020
Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Milind Rao
A. Raju
Pranav Dheram
Bach Bui
Ariya Rastrow
58
43
0
14 Aug 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
74
49
0
11 Aug 2020
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition
Egor Lakomkin
Jahn Heymann
Ilya Sklyar
Simon Wiesler
51
8
0
10 Aug 2020
KR-BERT: A Small-Scale Korean-Specific Language Model
Sangah Lee
Hansol Jang
Yunmee Baik
Suzi Park
Hyopil Shin
93
52
0
10 Aug 2020
A Survey of Orthographic Information in Machine Translation
Bharathi Raja Chakravarthi
P. Rani
Mihael Arcan
John P. Mccrae
57
34
0
04 Aug 2020
Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition
Ludwig Kurzinger
Edgar Ricardo Chavez Rosas
Lujun Li
Tobias Watzel
Gerhard Rigoll
AAML
50
4
0
21 Jul 2020
Drinking from a Firehose: Continual Learning with Web-scale Natural Language
Hexiang Hu
Ozan Sener
Fei Sha
V. Koltun
CLL
62
27
0
18 Jul 2020
Contrastive Code Representation Learning
Paras Jain
Ajay Jain
Tianjun Zhang
Pieter Abbeel
Joseph E. Gonzalez
Ion Stoica
SSL
DRL
134
151
0
09 Jul 2020
Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network
Tadashi Ogura
A. Magassouba
K. Sugiura
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
Hisashi Kawai
50
11
0
09 Jul 2020
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters
Vineel Pratap
Anuroop Sriram
Paden Tomasello
Awni Y. Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
89
143
0
06 Jul 2020
LMVE at SemEval-2020 Task 4: Commonsense Validation and Explanation using Pretraining Language Model
Shilei Liu
Yu Guo
Bochao Li
Feiliang Ren
LRM
81
4
0
06 Jul 2020
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Pavel Denisov
Ngoc Thang Vu
77
30
0
03 Jul 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Tianyan Zhou
Takuya Yoshioka
76
78
0
19 Jun 2020
On the Multi-Property Extraction and Beyond
Tomasz Dwojak
Michal Pietruszka
Łukasz Borchmann
Filip Graliñski
Jakub Chlkedowski
27
0
0
15 Jun 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
91
210
0
26 May 2020
A systematic comparison of grapheme-based vs. phoneme-based label units for encoder-decoder-attention models
Mohammad Zeineldeen
Albert Zeyer
Wei Zhou
T. Ng
Ralf Schluter
Hermann Ney
69
2
0
19 May 2020
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Frank Zhang
Yongqiang Wang
Xiaohui Zhang
Chunxi Liu
Yatharth Saraf
Geoffrey Zweig
66
20
0
19 May 2020
Are All Languages Created Equal in Multilingual BERT?
Shijie Wu
Mark Dredze
84
326
0
18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding
M. Bastan
Arnau Ramisa
Mehmet Tek
ViT
28
7
0
17 May 2020
An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition
Gizem Aras
Didem Makaroglu
Seniz Demir
Altan Cakir
34
30
0
14 May 2020
The Unstoppable Rise of Computational Linguistics in Deep Learning
James Henderson
AI4CE
69
28
0
13 May 2020
On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation
Chaojun Wang
Rico Sennrich
85
161
0
07 May 2020
2kenize: Tying Subword Sequences for Chinese Script Conversion
Pranav A
Isabelle Augenstein
66
1
0
07 May 2020
A Multi-Perspective Architecture for Semantic Code Search
Rajarshi Haldar
Lingfei Wu
Jinjun Xiong
Julia Hockenmaier
61
57
0
06 May 2020
Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation
Xuanli He
Gholamreza Haffari
Mohammad Norouzi
65
46
0
03 May 2020
Evaluating Robustness to Input Perturbations for Neural Machine Translation
Xing Niu
Prashant Mathur
Georgiana Dinu
Yaser Al-Onaizan
AAML
80
64
0
01 May 2020
A Study in Improving BLEU Reference Coverage with Diverse Automatic Paraphrasing
Rachel Bawden
Biao Zhang
Lisa Yankovskaya
Andre Tattar
Matt Post
30
1
0
30 Apr 2020
Data and Representation for Turkish Natural Language Inference
Emrah Budur
Rıza Özçelik
Tunga Güngör
Christopher Potts
43
1
0
30 Apr 2020
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding
Samson Tan
Shafiq Joty
Lav Varshney
Min-Yen Kan
133
35
0
30 Apr 2020
Vocabulary Adaptation for Distant Domain Adaptation in Neural Machine Translation
Shoetsu Sato
Jin Sakuma
Naoki Yoshinaga
Masashi Toyoda
M. Kitsuregawa
75
3
0
30 Apr 2020
WT5?! Training Text-to-Text Models to Explain their Predictions
Sharan Narang
Colin Raffel
Katherine Lee
Adam Roberts
Noah Fiedel
Karishma Malkan
82
201
0
30 Apr 2020
AxCell: Automatic Extraction of Results from Machine Learning Papers
Marcin Kardas
Piotr Czapla
Pontus Stenetorp
Sebastian Ruder
Sebastian Riedel
Ross Taylor
Robert Stojnic
49
76
0
29 Apr 2020
Adversarial Subword Regularization for Robust Neural Machine Translation
Jungsoo Park
Mujeen Sung
Jinhyuk Lee
Jaewoo Kang
64
8
0
29 Apr 2020
All Word Embeddings from One Embedding
Sho Takase
Sosuke Kobayashi
96
10
0
25 Apr 2020
Multiple Segmentations of Thai Sentences for Neural Machine Translation
Alberto Poncelas
Wichaya Pidchamook
Chao-Hong Liu
J. Hadley
Andy Way
31
7
0
23 Apr 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
118
166
0
21 Apr 2020
Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation
Stig-Arne Gronroos
Sami Virpioja
M. Kurimo
66
6
0
08 Apr 2020
Byte Pair Encoding is Suboptimal for Language Model Pretraining
Kaj Bostrom
Greg Durrett
71
214
0
07 Apr 2020
Previous
1
2
3
...
10
11
12
13
Next