Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.10959
Cited By
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
29 April 2018
Taku Kudo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates"
50 / 628 papers shown
Title
Impact of Tokenization on Language Models: An Analysis for Turkish
Cagri Toraman
E. Yilmaz
Furkan Şahinuç
Oguzhan Ozcelik
104
81
0
19 Apr 2022
Improving Tokenisation by Alternative Treatment of Spaces
Edward Gow-Smith
Harish Tayyar Madabushi
Carolina Scarton
Aline Villavicencio
89
21
0
08 Apr 2022
Deliberation Model for On-Device Spoken Language Understanding
Duc Le
Akshat Shrivastava
Paden Tomasello
Suyoun Kim
Aleksandr Livshits
Ozlem Kalinli
M. Seltzer
AuLLM
70
12
0
04 Apr 2022
Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Xuandi Fu
Feng-Ju Chang
Martin H. Radfar
Kailin Wei
Jing Liu
Grant P. Strimel
Kanthashree Mysore Sathyendra
48
4
0
01 Apr 2022
Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation
Sho Takase
Tatsuya Hiraoka
Naoaki Okazaki
44
5
0
25 Mar 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
107
106
0
24 Mar 2022
Small Batch Sizes Improve Training of Low-Resource Neural MT
Àlex R. Atrio
Andrei Popescu-Belis
64
6
0
20 Mar 2022
ScienceWorld: Is your Agent Smarter than a 5th Grader?
Ruoyao Wang
Peter Alexander Jansen
Marc-Alexandre Côté
Prithviraj Ammanabrolu
LLMAG
ReLM
LRM
134
129
0
14 Mar 2022
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
Gabriele Sarti
Malvina Nissim
AILaw
101
42
0
07 Mar 2022
Extracting linguistic speech patterns of Japanese fictional characters using subword units
Mika Kishino
Kanako Komiya
26
0
0
05 Mar 2022
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages
Vaidehi Patil
Partha P. Talukdar
Sunita Sarawagi
122
22
0
03 Mar 2022
Mukayese: Turkish NLP Strikes Back
Ali Safaya
Emirhan Kurtulucs
Arda Goktougan
Deniz Yuret
77
23
0
02 Mar 2022
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale
Laurent Sartran
Samuel Barrett
A. Kuncoro
Milovs Stanojević
Phil Blunsom
Chris Dyer
98
50
0
01 Mar 2022
LCP-dropout: Compression-based Multiple Subword Segmentation for Neural Machine Translation
Keita Nonaka
Kazutaka Yamanouchi
Tomohiro I
Tsuyoshi Okita
Kazutaka Shimada
Hiroshi Sakamoto
53
8
0
28 Feb 2022
Morphology Without Borders: Clause-Level Morphology
Omer Goldman
Reut Tsarfaty
AILaw
73
3
0
25 Feb 2022
Screening Gender Transfer in Neural Machine Translation
Guillaume Wisniewski
Lichao Zhu
Nicolas Bailler
François Yvon
103
6
0
25 Feb 2022
Refining the state-of-the-art in Machine Translation, optimizing NMT for the JA <-> EN language pair by leveraging personal domain expertise
Matthew Bieda
50
1
0
23 Feb 2022
Evaluating Persian Tokenizers
Danial Kamali
Behrooz Janfada
Mohammad Ebrahim Shenasa
B. Minaei-Bidgoli
28
1
0
22 Feb 2022
Korean Tokenization for Beam Search Rescoring in Speech Recognition
Kyuhong Shim
Hyewon Bae
Wonyong Sung
47
0
0
22 Feb 2022
Non-Autoregressive ASR with Self-Conditioned Folded Encoders
Tatsuya Komatsu
115
8
0
17 Feb 2022
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder
Bolaji Yusuf
Ankur Gandhe
Alex Sokolov
117
9
0
12 Feb 2022
Neural-FST Class Language Model for End-to-End Speech Recognition
A. Bruguier
Duc Le
Rohit Prabhavalkar
Dangna Li
Zhe Liu
Bo Wang
Eun Chang
Fuchun Peng
Ozlem Kalinli
M. Seltzer
85
6
0
28 Jan 2022
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset
Tiezheng Yu
Rita Frieske
Peng Xu
Samuel Cahyawijaya
Cheuk Tung Shadow Yiu
...
Elham J. Barezi
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
RALM
89
10
0
07 Jan 2022
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Bowen Shi
Wei-Ning Hsu
Kushal Lakhotia
Abdel-rahman Mohamed
SSL
130
321
0
05 Jan 2022
Fine-Tuning Transformers: Vocabulary Transfer
Vladislav D. Mosin
Igor Samenko
Alexey Tikhonov
Borislav M. Kozlovskii
Ivan P. Yamshchikov
81
20
0
29 Dec 2021
LaTr: Layout-Aware Transformer for Scene-Text VQA
Ali Furkan Biten
Ron Litman
Yusheng Xie
Srikar Appalaraju
R. Manmatha
ViT
125
102
0
23 Dec 2021
Few-shot Learning with Multilingual Language Models
Xi Lin
Todor Mihaylov
Mikel Artetxe
Tianlu Wang
Shuohui Chen
...
Luke Zettlemoyer
Zornitsa Kozareva
Mona T. Diab
Ves Stoyanov
Xian Li
BDL
ELM
LRM
153
308
0
20 Dec 2021
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
109
151
0
20 Dec 2021
Textless Speech-to-Speech Translation on Real Data
Ann Lee
Hongyu Gong
Paul-Ambroise Duquenne
Holger Schwenk
Peng-Jen Chen
...
Sravya Popuri
Yossi Adi
J. Pino
Jiatao Gu
Wei-Ning Hsu
99
150
0
15 Dec 2021
Improving Both Domain Robustness and Domain Adaptability in Machine Translation
Wen Lai
Jindrich Libovický
Alexander Fraser
AI4CE
96
14
0
15 Dec 2021
Attentive Contextual Carryover for Multi-Turn End-to-End Spoken Language Understanding
Kai Wei
Thanh-Binh Tran
Feng-Ju Chang
Kanthashree Mysore Sathyendra
Thejaswi Muniyappa
...
A. Raju
Ross McGowan
Nathan Susanj
Ariya Rastrow
Grant P. Strimel
29
10
0
13 Dec 2021
AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
Geet Shingi
Vedangi Wagh
144
0
0
10 Dec 2021
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Pengcheng He
Jianfeng Gao
Weizhu Chen
239
1,213
0
18 Nov 2021
Character-level HyperNetworks for Hate Speech Detection
Tomer Wullach
A. Adler
Einat Minkov
61
14
0
11 Nov 2021
Context-Aware Transformer Transducer for Speech Recognition
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
66
85
0
05 Nov 2021
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
Arij Riabi
Benoît Sagot
Djamé Seddah
85
15
0
26 Oct 2021
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding
Wei Wang
Shuo Ren
Yao Qian
Shujie Liu
Yu Shi
Y. Qian
Michael Zeng
87
18
0
23 Oct 2021
Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation
Guanhua Chen
Shuming Ma
Yun-Nung Chen
Dongdong Zhang
Jia Pan
Wenping Wang
Furu Wei
LRM
79
15
0
16 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
162
202
0
14 Oct 2021
Automated Essay Scoring Using Transformer Models
Sabrina Ludwig
Christian W. F. Mayer
Christopher Hansen
Kerstin Eilers
Steffen Brandt
85
40
0
13 Oct 2021
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems
Mohd Abbas Zaidi
Beomseok Lee
Sangha Kim
Chanwoo Kim
66
5
0
13 Oct 2021
Balancing Average and Worst-case Accuracy in Multitask Learning
Paul Michel
Sebastian Ruder
Dani Yogatama
72
12
0
12 Oct 2021
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Yosuke Higuchi
Nanxin Chen
Yuya Fujita
Hirofumi Inaguma
Tatsuya Komatsu
Jaesong Lee
Jumon Nozaki
Tianzi Wang
Shinji Watanabe
49
43
0
11 Oct 2021
Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Yosuke Higuchi
Niko Moritz
Jonathan Le Roux
Takaaki Hori
83
12
0
11 Oct 2021
Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Guoli Ye
V. Mazalov
Jinyu Li
Jiawei Liu
70
9
0
10 Oct 2021
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Yosuke Higuchi
Keita Karube
Tetsuji Ogawa
Tetsunori Kobayashi
51
24
0
08 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models
Robert Wolfe
Aylin Caliskan
125
51
0
01 Oct 2021
BERTweetFR : Domain Adaptation of Pre-Trained Language Models for French Tweets
Yanzhu Guo
Virgile Rennard
Christos Xypolopoulos
Michalis Vazirgiannis
VLM
AI4CE
89
19
0
21 Sep 2021
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training
Bo Zheng
Li Dong
Shaohan Huang
Saksham Singhal
Wanxiang Che
Ting Liu
Xia Song
Furu Wei
VLM
82
22
0
15 Sep 2021
Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
84
17
0
13 Sep 2021
Previous
1
2
3
...
7
8
9
...
11
12
13
Next