Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates

29 April 2018

Papers citing "Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates"

50 / 628 papers shown

Title
Deep Learning Based Text Classification: A Comprehensive Review Shervin Minaee Nal Kalchbrenner Min Zhang Narjes Nikzad M. Asgari-Chenaghlu Jianfeng Gao AILaw VLM AI4TS 116 1,116 0 06 Apr 2020
Finding the Optimal Vocabulary Size for Neural Machine Translation Thamme Gowda Jonathan May 33 3 0 05 Apr 2020
Give your Text Representation Models some Love: the Case for Basque Rodrigo Agerri Iñaki San Vicente Jon Ander Campos Ander Barrena X. Saralegi Aitor Soroa Etxabe Eneko Agirre 59 63 0 31 Mar 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition Naoyuki Kanda Yashesh Gaur Xiaofei Wang Zhong Meng Takuya Yoshioka 85 122 0 28 Mar 2020
Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and Pruning Stig-Arne Gronroos Sami Virpioja M. Kurimo VLM 84 21 0 06 Mar 2020
AraBERT: Transformer-based Model for Arabic Language Understanding Wissam Antoun Fady Baly Hazem M. Hajj 166 975 0 28 Feb 2020
Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity Prediction Danushka Bollegala Ryuichi Kiryo K. Tsujino Haruki Yukawa 23 7 0 25 Feb 2020
Semi-Supervised Speech Recognition via Local Prior Matching Wei-Ning Hsu Ann Lee Gabriel Synnaeve Awni Y. Hannun SSL 138 31 0 24 Feb 2020
Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation Alessandro Raganato Yves Scherrer Jörg Tiedemann 100 92 0 24 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation Shu Yang Yuxin Wang Xiaowen Chu VLM AI4TS AI4CE 106 140 0 18 Feb 2020
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language Kohei Matsuura Sei Ueno Masato Mimura S. Sakai Tatsuya Kawahara CVBM 41 13 0 16 Feb 2020
CBAG: Conditional Biomedical Abstract Generation Justin Sybrandt Ilya Safro MedIm AI4CE 53 8 0 13 Feb 2020
fastai: A Layered API for Deep Learning Jeremy Howard Sylvain Gugger AI4CE 135 873 0 11 Feb 2020
Aligning the Pretraining and Finetuning Objectives of Language Models Nuo Wang Pierse Jing Lu AI4CE 37 2 0 05 Feb 2020
Scaling Up Online Speech Recognition Using ConvNets Vineel Pratap Qiantong Xu Jacob Kahn Gilad Avidov Tatiana Likhomanenko Awni Y. Hannun Vitaliy Liptchinsky Gabriel Synnaeve R. Collobert 242 39 0 27 Jan 2020
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization Jingqing Zhang Yao-Min Zhao Mohammad Saleh Peter J. Liu RALM 3DGS 347 2,058 0 18 Dec 2019
Cross-Lingual Ability of Multilingual BERT: An Empirical Study Karthikeyan K Zihan Wang Stephen D. Mayhew Dan Roth LRM 98 340 0 17 Dec 2019
Neural Machine Translation: A Review and Survey Felix Stahlberg 3DV AI4TS MedIm 140 332 0 04 Dec 2019
A Subword Level Language Model for Bangla Language Aisha Khatun Anisur Rahman Hemayet Ahmed Chowdhury Md. Saiful Islam A. Tasnim 34 4 0 15 Nov 2019
CamemBERT: a Tasty French Language Model Louis Martin Benjamin Muller Pedro Ortiz Suarez Yoann Dupont Laurent Romary Eric Villemonte de la Clergerie Djamé Seddah Benoît Sagot 145 981 0 10 Nov 2019
Domain Robustness in Neural Machine Translation Mathias Müller Annette Rios Gonzales Rico Sennrich 109 95 0 08 Nov 2019
Unsupervised Cross-lingual Representation Learning at Scale Alexis Conneau Kartikay Khandelwal Naman Goyal Vishrav Chaudhary Guillaume Wenzek Francisco Guzmán Edouard Grave Myle Ott Luke Zettlemoyer Veselin Stoyanov 239 6,618 0 05 Nov 2019
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data Guillaume Wenzek Marie-Anne Lachaux Alexis Conneau Vishrav Chaudhary Francisco Guzmán Armand Joulin Edouard Grave 124 658 0 01 Nov 2019
BPE-Dropout: Simple and Effective Subword Regularization Ivan Provilkov Dmitrii Emelianenko Elena Voita 101 289 0 29 Oct 2019
Multitask Learning For Different Subword Segmentations In Neural Machine Translation Tejas Srinivasan Ramon Sanabria Florian Metze 37 5 0 27 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Colin Raffel Noam M. Shazeer Adam Roberts Katherine Lee Sharan Narang Michael Matena Yanqi Zhou Wei Li Peter J. Liu AIMat 822 20,447 0 23 Oct 2019
Deja-vu: Double Feature Presentation and Iterated Loss in Deep Transformer Networks Andros Tjandra Chunxi Liu Frank Zhang Xiaohui Zhang Yongqiang Wang Gabriel Synnaeve Satoshi Nakamura Geoffrey Zweig ViT 89 46 0 23 Oct 2019
End-to-End Speech Recognition: A review for the French Language Florian Boyer Jean-Luc Rouas AI4TS 66 10 0 18 Oct 2019
Controlling Utterance Length in NMT-based Word Segmentation with Attention Pierre Godard Laurent Besacier François Yvon 44 2 0 18 Oct 2019
Learning Invariant Representations of Social Media Users Nicholas Andrews M. Bishop 79 37 0 11 Oct 2019
Federated Learning of N-gram Language Models Mingqing Chen A. Suresh Rajiv Mathews Adeline Wong Cyril Allauzen F. Beaufays Michael Riley FedML 117 75 0 08 Oct 2019
Modeling Color Terminology Across Thousands of Languages Arya D. McCarthy Winston Wu S. Cascianelli Bill Watson Rita Cucchiara 50 11 0 03 Oct 2019
Regressing Word and Sentence Embeddings for Regularization of Neural Machine Translation Inigo Jauregi Unanue E. Z. Borzeshi Massimo Piccardi AI4TS 44 0 0 30 Sep 2019
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages Yi Zhu Benjamin Heinzerling Ivan Vulić Michael Strube Roi Reichart Anna Korhonen 65 20 0 26 Sep 2019
Self-Training for End-to-End Speech Recognition Jacob Kahn Ann Lee Awni Y. Hannun SSL 69 236 0 19 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Yiming Wang Tongfei Chen Hainan Xu Shuoyang Ding Hang Lv Yiwen Shao Nanyun Peng Lei Xie Shinji Watanabe Sanjeev Khudanpur VLM 96 73 0 18 Sep 2019
Subword ELMo Jiangtong Li Hai Zhao Z. Li Wei Bi Xiaojiang Liu 20 1 0 18 Sep 2019
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation Chengyi Wang Yu-Huan Wu Shujie Liu Zhenglu Yang M. Zhou 89 84 0 17 Sep 2019
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning Julian Martin Eisenschlos Sebastian Ruder Piotr Czapla Marcin Kardas Sylvain Gugger Jeremy Howard 69 99 0 10 Sep 2019
Neural Machine Translation with Byte-Level Subwords Changhan Wang Kyunghyun Cho Jiatao Gu 96 178 0 07 Sep 2019
Subword Language Model for Query Auto-Completion Gyuwan Kim 36 15 0 02 Sep 2019
Learning a Multi-Domain Curriculum for Neural Machine Translation Wei Wang Ye Tian Jiquan Ngiam Yinfei Yang Isaac Caswell Zarana Parekh 82 39 0 28 Aug 2019
Parsimonious Morpheme Segmentation with an Application to Enriching Word Embeddings Ahmed El-Kishky Frank F. Xu Aston Zhang Jiawei Han 49 4 0 18 Aug 2019
Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs WonKee Lee Junsuk Park Byung-Hyun Go Jong-Hyeok Lee KELM 30 3 0 15 Aug 2019
IMS-Speech: A Speech to Text Tool Pavel Denisov Ngoc Thang Vu 80 11 0 13 Aug 2019
A Baseline Neural Machine Translation System for Indian Languages Jerin Philip Vinay P. Namboodiri C. V. Jawahar 107 17 0 29 Jul 2019
Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task Alexandre Berard Ioan Calapodescu Claude Roux VLM 80 59 0 15 Jul 2019
The University of Edinburgh's Submissions to the WMT19 News Translation Task Rachel Bawden Nikolay Bogoychev Ulrich Germann Roman Grundkiewicz Faheem Kirefu Antonio Valerio Miceli Barone Alexandra Birch 59 32 0 12 Jul 2019
NTT's Machine Translation Systems for WMT19 Robustness Task Soichiro Murakami Makoto Morishita Tsutomu Hirao Masaaki Nagata VLM 49 9 0 09 Jul 2019
Lattice Transformer for Speech Translation Pei Zhang Boxing Chen Niyu Ge Kai Fan 80 50 0 13 Jun 2019