Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1908.09716
Cited By
uniblock: Scoring and Filtering Corpus with Unicode Block Information
26 August 2019
Yingbo Gao
Weiyue Wang
Hermann Ney
Re-assign community
ArXiv
PDF
HTML
Papers citing
"uniblock: Scoring and Filtering Corpus with Unicode Block Information"
5 / 5 papers shown
Title
Dual Conditional Cross-Entropy Filtering of Noisy Parallel Corpora
Marcin Junczys-Dowmunt
51
135
0
01 Sep 2018
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
196
3,520
0
19 Aug 2018
RETURNN as a Generic Flexible Neural Toolkit with Application to Translation and Speech Recognition
Albert Zeyer
Tamer Alkhouli
Hermann Ney
65
91
0
14 May 2018
A Call for Clarity in Reporting BLEU Scores
Matt Post
151
2,988
0
23 Apr 2018
Bag of Tricks for Efficient Text Classification
Armand Joulin
Edouard Grave
Piotr Bojanowski
Tomas Mikolov
VLM
170
4,622
0
06 Jul 2016
1