ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.09716
  4. Cited By
uniblock: Scoring and Filtering Corpus with Unicode Block Information

uniblock: Scoring and Filtering Corpus with Unicode Block Information

26 August 2019
Yingbo Gao
Weiyue Wang
Hermann Ney
ArXivPDFHTML

Papers citing "uniblock: Scoring and Filtering Corpus with Unicode Block Information"

5 / 5 papers shown
Title
Dual Conditional Cross-Entropy Filtering of Noisy Parallel Corpora
Dual Conditional Cross-Entropy Filtering of Noisy Parallel Corpora
Marcin Junczys-Dowmunt
51
135
0
01 Sep 2018
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
196
3,520
0
19 Aug 2018
RETURNN as a Generic Flexible Neural Toolkit with Application to
  Translation and Speech Recognition
RETURNN as a Generic Flexible Neural Toolkit with Application to Translation and Speech Recognition
Albert Zeyer
Tamer Alkhouli
Hermann Ney
65
91
0
14 May 2018
A Call for Clarity in Reporting BLEU Scores
A Call for Clarity in Reporting BLEU Scores
Matt Post
150
2,988
0
23 Apr 2018
Bag of Tricks for Efficient Text Classification
Bag of Tricks for Efficient Text Classification
Armand Joulin
Edouard Grave
Piotr Bojanowski
Tomas Mikolov
VLM
170
4,622
0
06 Jul 2016
1