ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.12893
  4. Cited By
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings
  and Grammatical Errors

GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors

28 November 2019
Masato Hagiwara
Masato Mita
ArXivPDFHTML

Papers citing "GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors"

15 / 15 papers shown
Title
Multi-teacher Distillation for Multilingual Spelling Correction
Multi-teacher Distillation for Multilingual Spelling Correction
Jingfen Zhang
Xuan Guo
S. Bodapati
Christopher Potts
KELM
29
3
0
20 Nov 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing
  Noisy Input
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
Minh Nguyen
Nancy F. Chen
30
0
0
21 Oct 2023
A Methodology for Generative Spelling Correction via Natural Spelling
  Errors Emulation across Multiple Domains and Languages
A Methodology for Generative Spelling Correction via Natural Spelling Errors Emulation across Multiple Domains and Languages
Nikita Martynov
Mark Baushenko
Anastasia Kozlova
Katerina Kolomeytseva
Aleksandr Abramov
Alena Fenogenova
40
2
0
18 Aug 2023
Domain specificity and data efficiency in typo tolerant spell checkers:
  the case of search in online marketplaces
Domain specificity and data efficiency in typo tolerant spell checkers: the case of search in online marketplaces
Dayanand Ubrangala
Juhi Sharma
R. Kondapalli
Kiran Rama
Amit Agarwala
Laurent Boué
23
0
0
03 Aug 2023
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal
  Data
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data
Xinzhe Li
Ming Liu
Shang Gao
MU
53
8
0
02 Jul 2023
Two-in-One: A Model Hijacking Attack Against Text Generation Models
Two-in-One: A Model Hijacking Attack Against Text Generation Models
Waiman Si
Michael Backes
Yang Zhang
A. Salem
SILM
32
22
0
12 May 2023
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input
  Noises
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
AAML
28
1
0
14 Feb 2023
Grammatical Error Correction: A Survey of the State of the Art
Grammatical Error Correction: A Survey of the State of the Art
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
31
79
0
09 Nov 2022
Data Augmentation for Intent Classification
Data Augmentation for Intent Classification
Derek Chen
Claire Yin
6
3
0
12 Jun 2022
Correcting diacritics and typos with a ByT5 transformer model
Correcting diacritics and typos with a ByT5 transformer model
Lukas Stankevicius
M. Lukoševičius
J. Kapočiūtė-Dzikienė
Monika Briediene
Tomas Krilavičius
36
20
0
31 Jan 2022
VarCLR: Variable Semantic Representation Pre-training via Contrastive
  Learning
VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning
Qibin Chen
Jeremy Lacomis
Edward J. Schwartz
Graham Neubig
Bogdan Vasilescu
Claire Le Goues
VLM
24
35
0
05 Dec 2021
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error
  Correction
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
Zhenghao Liu
Xiaoyuan Yi
Maosong Sun
Liner Yang
Tat-Seng Chua
36
24
0
10 May 2021
Do We Need Online NLU Tools?
Do We Need Online NLU Tools?
Petr Lorenc
Petro Marek
Jan Pichl
Jakub Konrád
Jan Sedivý
21
6
0
19 Nov 2020
Tokenization Repair in the Presence of Spelling Errors
Tokenization Repair in the Presence of Spelling Errors
Hannah Bast
Matthias Hertel
M. Mohamed
9
5
0
15 Oct 2020
Approaching Neural Grammatical Error Correction as a Low-Resource
  Machine Translation Task
Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task
Marcin Junczys-Dowmunt
Roman Grundkiewicz
Shubha Guha
Kenneth Heafield
33
192
0
16 Apr 2018
1