Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.12893
Cited By
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
28 November 2019
Masato Hagiwara
Masato Mita
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors"
15 / 15 papers shown
Title
Multi-teacher Distillation for Multilingual Spelling Correction
Jingfen Zhang
Xuan Guo
S. Bodapati
Christopher Potts
KELM
29
3
0
20 Nov 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
Minh Nguyen
Nancy F. Chen
30
0
0
21 Oct 2023
A Methodology for Generative Spelling Correction via Natural Spelling Errors Emulation across Multiple Domains and Languages
Nikita Martynov
Mark Baushenko
Anastasia Kozlova
Katerina Kolomeytseva
Aleksandr Abramov
Alena Fenogenova
40
2
0
18 Aug 2023
Domain specificity and data efficiency in typo tolerant spell checkers: the case of search in online marketplaces
Dayanand Ubrangala
Juhi Sharma
R. Kondapalli
Kiran Rama
Amit Agarwala
Laurent Boué
23
0
0
03 Aug 2023
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data
Xinzhe Li
Ming Liu
Shang Gao
MU
53
8
0
02 Jul 2023
Two-in-One: A Model Hijacking Attack Against Text Generation Models
Waiman Si
Michael Backes
Yang Zhang
A. Salem
SILM
32
22
0
12 May 2023
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
AAML
28
1
0
14 Feb 2023
Grammatical Error Correction: A Survey of the State of the Art
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
34
79
0
09 Nov 2022
Data Augmentation for Intent Classification
Derek Chen
Claire Yin
8
3
0
12 Jun 2022
Correcting diacritics and typos with a ByT5 transformer model
Lukas Stankevicius
M. Lukoševičius
J. Kapočiūtė-Dzikienė
Monika Briediene
Tomas Krilavičius
36
20
0
31 Jan 2022
VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning
Qibin Chen
Jeremy Lacomis
Edward J. Schwartz
Graham Neubig
Bogdan Vasilescu
Claire Le Goues
VLM
24
35
0
05 Dec 2021
Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
Zhenghao Liu
Xiaoyuan Yi
Maosong Sun
Liner Yang
Tat-Seng Chua
36
24
0
10 May 2021
Do We Need Online NLU Tools?
Petr Lorenc
Petro Marek
Jan Pichl
Jakub Konrád
Jan Sedivý
21
6
0
19 Nov 2020
Tokenization Repair in the Presence of Spelling Errors
Hannah Bast
Matthias Hertel
M. Mohamed
14
5
0
15 Oct 2020
Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task
Marcin Junczys-Dowmunt
Roman Grundkiewicz
Shubha Guha
Kenneth Heafield
33
192
0
16 Apr 2018
1