CharBERT: Character-aware Pre-trained Language Model

3 November 2020

ArXiv (abs)PDF HTML Github (121★)

Papers citing "CharBERT: Character-aware Pre-trained Language Model"

50 / 59 papers shown

Using External knowledge to Enhanced PLM for Semantic MatchingInternational Conference on Intelligent Computing (ICIC), 2025

Min Li

Chun Yuan

310

10 May 2025

KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications

M. Bommarito

Daniel Martin Katz

Jillian Bommarito

225

21 Mar 2025

Comateformer: Combined Attention Transformer for Semantic Sentence MatchingEuropean Conference on Artificial Intelligence (ECAI), 2024

Bo Li

Di Liang

Zixin Zhang

304

10 Dec 2024

TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language ModelsInternational Workshop on Information Forensics and Security (WIFS), 2024

141

11 Nov 2024

From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes

Zébulon Goriely

Richard Diehl Martinez

334

30 Oct 2024

MrT5: Dynamic Token Merging for Efficient Byte-level Language ModelsInternational Conference on Learning Representations (ICLR), 2024

Julie Kallini

Shikhar Murty

Christopher D. Manning

Christopher Potts

Róbert Csordás

495

28 Oct 2024

LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems

Nan Xu

Xuezhe Ma

LRM

445

18 Oct 2024

Advancing Post-OCR Correction: A Comparative Study of Synthetic DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Shuhao Guan

Derek Greene

383

05 Aug 2024

Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions

270

07 Jul 2024

KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning

Hui Xue

240

24 Jun 2024

Large Language Models for Cyber Security: A Systematic Literature Review

773

161

08 May 2024

Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question AnsweringIEEE International Conference on Multimedia and Expo (ICME), 2024

346

14 Mar 2024

Knowledge of Pretrained Language Models on Surface Information of Tokens

Tatsuya Hiraoka

Naoaki Okazaki

344

15 Feb 2024

MambaByte: Token-free Selective State Space Model

437

24 Jan 2024

TransURL: Improving malicious URL detection with multi-layer Transformer encoding and multi-scale pyramid features

Yanbin Wang

226

01 Dec 2023

Learning Mutually Informed Representations for Characters and Subwords

Yilin Wang

Xinyi Hu

Matthew R. Gormley

244

14 Nov 2023

Text Rendering Strategies for Pixel Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

423

01 Nov 2023

Optimized Tokenization for Transcribed Error CorrectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Tomer Wullach

Shlomo E. Chazan

235

16 Oct 2023

Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction

Yung-Hsin Chen

Yuli Zhou

212

29 Aug 2023

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

J. Wu

Dit-Yan Yeung

SILM

383

04 Jul 2023

People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval TextsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

259

26 May 2023

From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Kayhan Batmanghelich

239

23 May 2023

IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining

229

06 Mar 2023

Elementwise Language Representation

Du-Yeong Kim

Jeeeun Kim

240

27 Feb 2023

READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input NoisesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Chenglei Si

Zhengyan Zhang

Yingfa Chen

Xiaozhi Wang

Zhiyuan Liu

Maosong Sun

AAML

319

14 Feb 2023

MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling

Nathan Godey

Roman Castagné

Eric Villemonte de la Clergerie

Benoît Sagot

187

14 Dec 2022

Word-Level Representation From Bytes For Language Modeling

Chul Lee

Qipeng Guo

Xipeng Qiu

238

23 Nov 2022

CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

364

21 Nov 2022

Continuous Prompt Tuning Based Textual Entailment Model for E-commerce Entity Typing

Yibo Wang

Congying Xia

Guan Wang

Philip Yu

203

04 Nov 2022

Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT

B. Bhavya

Jinjun Xiong

Chengxiang Zhai

LRM

212

09 Oct 2022

MockingBERT: A Method for Retroactively Adding Resilience to NLP ModelsInternational Conference on Computational Linguistics (COLING), 2022

Jan Jezabek

A. Singh

SILM KELM

162

21 Aug 2022

Language Modelling with PixelsInternational Conference on Learning Representations (ICLR), 2022

423

14 Jul 2022

Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation

Guido Zuccon

405

21 Jun 2022

Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research ManifoldFindings (Findings), 2022

Sebastian Ruder

Ivan Vulić

Anders Søgaard

210

20 Jun 2022

Local Byte Fusion for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Makesh Narsimhan Sreedhar

Xiangpeng Wan

Yu-Jie Cheng

Junjie Hu

613

23 May 2022

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding

...

Xin Jiang

Qun Liu

Philippe Langlais

230

21 May 2022

Down and Across: Introducing Crossword-Solving as a New NLP BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

410

20 May 2022

Signal in Noise: Exploring Meaning Encoded in Random Character Sequences with Character-Aware Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Mark Chu

Bhargav Srinivasa Desikan

251

15 Mar 2022

Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little CostAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Lihu Chen

Gaël Varoquaux

Fabian M. Suchanek

307

15 Mar 2022

Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Vaidehi Patil

Partha P. Talukdar

Sunita Sarawagi

452

03 Mar 2022

Artificial Intelligence for the Metaverse: A SurveyEngineering applications of artificial intelligence (EAAI), 2022

490

510

15 Feb 2022

An Assessment of the Impact of OCR Noise on Language ModelsInternational Conference on Agents and Artificial Intelligence (ICAART), 2022

Konstantin Todorov

Giovanni Colavizza

418

26 Jan 2022

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

...

371

207

20 Dec 2021

Using Distributional Principles for the Semantic Study of Contextual Language Models

Olivier Ferret

167

23 Nov 2021

Character-level HyperNetworks for Hate Speech DetectionExpert systems with applications (ESWA), 2021

Tomer Wullach

A. Adler

Einat Minkov

199

11 Nov 2021

Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?

Arij Riabi

Benoît Sagot

Djamé Seddah

307

26 Oct 2021

BERT Cannot Align Characters

Antonis Maronikolakis

Philipp Dufter

Hinrich Schütze

202

20 Sep 2021

Integrating Approaches to Word Representation

Yuval Pinter

NAI

270

10 Sep 2021

AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing

Katikapalli Subramanyam Kalyan

A. Rajasekharan

S. Sangeetha

VLM LM&MA

437

322

12 Aug 2021

LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic MatchingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

Kun Zhang

Guangyi Lv

Le Wu

Enhong Chen

Qi Liu

Meng Wang

336

06 Aug 2021