ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.01513
  4. Cited By
CharBERT: Character-aware Pre-trained Language Model

CharBERT: Character-aware Pre-trained Language Model

3 November 2020
Wentao Ma
Yiming Cui
Chenglei Si
Ting Liu
Shijin Wang
Guoping Hu
ArXivPDFHTML

Papers citing "CharBERT: Character-aware Pre-trained Language Model"

50 / 62 papers shown
Title
Using External knowledge to Enhanced PLM for Semantic Matching
Using External knowledge to Enhanced PLM for Semantic Matching
Min Li
Chun Yuan
34
0
0
10 May 2025
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications
M. Bommarito
Daniel Martin Katz
Jillian Bommarito
42
1
0
21 Mar 2025
Comateformer: Combined Attention Transformer for Semantic Sentence
  Matching
Comateformer: Combined Attention Transformer for Semantic Sentence Matching
Bo Li
Di Liang
Zixin Zhang
67
2
0
10 Dec 2024
TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on
  Pre-trained Language Models
TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language Models
Matheus Simão
Fabiano Prado
Omar Abdul Wahab
Anderson Avila
26
0
0
11 Nov 2024
From Babble to Words: Pre-Training Language Models on Continuous Streams
  of Phonemes
From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
Zébulon Goriely
Richard Diehl Martinez
Andrew Caines
Lisa Beinborn
P. Buttery
CLL
50
5
0
30 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
40
2
0
28 Oct 2024
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
Nan Xu
Xuezhe Ma
LRM
59
3
0
18 Oct 2024
Advancing Post-OCR Correction: A Comparative Study of Synthetic Data
Advancing Post-OCR Correction: A Comparative Study of Synthetic Data
Shuhao Guan
Derek Greene
34
6
0
05 Aug 2024
Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through
  Gender-Neutral Name Predictions
Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions
Zhiwen You
Haejin Lee
Shubhanshu Mishra
Sullam Jeoung
Apratim Mishra
Jinseok Kim
Jana Diesner
34
9
0
07 Jul 2024
KEHRL: Learning Knowledge-Enhanced Language Representations with
  Hierarchical Reinforcement Learning
KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning
Dongyang Li
Taolin Zhang
Longtao Huang
Chengyu Wang
Xiaofeng He
Hui Xue
KELM
OffRL
33
0
0
24 Jun 2024
Large Language Models for Cyber Security: A Systematic Literature Review
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
37
23
0
08 May 2024
Adversarial Training with OCR Modality Perturbation for Scene-Text
  Visual Question Answering
Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering
Zhixuan Shen
Haonan Luo
Sijia Li
Tianrui Li
26
0
0
14 Mar 2024
Knowledge of Pretrained Language Models on Surface Information of Tokens
Knowledge of Pretrained Language Models on Surface Information of Tokens
Tatsuya Hiraoka
Naoaki Okazaki
32
1
0
15 Feb 2024
MambaByte: Token-free Selective State Space Model
MambaByte: Token-free Selective State Space Model
Junxiong Wang
Tushaar Gangavarapu
Jing Nathan Yan
Alexander M. Rush
Mamba
44
35
0
24 Jan 2024
TransURL: Improving malicious URL detection with multi-layer Transformer encoding and multi-scale pyramid features
TransURL: Improving malicious URL detection with multi-layer Transformer encoding and multi-scale pyramid features
Ruitong Liu
Yanbin Wang
Zhenhao Guo
Haitao Xu
Zhan Qin
Wenrui Ma
Fan Zhang
AI4TS
ViT
19
7
0
01 Dec 2023
Learning Mutually Informed Representations for Characters and Subwords
Learning Mutually Informed Representations for Characters and Subwords
Yilin Wang
Xinyi Hu
Matthew R. Gormley
36
0
0
14 Nov 2023
Text Rendering Strategies for Pixel Language Models
Text Rendering Strategies for Pixel Language Models
Jonas F. Lotz
Elizabeth Salesky
Phillip Rust
Desmond Elliott
VLM
29
11
0
01 Nov 2023
Optimized Tokenization for Transcribed Error Correction
Optimized Tokenization for Transcribed Error Correction
Tomer Wullach
Shlomo E. Chazan
32
0
0
16 Oct 2023
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph
  Embedding for Improved Correction
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction
Yung-Hsin Chen
Yuli Zhou
24
2
0
29 Aug 2023
SCAT: Robust Self-supervised Contrastive Learning via Adversarial
  Training for Text Classification
SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification
J. Wu
Dit-Yan Yeung
SILM
25
0
0
04 Jul 2023
People and Places of Historical Europe: Bootstrapping Annotation
  Pipeline and a New Corpus of Named Entities in Late Medieval Texts
People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts
Vít Novotný
Kristýna Luger
Michal Štefánik
Tereza Vrabcová
Ales Horak
46
1
0
26 May 2023
From Characters to Words: Hierarchical Pre-trained Language Model for
  Open-vocabulary Language Understanding
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding
Li Sun
F. Luisier
Kayhan Batmanghelich
D. Florêncio
Changrong Zhang
VLM
23
6
0
23 May 2023
IPA-CLIP: Integrating Phonetic Priors into Vision and Language
  Pretraining
IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining
Chihaya Matsuhira
Marc A. Kastner
Takahiro Komamizu
Takatsugu Hirayama
Keisuke Doman
Yasutomo Kawanishi
Ichiro Ide
37
6
0
06 Mar 2023
Elementwise Language Representation
Elementwise Language Representation
Du-Yeong Kim
Jeeeun Kim
33
0
0
27 Feb 2023
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input
  Noises
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
AAML
26
1
0
14 Feb 2023
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End
  Language Modeling
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Nathan Godey
Roman Castagné
Eric Villemonte de la Clergerie
Benoît Sagot
21
3
0
14 Dec 2022
Word-Level Representation From Bytes For Language Modeling
Word-Level Representation From Bytes For Language Modeling
Chul Lee
Qipeng Guo
Xipeng Qiu
17
1
0
23 Nov 2022
CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog
  Evaluation
CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation
Yinpei Dai
Wanwei He
Bowen Li
Yuchuan Wu
Zhen Cao
Zhongqi An
Jian Sun
Yongbin Li
ELM
ALM
41
12
0
21 Nov 2022
Continuous Prompt Tuning Based Textual Entailment Model for E-commerce
  Entity Typing
Continuous Prompt Tuning Based Textual Entailment Model for E-commerce Entity Typing
Yibo Wang
Congying Xia
Guan Wang
Philip Yu
21
6
0
04 Nov 2022
Analogy Generation by Prompting Large Language Models: A Case Study of
  InstructGPT
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
B. Bhavya
Jinjun Xiong
Chengxiang Zhai
LRM
45
42
0
09 Oct 2022
MockingBERT: A Method for Retroactively Adding Resilience to NLP Models
MockingBERT: A Method for Retroactively Adding Resilience to NLP Models
Jan Jezabek
A. Singh
SILM
KELM
31
0
0
21 Aug 2022
Language Modelling with Pixels
Language Modelling with Pixels
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
38
46
0
14 Jul 2022
Bridging the Gap Between Indexing and Retrieval for Differentiable
  Search Index with Query Generation
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang
Houxing Ren
Linjun Shou
Jian Pei
Ming Gong
Guido Zuccon
Daxin Jiang
40
65
0
21 Jun 2022
Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the
  Research Manifold
Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold
Sebastian Ruder
Ivan Vulić
Anders Søgaard
41
29
0
20 Jun 2022
Local Byte Fusion for Neural Machine Translation
Local Byte Fusion for Neural Machine Translation
Makesh Narsimhan Sreedhar
Xiangpeng Wan
Yu-Jie Cheng
Junjie Hu
27
4
0
23 May 2022
Revisiting Pre-trained Language Models and their Evaluation for Arabic
  Natural Language Understanding
Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding
Abbas Ghaddar
Yimeng Wu
Sunyam Bagga
Ahmad Rashid
Khalil Bibi
...
Zhefeng Wang
Baoxing Huai
Xin Jiang
Qun Liu
Philippe Langlais
27
6
0
21 May 2022
Down and Across: Introducing Crossword-Solving as a New NLP Benchmark
Down and Across: Introducing Crossword-Solving as a New NLP Benchmark
Saurabh Kulshreshtha
Olga Kovaleva
Namrata Shivagunde
Anna Rumshisky
ELM
LRM
29
4
0
20 May 2022
Signal in Noise: Exploring Meaning Encoded in Random Character Sequences
  with Character-Aware Language Models
Signal in Noise: Exploring Meaning Encoded in Random Character Sequences with Character-Aware Language Models
Mark Chu
Bhargav Srinivasa Desikan
E. Nadler
Ruggerio L. Sardo
Elise Darragh-Ford
Douglas Guilbeault
20
0
0
15 Mar 2022
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models
  Robust with Little Cost
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
21
14
0
15 Mar 2022
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer
  Among Related Languages
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages
Vaidehi Patil
Partha P. Talukdar
Sunita Sarawagi
24
21
0
03 Mar 2022
Artificial Intelligence for the Metaverse: A Survey
Artificial Intelligence for the Metaverse: A Survey
Thien Huynh-The
Viet Quoc Pham
Xuan-Qui Pham
Thanh Thi Nguyen
Zhu Han
Dong-Seong Kim
30
408
0
15 Feb 2022
An Assessment of the Impact of OCR Noise on Language Models
An Assessment of the Impact of OCR Noise on Language Models
Konstantin Todorov
Giovanni Colavizza
9
7
0
26 Jan 2022
Between words and characters: A Brief History of Open-Vocabulary
  Modeling and Tokenization in NLP
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
32
142
0
20 Dec 2021
Using Distributional Principles for the Semantic Study of Contextual
  Language Models
Using Distributional Principles for the Semantic Study of Contextual Language Models
Olivier Ferret
25
1
0
23 Nov 2021
Character-level HyperNetworks for Hate Speech Detection
Character-level HyperNetworks for Hate Speech Detection
Tomer Wullach
A. Adler
Einat Minkov
16
12
0
11 Nov 2021
Can Character-based Language Models Improve Downstream Task Performance
  in Low-Resource and Noisy Language Scenarios?
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
Arij Riabi
Benoît Sagot
Djamé Seddah
31
15
0
26 Oct 2021
BERT Cannot Align Characters
BERT Cannot Align Characters
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
25
0
0
20 Sep 2021
Integrating Approaches to Word Representation
Integrating Approaches to Word Representation
Yuval Pinter
NAI
48
5
0
10 Sep 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural
  Language Processing
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLM
LM&MA
26
261
0
12 Aug 2021
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence
  Semantic Matching
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic Matching
Kun Zhang
Guangyi Lv
Le Wu
Enhong Chen
Qi Liu
Meng Wang
28
6
0
06 Aug 2021
12
Next