Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.05417
Cited By
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models
8 May 2024
Sander Land
Max Bartolo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models"
5 / 5 papers shown
Title
Shared Global and Local Geometry of Language Model Embeddings
Andrew Lee
Melanie Weber
F. Viégas
Martin Wattenberg
FedML
76
3
0
27 Mar 2025
SuperBPE: Space Travel for Language Models
Alisa Liu
J. Hayase
Valentin Hofmann
Sewoong Oh
Noah A. Smith
Yejin Choi
43
3
0
17 Mar 2025
Enhancing Jailbreak Attack Against Large Language Models through Silent Tokens
Jiahao Yu
Haozheng Luo
Jerry Yao-Chieh Hu
Wenbo Guo
Han Liu
Xinyu Xing
40
18
0
31 May 2024
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Suraj Anand
Michael A. Lepori
Jack Merullo
Ellie Pavlick
CLL
31
6
0
28 May 2024
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Dixuan Wang
Yanda Li
Junyuan Jiang
Zepeng Ding
Ziqin Luo
Guochao Jiang
Jiaqing Liang
Deqing Yang
27
11
0
27 May 2024
1