Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

19 December 2022

Papers citing "Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training"

20 / 20 papers shown

Title
Inducing Causal Structure for Interpretable Neural Networks Applied to Glucose Prediction for T1DM Patients Ana Esponera Giovanni Cinnà BDL CML 57 0 0 18 Mar 2025
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models Julie Kallini Shikhar Murty Christopher D. Manning Christopher Potts Róbert Csordás 32 2 0 28 Oct 2024
Retrieval Augmented Spelling Correction for E-Commerce Applications Xuan Guo Rohit Patki Dante Everaert Christopher Potts 19 0 0 15 Oct 2024
CUTE: Measuring LLMs' Understanding of Their Tokens Lukas Edman Helmut Schmid Alexander M. Fraser 32 3 0 23 Sep 2024
ReFT: Representation Finetuning for Language Models Zhengxuan Wu Aryaman Arora Zheng Wang Atticus Geiger Daniel Jurafsky Christopher D. Manning Christopher Potts OffRL 30 33 0 04 Apr 2024
Knowledge of Pretrained Language Models on Surface Information of Tokens Tatsuya Hiraoka Naoaki Okazaki 24 1 0 15 Feb 2024
Multi-teacher Distillation for Multilingual Spelling Correction Jingfen Zhang Xuan Guo S. Bodapati Christopher Potts KELM 10 3 0 20 Nov 2023
Learning Mutually Informed Representations for Characters and Subwords Yilin Wang Xinyi Hu Matthew R. Gormley 31 0 0 14 Nov 2023
Rigorously Assessing Natural Language Explanations of Neurons Jing-ling Huang Atticus Geiger Karel DÓosterlinck Zhengxuan Wu Christopher Potts MILM 21 25 0 19 Sep 2023
Causal interventions expose implicit situation models for commonsense language understanding Takateru Yamakoshi James L. McClelland A. Goldberg Robert D. Hawkins 17 6 0 06 Jun 2023
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning Jingyuan Selena She Christopher Potts Sam Bowman Atticus Geiger 22 13 0 30 May 2023
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca Zhengxuan Wu Atticus Geiger Thomas Icard Christopher Potts Noah D. Goodman MILM 30 81 0 15 May 2023
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations Atticus Geiger Zhengxuan Wu Christopher Potts Thomas F. Icard Noah D. Goodman CML 73 98 0 05 Mar 2023
Causal Proxy Models for Concept-Based Model Explanations Zhengxuan Wu Karel DÓosterlinck Atticus Geiger Amir Zur Christopher Potts MILM 75 35 0 28 Sep 2022
Automated Crossword Solving Eric Wallace Nicholas Tomlin Albert Xu Kevin Kaichuang Yang Eshaan Pathak Matthew Ginsberg Dan Klein 40 12 0 19 May 2022
Causal Distillation for Language Models Zhengxuan Wu Atticus Geiger J. Rozner Elisa Kreiss Hanson Lu Thomas F. Icard Christopher Potts Noah D. Goodman 81 25 0 05 Dec 2021
Why don't people use character-level machine translation? Jindrich Libovický Helmut Schmid Alexander M. Fraser 65 28 0 15 Oct 2021
Integrating Approaches to Word Representation Yuval Pinter NAI 43 5 0 10 Sep 2021
Char2Subword: Extending the Subword Embedding Space Using Robust Character Compositionality Gustavo Aguilar Bryan McCann Tong Niu Nazneen Rajani N. Keskar Thamar Solorio 42 12 0 24 Oct 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters Hicham El Boukkouri Olivier Ferret Thomas Lavergne Hiroshi Noji Pierre Zweigenbaum Junichi Tsujii 66 156 0 20 Oct 2020