ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.09897
  4. Cited By
Inducing Character-level Structure in Subword-based Language Models with
  Type-level Interchange Intervention Training

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

19 December 2022
Jing-ling Huang
Zhengxuan Wu
Kyle Mahowald
Christopher Potts
ArXivPDFHTML

Papers citing "Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training"

20 / 20 papers shown
Title
Inducing Causal Structure for Interpretable Neural Networks Applied to Glucose Prediction for T1DM Patients
Inducing Causal Structure for Interpretable Neural Networks Applied to Glucose Prediction for T1DM Patients
Ana Esponera
Giovanni Cinnà
BDL
CML
57
0
0
18 Mar 2025
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
32
2
0
28 Oct 2024
Retrieval Augmented Spelling Correction for E-Commerce Applications
Retrieval Augmented Spelling Correction for E-Commerce Applications
Xuan Guo
Rohit Patki
Dante Everaert
Christopher Potts
19
0
0
15 Oct 2024
CUTE: Measuring LLMs' Understanding of Their Tokens
CUTE: Measuring LLMs' Understanding of Their Tokens
Lukas Edman
Helmut Schmid
Alexander M. Fraser
32
3
0
23 Sep 2024
ReFT: Representation Finetuning for Language Models
ReFT: Representation Finetuning for Language Models
Zhengxuan Wu
Aryaman Arora
Zheng Wang
Atticus Geiger
Daniel Jurafsky
Christopher D. Manning
Christopher Potts
OffRL
30
33
0
04 Apr 2024
Knowledge of Pretrained Language Models on Surface Information of Tokens
Knowledge of Pretrained Language Models on Surface Information of Tokens
Tatsuya Hiraoka
Naoaki Okazaki
24
1
0
15 Feb 2024
Multi-teacher Distillation for Multilingual Spelling Correction
Multi-teacher Distillation for Multilingual Spelling Correction
Jingfen Zhang
Xuan Guo
S. Bodapati
Christopher Potts
KELM
10
3
0
20 Nov 2023
Learning Mutually Informed Representations for Characters and Subwords
Learning Mutually Informed Representations for Characters and Subwords
Yilin Wang
Xinyi Hu
Matthew R. Gormley
31
0
0
14 Nov 2023
Rigorously Assessing Natural Language Explanations of Neurons
Rigorously Assessing Natural Language Explanations of Neurons
Jing-ling Huang
Atticus Geiger
Karel DÓosterlinck
Zhengxuan Wu
Christopher Potts
MILM
21
25
0
19 Sep 2023
Causal interventions expose implicit situation models for commonsense
  language understanding
Causal interventions expose implicit situation models for commonsense language understanding
Takateru Yamakoshi
James L. McClelland
A. Goldberg
Robert D. Hawkins
17
6
0
06 Jun 2023
ScoNe: Benchmarking Negation Reasoning in Language Models With
  Fine-Tuning and In-Context Learning
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning
Jingyuan Selena She
Christopher Potts
Sam Bowman
Atticus Geiger
22
13
0
30 May 2023
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
Zhengxuan Wu
Atticus Geiger
Thomas Icard
Christopher Potts
Noah D. Goodman
MILM
30
81
0
15 May 2023
Finding Alignments Between Interpretable Causal Variables and
  Distributed Neural Representations
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
Atticus Geiger
Zhengxuan Wu
Christopher Potts
Thomas F. Icard
Noah D. Goodman
CML
73
98
0
05 Mar 2023
Causal Proxy Models for Concept-Based Model Explanations
Causal Proxy Models for Concept-Based Model Explanations
Zhengxuan Wu
Karel DÓosterlinck
Atticus Geiger
Amir Zur
Christopher Potts
MILM
75
35
0
28 Sep 2022
Automated Crossword Solving
Automated Crossword Solving
Eric Wallace
Nicholas Tomlin
Albert Xu
Kevin Kaichuang Yang
Eshaan Pathak
Matthew Ginsberg
Dan Klein
40
12
0
19 May 2022
Causal Distillation for Language Models
Causal Distillation for Language Models
Zhengxuan Wu
Atticus Geiger
J. Rozner
Elisa Kreiss
Hanson Lu
Thomas F. Icard
Christopher Potts
Noah D. Goodman
81
25
0
05 Dec 2021
Why don't people use character-level machine translation?
Why don't people use character-level machine translation?
Jindrich Libovický
Helmut Schmid
Alexander M. Fraser
65
28
0
15 Oct 2021
Integrating Approaches to Word Representation
Integrating Approaches to Word Representation
Yuval Pinter
NAI
43
5
0
10 Sep 2021
Char2Subword: Extending the Subword Embedding Space Using Robust
  Character Compositionality
Char2Subword: Extending the Subword Embedding Space Using Robust Character Compositionality
Gustavo Aguilar
Bryan McCann
Tong Niu
Nazneen Rajani
N. Keskar
Thamar Solorio
42
12
0
24 Oct 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary
  Representations From Characters
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
66
156
0
20 Oct 2020
1