ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.05262
  4. Cited By
Locating and Editing Factual Associations in GPT
v1v2v3v4v5 (latest)

Locating and Editing Factual Associations in GPT

10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
    KELM
ArXiv (abs)PDFHTML

Papers citing "Locating and Editing Factual Associations in GPT"

50 / 1,056 papers shown
Title
Retrieval-Augmented Generation for Natural Language Processing: A Survey
Retrieval-Augmented Generation for Natural Language Processing: A Survey
Shangyu Wu
Ying Xiong
Yufei Cui
Haolun Wu
Can Chen
...
Lianming Huang
Xue Liu
Tei-Wei Kuo
Nan Guan
Chun Jason Xue
3DVRALM
152
39
0
18 Jul 2024
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
Jaden Fiotto-Kaufman
Alexander R. Loftus
Eric Todd
Jannik Brinkmann
Caden Juang
...
Carla Brodley
Arjun Guha
Jonathan Bell
Byron C. Wallace
David Bau
115
5
0
18 Jul 2024
Establishing Knowledge Preference in Language Models
Establishing Knowledge Preference in Language Models
Sizhe Zhou
Sha Li
Yu Meng
Yizhu Jiao
Heng Ji
Jiawei Han
KELM
135
0
0
17 Jul 2024
LLM Circuit Analyses Are Consistent Across Training and Scale
LLM Circuit Analyses Are Consistent Across Training and Scale
Curt Tigges
Michael Hanna
Qinan Yu
Stella Biderman
109
18
0
15 Jul 2024
How and where does CLIP process negation?
How and where does CLIP process negation?
Vincent Quantmeyer
Pablo Mosteiro
Albert Gatt
CoGe
80
9
0
15 Jul 2024
Cross-Lingual Multi-Hop Knowledge Editing
Cross-Lingual Multi-Hop Knowledge Editing
Aditi Khandelwal
Harman Singh
Hengrui Gu
Tianlong Chen
Kaixiong Zhou
KELM
63
0
0
14 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
78
7
0
12 Jul 2024
Transformer Circuit Faithfulness Metrics are not Robust
Transformer Circuit Faithfulness Metrics are not Robust
Joseph Miller
Bilal Chughtai
William Saunders
104
7
0
11 Jul 2024
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Huanqian Wang
Yang Yue
Rui Lu
Jingxin Shi
Andrew Zhao
Shenzhi Wang
Shiji Song
Gao Huang
LM&RoKELM
151
0
0
11 Jul 2024
Knowledge Overshadowing Causes Amalgamated Hallucination in Large
  Language Models
Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models
Yuji Zhang
Sha Li
Jiateng Liu
Pengfei Yu
Yi R. Fung
Jing Li
Manling Li
Heng Ji
110
12
0
10 Jul 2024
Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU
  Transformers
Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers
Cody Wild
Jesper Anderson
MoE
72
0
0
10 Jul 2024
Grounding and Evaluation for Large Language Models: Practical Challenges
  and Lessons Learned (Survey)
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)
K. Kenthapadi
M. Sameki
Ankur Taly
HILMELMAILaw
83
15
0
10 Jul 2024
Composable Interventions for Language Models
Composable Interventions for Language Models
Arinbjorn Kolbeinsson
Kyle O'Brien
Tianjin Huang
Shanghua Gao
Shiwei Liu
...
Anurag J. Vaidya
Faisal Mahmood
Marinka Zitnik
Tianlong Chen
Thomas Hartvigsen
KELMMU
206
4
0
09 Jul 2024
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
Weijia Shi
Jaechan Lee
Yangsibo Huang
Sadhika Malladi
Jieyu Zhao
Ari Holtzman
Daogao Liu
Luke Zettlemoyer
Noah A. Smith
Chiyuan Zhang
MUELM
113
84
0
08 Jul 2024
CodeUpdateArena: Benchmarking Knowledge Editing on API Updates
CodeUpdateArena: Benchmarking Knowledge Editing on API Updates
Zeyu Leo Liu
Shrey Pandit
Xi Ye
Eunsol Choi
Greg Durrett
KELMALM
145
7
0
08 Jul 2024
Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for
  Interpreting Neural Networks
Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for Interpreting Neural Networks
Aaron Mueller
CML
77
10
0
05 Jul 2024
Concept Bottleneck Models Without Predefined Concepts
Concept Bottleneck Models Without Predefined Concepts
Simon Schrodi
Julian Schur
Max Argus
Thomas Brox
84
12
0
04 Jul 2024
Functional Faithfulness in the Wild: Circuit Discovery with
  Differentiable Computation Graph Pruning
Functional Faithfulness in the Wild: Circuit Discovery with Differentiable Computation Graph Pruning
Lei Yu
Jingcheng Niu
Zining Zhu
Gerald Penn
84
7
0
04 Jul 2024
Truth is Universal: Robust Detection of Lies in LLMs
Truth is Universal: Robust Detection of Lies in LLMs
Lennart Bürger
Fred Hamprecht
B. Nadler
HILM
108
22
0
03 Jul 2024
Why Does New Knowledge Create Messy Ripple Effects in LLMs?
Why Does New Knowledge Create Messy Ripple Effects in LLMs?
Jiaxin Qin
Zixuan Zhang
Chi Han
Manling Li
Pengfei Yu
Heng Ji
KELM
95
12
0
02 Jul 2024
To Forget or Not? Towards Practical Knowledge Unlearning for Large
  Language Models
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models
Bozhong Tian
Xiaozhuan Liang
Siyuan Cheng
Qingbin Liu
Mengru Wang
Dianbo Sui
Xi Chen
Huajun Chen
Xin Xu
MU
89
14
0
02 Jul 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
Daking Rai
Yilun Zhou
Shi Feng
Abulhair Saparov
Ziyu Yao
194
33
0
02 Jul 2024
PFME: A Modular Approach for Fine-grained Hallucination Detection and
  Editing of Large Language Models
PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models
Kunquan Deng
Zeyu Huang
Chen Li
Chenghua Lin
Min Gao
Wenge Rong
KELM
71
0
0
29 Jun 2024
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs
Sheridan Feucht
David Atkinson
Byron C. Wallace
David Bau
109
8
0
28 Jun 2024
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of
  Large Language Models
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models
Renzhi Wang
Piji Li
KELMCLL
103
10
0
28 Jun 2024
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
  Generation
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation
Zijun Yao
Weijian Qi
Liangming Pan
S. Cao
Linmei Hu
Weichuan Liu
Lei Hou
Juanzi Li
RALM
80
6
0
27 Jun 2024
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image
  Models
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
Aishwarya Agarwal
Srikrishna Karanam
Balaji Vasan Srinivasan
96
1
0
27 Jun 2024
The Remarkable Robustness of LLMs: Stages of Inference?
The Remarkable Robustness of LLMs: Stages of Inference?
Vedang Lad
Wes Gurnee
Max Tegmark
Max Tegmark
136
53
0
27 Jun 2024
Evaluating Copyright Takedown Methods for Language Models
Evaluating Copyright Takedown Methods for Language Models
Boyi Wei
Weijia Shi
Yangsibo Huang
Noah A. Smith
Chiyuan Zhang
Luke Zettlemoyer
Kai Li
Peter Henderson
154
25
0
26 Jun 2024
IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying
  and Reweighting Context-Aware Neurons
IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons
Dan Shi
Renren Jin
Tianhao Shen
Weilong Dong
Xinwei Wu
Deyi Xiong
105
11
0
26 Jun 2024
Do LLMs dream of elephants (when told not to)? Latent concept
  association and associative memory in transformers
Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers
Yibo Jiang
Goutham Rajendran
Pradeep Ravikumar
Bryon Aragam
CLLKELM
96
8
0
26 Jun 2024
Enhancing Data Privacy in Large Language Models through Private
  Association Editing
Enhancing Data Privacy in Large Language Models through Private Association Editing
Davide Venditti
Elena Sofia Ruzzetti
Giancarlo A. Xompero
Cristina Giannone
Andrea Favalli
Raniero Romagnoli
Fabio Massimo Zanzotto
KELM
74
3
0
26 Jun 2024
Transformer Normalisation Layers and the Independence of Semantic
  Subspaces
Transformer Normalisation Layers and the Independence of Semantic Subspaces
S. Menary
Samuel Kaski
Andre Freitas
93
2
0
25 Jun 2024
How Well Can Knowledge Edit Methods Edit Perplexing Knowledge?
How Well Can Knowledge Edit Methods Edit Perplexing Knowledge?
Huaizhi Ge
Frank Rudzicz
Zining Zhu
KELM
111
4
0
25 Jun 2024
BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning
BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning
Ercong Nie
Bo Shao
Zifeng Ding
Mingyang Wang
Helmut Schmid
Hinrich Schütze
KELM
211
8
0
25 Jun 2024
It Is Not About What You Say, It Is About How You Say It: A Surprisingly
  Simple Approach for Improving Reading Comprehension
It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension
Sagi Shaier
Lawrence E Hunter
Katharina von der Wense
108
4
0
24 Jun 2024
Multilingual Knowledge Editing with Language-Agnostic Factual Neurons
Multilingual Knowledge Editing with Language-Agnostic Factual Neurons
Xue Zhang
Yunlong Liang
Fandong Meng
Songming Zhang
Yufeng Chen
Jinan Xu
Jie Zhou
KELM
76
9
0
24 Jun 2024
MD tree: a model-diagnostic tree grown on loss landscape
MD tree: a model-diagnostic tree grown on loss landscape
Yefan Zhou
Jianlong Chen
Qinxue Cao
Konstantin Schürholt
Yaoqing Yang
102
2
0
24 Jun 2024
Confidence Regulation Neurons in Language Models
Confidence Regulation Neurons in Language Models
Alessandro Stolfo
Ben Wu
Wes Gurnee
Yonatan Belinkov
Xingyi Song
Mrinmaya Sachan
Neel Nanda
82
20
0
24 Jun 2024
Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Peng Hu
Sizhe Liu
Changjiang Gao
Xin Huang
Xue Han
Junlan Feng
Chao Deng
Shujian Huang
LRM
157
5
0
24 Jun 2024
What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian-Noise-free Text-Image Corruption and Evaluation
What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian-Noise-free Text-Image Corruption and Evaluation
Michal Golovanevsky
William Rudman
Vedant Palit
Ritambhara Singh
Carsten Eickhoff
132
3
0
24 Jun 2024
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large
  Language Models
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models
Junyi Zhu
Shuochen Liu
Yu Yu
Bo Tang
Yibo Yan
Zhiyu Li
Feiyu Xiong
Tong Xu
Matthew B. Blaschko
100
5
0
23 Jun 2024
Memorizing Documents with Guidance in Large Language Models
Memorizing Documents with Guidance in Large Language Models
Bumjin Park
Jaesik Choi
KELMRALM
71
1
0
23 Jun 2024
Unveiling LLM Mechanisms Through Neural ODEs and Control Theory
Unveiling LLM Mechanisms Through Neural ODEs and Control Theory
Yukun Zhang
Qi Dong
121
0
0
23 Jun 2024
Beyond the Doors of Perception: Vision Transformers Represent Relations
  Between Objects
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
Michael A. Lepori
Alexa R. Tartaglini
Wai Keen Vong
Thomas Serre
Brenden M. Lake
Ellie Pavlick
91
4
0
22 Jun 2024
Beyond Individual Facts: Investigating Categorical Knowledge Locality of
  Taxonomy and Meronomy Concepts in GPT Models
Beyond Individual Facts: Investigating Categorical Knowledge Locality of Taxonomy and Meronomy Concepts in GPT Models
Christopher Burger
Yifan Hu
Thai Le
KELM
86
0
0
22 Jun 2024
Steering Without Side Effects: Improving Post-Deployment Control of
  Language Models
Steering Without Side Effects: Improving Post-Deployment Control of Language Models
Asa Cooper Stickland
Alexander Lyzhov
Jacob Pfau
Salsabila Mahdi
Samuel R. Bowman
LLMSVAAML
116
24
0
21 Jun 2024
Finding Safety Neurons in Large Language Models
Finding Safety Neurons in Large Language Models
Jianhui Chen
Xiaozhi Wang
Zijun Yao
Yushi Bai
Lei Hou
Juanzi Li
KELMLLMSV
93
18
0
20 Jun 2024
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models
Shijie Han
Zhenyu Zhang
Andrei Arsene Simion
88
2
0
20 Jun 2024
Locating and Extracting Relational Concepts in Large Language Models
Locating and Extracting Relational Concepts in Large Language Models
Zijian Wang
Britney White
Chang Xu
KELM
77
1
0
19 Jun 2024
Previous
123...101112...202122
Next