Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.05262
Cited By
v1
v2
v3
v4
v5 (latest)
Locating and Editing Factual Associations in GPT
10 February 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Locating and Editing Factual Associations in GPT"
50 / 1,056 papers shown
Title
A Human-Computer Collaborative Tool for Training a Single Large Language Model Agent into a Network through Few Examples
Lihang Pan
Yuxuan Li
Chun Yu
Yuanchun Shi
LLMAG
82
2
0
24 Apr 2024
How to use and interpret activation patching
Stefan Heimersheim
Neel Nanda
85
48
0
23 Apr 2024
TAXI: Evaluating Categorical Knowledge Editing for Language Models
Derek Powell
Walter Gerych
Thomas Hartvigsen
KELM
67
7
0
23 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
217
61
0
23 Apr 2024
Mechanistic Interpretability for AI Safety -- A Review
Leonard Bereska
E. Gavves
AI4CE
139
158
0
22 Apr 2024
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
Weili Zeng
Yichao Yan
Qi Zhu
Zhuo Chen
Pengzhi Chu
Weiming Zhao
Xiaokang Yang
171
10
0
22 Apr 2024
Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation
Shaomu Tan
Di Wu
Christof Monz
MoMe
113
9
0
17 Apr 2024
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory
Ali Modarressi
Abdullatif Köksal
Ayyoob Imani
Mohsen Fayyaz
Hinrich Schütze
KELM
201
11
0
17 Apr 2024
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
Hengyuan Zhang
Yanru Wu
Dawei Li
Zacc Yang
Rui Zhao
Yong Jiang
Fei Tan
ALM
142
1
0
16 Apr 2024
Explainable Generative AI (GenXAI): A Survey, Conceptualization, and Research Agenda
Johannes Schneider
142
35
0
15 Apr 2024
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees
William Fleshman
Aleem Khan
Marc Marone
Benjamin Van Durme
CLL
KELM
126
4
0
12 Apr 2024
DyKnow:Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs
Seyed Mahed Mousavi
Simone Alghisi
Giuseppe Riccardi
KELM
92
7
0
10 Apr 2024
Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
Mingyu Jin
Qinkai Yu
Jingyuan Huang
Qingcheng Zeng
Zhenting Wang
...
Yanda Meng
Kaize Ding
Fan Yang
Jundong Li
Yongfeng Zhang
100
21
0
10 Apr 2024
Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning
Ruiqi Zhang
Licong Lin
Yu Bai
Song Mei
MU
145
193
0
08 Apr 2024
Finding Visual Task Vectors
Alberto Hojel
Yutong Bai
Trevor Darrell
Amir Globerson
Amir Bar
124
8
0
08 Apr 2024
Locating and Editing Factual Associations in Mamba
Arnab Sen Sharma
David Atkinson
David Bau
KELM
126
31
0
04 Apr 2024
Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph
Marco Bronzini
Carlo Nicolini
Bruno Lepri
Jacopo Staiano
Andrea Passerini
KELM
78
0
0
04 Apr 2024
MuLan: A Study of Fact Mutability in Language Models
Constanza Fierro
Nicolas Garneau
Emanuele Bugliarello
Yova Kementchedjhieva
Anders Søgaard
KELM
HILM
69
9
0
03 Apr 2024
Empowering Biomedical Discovery with AI Agents
Shanghua Gao
Ada Fang
Yepeng Huang
Valentina Giunchiglia
Ayush Noori
Jonathan Richard Schwarz
Yasha Ektefaie
Jovana Kondic
Marinka Zitnik
LLMAG
AI4CE
112
100
0
03 Apr 2024
Scalable Model Editing via Customized Expert Networks
Zihan Yao
Yu He
Tianyu Qi
Ming Li
KELM
79
5
0
03 Apr 2024
UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing
Yijun Yang
Jie He
Pinzhen Chen
Víctor Gutiérrez-Basulto
Jeff Z. Pan
KELM
HILM
62
0
0
01 Apr 2024
Multi-hop Question Answering under Temporal Knowledge Editing
Keyuan Cheng
Gang Lin
Haoyang Fei
Yuxuan Zhai
Lu Yu
Muhammad Asif Ali
Lijie Hu
Di Wang
KELM
113
27
0
30 Mar 2024
PROMPT-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression
Muhammad Asif Ali
Zhengping Li
Shu Yang
Keyuan Cheng
Yang Cao
Tianhao Huang
Lijie Hu
Lu Yu
Di Wang
VLM
RALM
89
9
0
30 Mar 2024
On Large Language Models' Hallucination with Regard to Known Facts
Che Jiang
Biqing Qi
Xiangyu Hong
Dayuan Fu
Yang Cheng
Fandong Meng
Mo Yu
Bowen Zhou
Jie Zhou
HILM
LRM
75
22
0
29 Mar 2024
Localizing Paragraph Memorization in Language Models
Niklas Stoehr
Mitchell Gordon
Chiyuan Zhang
Owen Lewis
MU
68
15
0
28 Mar 2024
Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models
Ang Lv
Yuhan Chen
Kaiyi Zhang
Yulong Wang
Lifeng Liu
Ji-Rong Wen
Jian Xie
Rui Yan
KELM
76
18
0
28 Mar 2024
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Samuel Marks
Can Rager
Eric J. Michaud
Yonatan Belinkov
David Bau
Aaron Mueller
188
159
0
28 Mar 2024
Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations
Lei Yu
Meng Cao
Jackie Chi Kit Cheung
Yue Dong
HILM
86
15
0
27 Mar 2024
Robust and Scalable Model Editing for Large Language Models
Yingfa Chen
Zhengyan Zhang
Xu Han
Chaojun Xiao
Zhiyuan Liu
Chen Chen
Kuai Li
Tao Yang
Maosong Sun
KELM
52
2
0
26 Mar 2024
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
163
106
0
26 Mar 2024
CoLLEGe: Concept Embedding Generation for Large Language Models
Ryan Teehan
Brenden M. Lake
Mengye Ren
83
4
0
22 Mar 2024
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation
Xindi Luo
Zequn Sun
Jing-xin Zhao
Zhe Zhao
Wei Hu
KELM
69
8
0
22 Mar 2024
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf
Elad Richardson
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
MLLM
VLM
110
23
0
21 Mar 2024
Detoxifying Large Language Models via Knowledge Editing
Meng Wang
Ningyu Zhang
Ziwen Xu
Zekun Xi
Shumin Deng
Yunzhi Yao
Qishen Zhang
Linyi Yang
Jindong Wang
Huajun Chen
KELM
114
66
0
21 Mar 2024
Locating and Mitigating Gender Bias in Large Language Models
Yuchen Cai
Ding Cao
Rongxi Guo
Yaqin Wen
Guiquan Liu
Enhong Chen
71
5
0
21 Mar 2024
Editing Knowledge Representation of Language Model via Rephrased Prefix Prompts
Yuchen Cai
Ding Cao
Rongxi Guo
Yaqin Wen
Guiquan Liu
Enhong Chen
KELM
100
5
0
21 Mar 2024
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
Yuren Mao
Xuemei Dong
Wenyi Xu
Yunjun Gao
Bin Wei
Ying Zhang
70
10
0
21 Mar 2024
WikiFactDiff: A Large, Realistic, and Temporally Adaptable Dataset for Atomic Factual Knowledge Update in Causal Language Models
Hichem Ammar Khodja
Frédéric Béchet
Quentin Brabant
Alexis Nasr
Gwénolé Lecorvé
HILM
KELM
SyDa
54
8
0
21 Mar 2024
A Unified Framework for Model Editing
Akshat Gupta
Dev Sajnani
Gopala Anumanchipalli
KELM
137
38
0
21 Mar 2024
Editing Massive Concepts in Text-to-Image Diffusion Models
Tianwei Xiong
Yue Wu
Enze Xie
Yue Wu
Zhenguo Li
Xihui Liu
148
11
0
20 Mar 2024
BadEdit: Backdooring large language models by model editing
Yanzhou Li
Tianlin Li
Kangjie Chen
Jian Zhang
Shangqing Liu
Wenhan Wang
Tianwei Zhang
Yang Liu
SyDa
AAML
KELM
119
67
0
20 Mar 2024
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Sara Abdali
Richard Anarfi
C. Barberan
Jia He
Erfan Shayegani
PILM
146
31
0
19 Mar 2024
Larimar: Large Language Models with Episodic Memory Control
Payel Das
Subhajit Chaudhury
Elliot Nelson
Igor Melnyk
Sarath Swaminathan
...
Vijil Chenthamarakshan
Jiří
Jirí Navrátil
Soham Dan
Pin-Yu Chen
CLL
KELM
108
24
0
18 Mar 2024
Embedded Named Entity Recognition using Probing Classifiers
Nicholas Popovic
Michael Färber
85
1
0
18 Mar 2024
Enhancing Event Causality Identification with Rationale and Structure-Aware Causal Question Answering
Baiyan Zhang
Qin Chen
Jie Zhou
Jian Jin
Liang He
54
3
0
17 Mar 2024
SelfIE: Self-Interpretation of Large Language Model Embeddings
Haozhe Chen
Carl Vondrick
Chengzhi Mao
77
27
0
16 Mar 2024
Monotonic Representation of Numeric Properties in Language Models
Benjamin Heinzerling
Kentaro Inui
KELM
MILM
110
10
0
15 Mar 2024
Scaling Behavior of Machine Translation with Large Language Models under Prompt Injection Attacks
Zhifan Sun
Antonio Valerio Miceli Barone
43
2
0
14 Mar 2024
Ethos: Rectifying Language Models in Orthogonal Parameter Space
Lei Gao
Yue Niu
Tingting Tang
A. Avestimehr
Murali Annavaram
MU
87
12
0
13 Mar 2024
The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models
Carlo Nicolini
Jacopo Staiano
Bruno Lepri
Raffaele Marino
MoE
67
1
0
13 Mar 2024
Previous
1
2
3
...
13
14
15
...
20
21
22
Next