Detecting Edit Failures In Large Language Models: An Improved
Specificity Benchmark

v1v2 (latest)

Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark

27 May 2023

J. Hoelscher-Obermaier

Ioannis Konstas

ArXiv (abs)PDF HTML

Papers citing "Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark"

14 / 14 papers shown

Title
ThinkEval: Practical Evaluation of Knowledge Preservation and Consistency in LLM Editing with Thought-based Knowledge Graphs Manit Baser D. Divakaran M. Gurusamy KELM 78 0 0 02 Jun 2025
SEPS: A Separability Measure for Robust Unlearning in LLMs Wonje Jeung Sangyeon Yoon Albert No MU VLM 228 1 0 20 May 2025
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection Yuwei Zhang Wenhao Yu Shangbin Feng Yifan Zhu Letian Peng Jayanth Srinivasa Gaowen Liu Jingbo Shang KELM 73 2 0 18 May 2025
Revealing and Mitigating Over-Attention in Knowledge Editing Pinzheng Wang Zecheng Tang Keyan Zhou Junlin Li Qiaoming Zhu Hao Fei KELM 175 3 0 21 Feb 2025
The Knowledge Microscope: Features as Better Analytical Lenses than Neurons Yuheng Chen Pengfei Cao Kang Liu Jun Zhao 85 2 0 18 Feb 2025
Uncovering Overfitting in Large Language Model Editing Mengqi Zhang Xiaotian Ye Qiang Liu Pengjie Ren Shu Wu Zhumin Chen KELM 78 16 0 10 Oct 2024
Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models Xiyu Liu Zhengxiao Liu Naibin Gu Zheng Lin Wanli Ma Ji Xiang Weiping Wang KELM 93 2 0 27 Aug 2024
Composable Interventions for Language Models Arinbjorn Kolbeinsson Kyle O'Brien Tianjin Huang Shanghua Gao Shiwei Liu ... Anurag J. Vaidya Faisal Mahmood Marinka Zitnik Tianlong Chen Thomas Hartvigsen KELM MU 197 4 0 09 Jul 2024
Foundational Models for Pathology and Endoscopy Images: Application for Gastric Inflammation H. Kerdegari Kyle Higgins Dennis Veselkov I. Laponogov I. Poļaka ... Junior Andrea Pescino M. Leja M. Dinis-Ribeiro T. F. Kanonnikoff Kirill Veselkov 106 5 0 26 Jun 2024
How Well Can Knowledge Edit Methods Edit Perplexing Knowledge? Huaizhi Ge Frank Rudzicz Zining Zhu KELM 99 4 0 25 Jun 2024
In-Context Editing: Learning Knowledge from Self-Induced Distributions Siyuan Qi Bangcheng Yang Kailin Jiang Xiaobo Wang Jiaqi Li Yifan Zhong Yaodong Yang Zilong Zheng KELM 181 10 0 17 Jun 2024
Long-form evaluation of model editing Domenic Rosati Robie Gonzales Jinkun Chen Xuemin Yu Melis Erkan Yahya Kayani Satya Deepika Chavatapalli Frank Rudzicz Hassan Sajjad KELM 68 15 0 14 Feb 2024
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances Zihan Zhang Meng Fang Lingxi Chen Mohammad-Reza Namazi-Rad Jun Wang KELM 96 24 0 11 Oct 2023
Editing Large Language Models: Problems, Methods, and Opportunities Yunzhi Yao Peng Wang Bo Tian Shuyang Cheng Zhoubo Li Shumin Deng Huajun Chen Ningyu Zhang KELM 120 314 0 22 May 2023

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.