Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.15476
Cited By
v1
v2
v3 (latest)
Editable Concept Bottleneck Models
24 May 2024
Lijie Hu
Chenyang Ren
Zhengyu Hu
Cheng-Long Wang
Di Wang
Hui Xiong
Jingfeng Zhang
Di Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Editable Concept Bottleneck Models"
42 / 42 papers shown
Title
Semi-supervised Concept Bottleneck Models
Lijie Hu
Tianhao Huang
Huanyi Xie
Chenyang Ren
Zhengyu Hu
Lu Yu
Lu Yu
Ping Ma
Di Wang
123
8
0
27 Jun 2024
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Somnath Basu Roy Chowdhury
Krzysztof Choromanski
Arijit Sehanobish
Avinava Dubey
Snigdha Chaturvedi
MU
87
8
0
24 Jun 2024
Multi-hop Question Answering under Temporal Knowledge Editing
Keyuan Cheng
Gang Lin
Haoyang Fei
Yuxuan Zhai
Lu Yu
Muhammad Asif Ali
Lijie Hu
Di Wang
KELM
94
26
0
30 Mar 2024
PROMPT-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression
Muhammad Asif Ali
Zhengping Li
Shu Yang
Keyuan Cheng
Yang Cao
Tianhao Huang
Lijie Hu
Lu Yu
Di Wang
VLM
RALM
72
9
0
30 Mar 2024
Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs
Shu Yang
Jiayuan Su
Han Jiang
Mengdi Li
Keyuan Cheng
Muhammad Asif Ali
Lijie Hu
Di Wang
76
6
0
30 Mar 2024
MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning
Shu Yang
Muhammad Asif Ali
Cheng-Long Wang
Lijie Hu
Di Wang
CLL
MoE
98
44
0
17 Feb 2024
Low-Cost High-Power Membership Inference Attacks
Sajjad Zarifzadeh
Philippe Liu
Reza Shokri
94
44
0
06 Dec 2023
Auxiliary Losses for Learning Generalizable Concept-based Models
Ivaxi Sheth
Samira Ebrahimi Kahou
77
28
0
18 Nov 2023
An LLM can Fool Itself: A Prompt-Based Adversarial Attack
Xilie Xu
Keyi Kong
Ning Liu
Li-zhen Cui
Di Wang
Jingfeng Zhang
Mohan Kankanhalli
AAML
SILM
99
88
0
20 Oct 2023
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models
Yongchan Kwon
Eric Wu
K. Wu
James Zou
DiffM
TDI
83
68
0
02 Oct 2023
A Survey on Multimodal Large Language Models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Xu
Enhong Chen
MLLM
LRM
119
591
0
23 Jun 2023
Probabilistic Concept Bottleneck Models
Eunji Kim
Dahuin Jung
Sangha Park
Siwon Kim
Sung-Hoon Yoon
121
70
0
02 Jun 2023
Label-Free Concept Bottleneck Models
Tuomas P. Oikarinen
Subhro Das
Lam M. Nguyen
Tsui-Wei Weng
86
177
0
12 Apr 2023
Interactive Concept Bottleneck Models
Kushal Chauhan
Rishabh Tiwari
Jan Freyberg
Pradeep Shenoy
Krishnamurthy Dvijotham
56
55
0
14 Dec 2022
SEAT: Stable and Explainable Attention
Lijie Hu
Yixin Liu
Ninghao Liu
Mengdi Huai
Lichao Sun
Di Wang
OOD
71
19
0
23 Nov 2022
Post-hoc Concept Bottleneck Models
Mert Yuksekgonul
Maggie Wang
James Zou
212
196
0
31 May 2022
Deep Unlearning via Randomized Conditionally Independent Hessians
Ronak R. Mehta
Sourav Pal
Vikas Singh
Sathya Ravi
MU
59
87
0
15 Apr 2022
Concept Bottleneck Model with Additional Unsupervised Concepts
Yoshihide Sawada
Keigo Nakamura
SSL
62
73
0
03 Feb 2022
Recommendation Unlearning
C. L. Philip Chen
Fei Sun
Hao Fei
Bolin Ding
MU
77
90
0
18 Jan 2022
Unrolling SGD: Understanding Factors Influencing Machine Unlearning
Anvith Thudi
Gabriel Deza
Varun Chandrasekaran
Nicolas Papernot
MU
109
182
0
27 Sep 2021
Machine Unlearning of Features and Labels
Alexander Warnecke
Lukas Pirch
Christian Wressnegger
Konrad Rieck
MU
87
186
0
26 Aug 2021
AI in Finance: Challenges, Techniques and Opportunities
LongBing Cao
AIFin
83
259
0
20 Jul 2021
Graph Unlearning
Min Chen
Zhikun Zhang
Tianhao Wang
Michael Backes
Mathias Humbert
Yang Zhang
MU
60
146
0
27 Mar 2021
FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging
Han Guo
Nazneen Rajani
Peter Hase
Joey Tianyi Zhou
Caiming Xiong
TDI
110
114
0
31 Dec 2020
Mixed-Privacy Forgetting in Deep Networks
Aditya Golatkar
Alessandro Achille
Avinash Ravichandran
M. Polito
Stefano Soatto
CLL
MU
194
165
0
24 Dec 2020
Machine Unlearning for Random Forests
Jonathan Brophy
Daniel Lowd
MU
70
161
0
11 Sep 2020
Multi-Stage Influence Function
Hongge Chen
Si Si
Yongqian Li
Ciprian Chelba
Sanjiv Kumar
Duane S. Boning
Cho-Jui Hsieh
TDI
52
17
0
17 Jul 2020
Concept Bottleneck Models
Pang Wei Koh
Thao Nguyen
Y. S. Tang
Stephen Mussmann
Emma Pierson
Been Kim
Percy Liang
96
828
0
09 Jul 2020
Influence Functions in Deep Learning Are Fragile
S. Basu
Phillip E. Pope
Soheil Feizi
TDI
125
235
0
25 Jun 2020
Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey
Arun Das
P. Rad
XAI
155
603
0
16 Jun 2020
Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions
Xiaochuang Han
Byron C. Wallace
Yulia Tsvetkov
MILM
FAtt
AAML
TDI
77
174
0
14 May 2020
Forgetting Outside the Box: Scrubbing Deep Networks of Information Accessible from Input-Output Observations
Aditya Golatkar
Alessandro Achille
Stefano Soatto
MU
OOD
133
195
0
05 Mar 2020
Approximate Data Deletion from Machine Learning Models
Zachary Izzo
Mary Anne Smart
Kamalika Chaudhuri
James Zou
MU
66
263
0
24 Feb 2020
Machine Unlearning
Lucas Bourtoule
Varun Chandrasekaran
Christopher A. Choquette-Choo
Hengrui Jia
Adelin Travers
Baiwu Zhang
David Lie
Nicolas Papernot
MU
128
868
0
09 Dec 2019
Eternal Sunshine of the Spotless Net: Selective Forgetting in Deep Networks
Aditya Golatkar
Alessandro Achille
Stefano Soatto
CLL
MU
76
495
0
12 Nov 2019
Certified Data Removal from Machine Learning Models
Chuan Guo
Tom Goldstein
Awni Y. Hannun
Laurens van der Maaten
MU
110
446
0
08 Nov 2019
Repairing without Retraining: Avoiding Disparate Impact with Counterfactual Distributions
Hao Wang
Berk Ustun
Flavio du Pin Calmon
FaML
87
85
0
29 Jan 2019
Understanding the Origins of Bias in Word Embeddings
Marc-Etienne Brunet
Colleen Alkalay-Houlihan
Ashton Anderson
R. Zemel
FaML
76
202
0
08 Oct 2018
Fast Approximate Natural Gradient Descent in a Kronecker-factored Eigenbasis
Thomas George
César Laurent
Xavier Bouthillier
Nicolas Ballas
Pascal Vincent
ODL
72
154
0
11 Jun 2018
Understanding Black-box Predictions via Influence Functions
Pang Wei Koh
Percy Liang
TDI
213
2,894
0
14 Mar 2017
Membership Inference Attacks against Machine Learning Models
Reza Shokri
M. Stronati
Congzheng Song
Vitaly Shmatikov
SLR
MIALM
MIACV
261
4,135
0
18 Oct 2016
Deep Learning Face Attributes in the Wild
Ziwei Liu
Ping Luo
Xiaogang Wang
Xiaoou Tang
CVBM
244
8,408
0
28 Nov 2014
1