Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior

27 June 2022

Papers citing "Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior"

9 / 9 papers shown

Title
ALMANACS: A Simulatability Benchmark for Language Model Explainability Edmund Mills Shiye Su Stuart J. Russell Scott Emmons 48 7 0 20 Dec 2023
Efficient Shapley Values Estimation by Amortization for Text Classification Chenghao Yang Fan Yin He He Kai-Wei Chang Xiaofei Ma Bing Xiang FAtt VLM 18 4 0 31 May 2023
Red Teaming Deep Neural Networks with Feature Synthesis Tools Stephen Casper Yuxiao Li Jiawei Li Tong Bu Ke Zhang K. Hariharan Dylan Hadfield-Menell AAML 29 15 0 08 Feb 2023
ModelDiff: A Framework for Comparing Learning Algorithms Harshay Shah Sung Min Park Andrew Ilyas A. Madry SyDa 51 26 0 22 Nov 2022
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks Tilman Raukur A. Ho Stephen Casper Dylan Hadfield-Menell AAML AI4CE 23 124 0 27 Jul 2022
Natural Language Descriptions of Deep Visual Features Evan Hernandez Sarah Schwettmann David Bau Teona Bagashvili Antonio Torralba Jacob Andreas MILM 201 117 0 26 Jan 2022
Editing a classifier by rewriting its prediction rules Shibani Santurkar Dimitris Tsipras Mahalaxmi Elango David Bau Antonio Torralba A. Madry KELM 175 89 0 02 Dec 2021
"Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification Jasmijn Bastings Sebastian Ebert Polina Zablotskaia Anders Sandholm Katja Filippova 115 75 0 14 Nov 2021
Towards A Rigorous Science of Interpretable Machine Learning Finale Doshi-Velez Been Kim XAI FaML 251 3,683 0 28 Feb 2017