Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.12664
Cited By
Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience
22 August 2024
Zhonghao He
Jascha Achterberg
Katie Collins
Kevin K. Nejad
Danyal Akarca
Yinzhu Yang
Wes Gurnee
Ilia Sucholutsky
Yuhan Tang
Rebeca Ianov
George Ogden
Chole Li
Kai J. Sandbrink
Stephen Casper
Anna Ivanova
Grace W. Lindsay
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience"
32 / 82 papers shown
Title
Knowledge Neurons in Pretrained Transformers
Damai Dai
Li Dong
Y. Hao
Zhifang Sui
Baobao Chang
Furu Wei
KELM
MU
79
449
0
18 Apr 2021
Probing artificial neural networks: insights from neuroscience
Anna A. Ivanova
John Hewitt
Noga Zaslavsky
38
16
0
16 Apr 2021
Neural population geometry: An approach for understanding biological and artificial neural networks
SueYeon Chung
L. F. Abbott
AI4CE
39
209
0
14 Apr 2021
Disentangling Syntax and Semantics in the Brain with Deep Networks
Charlotte Caucheteux
Alexandre Gramfort
J. King
93
72
0
02 Mar 2021
Probing Classifiers: Promises, Shortcomings, and Advances
Yonatan Belinkov
256
443
0
24 Feb 2021
Understanding the Role of Individual Units in a Deep Neural Network
David Bau
Jun-Yan Zhu
Hendrik Strobelt
Àgata Lapedriza
Bolei Zhou
Antonio Torralba
GAN
65
449
0
10 Sep 2020
Intelligence plays dice: Stochasticity is essential for machine learning
M. Sabuncu
122
6
0
17 Aug 2020
Levels of Analysis for Machine Learning
Jessica B. Hamrick
S. Mohamed
43
16
0
06 Apr 2020
Information-Theoretic Probing with Minimum Description Length
Elena Voita
Ivan Titov
71
274
0
27 Mar 2020
Deep Randomized Neural Networks
Claudio Gallicchio
Simone Scardapane
OOD
67
64
0
27 Feb 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
539
4,773
0
23 Jan 2020
Designing and Interpreting Probes with Control Tasks
John Hewitt
Percy Liang
58
536
0
08 Sep 2019
What does it mean to understand a neural network?
Timothy Lillicrap
Konrad Paul Kording
43
43
0
15 Jul 2019
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
209
1,592
0
11 Jun 2019
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Elena Voita
David Talbot
F. Moiseev
Rico Sennrich
Ivan Titov
106
1,134
0
23 May 2019
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney
Dipanjan Das
Ellie Pavlick
MILM
SSeg
129
1,469
0
15 May 2019
Similarity of Neural Network Representations Revisited
Simon Kornblith
Mohammad Norouzi
Honglak Lee
Geoffrey E. Hinton
136
1,408
0
01 May 2019
A Review of Modularization Techniques in Artificial Neural Networks
Mohammed Amer
Tomás Maul
54
80
0
29 Apr 2019
The emergence of number and syntax units in LSTM language models
Yair Lakretz
Germán Kruszewski
T. Desbordes
Dieuwke Hupkes
S. Dehaene
Marco Baroni
44
170
0
18 Mar 2019
ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness
Robert Geirhos
Patricia Rubisch
Claudio Michaelis
Matthias Bethge
Felix Wichmann
Wieland Brendel
96
2,662
0
29 Nov 2018
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
210
1,272
0
30 May 2018
Deep Learning Scaling is Predictable, Empirically
Joel Hestness
Sharan Narang
Newsha Ardalani
G. Diamos
Heewoo Jun
Hassan Kianinejad
Md. Mostofa Ali Patwary
Yang Yang
Yanqi Zhou
87
736
0
01 Dec 2017
Classification and Geometry of General Perceptual Manifolds
SueYeon Chung
Daniel D. Lee
H. Sompolinsky
52
154
0
17 Oct 2017
Learning to Generate Reviews and Discovering Sentiment
Alec Radford
Rafal Jozefowicz
Ilya Sutskever
93
508
0
05 Apr 2017
Towards A Rigorous Science of Interpretable Machine Learning
Finale Doshi-Velez
Been Kim
XAI
FaML
376
3,776
0
28 Feb 2017
Understanding intermediate layers using linear classifier probes
Guillaume Alain
Yoshua Bengio
FAtt
137
941
0
05 Oct 2016
Causal interpretation rules for encoding and decoding models in neuroimaging
S. Weichwald
Timm Meyer
Ozan Özdenizci
Bernhard Schölkopf
T. Ball
Moritz Grosse-Wentrup
AI4CE
CML
39
107
0
15 Nov 2015
Demixed principal component analysis of population activity in higher cortical areas reveals independent representation of task parameters
D. Kobak
Wieland Brendel
C. Constantinidis
C. Feierstein
Adam Kepecs
Z. Mainen
R. Romo
Xue-Lian Qi
N. Uchida
C. Machens
79
466
0
22 Oct 2014
Intriguing properties of neural networks
Christian Szegedy
Wojciech Zaremba
Ilya Sutskever
Joan Bruna
D. Erhan
Ian Goodfellow
Rob Fergus
AAML
249
14,912
1
21 Dec 2013
Visualizing and Understanding Convolutional Networks
Matthew D. Zeiler
Rob Fergus
FAtt
SSL
541
15,874
0
12 Nov 2013
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
365
33,500
0
16 Oct 2013
The evolutionary origins of modularity
Jeff Clune
Jean-Baptiste Mouret
Hod Lipson
73
579
0
11 Jul 2012
Previous
1
2