Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.06581
Cited By
Grokking Group Multiplication with Cosets
11 December 2023
Dashiell Stander
Qinan Yu
Honglu Fan
Stella Biderman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Grokking Group Multiplication with Cosets"
4 / 4 papers shown
Title
Evaluating Explanations: An Explanatory Virtues Framework for Mechanistic Interpretability -- The Strange Science Part I.ii
Kola Ayonrinde
Louis Jaburi
XAI
80
1
0
02 May 2025
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model
Michael Hanna
Ollie Liu
Alexandre Variengien
LRM
189
119
0
30 Apr 2023
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small
Kevin Wang
Alexandre Variengien
Arthur Conmy
Buck Shlegeris
Jacob Steinhardt
212
494
0
01 Nov 2022
Towards A Rigorous Science of Interpretable Machine Learning
Finale Doshi-Velez
Been Kim
XAI
FaML
251
3,683
0
28 Feb 2017
1