Relational Composition in Neural Networks: A Survey and Call to Action

19 July 2024

Papers citing "Relational Composition in Neural Networks: A Survey and Call to Action"

5 / 5 papers shown

Title
From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit Valérie Costa Thomas Fel Ekdeep Singh Lubana Bahareh Tolooshams Demba Ba 63 0 0 03 Jun 2025
Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models Thomas Fel Ekdeep Singh Lubana Jacob S. Prince M. Kowal Victor Boutin Isabel Papadimitriou Binxu Wang Martin Wattenberg Demba Ba Talia Konkle 76 8 0 18 Feb 2025
Towards Unifying Interpretability and Control: Evaluation via Intervention Usha Bhalla Suraj Srinivas Asma Ghandeharioun Himabindu Lakkaraju 110 11 0 07 Nov 2024
Residual Stream Analysis with Multi-Layer SAEs Tim Lawson Lucy Farnik Conor Houghton Laurence Aitchison 82 5 0 06 Sep 2024
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models Jack Merullo Carsten Eickhoff Ellie Pavlick 146 16 0 13 Jun 2024