Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.10091
Cited By
Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models
13 December 2023
Alexandre Variengien
Eric Winsor
LRM
ReLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models"
10 / 10 papers shown
Title
Activation Steering in Neural Theorem Provers
Shashank Kirtania
LLMSV
465
0
0
21 Feb 2025
Discovering Variable Binding Circuitry with Desiderata
Xander Davies
Max Nadeau
Nikhil Prakash
Tamar Rott Shaham
David Bau
47
15
0
07 Jul 2023
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
138
377
0
29 May 2023
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
Zhengxuan Wu
Atticus Geiger
Thomas Icard
Christopher Potts
Noah D. Goodman
MILM
75
92
0
15 May 2023
The Alignment Problem from a Deep Learning Perspective
Richard Ngo
Lawrence Chan
Sören Mindermann
107
192
0
30 Aug 2022
Locating and Editing Factual Associations in GPT
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
KELM
248
1,357
0
10 Feb 2022
Training Verifiers to Solve Math Word Problems
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
...
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
308
4,408
0
27 Oct 2021
Are Sixteen Heads Really Better than One?
Paul Michel
Omer Levy
Graham Neubig
MoE
103
1,062
0
25 May 2019
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
413
10,494
0
21 Jul 2016
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
172
5,011
0
27 Jun 2016
1