Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.01702
Cited By
Fluent dreaming for language models
24 January 2024
T. B. Thompson
Zygimantas Straznickas
Michael Sklar
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fluent dreaming for language models"
3 / 3 papers shown
Title
Patterns and Mechanisms of Contrastive Activation Engineering
Yixiong Hao
Ayush Panda
Stepan Shabalin
Sheikh Abdur Raheem Ali
LLMSV
67
0
0
06 May 2025
Finding Neurons in a Haystack: Case Studies with Sparse Probing
Wes Gurnee
Neel Nanda
Matthew Pauly
Katherine Harvey
Dmitrii Troitskii
Dimitris Bertsimas
MILM
165
190
0
02 May 2023
Toy Models of Superposition
Nelson Elhage
Tristan Hume
Catherine Olsson
Nicholas Schiefer
T. Henighan
...
Sam McCandlish
Jared Kaplan
Dario Amodei
Martin Wattenberg
C. Olah
AAML
MILM
133
326
0
21 Sep 2022
1