ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.02193
  4. Cited By
Improving Steering Vectors by Targeting Sparse Autoencoder Features

Improving Steering Vectors by Targeting Sparse Autoencoder Features

4 November 2024
Sviatoslav Chalnev
Matthew Siu
Arthur Conmy
    LLMSV
ArXivPDFHTML

Papers citing "Improving Steering Vectors by Targeting Sparse Autoencoder Features"

3 / 3 papers shown
Title
Patterns and Mechanisms of Contrastive Activation Engineering
Patterns and Mechanisms of Contrastive Activation Engineering
Yixiong Hao
Ayush Panda
Stepan Shabalin
Sheikh Abdur Raheem Ali
LLMSV
62
0
0
06 May 2025
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study
Yang Xu
Yixuan Wang
Hao Wang
114
1
0
23 Dec 2024
Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Michael Lan
Philip H. S. Torr
Austin Meek
Ashkan Khakzar
David M. Krueger
Fazl Barez
43
10
0
09 Oct 2024
1