ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Communities
  3. ...

Neighbor communities

0 / 0 papers shown
Title
Top Contributors
Name# Papers# Citations
Social Events
DateLocationEvent
  1. Home
  2. Communities
  3. LLMSV

Large Language Model Steering Vectors

LLMSV
More data

Large Language Model Steering Vectors are techniques used to guide or control the behavior of large language models by directly manipulating their internal representations. This approach, part of the broader representation engineering framework, involves identifying and modifying activation vectors within the model to achieve desired outputs without requiring additional training or fine-tuning.

Neighbor communities

51015

Featured Papers

0 / 0 papers shown
Title

All papers

50 / 624 papers shown
Title
Angular Steering: Behavior Control via Rotation in Activation Space
Angular Steering: Behavior Control via Rotation in Activation Space
Hieu M. Vu
Tan M. Nguyen
LLMSV
79
0
0
30 Oct 2025
SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models
SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models
Anushka Sivakumar
Andrew Zhang
Zaber Hakim
Chris Thomas
LLMSV
24
0
0
30 Oct 2025
On Steerability Factors for Growing Vine Robots
On Steerability Factors for Growing Vine Robots
Ciera McFarland
Antonio Alvarez
Sarah Taher
Nathaniel Hanson
Margaret McGuinness
LLMSV
24
0
0
26 Oct 2025
SteerX: Disentangled Steering for LLM Personalization
SteerX: Disentangled Steering for LLM Personalization
Xiaoyan Zhao
Ming Yan
Yilun Qiu
Haoting Ni
Y. Zhang
Fuli Feng
Hong Cheng
Tat-Seng Chua
LLMSV
28
0
0
25 Oct 2025
Fixed Horizon Linear Quadratic Covariance Steering in Continuous Time with Hilbert-Schmidt Terminal Cost
Fixed Horizon Linear Quadratic Covariance Steering in Continuous Time with Hilbert-Schmidt Terminal Cost
Tushar Sial
Abhishek Halder
LLMSV
20
0
0
24 Oct 2025
Steering Evaluation-Aware Language Models to Act Like They Are Deployed
Steering Evaluation-Aware Language Models to Act Like They Are Deployed
Tim Tian Hua
Andrew Qin
Samuel Marks
Neel Nanda
LLMSV
62
0
0
23 Oct 2025
ActivationReasoning: Logical Reasoning in Latent Activation Spaces
ActivationReasoning: Logical Reasoning in Latent Activation Spaces
Lukas Helff
Ruben Härle
Wolfgang Stammer
Felix Friedrich
Manuel Brack
Antonia Wüst
Hikaru Shindo
P. Schramowski
Kristian Kersting
LLMSVLRMAI4CE
57
0
0
21 Oct 2025
Motion Planning and Control of an Overactuated 4-Wheel Drive with Constrained Independent Steering
Motion Planning and Control of an Overactuated 4-Wheel Drive with Constrained Independent Steering
Shiyu Liu
Ilija Hadzic
Akshay Gupta
Aliasghar Arab
LLMSV
43
0
0
21 Oct 2025
SARSteer: Safeguarding Large Audio Language Models via Safe-Ablated Refusal Steering
SARSteer: Safeguarding Large Audio Language Models via Safe-Ablated Refusal Steering
Weilin Lin
Jianze Li
Hui Xiong
Li Liu
LLMSV
21
0
0
20 Oct 2025
Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification
Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification
Yilin Wu
Anqi Li
Tucker Hermans
F. Ramos
Andrea Bajcsy
Claudia Pérez-DÁrpino
LM&RoLRMLLMSV
61
0
0
18 Oct 2025
Navigating through the hidden embedding space: steering LLMs to improve mental health assessment
Navigating through the hidden embedding space: steering LLMs to improve mental health assessment
Federico Ravenda
Seyed Ali Bahrainian
Andrea Raballo
Antonietta Mira
LLMSV
37
0
0
18 Oct 2025
HarmRLVR: Weaponizing Verifiable Rewards for Harmful LLM Alignment
HarmRLVR: Weaponizing Verifiable Rewards for Harmful LLM Alignment
Y. Liu
Lijun Li
X. Wang
Jing Shao
LLMSV
48
0
0
17 Oct 2025
SteeringTTA: Guiding Diffusion Trajectories for Robust Test-Time-Adaptation
SteeringTTA: Guiding Diffusion Trajectories for Robust Test-Time-Adaptation
Jihyun Yu
Yoojin Oh
Wonho Bae
Mingyu Kim
Junhyug Noh
TTALLMSV
80
0
0
16 Oct 2025
Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module
Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module
Ruitao Feng
Bixi Zhang
Sheng Liang
Zheng Yuan
AuLLMMoELLMSV
25
0
0
15 Oct 2025
To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models
To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models
Anna Hedström
Salim I. Amoukou
Tom Bewley
Saumitra Mishra
Manuela Veloso
LLMSV
36
2
0
15 Oct 2025
Risk-adaptive Activation Steering for Safe Multimodal Large Language Models
Risk-adaptive Activation Steering for Safe Multimodal Large Language Models
Jonghyun Park
Minhyuk Seo
Jonghyun Choi
AAMLLLMSV
40
0
0
15 Oct 2025
In-Distribution Steering: Balancing Control and Coherence in Language Model Generation
In-Distribution Steering: Balancing Control and Coherence in Language Model Generation
Arthur Vogels
Benjamin Wong
Yann Choho
A. Blangero
Milan Bhan
LLMSV
72
0
0
15 Oct 2025
Adaptive vector steering: A training-free, layer-wise intervention for hallucination mitigation in large audio and multimodal models
Adaptive vector steering: A training-free, layer-wise intervention for hallucination mitigation in large audio and multimodal models
Tsung-En Lin
Kuan-Yi Lee
Hung-yi Lee
LLMSV
52
0
0
14 Oct 2025
Steering Over-refusals Towards Safety in Retrieval Augmented Generation
Steering Over-refusals Towards Safety in Retrieval Augmented Generation
Utsav Maskey
Mark Dras
Usman Naseem
LLMSV
8
0
0
12 Oct 2025
PIXEL: Adaptive Steering Via Position-wise Injection with eXact Estimated Levels under Subspace Calibration
PIXEL: Adaptive Steering Via Position-wise Injection with eXact Estimated Levels under Subspace Calibration
Manjiang Yu
Hongji Li
Priyanka Singh
X. Li
Di Wang
Lijie Hu
LLMSV
8
0
0
11 Oct 2025
Language steering in latent space to mitigate unintended code-switching
Language steering in latent space to mitigate unintended code-switching
Andrey Goncharov
Nikolai Kondusov
Alexey Zaytsev
LLMSV
36
0
0
11 Oct 2025
Steering Embedding Models with Geometric Rotation: Mapping Semantic Relationships Across Languages and Models
Steering Embedding Models with Geometric Rotation: Mapping Semantic Relationships Across Languages and Models
Michael Freenor
Lauren Alvarez
LLMSV
33
0
0
10 Oct 2025
Chain-of-Trigger: An Agentic Backdoor that Paradoxically Enhances Agentic Robustness
Chain-of-Trigger: An Agentic Backdoor that Paradoxically Enhances Agentic Robustness
Jiyang Qiu
Xinbei Ma
Yunqing Xu
Zhuosheng Zhang
Hai Zhao
LLMSV
64
0
0
09 Oct 2025
Energy-Driven Steering: Reducing False Refusals in Large Language Models
Energy-Driven Steering: Reducing False Refusals in Large Language Models
Eric Hanchen Jiang
Weixuan Ou
Run Liu
Shengyuan Pang
Guancheng Wan
...
Wei Dong
Kai-Wei Chang
Xiaofeng Wang
Ying Nian Wu
Xinfeng Li
LLMSV
52
0
0
09 Oct 2025
Geometry-Aware Backdoor Attacks: Leveraging Curvature in Hyperbolic Embeddings
Geometry-Aware Backdoor Attacks: Leveraging Curvature in Hyperbolic Embeddings
Ali Baheri
AAMLLLMSV
16
0
0
07 Oct 2025
EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preferences
EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preferences
Kshitish Ghate
Andy Liu
Devansh Jain
Taylor Sorensen
Atoosa Kasirzadeh
Aylin Caliskan
Mona Diab
Maarten Sap
LLMSV
84
0
0
07 Oct 2025
The DISTANT Design for Remote Transmission and Steering Systems for Planetary Robotics
The DISTANT Design for Remote Transmission and Steering Systems for Planetary Robotics
Cristina Luna
Alba Guerra
Almudena Moreno
Manuel Esquer
Willy Roa
Mateusz Krawczak
Robert Popela
Piotr Osica
Davide Nicolis
LLMSV
24
0
0
07 Oct 2025
Prototype-Based Dynamic Steering for Large Language Models
Prototype-Based Dynamic Steering for Large Language Models
Ceyhun Efe Kayan
Li Zhang
LLMSVLRM
60
0
0
07 Oct 2025
Psychological Steering in LLMs: An Evaluation of Effectiveness and Trustworthiness
Psychological Steering in LLMs: An Evaluation of Effectiveness and Trustworthiness
Amin Banayeeanzade
Ala Nekouvaght Tak
Fatemeh Bahrani
Anahita Bolourani
Leonardo Blas
Emilio Ferrara
Jonathan Gratch
Sai Praneeth Karimireddy
LLMSV
28
0
0
06 Oct 2025
Activation Steering with a Feedback Controller
Activation Steering with a Feedback Controller
Dung V. Nguyen
Hieu M. Vu
Nhi Y. Pham
Lei Zhang
T. Nguyen
LLMSV
32
0
0
05 Oct 2025
Does higher interpretability imply better utility? A Pairwise Analysis on Sparse Autoencoders
Does higher interpretability imply better utility? A Pairwise Analysis on Sparse Autoencoders
Xu Wang
Yan Hu
Benyou Wang
Difan Zou
LLMSV
68
0
0
04 Oct 2025
Mechanistic Interpretability of Code Correctness in LLMs via Sparse Autoencoders
Mechanistic Interpretability of Code Correctness in LLMs via Sparse Autoencoders
Kriz Tahimic
Charibeth Cheng
LLMSV
28
0
0
03 Oct 2025
VISOR++: Universal Visual Inputs based Steering for Large Vision Language Models
VISOR++: Universal Visual Inputs based Steering for Large Vision Language Models
Ravikumar Balakrishnan
Mansi Phute
LLMSV
67
0
0
29 Sep 2025
Hyperdimensional Probe: Decoding LLM Representations via Vector Symbolic Architectures
Hyperdimensional Probe: Decoding LLM Representations via Vector Symbolic Architectures
Marco Bronzini
Carlo Nicolini
Bruno Lepri
Jacopo Staiano
Andrea Passerini
LLMSV
8
0
0
29 Sep 2025
EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering
EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering
Haolei Xu
Xinyu Mei
Yuchen Yan
Rui Zhou
Wenqi Zhang
Weiming Lu
Yueting Zhuang
Yongliang Shen
LLMSV
24
1
0
29 Sep 2025
Integrator Forwading Design for Unicycles with Constant and Actuated Velocity in Polar Coordinates
Integrator Forwading Design for Unicycles with Constant and Actuated Velocity in Polar Coordinates
Miroslav Krstic
Velimir Todorovski
Kwang Hak Kim
Alessandro Astolfi
LLMSV
12
0
0
29 Sep 2025
Toward Preference-aligned Large Language Models via Residual-based Model Steering
Toward Preference-aligned Large Language Models via Residual-based Model Steering
Lucio La Cava
Andrea Tagarelli
LLMSV
28
0
0
28 Sep 2025
Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement
Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement
Anyi Wang
Xuansheng Wu
Dong Shu
Yunpu Ma
Ninghao Liu
LLMSV
41
0
0
28 Sep 2025
Steering Prepositional Phrases in Language Models: A Case of with-headed Adjectival and Adverbial Complements in Gemma-2
Steering Prepositional Phrases in Language Models: A Case of with-headed Adjectival and Adverbial Complements in Gemma-2
Stefan Arnold
Rene Gröbner
LLMSV
20
0
0
27 Sep 2025
The Rogue Scalpel: Activation Steering Compromises LLM Safety
The Rogue Scalpel: Activation Steering Compromises LLM Safety
Anton Korznikov
Andrey V. Galichin
Alexey Dontsov
Oleg Y. Rogov
Ivan Oseledets
Elena Tutubalina
LLMSVAAML
40
0
0
26 Sep 2025
Stochastic activations
Stochastic activations
Maria Lomeli
Matthijs Douze
Gergely Szilvasy
Loic Cabannes
Jade Copet
Sainbayar Sukhbaatar
Jason Weston
Gabriel Synnaeve
Pierre-Emmanuel Mazaré
Hervé Jégou
LLMSV
8
0
0
26 Sep 2025
Backdoor Attribution: Elucidating and Controlling Backdoor in Language Models
Backdoor Attribution: Elucidating and Controlling Backdoor in Language Models
Miao Yu
Zhenhong Zhou
Moayad Aloqaily
Kun Wang
Biwei Huang
S. Wang
Yueming Jin
Qingsong Wen
AAMLLLMSV
94
0
0
26 Sep 2025
We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Gautam Siddharth Kashyap
Mark Dras
Usman Naseem
LLMSV
24
0
0
26 Sep 2025
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
Sasha Cui
Zhongren Chen
LLMSV
62
0
0
25 Sep 2025
Preemptive Detection and Steering of LLM Misalignment via Latent Reachability
Preemptive Detection and Steering of LLM Misalignment via Latent Reachability
Sathwik Karnik
Somil Bansal
LLMSV
6
1
0
25 Sep 2025
A Comparative Analysis of Sparse Autoencoder and Activation Difference in Language Model Steering
A Comparative Analysis of Sparse Autoencoder and Activation Difference in Language Model Steering
Jiaqing Xie
LLMSV
67
0
0
24 Sep 2025
Latent Activation Editing: Inference-Time Refinement of Learned Policies for Safer Multirobot Navigation
Latent Activation Editing: Inference-Time Refinement of Learned Policies for Safer Multirobot Navigation
Satyajeet Das
Darren Chiu
Zhehui Huang
Lars Lindemann
Gaurav Sukhatme
LLMSV
72
0
0
24 Sep 2025
LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation
LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation
Huizhen Shu
Xuying Li
Zhuo Li
LLMSV
36
0
0
24 Sep 2025
SafeSteer: Adaptive Subspace Steering for Efficient Jailbreak Defense in Vision-Language Models
SafeSteer: Adaptive Subspace Steering for Efficient Jailbreak Defense in Vision-Language Models
Xiyu Zeng
Siyuan Liang
Liming Lu
Haotian Zhu
Enguang Liu
Jisheng Dang
Yongbin Zhou
Shuchao Pang
AAMLLLMSV
24
0
0
24 Sep 2025
Prompt-Guided Dual Latent Steering for Inversion Problems
Prompt-Guided Dual Latent Steering for Inversion Problems
Yichen Wu
Xu Liu
Chenxuan Zhao
Xinyu Wu
DiffMLLMSV
32
3
0
23 Sep 2025
Loading #Papers per Month with "LLMSV"
Past speakers
Name (-)
Top Contributors
Name (-)
Top Organizations at ResearchTrend.AI
Name (-)
Social Events
DateLocationEvent
No social events available