Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering
v1v2 (latest)

Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering

Papers citing "Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering"

11 / 11 papers shown
Title
Steering Llama 2 via Contrastive Activation Addition
Steering Llama 2 via Contrastive Activation AdditionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
305
400
0
09 Dec 2023
Mass-Editing Memory in a Transformer
Mass-Editing Memory in a TransformerInternational Conference on Learning Representations (ICLR), 2022
337
761
0
13 Oct 2022
Locating and Editing Factual Associations in GPT
Locating and Editing Factual Associations in GPTNeural Information Processing Systems (NeurIPS), 2022
811
1,851
0
10 Feb 2022