Contrastive Prompting Enhances Sentence Embeddings in LLMs through Inference-Time Steering

Extracting sentence embeddings from large language models (LLMs) is a practical direction, as it requires neither additional data nor fine-tuning. Previous studies usually focus on prompt engineering to guide LLMs to encode the core semantic information of the sentence into the embedding of the last token. However, the last token in these methods still encodes an excess of non-essential information, such as stop words, limiting its encoding capacity. To this end, we propose a Contrastive Prompting (CP) method that introduces an extra auxiliary prompt to elicit better sentence embedding. By contrasting with the auxiliary prompt, CP can steer existing prompts to encode the core semantics of the sentence, rather than non-essential information. CP is a plug-and-play inference-time intervention method that can be combined with various prompt-based methods. Extensive experiments on Semantic Textual Similarity (STS) tasks and downstream classification tasks demonstrate that our method can improve the performance of existing prompt-based methods across different LLMs. Our code will be released atthis https URL.
View on arXiv@article{cheng2025_2505.12831, title={ Contrastive Prompting Enhances Sentence Embeddings in LLMs through Inference-Time Steering }, author={ Zifeng Cheng and Zhonghui Wang and Yuchen Fu and Zhiwei Jiang and Yafeng Yin and Cong Wang and Qing Gu }, journal={arXiv preprint arXiv:2505.12831}, year={ 2025 } }