Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.05587
Cited By
Creativity Has Left the Chat: The Price of Debiasing Language Models
8 June 2024
Behnam Mohammadi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Creativity Has Left the Chat: The Price of Debiasing Language Models"
6 / 6 papers shown
Title
AttentionDefense: Leveraging System Prompt Attention for Explainable Defense Against Novel Jailbreaks
Charlotte Siska
Anush Sankaran
AAML
45
0
0
10 Apr 2025
Preference Optimization with Multi-Sample Comparisons
Chaoqi Wang
Zhuokai Zhao
Chen Zhu
Karthik Abinav Sankararaman
Michal Valko
...
Zhaorun Chen
Madian Khabsa
Yuxin Chen
Hao Ma
Sinong Wang
69
10
0
16 Oct 2024
Diversity-Rewarded CFG Distillation
Geoffrey Cideron
A. Agostinelli
Johan Ferret
Sertan Girgin
Romuald Elie
Olivier Bachem
Sarah Perrin
Alexandre Ramé
41
2
0
08 Oct 2024
Towards a Science Exocortex
Kevin G. Yager
80
0
0
24 Jun 2024
Cascade Reward Sampling for Efficient Decoding-Time Alignment
Bolian Li
Yifan Wang
A. Grama
Ruqi Zhang
Ruqi Zhang
AI4TS
49
9
0
24 Jun 2024
Wait, It's All Token Noise? Always Has Been: Interpreting LLM Behavior Using Shapley Value
Behnam Mohammadi
24
2
0
29 Mar 2024
1