Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.20015
Cited By
Decomposing Behavioral Phase Transitions in LLMs: Order Parameters for Emergent Misalignment
27 August 2025
Julian Arnold
Niels Lörch
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Decomposing Behavioral Phase Transitions in LLMs: Order Parameters for Emergent Misalignment"
1 / 1 papers shown
Title
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions
Xuhao Hu
Peng Wang
Xiaoya Lu
Dongrui Liu
Xuanjing Huang
Jing Shao
116
1
0
09 Oct 2025
1