Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.08657
Cited By
Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions
8 February 2025
Jingxin Xu
Guoshun Nan
Sheng Guan
Sicong Leng
Yong-Jin Liu
Zixiao Wang
Yuyang Ma
Zhili Zhou
Yanzhao Hou
Xiaofeng Tao
LM&MA
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions"
Title
No papers