Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.08346
Cited By
Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings
15 December 2021
Andrew Wang
Mohit Sudhakar
Yangfeng Ji
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings"
1 / 1 papers shown
Title
Weakly Supervised Detection of Hallucinations in LLM Activations
Miriam Rateike
C. Cintas
John Wamburu
Tanya Akumu
Skyler Speakman
28
11
0
05 Dec 2023
1