ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.08346
  4. Cited By
Simple Text Detoxification by Identifying a Linear Toxic Subspace in
  Language Model Embeddings

Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings

15 December 2021
Andrew Wang
Mohit Sudhakar
Yangfeng Ji
ArXivPDFHTML

Papers citing "Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings"

1 / 1 papers shown
Title
Weakly Supervised Detection of Hallucinations in LLM Activations
Weakly Supervised Detection of Hallucinations in LLM Activations
Miriam Rateike
C. Cintas
John Wamburu
Tanya Akumu
Skyler Speakman
28
11
0
05 Dec 2023
1