ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.16447
  4. Cited By
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models

Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models

19 June 2025
Biao Yi
Tiansheng Huang
Sishuo Chen
Tong Li
Zheli Liu
Zhixuan Chu
Yiming Li
    AAML
ArXiv (abs)PDFHTML

Papers citing "Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models"

1 / 51 papers shown
Title
BERTScore: Evaluating Text Generation with BERT
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
374
5,872
0
21 Apr 2019
Previous
12