ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.00787
  4. Cited By
Evaluating Shutdown Avoidance of Language Models in Textual Scenarios

Evaluating Shutdown Avoidance of Language Models in Textual Scenarios

3 July 2023
Teun van der Weij
Simon Lermen
Leon Lang
    LLMAG
ArXiv (abs)PDFHTML

Papers citing "Evaluating Shutdown Avoidance of Language Models in Textual Scenarios"

3 / 3 papers shown
Title
Exploring Advanced Methodologies in Security Evaluation for LLMs
Exploring Advanced Methodologies in Security Evaluation for LLMs
Junming Huang
Jiawei Zhang
Qi Wang
Weihong Han
Yanchun Zhang
104
0
0
28 Feb 2024
Exploring the Robustness of Model-Graded Evaluations and Automated
  Interpretability
Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability
Simon Lermen
Ondvrej Kvapil
ELMAAML
44
3
0
26 Nov 2023
Large Language Models can Strategically Deceive their Users when Put
  Under Pressure
Large Language Models can Strategically Deceive their Users when Put Under Pressure
Jérémy Scheurer
Mikita Balesni
Marius Hobbhahn
LLMAG
121
60
0
09 Nov 2023
1