Testing Language Model Agents Safely in the Wild

Testing Language Model Agents Safely in the Wild

17 November 2023

Douglas Schonholtz

Adam Tauman Kalai

Papers citing "Testing Language Model Agents Safely in the Wild"

4 / 4 papers shown

Title
Testing and Understanding Erroneous Planning in LLM Agents through Synthesized User Inputs Zhenlan Ji Daoyuan Wu Pingchuan Ma Zongjie Li Shuai Wang LLMAG 48 3 0 27 Apr 2024
Black-Box Access is Insufficient for Rigorous AI Audits Stephen Casper Carson Ezell Charlotte Siegmann Noam Kolt Taylor Lynn Curtis ... Michael Gerovitch David Bau Max Tegmark David M. Krueger Dylan Hadfield-Menell AAML 34 78 0 25 Jan 2024
AI Control: Improving Safety Despite Intentional Subversion Ryan Greenblatt Buck Shlegeris Kshitij Sachan Fabien Roger 31 40 0 12 Dec 2023
Formal Scenario-Based Testing of Autonomous Vehicles: From Simulation to the Real World Daniel J. Fremont Edward Kim Yash Vardhan Pant S. Seshia Atul Acharya Xantha Bruso Paul Wells Steve Lemke Q. Lu Shalin Mehta 76 125 0 17 Mar 2020