Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.10538
Cited By
Testing Language Model Agents Safely in the Wild
17 November 2023
Silen Naihin
David Atkinson
Marc Green
Merwane Hamadi
Craig Swift
Douglas Schonholtz
Adam Tauman Kalai
David Bau
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Testing Language Model Agents Safely in the Wild"
4 / 4 papers shown
Title
Testing and Understanding Erroneous Planning in LLM Agents through Synthesized User Inputs
Zhenlan Ji
Daoyuan Wu
Pingchuan Ma
Zongjie Li
Shuai Wang
LLMAG
48
3
0
27 Apr 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
34
78
0
25 Jan 2024
AI Control: Improving Safety Despite Intentional Subversion
Ryan Greenblatt
Buck Shlegeris
Kshitij Sachan
Fabien Roger
31
40
0
12 Dec 2023
Formal Scenario-Based Testing of Autonomous Vehicles: From Simulation to the Real World
Daniel J. Fremont
Edward Kim
Yash Vardhan Pant
S. Seshia
Atul Acharya
Xantha Bruso
Paul Wells
Steve Lemke
Q. Lu
Shalin Mehta
76
125
0
17 Mar 2020
1