
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
Richard Ren
Arunim Agarwal
Mantas Mazeika
Cristina Menghini
Robert Vacareanu
Brad Kenstler
Mick Yang
Isabelle Barrass
Alice Gatti
Xuwang Yin
Eduardo Trevino
Matias Geralnik
Adam Khoja
Dean Lee
Summer Yue
Dan Hendrycks