Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.05336
Cited By
v1
v2
v3 (latest)
Toward an Evaluation Science for Generative AI Systems
7 March 2025
Laura Weidinger
Deb Raji
Hanna M. Wallach
Margaret Mitchell
Angelina Wang
Olawale Salaudeen
Rishi Bommasani
Sayash Kapoor
Deep Ganguli
Sanmi Koyejo
EGVM
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Toward an Evaluation Science for Generative AI Systems"
5 / 5 papers shown
Title
Correlated Errors in Large Language Models
Elliot Kim
Avi Garg
Kenny Peng
Nikhil Garg
24
0
0
09 Jun 2025
Real-World Gaps in AI Governance Research
Ilan Strauss
Isobel Moure
Tim O'Reilly
Sruly Rosenblat
160
1
0
30 Apr 2025
LLM Social Simulations Are a Promising Research Method
Jacy Reese Anthis
Ryan Liu
Sean M. Richardson
Austin C. Kozlowski
Bernard Koch
James A. Evans
Erik Brynjolfsson
Michael S. Bernstein
ALM
111
15
0
03 Apr 2025
The Impossibility of Fair LLMs
Jacy Reese Anthis
Kristian Lum
Michael Ekstrand
Avi Feller
Alexander D’Amour
FaML
130
14
0
28 May 2024
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
Kristian Lum
Jacy Reese Anthis
Chirag Nagpal
Alex DÁmour
Alexander D’Amour
122
17
0
20 Feb 2024
1