Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2601.14691
Cited By
v1
v2 (latest)
Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation
21 January 2026
Muhammad Khalifa
Lajanugen Logeswaran
Jaekyeom Kim
Sungryull Sohn
Yunxiang Zhang
Moontae Lee
Hao Peng
Lu Wang
Honglak Lee
LLMAG
LRM
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (76136★)
Papers citing
"Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation"
0 / 0 papers shown
No papers found
Page 1 of 0