ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2601.14691
  4. Cited By
Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation
v1v2 (latest)

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

21 January 2026
Muhammad Khalifa
Lajanugen Logeswaran
Jaekyeom Kim
Sungryull Sohn
Yunxiang Zhang
Moontae Lee
Hao Peng
Lu Wang
Honglak Lee
    LLMAGLRMELM
ArXiv (abs)PDFHTMLGithub (76136★)

Papers citing "Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation"

0 / 0 papers shown

No papers found

Page 1 of 0