ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.07440
  4. Cited By
Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric

Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric

10 April 2025
Yixin Cao
Jiahao Ying
Y. Wang
Xipeng Qiu
Xuanjing Huang
Yugang Jiang
    ELM
ArXivPDFHTML

Papers citing "Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric"

2 / 2 papers shown
Title
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
Takyoung Kim
Janvijay Singh
Shuhaib Mehri
Emre Can Acikgoz
Sagnik Mukherjee
Nimet Beyza Bozdag
Sumuk Shashidhar
Gökhan Tür
Dilek Hakkani-Tür
LLMAG
27
0
0
02 May 2025
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
X. Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Xuanjing Huang
Tat-Seng Chua
Yu Jiang
ALM
ELM
84
1
0
26 Apr 2025
1