INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large
  Language Models
v1v2v3 (latest)

INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models

    ELM

Papers citing "INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models"

48 / 48 papers shown
Title
Beyond Metrics: A Critical Analysis of the Variability in Large Language
  Model Evaluation Frameworks
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Marco AF Pimentel
Clément Christophe
Tathagata Raha
Prateek Munjal
Praveen K Kanithi
Shadab Khan
82
3
0
29 Jul 2024

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.