
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models
Papers citing "INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models"
48 / 48 papers shown
Title |
---|
![]() The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models Seungone Kim Juyoung Suk Ji Yong Cho Shayne Longpre Chaeeun Kim ...Sean Welleck Graham Neubig Moontae Lee Kyungjae Lee Minjoon Seo |