Title |
---|
![]() HelloBench: Evaluating Long Text Generation Capabilities of Large
Language Models Haoran Que Feiyu Duan Liqun He Yutao Mou Wangchunshu Zhou ...Ge Zhang Junran Peng Zhaoxiang Zhang Songyang Zhang Kai Chen |
![]() CREAM: Comparison-Based Reference-Free ELO-Ranked Automatic Evaluation
for Meeting Summarization Ziwei Gong Lin Ai Harshsaiprasad Deshpande Alexander Johnson Emmy Phung Zehui Wu Ahmad Emami Julia Hirschberg |