Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.10621
Cited By
StrucText-Eval: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding
15 June 2024
Zhouhong Gu
Haoning Ye
Zeyang Zhou
Hongwei Feng
Yanghua Xiao
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StrucText-Eval: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding"
2 / 2 papers shown
Title
Don't Make Your LLM an Evaluation Benchmark Cheater
Kun Zhou
Yutao Zhu
Zhipeng Chen
Wentong Chen
Wayne Xin Zhao
Xu Chen
Yankai Lin
Ji-Rong Wen
Jiawei Han
ELM
107
136
0
03 Nov 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
364
8,495
0
28 Jan 2022
1