
LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles
Papers citing "LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles"
8 / 8 papers shown
Title |
---|
![]() TableBench: A Comprehensive and Complex Benchmark for Table Question Answering Xianjie Wu Jian Yang Linzheng Chai Ge Zhang Jiaheng Liu ...Xianfu Cheng Tianzhen Sun Guanglin Niu Tongliang Li Zhoujun Li |