MastermindEval: A Simple But Scalable Reasoning Benchmark
Papers citing "MastermindEval: A Simple But Scalable Reasoning Benchmark"
22 / 22 papers shown
Title |
---|
![]() Faith and Fate: Limits of Transformers on Compositionality Nouha Dziri Ximing Lu Melanie Sclar Xiang Lorraine Li Liwei Jian ...Sean Welleck Xiang Ren Allyson Ettinger Zaïd Harchaoui Yejin Choi |