Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the
Direct-Answer AI2 Reasoning Challenge

Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge

5 February 2021

Sumithra Bhakthavatsalam

Daniel Khashabi

Kyle Richardson

Ashish Sabharwal

Carissa Schoenick

Oyvind Tafjord

Papers citing "Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge"

15 / 15 papers shown

Title
Enhancing LLMs via High-Knowledge Data Selection Feiyu Duan Xuemiao Zhang Sirui Wang Haoran Que Y. Liu Wenge Rong Xunliang Cai 12 0 0 20 May 2025
ExpertSteer: Intervening in LLMs through Expert Knowledge Weixuan Wang Minghao Wu Barry Haddow Alexandra Birch LLMSV 17 0 0 18 May 2025
Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing Yi-Kai Zhang De-Chuan Zhan Han-Jia Ye ALM ELM LRM 44 1 0 24 Feb 2025
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act Philipp Guldimann Alexander Spiridonov Robin Staab Nikola Jovanović Mark Vero ... Mislav Balunović Nikola Konstantinov Pavol Bielik Petar Tsankov Martin Vechev ELM 58 5 0 10 Oct 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation Catherine Tony Nicolás E. Díaz Ferreyra Markus Mutas Salem Dhiff Riccardo Scandariato SILM 82 9 0 09 Jul 2024
VRSD: Rethinking Similarity and Diversity for Retrieval in Large Language Models Hang Gao Yongfeng Zhang 43 2 0 05 Jul 2024
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading Tu Anh Dinh Carlos Mullov Leonard Barmann Zhaolin Li Danni Liu ... Michael Beigl Rainer Stiefelhagen Carsten Dachsbacher Klemens Bohm Jan Niehues ELM 42 8 0 14 Jun 2024
DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation Sunghyeon Woo Baeseong Park Byeongwook Kim Minjung Jo S. Kwon Dongsuk Jeon Dongsoo Lee 65 2 0 27 Feb 2024
$Se^2$ : Sequential Example Selection for In-Context Learning Haoyu Liu Jianfeng Liu Shaohan Huang Yuefeng Zhan Hao Sun Weiwei Deng Furu Wei Qi Zhang 38 3 0 21 Feb 2024
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions Tim Hartill N. Tan Michael Witbrock Patricia J. Riddle ReLM KELM LRM 34 2 0 02 Aug 2023
Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive Survey Jiaoyan Chen Yuxia Geng Zhuo Chen Jeff Z. Pan Yuan He Wen Zhang Ian Horrocks Hua-zeng Chen 30 43 0 18 Dec 2021
WebGPT: Browser-assisted question-answering with human feedback Reiichiro Nakano Jacob Hilton S. Balaji Jeff Wu Ouyang Long ... Gretchen Krueger Kevin Button Matthew Knight B. Chess John Schulman ALM RALM 117 1,207 0 17 Dec 2021
TruthfulQA: Measuring How Models Mimic Human Falsehoods Stephanie C. Lin Jacob Hilton Owain Evans HILM 57 1,750 0 08 Sep 2021
General-Purpose Question-Answering with Macaw Oyvind Tafjord Peter Clark SyDa ELM MLLM 30 59 0 06 Sep 2021
Temporal Reasoning on Implicit Events from Distant Supervision Ben Zhou Kyle Richardson Qiang Ning Tushar Khot Ashish Sabharwal Dan Roth 170 74 0 24 Oct 2020