ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.14353
  4. Cited By
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question
  Answering

RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering

25 October 2022
Victor Zhong
Weijia Shi
Wen-tau Yih
Luke Zettlemoyer
ArXivPDFHTML

Papers citing "RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering"

16 / 16 papers shown
Title
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in
  Language?
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?
Zongmeng Zhang
Jinhua Zhu
Wengang Zhou
Xiang Qi
Peng Zhang
H. Li
70
1
0
19 Nov 2024
Living in the Moment: Can Large Language Models Grasp Co-Temporal
  Reasoning?
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
Zhaochen Su
Juntao Li
Jun Zhang
Tong Zhu
Xiaoye Qu
Pan Zhou
Yan Bowen
Yu Cheng
Min zhang
LRM
39
20
0
13 Jun 2024
INSTRUCTIR: A Benchmark for Instruction Following of Information
  Retrieval Models
INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models
Hanseok Oh
Hyunji Lee
Seonghyeon Ye
Haebin Shin
Hansol Jang
Changwook Jun
Minjoon Seo
46
19
0
22 Feb 2024
Towards Robust Temporal Reasoning of Large Language Models via a
  Multi-Hop QA Dataset and Pseudo-Instruction Tuning
Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning
Qingyu Tan
Hwee Tou Ng
Lidong Bing
26
6
0
16 Nov 2023
KTRL+F: Knowledge-Augmented In-Document Search
KTRL+F: Knowledge-Augmented In-Document Search
Hanseok Oh
Haebin Shin
Miyoung Ko
Hyunji Lee
Minjoon Seo
28
3
0
14 Nov 2023
SEMQA: Semi-Extractive Multi-Source Question Answering
SEMQA: Semi-Extractive Multi-Source Question Answering
Tal Schuster
Á. Lelkes
Haitian Sun
Jai Gupta
Jonathan Berant
W. Cohen
Donald Metzler
28
13
0
08 Nov 2023
NERetrieve: Dataset for Next Generation Named Entity Recognition and
  Retrieval
NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval
Uri Katz
Matan Vetzler
Amir D. N. Cohen
Yoav Goldberg
9
9
0
22 Oct 2023
Answering Ambiguous Questions with a Database of Questions, Answers, and
  Revisions
Answering Ambiguous Questions with a Database of Questions, Answers, and Revisions
Haitian Sun
William W. Cohen
Ruslan Salakhutdinov
19
3
0
16 Aug 2023
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill
  Sets
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
Seonghyeon Ye
Doyoung Kim
Sungdong Kim
Hyeonbin Hwang
Seungone Kim
Yongrae Jo
James Thorne
Juho Kim
Minjoon Seo
ALM
35
97
0
20 Jul 2023
Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction
Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction
Ji Qi
Chuchu Zhang
Xiaozhi Wang
Kaisheng Zeng
Jifan Yu
...
Jiu Sun
Yuxiang Chen
Lei How
Juanzi Li
Bin Xu
27
9
0
23 May 2023
QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set
  Operations
QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations
Chaitanya Malaviya
Peter Shaw
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
11
14
0
19 May 2023
Can Pre-trained Vision and Language Models Answer Visual
  Information-Seeking Questions?
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
39
80
0
23 Feb 2023
Demonstrate-Search-Predict: Composing retrieval and language models for
  knowledge-intensive NLP
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Omar Khattab
Keshav Santhanam
Xiang Lisa Li
David Leo Wright Hall
Percy Liang
Christopher Potts
Matei A. Zaharia
RALM
KELM
22
244
0
28 Dec 2022
QAMPARI: An Open-domain Question Answering Benchmark for Questions with
  Many Answers from Multiple Paragraphs
QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
S. Amouyal
Tomer Wolfson
Ohad Rubin
Ori Yoran
Jonathan Herzig
Jonathan Berant
RALM
VLM
13
21
0
25 May 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,915
0
04 Mar 2022
Adversarial Example Generation with Syntactically Controlled Paraphrase
  Networks
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks
Mohit Iyyer
John Wieting
Kevin Gimpel
Luke Zettlemoyer
AAML
GAN
193
711
0
17 Apr 2018
1