Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.13453
Cited By
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension
31 May 2019
Alon Talmor
Jonathan Berant
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension"
45 / 45 papers shown
Title
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Lifan Yuan
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Fangyuan Zou
Xingyi Cheng
Heng Ji
Zhiyuan Liu
Maosong Sun
41
73
0
07 Jun 2023
Evaluating the Robustness of Machine Reading Comprehension Models to Low Resource Entity Renaming
Clemencia Siro
T. Ajayi
26
2
0
06 Apr 2023
UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Haritz Puerto
Tim Baumgärtner
Rachneet Sachdeva
Haishuo Fang
Haotian Zhang
Sewin Tariverdian
Kexin Wang
Iryna Gurevych
28
2
0
31 Mar 2023
Understanding Finetuning for Factual Knowledge Extraction from Language Models
Mehran Kazemi
Sid Mittal
Deepak Ramachandran
KELM
34
10
0
26 Jan 2023
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
127
94
0
06 Oct 2022
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts
Akari Asai
Mohammadreza Salehi
Matthew E. Peters
Hannaneh Hajishirzi
130
100
0
24 May 2022
When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning
Orion Weller
Kevin Seppi
Matt Gardner
22
21
0
17 May 2022
Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning
Shicheng Xu
Liang Pang
Huawei Shen
Xueqi Cheng
VLM
35
17
0
06 Apr 2022
Investigating Selective Prediction Approaches Across Several Tasks in IID, OOD, and Adversarial Settings
Neeraj Varshney
Swaroop Mishra
Chitta Baral
27
55
0
01 Mar 2022
Active Learning Over Multiple Domains in Natural Language Tasks
Shayne Longpre
Julia Reisler
E. G. Huang
Yi Lu
Andrew J. Frank
Nikhil Ramesh
Chris DuBois
OOD
27
13
0
01 Feb 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
33
141
0
14 Jan 2022
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto
Gözde Gül Sahin
Iryna Gurevych
LLMAG
33
20
0
03 Dec 2021
Can Explanations Be Useful for Calibrating Black Box Models?
Xi Ye
Greg Durrett
FAtt
24
25
0
14 Oct 2021
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering
Minghan Li
Jimmy J. Lin
AI4CE
27
9
0
04 Oct 2021
Single-dataset Experts for Multi-dataset Question Answering
Dan Friedman
Ben Dodge
Danqi Chen
MoMe
134
26
0
28 Sep 2021
PPT: Pre-trained Prompt Tuning for Few-shot Learning
Yuxian Gu
Xu Han
Zhiyuan Liu
Minlie Huang
VLM
54
403
0
09 Sep 2021
SyGNS: A Systematic Generalization Testbed Based on Natural Language Semantics
Hitomi Yanaka
K. Mineshima
Kentaro Inui
NAI
AI4CE
38
11
0
02 Jun 2021
What to Pre-Train on? Efficient Intermediate Task Selection
Clifton A. Poth
Jonas Pfeiffer
Andreas Rucklé
Iryna Gurevych
21
94
0
16 Apr 2021
Exploring Transitivity in Neural NLI Models through Veridicality
Hitomi Yanaka
K. Mineshima
Kentaro Inui
19
23
0
26 Jan 2021
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning
Yujia Qin
Yankai Lin
Ryuichi Takanobu
Zhiyuan Liu
Peng Li
Heng Ji
Minlie Huang
Maosong Sun
Jie Zhou
57
125
0
30 Dec 2020
BERT Goes Shopping: Comparing Distributional Models for Product Representations
Federico Bianchi
Bingqing Yu
Jacopo Tagliabue
25
15
0
17 Dec 2020
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task
Dmitry Tsarkov
Tibor Tihon
Nathan Scales
Nikola Momchev
Danila Sinopalnikov
Nathanael Scharli
18
17
0
15 Dec 2020
Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora
M. Bugert
Nils Reimers
Iryna Gurevych
29
17
0
24 Nov 2020
XOR QA: Cross-lingual Open-Retrieval Question Answering
Akari Asai
Jungo Kasai
J. Clark
Kenton Lee
Eunsol Choi
Hannaneh Hajishirzi
14
145
0
22 Oct 2020
CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems
Yiran Chen
Pengfei Liu
Ming Zhong
Zi-Yi Dou
Danqing Wang
Xipeng Qiu
Xuanjing Huang
ELM
30
24
0
11 Oct 2020
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks
Stephen Mussmann
Robin Jia
Percy Liang
29
15
0
10 Oct 2020
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
Andreas Rucklé
Jonas Pfeiffer
Iryna Gurevych
27
37
0
02 Oct 2020
Knowledge-Aware Procedural Text Understanding with Multi-Stage Training
Zhihan Zhang
Xiubo Geng
Tao Qin
Yunfang Wu
Daxin Jiang
35
22
0
28 Sep 2020
Transferability of Natural Language Inference to Biomedical Question Answering
Minbyul Jeong
Mujeen Sung
Gangwoo Kim
Donghyeon Kim
Wonjin Yoon
J. Yoo
Jaewoo Kang
19
38
0
01 Jul 2020
Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge
Alon Talmor
Oyvind Tafjord
Peter Clark
Yoav Goldberg
Jonathan Berant
ReLM
LRM
36
39
0
11 Jun 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
35
721
0
02 May 2020
Syntactic Data Augmentation Increases Robustness to Inference Heuristics
Junghyun Min
R. Thomas McCoy
Dipanjan Das
Emily Pitler
Tal Linzen
30
175
0
24 Apr 2020
Training Question Answering Models From Synthetic Data
Raul Puri
Ryan Spring
M. Patwary
M. Shoeybi
Bryan Catanzaro
ELM
24
159
0
22 Feb 2020
A Survey on Machine Reading Comprehension Systems
Razieh Baradaran
Razieh Ghiasi
Hossein Amirkhani
FaML
13
85
0
06 Jan 2020
ORB: An Open Reading Benchmark for Comprehensive Evaluation of Machine Reading Comprehension
Dheeru Dua
Ananth Gottumukkala
Alon Talmor
Sameer Singh
Matt Gardner
23
10
0
29 Dec 2019
SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis
Pavel Efimov
Andrey Chertok
Leonid Boytsov
Pavel Braslavski
60
59
0
20 Dec 2019
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering
Shayne Longpre
Yi Lu
Zhucheng Tu
Christopher DuBois
21
70
0
04 Dec 2019
The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents
Kurt Shuster
Da Ju
Stephen Roller
Emily Dinan
Y-Lan Boureau
Jason Weston
32
81
0
09 Nov 2019
Coreference Resolution as Query-based Span Prediction
Wei Wu
Fei Wang
Arianna Yuan
Fei Wu
Jiwei Li
LRM
36
180
0
05 Nov 2019
MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension
Adam Fisch
Alon Talmor
Robin Jia
Minjoon Seo
Eunsol Choi
Danqi Chen
27
301
0
22 Oct 2019
MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension
Di Jin
Shuyang Gao
Jiun-Yu Kao
Tagyoung Chung
Dilek Z. Hakkani-Tür
24
69
0
01 Oct 2019
A Constructive Prediction of the Generalization Error Across Scales
Jonathan S. Rosenfeld
Amir Rosenfeld
Yonatan Belinkov
Nir Shavit
36
205
0
27 Sep 2019
Pre-trained Language Model for Biomedical Question Answering
Wonjin Yoon
Jinhyuk Lee
Donghyeon Kim
Minbyul Jeong
Jaewoo Kang
AI4MH
11
85
0
18 Sep 2019
A Discrete Hard EM Approach for Weakly Supervised Question Answering
Sewon Min
Danqi Chen
Hannaneh Hajishirzi
Luke Zettlemoyer
12
162
0
11 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1