Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.00161
Cited By
v1
v2 (latest)
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
1 March 2019
Dheeru Dua
Yizhong Wang
Pradeep Dasigi
Gabriel Stanovsky
Sameer Singh
Matt Gardner
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs"
50 / 376 papers shown
Title
Reliability Testing for Natural Language Processing Systems
Samson Tan
Shafiq Joty
K. Baxter
Araz Taeihagh
G. Bennett
Min-Yen Kan
98
41
0
06 May 2021
InfographicVQA
Minesh Mathew
Viraj Bagal
Rubèn Pérez Tito
Dimosthenis Karatzas
Ernest Valveny
C. V. Jawahar
115
242
0
26 Apr 2021
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
Swaroop Mishra
Daniel Khashabi
Chitta Baral
Hannaneh Hajishirzi
LRM
192
756
0
18 Apr 2021
Constrained Language Models Yield Few-Shot Semantic Parsers
Richard Shin
C. H. Lin
Sam Thomson
Charles C. Chen
Subhro Roy
Emmanouil Antonios Platanios
Adam Pauls
Dan Klein
J. Eisner
Benjamin Van Durme
399
206
0
18 Apr 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
858
4,128
0
18 Apr 2021
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
Max Bartolo
Tristan Thrush
Robin Jia
Sebastian Riedel
Pontus Stenetorp
Douwe Kiela
AAML
104
106
0
18 Apr 2021
Competency Problems: On Finding and Removing Artifacts in Language Data
Matt Gardner
William Merrill
Jesse Dodge
Matthew E. Peters
Alexis Ross
Sameer Singh
Noah A. Smith
251
111
0
17 Apr 2021
What to Pre-Train on? Efficient Intermediate Task Selection
Clifton A. Poth
Jonas Pfeiffer
Andreas Rucklé
Iryna Gurevych
113
100
0
16 Apr 2021
Time-Stamped Language Model: Teaching Language Models to Understand the Flow of Events
Hossein Rajaby Faghihi
Parisa Kordjamshidi
65
25
0
15 Apr 2021
NT5?! Training T5 to Perform Numerical Reasoning
Peng Yang
Ying Chen
Yuechan Chen
Daniel Cer
AIMat
LRM
75
15
0
15 Apr 2021
TransferNet: An Effective and Transparent Framework for Multi-hop Question Answering over Relation Graph
Jiaxin Shi
S. Cao
Lei Hou
Juan-Zi Li
Hanwang Zhang
GNN
93
112
0
15 Apr 2021
TWEAC: Transformer with Extendable QA Agent Classifiers
Gregor Geigle
Nils Reimers
Andreas Rucklé
Iryna Gurevych
ViT
154
27
0
14 Apr 2021
AR-LSAT: Investigating Analytical Reasoning of Text
Wanjun Zhong
Siyuan Wang
Duyu Tang
Zenan Xu
Daya Guo
Jiahai Wang
Jian Yin
Ming Zhou
Nan Duan
ELM
137
44
0
14 Apr 2021
What's in your Head? Emergent Behaviour in Multi-Task Transformer Models
Mor Geva
Uri Katz
Aviv Ben-Arie
Jonathan Berant
LRM
84
11
0
13 Apr 2021
MultiModalQA: Complex Question Answering over Text, Tables and Images
Alon Talmor
Ori Yoran
Amnon Catav
Dan Lahav
Yizhong Wang
Akari Asai
Gabriel Ilharco
Hannaneh Hajishirzi
Jonathan Berant
LMTD
102
163
0
13 Apr 2021
SpartQA: : A Textual Question Answering Benchmark for Spatial Reasoning
Roshanak Mirzaee
Hossein Rajaby Faghihi
Qiang Ning
Parisa Kordjmashidi
56
83
0
12 Apr 2021
Detecting of a Patient's Condition From Clinical Narratives Using Natural Language Representation
Thanh-Dung Le
R. Noumeir
J. Rambaud
Guillaume Sans
P. Jouvet
57
18
0
08 Apr 2021
Dynabench: Rethinking Benchmarking in NLP
Douwe Kiela
Max Bartolo
Yixin Nie
Divyansh Kaushik
Atticus Geiger
...
Pontus Stenetorp
Robin Jia
Joey Tianyi Zhou
Christopher Potts
Adina Williams
218
411
0
07 Apr 2021
Discrete Reasoning Templates for Natural Language Understanding
Hadeel Al-Negheimish
Pranava Madhyastha
A. Russo
31
4
0
05 Apr 2021
Paired Examples as Indirect Supervision in Latent Decision Models
Nitish Gupta
Sameer Singh
Matt Gardner
Dan Roth
85
7
0
05 Apr 2021
Representing Numbers in NLP: a Survey and a Vision
Avijit Thawani
Jay Pujara
Pedro A. Szekely
Filip Ilievski
97
119
0
24 Mar 2021
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
Honglu Zhou
Asim Kadav
Farley Lai
Alexandru Niculescu-Mizil
Martin Renqiang Min
Mubbasir Kapadia
H. Graf
LRM
89
18
0
19 Mar 2021
Investigating the Limitations of Transformers with Simple Arithmetic Tasks
Rodrigo Nogueira
Zhiying Jiang
Jimmy J. Li
LRM
127
130
0
25 Feb 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafal Powalski
Łukasz Borchmann
Dawid Jurkiewicz
Tomasz Dwojak
Michal Pietruszka
Gabriela Pałka
ViT
96
160
0
18 Feb 2021
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge
Sumithra Bhakthavatsalam
Daniel Khashabi
Tushar Khot
Bhavana Dalvi
Kyle Richardson
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
Peter Clark
RALM
AI4CE
80
66
0
05 Feb 2021
Weakly Supervised Neuro-Symbolic Module Networks for Numerical Reasoning
Amrita Saha
Shafiq Joty
Guosheng Lin
NAI
AIMat
LRM
52
20
0
28 Jan 2021
Muppet: Massive Multi-task Representations with Pre-Finetuning
Armen Aghajanyan
Anchit Gupta
Akshat Shrivastava
Xilun Chen
Luke Zettlemoyer
Sonal Gupta
100
270
0
26 Jan 2021
English Machine Reading Comprehension Datasets: A Survey
Daria Dzendzik
Carl Vogel
Jennifer Foster
RALM
AIMat
90
49
0
25 Jan 2021
ComQA:Compositional Question Answering via Hierarchical Graph Neural Networks
Bingning Wang
Ting Yao
Weipeng Chen
Jingfang Xu
Xiaochuan Wang
CoGe
70
6
0
16 Jan 2021
Grid Search Hyperparameter Benchmarking of BERT, ALBERT, and LongFormer on DuoRC
Alex John Quijano
Sam Nguyen
Juanita Ordoñez
55
7
0
15 Jan 2021
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
433
743
0
06 Jan 2021
Retrieving and Reading: A Comprehensive Survey on Open-domain Question Answering
Fengbin Zhu
Wenqiang Lei
Chao Wang
Jianming Zheng
Soujanya Poria
Tat-Seng Chua
RALM
286
257
0
04 Jan 2021
Few-Shot Question Answering by Pretraining Span Selection
Ori Ram
Yuval Kirstain
Jonathan Berant
Amir Globerson
Omer Levy
104
98
0
02 Jan 2021
NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned
Sewon Min
Jordan L. Boyd-Graber
Chris Alberti
Danqi Chen
Eunsol Choi
...
Dmytro Okhonko
Michael Schlichtkrull
Sonal Gupta
Yashar Mehdad
Wen-tau Yih
85
62
0
01 Jan 2021
DynaSent: A Dynamic Benchmark for Sentiment Analysis
Christopher Potts
Zhengxuan Wu
Atticus Geiger
Douwe Kiela
302
80
0
30 Dec 2020
Learning by Fixing: Solving Math Word Problems with Weak Supervision
Yining Hong
Qing Li
Daniel Ciao
Siyuan Huang
Song-Chun Zhu
AIMat
95
60
0
19 Dec 2020
Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension
Preslav Nakov
Zhi Cui
Jiayi Zhang
Chen Wei
Jianwei Cui
Bin Wang
Dongyan Zhao
Rui Yan
85
15
0
14 Dec 2020
Learning from Task Descriptions
Orion Weller
Nicholas Lourie
Matt Gardner
Matthew E. Peters
115
91
0
16 Nov 2020
IIRC: A Dataset of Incomplete Information Reading Comprehension Questions
James Ferguson
Matt Gardner
Hannaneh Hajishirzi
Tushar Khot
Pradeep Dasigi
RALM
54
55
0
13 Nov 2020
Synonym Knowledge Enhanced Reader for Chinese Idiom Reading Comprehension
Siyu Long
Ran Wang
Kun Tao
Jiali Zeng
Xinyu Dai
42
7
0
09 Nov 2020
AI Marker-based Large-scale AI Literature Mining
Rujing Yao
Yingchun Ye
Ji Zhang
Shuxiao Li
Ou Wu
29
2
0
01 Nov 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
88
22
0
24 Oct 2020
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
255
47
0
24 Oct 2020
Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering
Arij Riabi
Thomas Scialom
Rachel Keraron
Benoît Sagot
Djamé Seddah
Jacopo Staiano
222
54
0
23 Oct 2020
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
115
54
0
22 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
83
38
0
20 Oct 2020
Neural Databases
James Thorne
Majid Yazdani
Marzieh Saeidi
Fabrizio Silvestri
Sebastian Riedel
A. Halevy
NAI
99
9
0
14 Oct 2020
Improving Compositional Generalization in Semantic Parsing
I. Oren
Jonathan Herzig
Nitish Gupta
Matt Gardner
Jonathan Berant
89
63
0
12 Oct 2020
MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics
Anthony Chen
Gabriel Stanovsky
Sameer Singh
Matt Gardner
101
51
0
07 Oct 2020
Improving QA Generalization by Concurrent Modeling of Multiple Biases
Mingzhu Wu
N. Moosavi
Andreas Rucklé
Iryna Gurevych
AI4CE
72
17
0
07 Oct 2020
Previous
1
2
3
4
5
6
7
8
Next