Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.09241
Cited By
Assessing the Benchmarking Capacity of Machine Reading Comprehension Datasets
21 November 2019
Saku Sugawara
Pontus Stenetorp
Kentaro Inui
Akiko Aizawa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Assessing the Benchmarking Capacity of Machine Reading Comprehension Datasets"
25 / 25 papers shown
Title
Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Vipul Gupta
Candace Ross
David Pantoja
R. Passonneau
Megan Ung
Adina Williams
94
1
0
26 Oct 2024
Analyzing Multiple-Choice Reading and Listening Comprehension Tests
Vatsal Raina
Adian Liusie
Mark Gales
ELM
43
2
0
03 Jul 2023
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
37
2
0
23 May 2023
APPLS: Evaluating Evaluation Metrics for Plain Language Summarization
Yue Guo
Tal August
Gondy Leroy
T. Cohen
Lucy Lu Wang
57
9
0
23 May 2023
SMoA: Sparse Mixture of Adapters to Mitigate Multiple Dataset Biases
Yanchen Liu
Jing Yang
Yan Chen
Jing Liu
Huaqin Wu
MoE
47
2
0
28 Feb 2023
Cross-Lingual Question Answering over Knowledge Base as Reading Comprehension
Chen Zhang
Yuxuan Lai
Yansong Feng
Xingyu Shen
Haowei Du
Dongyan Zhao
21
3
0
26 Feb 2023
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
Tianxing He
Jingyu Zhang
Tianle Wang
Sachin Kumar
Kyunghyun Cho
James R. Glass
Yulia Tsvetkov
40
44
0
20 Dec 2022
Statistical Dataset Evaluation: Reliability, Difficulty, and Validity
Chengwen Wang
Qingxiu Dong
Xiaochen Wang
Haitao Wang
Zhifang Sui
XAI
29
3
0
19 Dec 2022
Validating Large Language Models with ReLM
Michael Kuchnik
Virginia Smith
George Amvrosiadis
36
27
0
21 Nov 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
35
4
0
05 Sep 2022
Down and Across: Introducing Crossword-Solving as a New NLP Benchmark
Saurabh Kulshreshtha
Olga Kovaleva
Namrata Shivagunde
Anna Rumshisky
ELM
LRM
32
4
0
20 May 2022
Towards Fine-grained Causal Reasoning and QA
Linyi Yang
Zhen Wang
Yuxiang Wu
Jie Yang
Yue Zhang
41
15
0
15 Apr 2022
VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension
Kiet Van Nguyen
Son Quoc Tran
Luan Thanh Nguyen
Tin Van Huynh
Son T. Luu
Ngan Luu-Thuy Nguyen
32
12
0
22 Mar 2022
Feeding What You Need by Understanding What You Learned
Xiaoqiang Wang
Bang Liu
Fangli Xu
Bowei Long
Siliang Tang
Lingfei Wu
65
6
0
05 Mar 2022
WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation
Alisa Liu
Swabha Swayamdipta
Noah A. Smith
Yejin Choi
82
212
0
16 Jan 2022
AI and the Everything in the Whole Wide World Benchmark
Inioluwa Deborah Raji
Emily M. Bender
Amandalynne Paullada
Emily L. Denton
A. Hanna
30
291
0
26 Nov 2021
Shaking Syntactic Trees on the Sesame Street: Multilingual Probing with Controllable Perturbations
Ekaterina Taktasheva
Vladislav Mikhailov
Ekaterina Artemova
24
13
0
28 Sep 2021
Designing Multimodal Datasets for NLP Challenges
James Pustejovsky
E. Holderness
Jingxuan Tu
Parker Glenn
Kyeongmin Rim
Kelley Lynch
R. Brutti
26
5
0
12 May 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
Samuel R. Bowman
George E. Dahl
ELM
ALM
30
156
0
05 Apr 2021
Out of Order: How Important Is The Sequential Order of Words in a Sentence in Natural Language Understanding Tasks?
Thang M. Pham
Trung Bui
Long Mai
Anh Totti Nguyen
220
122
0
30 Dec 2020
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
Patrick Lewis
Pontus Stenetorp
Sebastian Riedel
OOD
ELM
21
184
0
06 Aug 2020
A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics and Benchmark Datasets
Chengchang Zeng
Shaobo Li
Qin Li
Jie Hu
Jianjun Hu
8
101
0
21 Jun 2020
Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension
Max Bartolo
A. Roberts
Johannes Welbl
Sebastian Riedel
Pontus Stenetorp
AAML
28
167
0
02 Feb 2020
A Survey on Machine Reading Comprehension Systems
Razieh Baradaran
Razieh Ghiasi
Hossein Amirkhani
FaML
13
85
0
06 Jan 2020
What Question Answering can Learn from Trivia Nerds
Jordan L. Boyd-Graber
Benjamin Borschinger
24
36
0
31 Oct 2019
1