ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.12708
  4. Cited By
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering
  and Reading Comprehension
v1v2 (latest)

QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension

ACM Computing Surveys (CSUR), 2021
27 July 2021
Anna Rogers
Matt Gardner
Isabelle Augenstein
ArXiv (abs)PDFHTMLGithub

Papers citing "QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension"

50 / 89 papers shown
FarsiMCQGen: a Persian Multiple-choice Question Generation Framework
FarsiMCQGen: a Persian Multiple-choice Question Generation Framework
Mohammad Heydari Rad
Rezvan Afari
Saeedeh Momtazi
AI4Ed
304
0
0
16 Oct 2025
ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification
ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification
Utsav Nareti
Suraj Kumar
Soumya Pandey
S. Chattopadhyay
Chandranath Adak
VLM
221
0
0
14 Oct 2025
ProMQA-Assembly: Multimodal Procedural QA Dataset on Assembly
ProMQA-Assembly: Multimodal Procedural QA Dataset on Assembly
Kimihiro Hasegawa
Wiradee Imrattanatrai
Masaki Asada
Susan Holm
Yuran Wang
Vincent Zhou
Ken Fukuda
Teruko Mitamura
282
2
0
03 Sep 2025
Meet Your New Client: Writing Reports for AI -- Benchmarking Information Loss in Market Research Deliverables
Meet Your New Client: Writing Reports for AI -- Benchmarking Information Loss in Market Research Deliverables
Paul F. Simmering
Benedikt Schulz
Oliver Tabino
Georg Wittenburg
191
0
0
17 Aug 2025
Metric assessment protocol in the context of answer fluctuation on MCQ tasks
Metric assessment protocol in the context of answer fluctuation on MCQ tasks
Ekaterina Goliakova
X. Renard
Marie-Jeanne Lesot
Thibault Laugel
Christophe Marsala
Marcin Detyniecki
216
0
0
21 Jul 2025
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
Chen Xu
Yuxuan Yue
Zukang Xu
Yan Chen
Jiangyong Yu
Zhixuan Chen
Sifan Zhou
Zhihang Yuan
Dawei Yang
MQ
362
3
0
02 May 2025
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation SystemsTransactions of the Association for Computational Linguistics (TACL), 2025
Yannis Katsis
Sara Rosenthal
Kshitij P. Fadnis
Chulaka Gunasekara
Young-Suk Lee
Lucian Popa
Vraj Shah
Khoi-Nguyen Tran
Danish Contractor
Marina Danilevsky
RALMLRM
305
37
0
08 Jan 2025
QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance
QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA PerformanceIEEE Access (IEEE Access), 2025
Binita Saha
Utsha Saha
Muhammad Zubair Malik
RALM3DV
351
29
0
06 Jan 2025
RAG-based Question Answering over Heterogeneous Data and Text
RAG-based Question Answering over Heterogeneous Data and TextIEEE Data Engineering Bulletin (DEB), 2024
Philipp Christmann
Gerhard Weikum
LMTDRALM
417
13
0
10 Dec 2024
CaLMQA: Exploring culturally specific long-form question answering across 23 languages
CaLMQA: Exploring culturally specific long-form question answering across 23 languages
Shane Arora
Marzena Karpinska
Hung-Ting Chen
Ipsita Bhattacharjee
Mohit Iyyer
Eunsol Choi
HILM
557
30
0
25 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and
  Metrics for Open Domain Question Answering in the Era of Large Language
  Models
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language ModelsIEEE Access (IEEE Access), 2024
Akchay Srivastava
Atif Memon
ELM
267
7
0
19 Jun 2024
emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0
  Framework, Enriched with emrQA Medical Information
emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information
Jimenez Eladio
Hao Wu
284
5
0
18 Apr 2024
Data Augmentation with In-Context Learning and Comparative Evaluation in
  Math Word Problem Solving
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving
Gulsum Yigit
M. Amasyalı
AIMat
219
1
0
05 Apr 2024
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions
  for RAG systems
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systemsTransactions of the Association for Computational Linguistics (TACL), 2024
Sara Rosenthal
Avirup Sil
Radu Florian
Salim Roukos
365
33
0
02 Apr 2024
TriviaHG: A Dataset for Automatic Hint Generation from Factoid Questions
TriviaHG: A Dataset for Automatic Hint Generation from Factoid Questions
Jamshid Mozafari
Anubhav Jangra
Adam Jatowt
343
6
1
27 Mar 2024
Reasoning Runtime Behavior of a Program with LLM: How Far Are We?
Reasoning Runtime Behavior of a Program with LLM: How Far Are We?
Junkai Chen
Zhiyuan Pan
Xing Hu
Zhenhao Li
Ge Li
Xin Xia
LRM
349
64
0
25 Mar 2024
A Question Answering Based Pipeline for Comprehensive Chinese EHR
  Information Extraction
A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction
Huaiyuan Ying
Sheng Yu
MedIm
178
1
0
17 Feb 2024
FinLLMs: A Framework for Financial Reasoning Dataset Generation with
  Large Language Models
FinLLMs: A Framework for Financial Reasoning Dataset Generation with Large Language Models
Ziqiang Yuan
Kaiyuan Wang
Shoutai Zhu
Ye Yuan
Jingya Zhou
Yanlin Zhu
Wenqi Wei
305
15
0
19 Jan 2024
One Pass Streaming Algorithm for Super Long Token Attention
  Approximation in Sublinear Space
One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space
Raghav Addanki
Chenyang Li
Zhao Song
Chiwun Yang
423
3
0
24 Nov 2023
NEWTON: Are Large Language Models Capable of Physical Reasoning?
NEWTON: Are Large Language Models Capable of Physical Reasoning?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yi Ru Wang
Jiafei Duan
Dieter Fox
S. Srinivasa
ELMLRMAIMatReLM
377
60
0
10 Oct 2023
Graph Neural Prompting with Large Language Models
Graph Neural Prompting with Large Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023
Yijun Tian
Huan Song
Zichen Wang
Haozhu Wang
Ziqing Hu
Fang Wang
Nitesh Chawla
Panpan Xu
AI4CE
529
85
0
27 Sep 2023
Using Large Language Models for Knowledge Engineering (LLMKE): A Case
  Study on Wikidata
Using Large Language Models for Knowledge Engineering (LLMKE): A Case Study on Wikidata
Bohui Zhang
Ioannis Reklos
Nitisha Jain
Albert Meroño-Peñuela
Elena Simperl
239
11
0
15 Sep 2023
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation
  Suite for Large Language Models
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models
Wei Qi Leong
Jian Gang Ngui
Yosephine Susanto
Hamsawardhini Rengarajan
Kengatharaiyer Sarveswaran
William-Chandra Tjhi
366
18
0
12 Sep 2023
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Position: Key Claims in LLM Research Have a Long Tail of FootnotesInternational Conference on Machine Learning (ICML), 2023
Anna Rogers
A. Luccioni
552
24
0
14 Aug 2023
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill
  Sets
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill SetsInternational Conference on Learning Representations (ICLR), 2023
Seonghyeon Ye
Doyoung Kim
Sungdong Kim
Hyeonbin Hwang
Seungone Kim
Yongrae Jo
James Thorne
Juho Kim
Minjoon Seo
ALM
680
170
0
20 Jul 2023
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in
  Generative Language Models
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models
Nedelina Teneva
243
1
0
20 Jul 2023
When Do Annotator Demographics Matter? Measuring the Influence of
  Annotator Demographics with the POPQUORN Dataset
When Do Annotator Demographics Matter? Measuring the Influence of Annotator Demographics with the POPQUORN DatasetLaw (LAW), 2023
Jiaxin Pei
David Jurgens
337
59
0
12 Jun 2023
Benchmarking Foundation Models with Language-Model-as-an-Examiner
Benchmarking Foundation Models with Language-Model-as-an-ExaminerNeural Information Processing Systems (NeurIPS), 2023
Yushi Bai
Jiahao Ying
Yixin Cao
Xin Lv
Yuze He
...
Yijia Xiao
Haozhe Lyu
Jiayin Zhang
Juanzi Li
Lei Hou
ALMELM
419
217
0
07 Jun 2023
On Degrees of Freedom in Defining and Testing Natural Language
  Understanding
On Degrees of Freedom in Defining and Testing Natural Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Saku Sugawara
S. Tsugita
ELM
381
2
0
24 May 2023
Getting MoRE out of Mixture of Language Model Reasoning Experts
Getting MoRE out of Mixture of Language Model Reasoning ExpertsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chenglei Si
Weijia Shi
Chen Zhao
Luke Zettlemoyer
Jordan L. Boyd-Graber
LRM
318
51
0
24 May 2023
Few-shot Unified Question Answering: Tuning Models or Prompts?
Few-shot Unified Question Answering: Tuning Models or Prompts?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Srijan Bansal
Semih Yavuz
Bo Pang
Meghana Moorthy Bhat
Yingbo Zhou
455
3
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past,
  Present, and Future
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Yongfeng Zhang
Jennifer Foster
Yue Zhang
OOD
355
3
0
23 May 2023
On the Risk of Misinformation Pollution with Large Language Models
On the Risk of Misinformation Pollution with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yikang Pan
Liangming Pan
Wenhu Chen
Preslav Nakov
Min-Yen Kan
Wenjie Wang
DeLMO
594
194
0
23 May 2023
Evaluating Open-Domain Question Answering in the Era of Large Language
  Models
Evaluating Open-Domain Question Answering in the Era of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Ehsan Kamalloo
Nouha Dziri
C. Clarke
Davood Rafiei
ELM
547
168
0
11 May 2023
MAUPQA: Massive Automatically-created Polish Question Answering Dataset
MAUPQA: Massive Automatically-created Polish Question Answering DatasetWorkshop on Balto-Slavic Natural Language Processing (BSNLP), 2023
Piotr Rybak
254
13
0
09 May 2023
NorQuAD: Norwegian Question Answering Dataset
NorQuAD: Norwegian Question Answering DatasetNordic Conference of Computational Linguistics (NODALIDA), 2023
Sardana Ivanova
Fredrik Aas Andreassen
Matias Jentoft
Sondre Wold
Lilja Ovrelid
233
9
0
03 May 2023
In ChatGPT We Trust? Measuring and Characterizing the Reliability of
  ChatGPT
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Xinyue Shen
Sihao Lin
Michael Backes
Yang Zhang
261
77
0
18 Apr 2023
LINGO : Visually Debiasing Natural Language Instructions to Support Task
  Diversity
LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity
Anjana Arunkumar
Sanjay Kariyappa
Rakhi Agrawal
Sriramakrishnan Chandrasekaran
Chris Bryan
225
1
0
12 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
377
54
0
07 Apr 2023
Querying Large Language Models with SQL
Querying Large Language Models with SQLInternational Conference on Extending Database Technology (EDBT), 2023
Mohammed Saeed
Nicola De Cao
Paolo Papotti
323
44
0
02 Apr 2023
UKP-SQuARE v3: A Platform for Multi-Agent QA Research
UKP-SQuARE v3: A Platform for Multi-Agent QA ResearchAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Haritz Puerto
Tim Baumgärtner
Rachneet Sachdeva
Haishuo Fang
Haotian Zhang
Sewin Tariverdian
Kexin Wang
Iryna Gurevych
334
2
0
31 Mar 2023
Integrating Image Features with Convolutional Sequence-to-sequence
  Network for Multilingual Visual Question Answering
Integrating Image Features with Convolutional Sequence-to-sequence Network for Multilingual Visual Question AnsweringJournal of Computer Science and Cybernetics (JCSC), 2023
T. M. Thai
Son T. Luu
273
0
0
22 Mar 2023
Secret-Keeping in Question Answering
Secret-Keeping in Question Answering
Nathaniel W. Rollings
Kent O'Sullivan
Sakshum Kulshrestha
KELM
211
1
0
16 Mar 2023
Generating multiple-choice questions for medical question answering with
  distractors and cue-masking
Generating multiple-choice questions for medical question answering with distractors and cue-maskingInternational Conference on Language Resources and Evaluation (LREC), 2023
Damien Sileo
Kanimozhi Uma
Marie-Francine Moens
256
5
0
13 Mar 2023
AnoMalNet: Outlier Detection based Malaria Cell Image Classification
  Method Leveraging Deep Autoencoder
AnoMalNet: Outlier Detection based Malaria Cell Image Classification Method Leveraging Deep AutoencoderInternational Journal of Reconfigurable and Embedded Systems (IJRES) (IJRES), 2023
A. Huq
Md. Tanzim Reza
Shahriar Hossain
Shakib Mahmud Dipto
198
2
0
10 Mar 2023
AmQA: Amharic Question Answering Dataset
AmQA: Amharic Question Answering Dataset
Tilahun Abedissa
Ricardo Usbeck
Yaregal Assabie
224
3
0
06 Mar 2023
Make Every Example Count: On the Stability and Utility of Self-Influence
  for Learning from Noisy NLP Datasets
Make Every Example Count: On the Stability and Utility of Self-Influence for Learning from Noisy NLP DatasetsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Irina Bejan
Artem Sokolov
Katja Filippova
TDI
423
23
0
27 Feb 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
846
17
0
17 Feb 2023
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on
  Reasoning, Hallucination, and Interactivity
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityInternational Joint Conference on Natural Language Processing (IJCNLP), 2023
Yejin Bang
Samuel Cahyawijaya
Nayeon Lee
Wenliang Dai
Jane Polak Scowcroft
...
Tiezheng Yu
Willy Chung
Quyet V. Do
Yan Xu
Pascale Fung
ReLMLRM
973
1,702
0
08 Feb 2023
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction
  and Necessary Resources
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction and Necessary ResourcesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Xinyan Velocity Yu
Akari Asai
Trina Chatterjee
Junjie Hu
Eunsol Choi
283
31
0
28 Nov 2022
12
Next
Page 1 of 2