ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.09753
  4. Cited By
MRQA 2019 Shared Task: Evaluating Generalization in Reading
  Comprehension

MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension

22 October 2019
Adam Fisch
Alon Talmor
Robin Jia
Minjoon Seo
Eunsol Choi
Danqi Chen
ArXivPDFHTML

Papers citing "MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension"

50 / 81 papers shown
Title
CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning
CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning
S. Wang
L. Zhang
Zheren Fu
Zhendong Mao
27
0
0
15 May 2025
UniKnow: A Unified Framework for Reliable Language Model Behavior across Parametric and External Knowledge
UniKnow: A Unified Framework for Reliable Language Model Behavior across Parametric and External Knowledge
Youna Kim
Hyuhng Joon Kim
Minjoon Choi
Hyuhng Joon Kim
Sang-goo Lee
Sang-goo Lee
Taeuk Kim
KELM
61
0
0
19 Feb 2025
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models
Yanwen Huang
Yong Zhang
Ning Cheng
Zhitao Li
Shaojun Wang
Jing Xiao
91
0
0
02 Jan 2025
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences
Genta Indra Winata
David Anugraha
Lucky Susanto
Garry Kuwanto
Derry Wijaya
42
8
0
03 Oct 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Chenyu You
HILM
114
16
0
30 Sep 2024
Evaluation of Retrieval-Augmented Generation: A Survey
Evaluation of Retrieval-Augmented Generation: A Survey
Hao Yu
Aoran Gan
Kai Zhang
Shiwei Tong
Qi Liu
Zhaofeng Liu
3DV
62
83
0
13 May 2024
FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning
  Leveraging Weight Decomposition
FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition
Yuxuan Yan
Qianqian Yang
Shunpu Tang
Zhiguo Shi
38
14
0
29 Apr 2024
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot
  Question Answering
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
Xiusi Chen
Jyun-Yu Jiang
Wei-Cheng Chang
Cho-Jui Hsieh
Hsiang-Fu Yu
Wei Wang
21
11
0
08 Oct 2023
AGent: A Novel Pipeline for Automatically Creating Unanswerable
  Questions
AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions
Son Quoc Tran
Gia-Huy Do
Phong Nguyen-Thuan Do
Matt Kretchmar
Xinya Du
29
0
0
10 Sep 2023
Investigating the Factual Knowledge Boundary of Large Language Models
  with Retrieval Augmentation
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
Ruiyang Ren
Yuhao Wang
Yingqi Qu
Wayne Xin Zhao
Qingbin Liu
Hao Tian
Huaqin Wu
Ji-Rong Wen
Haifeng Wang
RALM
KELM
46
125
0
20 Jul 2023
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis,
  and LLMs Evaluations
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Lifan Yuan
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Fangyuan Zou
Xingyi Cheng
Heng Ji
Zhiyuan Liu
Maosong Sun
41
73
0
07 Jun 2023
A Frustratingly Easy Improvement for Position Embeddings via Random
  Padding
A Frustratingly Easy Improvement for Position Embeddings via Random Padding
Mingxu Tao
Yansong Feng
Dongyan Zhao
34
6
0
08 May 2023
UKP-SQuARE v3: A Platform for Multi-Agent QA Research
UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Haritz Puerto
Tim Baumgärtner
Rachneet Sachdeva
Haishuo Fang
Haotian Zhang
Sewin Tariverdian
Kexin Wang
Iryna Gurevych
28
2
0
31 Mar 2023
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Zhen Wang
Yikang Shen
Leonid Karlinsky
Rogerio Feris
Huan Sun
Yoon Kim
VLM
VPVLM
44
107
0
06 Mar 2023
Robust Question Answering against Distribution Shifts with Test-Time
  Adaptation: An Empirical Study
Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study
Hai Ye
Yuyang Ding
Juntao Li
Hwee Tou Ng
OOD
TTA
29
9
0
09 Feb 2023
Understanding Finetuning for Factual Knowledge Extraction from Language
  Models
Understanding Finetuning for Factual Knowledge Extraction from Language Models
Mehran Kazemi
Sid Mittal
Deepak Ramachandran
KELM
34
10
0
26 Jan 2023
PIE-QG: Paraphrased Information Extraction for Unsupervised Question
  Generation from Small Corpora
PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora
D. Nagumothu
B. Ofoghi
G. Huang
Peter W. Eklund
RALM
16
5
0
03 Jan 2023
TASA: Deceiving Question Answering Models by Twin Answer Sentences
  Attack
TASA: Deceiving Question Answering Models by Twin Answer Sentences Attack
Yu Cao
Dianqi Li
Meng Fang
Dinesh Manocha
Jun Gao
Yibing Zhan
Dacheng Tao
AAML
26
15
0
27 Oct 2022
DyREx: Dynamic Query Representation for Extractive Question Answering
DyREx: Dynamic Query Representation for Extractive Question Answering
Urchade Zaratiana
Niama El Khbir
Dennis Núñez
Pierre Holat
Nadi Tomeh
Thierry Charnois
89
2
0
26 Oct 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating
  Models to Reflect Conflicting Evidence
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence
Hung-Ting Chen
Michael J.Q. Zhang
Eunsol Choi
RALM
HILM
47
92
0
25 Oct 2022
Pre-training Language Models with Deterministic Factual Knowledge
Pre-training Language Models with Deterministic Factual Knowledge
Shaobo Li
Xiaoguang Li
Lifeng Shang
Chengjie Sun
Bingquan Liu
Zhenzhou Ji
Xin Jiang
Qun Liu
KELM
47
11
0
20 Oct 2022
Prompting GPT-3 To Be Reliable
Prompting GPT-3 To Be Reliable
Chenglei Si
Zhe Gan
Zhengyuan Yang
Shuohang Wang
Jianfeng Wang
Jordan L. Boyd-Graber
Lijuan Wang
KELM
LRM
50
283
0
17 Oct 2022
Are Sample-Efficient NLP Models More Robust?
Are Sample-Efficient NLP Models More Robust?
Nelson F. Liu
Ananya Kumar
Percy Liang
Robin Jia
VLM
OOD
30
6
0
12 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
129
95
0
06 Oct 2022
Honest Students from Untrusted Teachers: Learning an Interpretable
  Question-Answering Pipeline from a Pretrained Language Model
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model
Jacob Eisenstein
D. Andor
Bernd Bohnet
Michael Collins
David M. Mimno
LRM
194
24
0
05 Oct 2022
Using contradictions improves question answering systems
Using contradictions improves question answering systems
Étienne Fortier-Dubois
Domenic Rosati
15
0
0
28 Sep 2022
Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?
Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?
Jiawen Wu
Xinyu Zhang
Yutao Zhu
Zheng Liu
Zikai Guo
Zhaoye Fei
Ruofei Lai
Yongkang Wu
Bo Zhao
Zhicheng Dou
33
5
0
14 Sep 2022
Continual Machine Reading Comprehension via Uncertainty-aware Fixed
  Memory and Adversarial Domain Adaptation
Continual Machine Reading Comprehension via Uncertainty-aware Fixed Memory and Adversarial Domain Adaptation
Zhijing Wu
Hua Xu
Jingliang Fang
Kai Gao
CLL
28
1
0
10 Aug 2022
BioTABQA: Instruction Learning for Biomedical Table Question Answering
BioTABQA: Instruction Learning for Biomedical Table Question Answering
Man Luo
S. Saxena
Swaroop Mishra
Mihir Parmar
Chitta Baral
LMTD
157
15
0
06 Jul 2022
QAGAN: Adversarial Approach To Learning Domain Invariant Language
  Features
QAGAN: Adversarial Approach To Learning Domain Invariant Language Features
Shubham Shrivastava
Kaiyue Wang
OOD
29
2
0
24 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
77
393
0
17 Jun 2022
Can Foundation Models Help Us Achieve Perfect Secrecy?
Can Foundation Models Help Us Achieve Perfect Secrecy?
Simran Arora
Christopher Ré
FedML
24
6
0
27 May 2022
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures
  of Soft Prompts
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts
Akari Asai
Mohammadreza Salehi
Matthew E. Peters
Hannaneh Hajishirzi
130
100
0
24 May 2022
Not to Overfit or Underfit the Source Domains? An Empirical Study of
  Domain Generalization in Question Answering
Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering
Md Arafat Sultan
Avirup Sil
Radu Florian
OOD
32
6
0
15 May 2022
KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive
  Question Answering
KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering
Jie Wang
Chengyu Wang
Minghui Qiu
Qiuhui Shi
Hongbin Wang
Jun Huang
Ming Gao
RALM
39
16
0
06 May 2022
On Continual Model Refinement in Out-of-Distribution Data Streams
On Continual Model Refinement in Out-of-Distribution Data Streams
Bill Yuchen Lin
Sida I. Wang
Xi Lin
Robin Jia
Lin Xiao
Xiang Ren
Wen-tau Yih
CLL
28
29
0
04 May 2022
A Thorough Examination on Zero-shot Dense Retrieval
A Thorough Examination on Zero-shot Dense Retrieval
Ruiyang Ren
Yingqi Qu
Qingbin Liu
Wayne Xin Zhao
Qifei Wu
Yuchen Ding
Hua Wu
Haifeng Wang
Ji-Rong Wen
37
41
0
27 Apr 2022
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking
  Inference
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
Kai Hui
Honglei Zhuang
Tao Chen
Zhen Qin
Jing Lu
...
Ji Ma
Jai Gupta
Cicero Nogueira dos Santos
Yi Tay
Donald Metzler
34
16
0
25 Apr 2022
LinkBERT: Pretraining Language Models with Document Links
LinkBERT: Pretraining Language Models with Document Links
Michihiro Yasunaga
J. Leskovec
Percy Liang
KELM
29
353
0
29 Mar 2022
VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension
VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension
Kiet Van Nguyen
Son Quoc Tran
Luan Thanh Nguyen
Tin Van Huynh
Son T. Luu
Ngan Luu-Thuy Nguyen
32
12
0
22 Mar 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hamish Ivison
Matthew E. Peters
AI4CE
34
20
0
15 Mar 2022
Generalized but not Robust? Comparing the Effects of Data Modification
  Methods on Out-of-Domain Generalization and Adversarial Robustness
Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness
Tejas Gokhale
Swaroop Mishra
Man Luo
Bhavdeep Singh Sachdeva
Chitta Baral
52
29
0
15 Mar 2022
Choose Your QA Model Wisely: A Systematic Study of Generative and
  Extractive Readers for Question Answering
Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering
Man Luo
Kazuma Hashimoto
Semih Yavuz
Zhiwei Liu
Chitta Baral
Yingbo Zhou
29
21
0
14 Mar 2022
What Makes Reading Comprehension Questions Difficult?
What Makes Reading Comprehension Questions Difficult?
Saku Sugawara
Nikita Nangia
Alex Warstadt
Sam Bowman
ELM
RALM
20
13
0
12 Mar 2022
Read before Generate! Faithful Long Form Question Answering with Machine
  Reading
Read before Generate! Faithful Long Form Question Answering with Machine Reading
Dan Su
Xiaoguang Li
Jindi Zhang
Lifeng Shang
Xin Jiang
Qun Liu
Pascale Fung
HILM
19
59
0
01 Mar 2022
Active Learning Over Multiple Domains in Natural Language Tasks
Active Learning Over Multiple Domains in Natural Language Tasks
Shayne Longpre
Julia Reisler
E. G. Huang
Yi Lu
Andrew J. Frank
Nikhil Ramesh
Chris DuBois
OOD
27
13
0
01 Feb 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
33
141
0
14 Jan 2022
SCROLLS: Standardized CompaRison Over Long Language Sequences
SCROLLS: Standardized CompaRison Over Long Language Sequences
Uri Shaham
Elad Segal
Maor Ivgi
Avia Efrat
Ori Yoran
...
Ankit Gupta
Wenhan Xiong
Mor Geva
Jonathan Berant
Omer Levy
RALM
37
133
0
10 Jan 2022
Models in the Loop: Aiding Crowdworkers with Generative Annotation
  Assistants
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Max Bartolo
Tristan Thrush
Sebastian Riedel
Pontus Stenetorp
Robin Jia
Douwe Kiela
24
33
0
16 Dec 2021
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto
Gözde Gül Sahin
Iryna Gurevych
LLMAG
33
20
0
03 Dec 2021
12
Next