ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.03551
  4. Cited By
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
v1v2 (latest)

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
    RALM
ArXiv (abs)PDFHTML

Papers citing "TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"

50 / 1,823 papers shown
Title
WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large
  Language Models
WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models
Youssef Benchekroun
Megi Dervishi
Mark Ibrahim
Jean-Baptiste Gaya
Xavier Martinet
Grégoire Mialon
Thomas Scialom
Emmanuel Dupoux
Dieuwke Hupkes
Pascal Vincent
LRM
68
8
0
27 Nov 2023
Boot and Switch: Alternating Distillation for Zero-Shot Dense Retrieval
Boot and Switch: Alternating Distillation for Zero-Shot Dense Retrieval
Fan Jiang
Xingliang Yuan
Tom Drummond
Trevor Cohn
73
2
0
27 Nov 2023
Efficient Transformer Knowledge Distillation: A Performance Review
Efficient Transformer Knowledge Distillation: A Performance Review
Nathan Brown
Ashton Williamson
Tahj Anderson
Logan Lawrence
VLM
58
5
0
22 Nov 2023
Drilling Down into the Discourse Structure with LLMs for Long Document
  Question Answering
Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering
Inderjeet Nair
Shwetha Somasundaram
Apoorv Saxena
Koustava Goswami
RALM
78
9
0
22 Nov 2023
Unifying Corroborative and Contributive Attributions in Large Language
  Models
Unifying Corroborative and Contributive Attributions in Large Language Models
Theodora Worledge
Judy Hanwen Shen
Nicole Meister
Caleb Winston
Carlos Guestrin
TDI
95
13
0
20 Nov 2023
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
David Rein
Betty Li Hou
Asa Cooper Stickland
Jackson Petty
Richard Yuanzhe Pang
Julien Dirani
Julian Michael
Samuel R. Bowman
AI4MHELM
183
738
0
20 Nov 2023
Empirical evaluation of Uncertainty Quantification in
  Retrieval-Augmented Language Models for Science
Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science
S. Wagle
Sai Munikoti
Anurag Acharya
Sara Smith
Sameera Horawalavithana
28
5
0
15 Nov 2023
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language
  Models
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models
Wenhao Yu
Hongming Zhang
Xiaoman Pan
Kaixin Ma
Hongwei Wang
Dong Yu
KELMRALMLRM
125
119
0
15 Nov 2023
Memory Augmented Language Models through Mixture of Word Experts
Memory Augmented Language Models through Mixture of Word Experts
Cicero Nogueira dos Santos
James Lee-Thorp
Isaac Noble
Chung-Ching Chang
David C. Uthus
MoE
110
8
0
15 Nov 2023
Ever: Mitigating Hallucination in Large Language Models through
  Real-Time Verification and Rectification
Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification
Haoqiang Kang
Juntong Ni
Huaxiu Yao
HILMLRM
113
37
0
15 Nov 2023
Safer-Instruct: Aligning Language Models with Automated Preference Data
Safer-Instruct: Aligning Language Models with Automated Preference Data
Taiwei Shi
Kai Chen
Jieyu Zhao
ALMSyDa
115
28
0
15 Nov 2023
Learning to Filter Context for Retrieval-Augmented Generation
Learning to Filter Context for Retrieval-Augmented Generation
Zhiruo Wang
Jun Araki
Zhengbao Jiang
Md. Rizwan Parvez
Graham Neubig
RALM
76
52
0
14 Nov 2023
KTRL+F: Knowledge-Augmented In-Document Search
KTRL+F: Knowledge-Augmented In-Document Search
Hanseok Oh
Haebin Shin
Miyoung Ko
Hyunji Lee
Minjoon Seo
62
3
0
14 Nov 2023
Hallucination Augmented Recitations for Language Models
Hallucination Augmented Recitations for Language Models
Abdullatif Köksal
Renat Aksitov
Chung-Ching Chang
HILM
69
5
0
13 Nov 2023
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
Yassir Fathullah
Chunyang Wu
Egor Lakomkin
Ke Li
Junteng Jia
Shangguan Yuan
Jay Mahadeokar
Ozlem Kalinli
Christian Fuegen
Michael Seltzer
LM&MAMLLMAuLLM
118
44
0
12 Nov 2023
Trends in Integration of Knowledge and Large Language Models: A Survey
  and Taxonomy of Methods, Benchmarks, and Applications
Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Zhangyin Feng
Weitao Ma
Weijiang Yu
Lei Huang
Haotian Wang
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
KELM
80
40
0
10 Nov 2023
Agent Lumos: Unified and Modular Training for Open-Source Language
  Agents
Agent Lumos: Unified and Modular Training for Open-Source Language Agents
Da Yin
Faeze Brahman
Abhilasha Ravichander
Khyathi Chandu
Kai-Wei Chang
Yejin Choi
Bill Yuchen Lin
LLMAG
126
44
0
09 Nov 2023
SEMQA: Semi-Extractive Multi-Source Question Answering
SEMQA: Semi-Extractive Multi-Source Question Answering
Tal Schuster
Á. Lelkes
Haitian Sun
Jai Gupta
Jonathan Berant
W. Cohen
Donald Metzler
73
14
0
08 Nov 2023
MTGER: Multi-view Temporal Graph Enhanced Temporal Reasoning over
  Time-Involved Document
MTGER: Multi-view Temporal Graph Enhanced Temporal Reasoning over Time-Involved Document
Zheng Chu
Zekun Wang
Jiafeng Liang
Ming Liu
Bing Qin
77
2
0
08 Nov 2023
Noisy Pair Corrector for Dense Retrieval
Noisy Pair Corrector for Dense Retrieval
Hang Zhang
Yeyun Gong
Xingwei He
Dayiheng Liu
Daya Guo
Jiancheng Lv
Jian Guo
87
7
0
07 Nov 2023
Adapting Pre-trained Generative Models for Extractive Question Answering
Adapting Pre-trained Generative Models for Extractive Question Answering
Prabir Mallick
Tapas Nayak
Indrajit Bhattacharya
57
4
0
06 Nov 2023
Perturbation-based Active Learning for Question Answering
Perturbation-based Active Learning for Question Answering
Fan Luo
Mihai Surdeanu
83
0
0
04 Nov 2023
Post Turing: Mapping the landscape of LLM Evaluation
Post Turing: Mapping the landscape of LLM Evaluation
Alexey Tikhonov
Ivan P. Yamshchikov
ELM
102
4
0
03 Nov 2023
Hint-enhanced In-Context Learning wakes Large Language Models up for
  knowledge-intensive tasks
Hint-enhanced In-Context Learning wakes Large Language Models up for knowledge-intensive tasks
Yifan Wang
Qingyan Guo
Xinzhe Ni
Chufan Shi
Lemao Liu
Haiyun Jiang
Yujiu Yang
ReLMRALM
52
8
0
03 Nov 2023
Plot Retrieval as an Assessment of Abstract Semantic Association
Plot Retrieval as an Assessment of Abstract Semantic Association
Shicheng Xu
Liang Pang
JiangNan Li
Mo Yu
Fandong Meng
Huawei Shen
Xueqi Cheng
Jie Zhou
87
3
0
03 Nov 2023
From Image to Language: A Critical Analysis of Visual Question Answering
  (VQA) Approaches, Challenges, and Opportunities
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Md Farhan Ishmam
Md Sakib Hossain Shovon
M. F. Mridha
Nilanjan Dey
158
44
0
01 Nov 2023
DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain
  Question Answering over Knowledge Base and Text
DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text
Wenting Zhao
Ye Liu
Tong Niu
Yao Wan
Philip S. Yu
Shafiq Joty
Yingbo Zhou
Semih Yavuz
LRM
93
7
0
31 Oct 2023
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties
  in Generative Language Models
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
Zhenpeng Su
Xing Wu
Xue Bai
Zijia Lin
Hui Chen
Guiguang Ding
Wei Zhou
Songlin Hu
138
5
0
30 Oct 2023
A Lightweight Method to Generate Unanswerable Questions in English
A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
75
1
0
30 Oct 2023
Skywork: A More Open Bilingual Foundation Model
Skywork: A More Open Bilingual Foundation Model
Tianwen Wei
Liang Zhao
Lichang Zhang
Bo Zhu
Lijie Wang
...
Yongyi Peng
Xiaojuan Liang
Shuicheng Yan
Han Fang
Yahui Zhou
104
102
0
30 Oct 2023
Fusing Temporal Graphs into Transformers for Time-Sensitive Question
  Answering
Fusing Temporal Graphs into Transformers for Time-Sensitive Question Answering
Xin Su
Phillip Howard
Nagib Hakim
Steven Bethard
99
3
0
30 Oct 2023
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context
  Evaluation Benchmark for Large Language Models
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Wai-Chung Kwan
Xingshan Zeng
Yufei Wang
Yusen Sun
Liangyou Li
Lifeng Shang
Qun Liu
Kam-Fai Wong
ELM
154
11
0
30 Oct 2023
N-Critics: Self-Refinement of Large Language Models with Ensemble of
  Critics
N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics
Sajad Mousavi
Ricardo Luna Gutierrez
Desik Rengarajan
Vineet Gundecha
Ashwin Ramesh Babu
Avisek Naug
Antonio Guillen-Perez
Soumyendu Sarkar
LRMHILMKELM
50
7
0
28 Oct 2023
Personas as a Way to Model Truthfulness in Language Models
Personas as a Way to Model Truthfulness in Language Models
Nitish Joshi
Javier Rando
Abulhair Saparov
Najoung Kim
He He
HILM
126
34
0
27 Oct 2023
Detrimental Contexts in Open-Domain Question Answering
Detrimental Contexts in Open-Domain Question Answering
Philhoon Oh
James Thorne
60
1
0
27 Oct 2023
An Open Source Data Contamination Report for Large Language Models
An Open Source Data Contamination Report for Large Language Models
Yucheng Li
Frank Guerin
Chenghua Lin
ELM
104
19
0
26 Oct 2023
Improving Zero-shot Reader by Reducing Distractions from Irrelevant
  Documents in Open-Domain Question Answering
Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering
Sukmin Cho
Jeongyeon Seo
Soyeong Jeong
Jong C. Park
RALM
69
2
0
26 Oct 2023
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained
  Language Models
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
Paul Youssef
Osman Alperen Koracs
Meijie Li
Jorg Schlotterer
Christin Seifert
KELM
81
19
0
25 Oct 2023
SoK: Memorization in General-Purpose Large Language Models
SoK: Memorization in General-Purpose Large Language Models
Valentin Hartmann
Anshuman Suri
Vincent Bindschaedler
David Evans
Shruti Tople
Robert West
KELMLLMAG
98
24
0
24 Oct 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without
  Full Large Language Model
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
Kaiyan Zhang
Ning Ding
Biqing Qi
Xuekai Zhu
Xinwei Long
Bowen Zhou
100
5
0
24 Oct 2023
FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering
FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering
Md Rafi Ur Rashid
Vishnu Asutosh Dasu
Kang Gu
Najrin Sultana
Shagufta Mehnaz
AAMLFedML
179
12
0
24 Oct 2023
Specialist or Generalist? Instruction Tuning for Specific NLP Tasks
Specialist or Generalist? Instruction Tuning for Specific NLP Tasks
Chufan Shi
Yixuan Su
Cheng Yang
Yujiu Yang
Deng Cai
120
18
0
23 Oct 2023
Large Language Models are Visual Reasoning Coordinators
Large Language Models are Visual Reasoning Coordinators
Liangyu Chen
Bo Li
Sheng Shen
Jingkang Yang
Chunyuan Li
Kurt Keutzer
Trevor Darrell
Ziwei Liu
VLMLRM
130
58
0
23 Oct 2023
Merging Generated and Retrieved Knowledge for Open-Domain QA
Merging Generated and Retrieved Knowledge for Open-Domain QA
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Lu Wang
RALM
91
38
0
22 Oct 2023
Chainpoll: A high efficacy method for LLM hallucination detection
Chainpoll: A high efficacy method for LLM hallucination detection
Robert Friel
Atindriyo Sanyal
LRMHILM
80
28
0
22 Oct 2023
Self-prompted Chain-of-Thought on Large Language Models for Open-domain
  Multi-hop Reasoning
Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning
Jinyuan Wang
Junlong Li
Hai Zhao
LRMReLM
114
25
0
20 Oct 2023
A Diachronic Perspective on User Trust in AI under Uncertainty
A Diachronic Perspective on User Trust in AI under Uncertainty
Shehzaad Dhuliawala
Vilém Zouhar
Mennatallah El-Assady
Mrinmaya Sachan
85
16
0
20 Oct 2023
Explicit Alignment and Many-to-many Entailment Based Reasoning for
  Conversational Machine Reading
Explicit Alignment and Many-to-many Entailment Based Reasoning for Conversational Machine Reading
Yangyang Luo
Shiyu Tian
Caixia Yuan
Fangkun Zhao
62
1
0
20 Oct 2023
Test-Time Self-Adaptive Small Language Models for Question Answering
Test-Time Self-Adaptive Small Language Models for Question Answering
Soyeong Jeong
Jinheon Baek
Sukmin Cho
Sung Ju Hwang
Jong C. Park
64
2
0
20 Oct 2023
Towards Understanding Sycophancy in Language Models
Towards Understanding Sycophancy in Language Models
Mrinank Sharma
Meg Tong
Tomasz Korbak
David Duvenaud
Amanda Askell
...
Oliver Rausch
Nicholas Schiefer
Da Yan
Miranda Zhang
Ethan Perez
369
247
0
20 Oct 2023
Previous
123...181920...353637
Next