Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03551
Cited By
v1
v2 (latest)
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"
50 / 1,823 papers shown
Title
Context versus Prior Knowledge in Language Models
Kevin Du
Vésteinn Snaebjarnarson
Niklas Stoehr
Jennifer C. White
Aaron Schein
Ryan Cotterell
104
13
0
06 Apr 2024
Q-PEFT: Query-dependent Parameter Efficient Fine-tuning for Text Reranking with Large Language Models
Zhiyuan Peng
Xuyang Wu
Qifan Wang
Sravanthi Rajanala
Yi Fang
111
4
0
06 Apr 2024
Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer
Hele-Andra Kuulmets
Taido Purason
Agnes Luhtaru
Mark Fishel
81
19
0
05 Apr 2024
BuDDIE: A Business Document Dataset for Multi-task Information Extraction
Ran Zmigrod
Dongsheng Wang
Mathieu Sibue
Yulong Pei
Petr Babkin
...
Antony Papadimitriou
William Watson
Zhiqiang Ma
Armineh Nourbakhsh
Sameena Shah
66
5
0
05 Apr 2024
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
Ajay Jaiswal
Bodun Hu
Lu Yin
Yeonju Ro
Shiwei Liu
Tianlong Chen
Aditya Akella
132
17
0
05 Apr 2024
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses
Dongwei Jiang
Jingyu Zhang
Orion Weller
Nathaniel Weir
Benjamin Van Durme
Daniel Khashabi
104
3
0
04 Apr 2024
Mitigating LLM Hallucinations via Conformal Abstention
Yasin Abbasi-Yadkori
Ilja Kuzborskij
David Stutz
András György
Adam Fisch
...
Wei-Hung Weng
Yao-Yuan Yang
Csaba Szepesvári
A. Cemgil
Nenad Tomašev
HILM
88
19
0
04 Apr 2024
Uncertainty in Language Models: Assessment through Rank-Calibration
Xinmeng Huang
Shuo Li
Mengxin Yu
Matteo Sesia
Hamed Hassani
Insup Lee
Osbert Bastani
Yan Sun
97
20
0
04 Apr 2024
Multi-Granularity Guided Fusion-in-Decoder
Eunseong Choi
Hyeri Lee
Jongwuk Lee
62
1
0
03 Apr 2024
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
Zhuo Chen
Xinyu Wang
Yong Jiang
Pengjun Xie
Fei Huang
Kewei Tu
RALM
86
3
0
02 Apr 2024
HyperCLOVA X Technical Report
Kang Min Yoo
Jaegeun Han
Sookyo In
Heewon Jeon
Jisu Jeong
...
Hyunkyung Noh
Se-Eun Choi
Sang-Woo Lee
Jung Hwa Lim
Nako Sung
VLM
88
9
0
02 Apr 2024
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Zixuan Zhang
R. Reddy
Kevin Small
Tong Zhang
Heng Ji
118
1
0
02 Apr 2024
The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis
Chen Yang
Junzhuo Li
Xinyao Niu
Xinrun Du
Songyang Gao
...
Stephen W. Huang
Shawn Yue
Wenhu Chen
Jie Fu
Ge Zhang
78
2
0
01 Apr 2024
Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs
Xiaoze Liu
Feijie Wu
Tianyang Xu
Zhuo Chen
Yichi Zhang
Xiaoqian Wang
Jing Gao
HILM
100
10
0
01 Apr 2024
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs
Jingzhe Shi
Jialuo Li
Qinwei Ma
Zaiwen Yang
Huan Ma
Lei Li
85
7
0
31 Mar 2024
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Taishi Nakamura
Mayank Mishra
Simone Tedeschi
Yekun Chai
Jason T Stillerman
...
Virendra Mehta
Matthew Blumberg
Victor May
Huu Nguyen
S. Pyysalo
LRM
93
8
0
30 Mar 2024
TriviaHG: A Dataset for Automatic Hint Generation from Factoid Questions
Jamshid Mozafari
Anubhav Jangra
Adam Jatowt
94
4
1
27 Mar 2024
Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback
Hongshen Xu
Zichen Zhu
Situo Zhang
Da Ma
Shuai Fan
Lu Chen
Kai Yu
HILM
107
45
0
27 Mar 2024
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Neeloy Chakraborty
Melkior Ornik
Katherine Driggs-Campbell
LRM
252
12
0
25 Mar 2024
Understanding Emergent Abilities of Language Models from the Loss Perspective
Zhengxiao Du
Aohan Zeng
Yuxiao Dong
Jie Tang
UQCV
LRM
166
56
0
23 Mar 2024
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation
Xindi Luo
Zequn Sun
Jing-xin Zhao
Zhe Zhao
Wei Hu
KELM
69
8
0
22 Mar 2024
Detoxifying Large Language Models via Knowledge Editing
Meng Wang
Ningyu Zhang
Ziwen Xu
Zekun Xi
Shumin Deng
Yunzhi Yao
Qishen Zhang
Linyi Yang
Jindong Wang
Huajun Chen
KELM
112
66
0
21 Mar 2024
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Soyeong Jeong
Jinheon Baek
Sukmin Cho
Sung Ju Hwang
Jong C. Park
RALM
126
187
0
21 Mar 2024
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
Yuren Mao
Xuemei Dong
Wenyi Xu
Yunjun Gao
Bin Wei
Ying Zhang
62
10
0
21 Mar 2024
Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering
Kosuke Akimoto
Kunihiro Takeoka
Masafumi Oyamada
75
1
0
21 Mar 2024
M3: A Multi-Task Mixed-Objective Learning Framework for Open-Domain Multi-Hop Dense Sentence Retrieval
Yang Bai
Anthony Colas
Christan Earl Grant
Daisy Zhe Wang
77
1
0
21 Mar 2024
Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference
Baolin Li
Yankai Jiang
V. Gadepally
Devesh Tiwari
101
15
0
19 Mar 2024
What Are Tools Anyway? A Survey from the Language Model Perspective
Zhiruo Wang
Zhoujun Cheng
Hao Zhu
Daniel Fried
Graham Neubig
130
33
0
18 Mar 2024
RAFT: Adapting Language Model to Domain Specific RAG
Tianjun Zhang
Shishir G. Patil
Naman Jain
Sheng Shen
Matei A. Zaharia
Ion Stoica
Joseph E. Gonzalez
RALM
108
213
0
15 Mar 2024
Self-Consistency Boosts Calibration for Math Reasoning
Ante Wang
Linfeng Song
Ye Tian
Baolin Peng
Lifeng Jin
Haitao Mi
Jinsong Su
Dong Yu
LRM
68
5
0
14 Mar 2024
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Brandon McKinzie
Zhe Gan
J. Fauconnier
Sam Dodge
Bowen Zhang
...
Zirui Wang
Ruoming Pang
Peter Grasch
Alexander Toshev
Yinfei Yang
MLLM
138
209
0
14 Mar 2024
Semiparametric Token-Sequence Co-Supervision
Hyunji Lee
Doyoung Kim
Jihoon Jun
Se June Joo
Joel Jang
Kyoung-Woon On
Minjoon Seo
114
1
0
14 Mar 2024
Simple and Scalable Strategies to Continually Pre-train Large Language Models
Adam Ibrahim
Benjamin Thérien
Kshitij Gupta
Mats L. Richter
Quentin Anthony
Timothée Lesort
Eugene Belilovsky
Irina Rish
KELM
CLL
109
63
0
13 Mar 2024
Knowledge Conflicts for LLMs: A Survey
Rongwu Xu
Zehan Qi
Zhijiang Guo
Cunxiang Wang
Hongru Wang
Yue Zhang
Wei Xu
313
122
0
13 Mar 2024
Gemma: Open Models Based on Gemini Research and Technology
Gemma Team
Gemma Team Thomas Mesnard
Cassidy Hardin
Robert Dadashi
Surya Bhupatiraju
...
Armand Joulin
Noah Fiedel
Evan Senter
Alek Andreev
Kathleen Kenealy
VLM
LLMAG
245
515
0
13 Mar 2024
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Shikhar Murty
Christopher D. Manning
Peter Shaw
Mandar Joshi
Kenton Lee
LM&Ro
LLMAG
72
20
0
12 Mar 2024
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Sainbayar Sukhbaatar
O. Yu. Golovneva
Vasu Sharma
Hu Xu
Xi Lin
...
Jacob Kahn
Shang-Wen Li
Wen-tau Yih
Jason Weston
Xian Li
MoMe
OffRL
MoE
98
69
0
12 Mar 2024
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback
Yanming Liu
Xinyue Peng
Xuhong Zhang
Weihao Liu
Jianwei Yin
Jiannan Cao
Tianyu Du
RALM
71
45
0
11 Mar 2024
Calibrating Large Language Models Using Their Generations Only
Dennis Ulmer
Martin Gubri
Hwaran Lee
Sangdoo Yun
Seong Joon Oh
UQLM
506
28
1
09 Mar 2024
Unfamiliar Finetuning Examples Control How Language Models Hallucinate
Katie Kang
Eric Wallace
Claire Tomlin
Aviral Kumar
Sergey Levine
HILM
LRM
109
58
0
08 Mar 2024
HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild
Zhiying Zhu
Yiming Yang
Zhiqing Sun
HILM
VLM
96
14
0
07 Mar 2024
Backtracing: Retrieving the Cause of the Query
Rose E. Wang
Pawan Wirawarn
Omar Khattab
Noah D. Goodman
Dorottya Demszky
59
2
0
06 Mar 2024
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
Shiqi Chen
Miao Xiong
Junteng Liu
Zhengxuan Wu
Teng Xiao
Siyang Gao
Junxian He
HILM
138
26
0
03 Mar 2024
Infusing Knowledge into Large Language Models with Contextual Prompts
Kinshuk Vasisht
Balaji Ganesan
Vikas Kumar
Vasudha Bhatnagar
KELM
90
3
0
03 Mar 2024
Automatic Question-Answer Generation for Long-Tail Knowledge
Rohan Kumar
Youngmin Kim
Sunitha Ravi
Haitian Sun
Christos Faloutsos
Ruslan Salakhutdinov
Minji Yoon
53
8
0
03 Mar 2024
API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access
Jiayuan Su
Jing Luo
Hongwei Wang
Lu Cheng
247
23
0
02 Mar 2024
LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems
Xiao Yu
Yunan Lu
Zhou Yu
RALM
73
7
0
01 Mar 2024
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Saurabh Srivastava
B. AnnaroseM
V. AntoP
Shashank Menon
Ajay Sukumar
T. AdwaithSamod
Alan Philipose
Stevin Prince
Sooraj Thomas
ELM
ReLM
LRM
79
56
0
29 Feb 2024
Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark
Zhikun Xu
Hai-Tao Zheng
Ruixue Ding
Xinyu Wang
Boli Chen
Yong Jiang
Hai-Tao Zheng
Wenlian Lu
Pengjun Xie
Fei Huang
120
11
0
29 Feb 2024
CogBench: a large language model walks into a psychology lab
Julian Coda-Forno
Marcel Binz
Jane X. Wang
Eric Schulz
ELM
ALM
LLMAG
LM&MA
121
39
0
28 Feb 2024
Previous
1
2
3
...
15
16
17
...
35
36
37
Next