Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03551
Cited By
v1
v2 (latest)
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"
50 / 1,823 papers shown
Title
Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation
Jiahao Cheng
Tiancheng Su
Jia Yuan
Guoxiu He
Jiawei Liu
Xinqi Tao
Jingwen Xie
Huaxia Li
HILM
LRM
28
0
0
20 Jun 2025
The Role of Model Confidence on Bias Effects in Measured Uncertainties
Xinyi Liu
Weiguang Wang
Hangfeng He
23
0
0
20 Jun 2025
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
Adithya Bhaskar
Alexander Wettig
Tianyu Gao
Yihe Dong
Danqi Chen
22
0
0
20 Jun 2025
A Vietnamese Dataset for Text Segmentation and Multiple Choices Reading Comprehension
Toan Nguyen Hai
Ha Nguyen Viet
Truong Quan Xuan
Duc Do Minh
29
0
0
19 Jun 2025
RATTENTION: Towards the Minimal Sliding Window Size in Local-Global Attention Models
Bailin Wang
Chang Lan
Chong-Jun Wang
Ruoming Pang
32
0
0
18 Jun 2025
Context-Informed Grounding Supervision
Hyunji Lee
Seunghyun Yoon
Yunjae Won
Hanseok Oh
Geewook Kim
Trung H. Bui
Franck Dernoncourt
Elias Stengel-Eskin
Mohit Bansal
Minjoon Seo
LRM
43
0
0
18 Jun 2025
Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality
Yuto Harada
Yusuke Yamauchi
Yusuke Oda
Yohei Oseki
Yusuke Miyao
Yu Takagi
ALM
33
0
0
17 Jun 2025
CrEst: Credibility Estimation for Contexts in LLMs via Weak Supervision
Dyah Adila
Shuai Zhang
Boran Han
Bonan Min
Yuyang Wang
41
0
0
17 Jun 2025
Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Yibo Yang
Sihao Liu
Chuan Rao
Bang An
Tiancheng Shen
Philip Torr
Ming-Hsuan Yang
Bernard Ghanem
31
0
0
16 Jun 2025
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation
Zhuocheng Zhang
Yang Feng
Min Zhang
41
0
0
14 Jun 2025
MALM: A Multi-Information Adapter for Large Language Models to Mitigate Hallucination
Ao Jia
Haiming Wu
Guohui Yao
D. Song
Songkun Ji
Yazhou Zhang
26
0
0
14 Jun 2025
Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Xiaoran Fan
Zhichao Sun
Yangfan Gao
Jingfei Xiong
Hang Yan
...
Shaokang Dong
Tao Ji
Tao Gui
Qi Zhang
Xuanjing Huang
40
0
0
14 Jun 2025
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback
Dongwei Jiang
Alvin Zhang
Andrew Wang
Nicholas Andrews
Daniel Khashabi
LRM
31
0
0
13 Jun 2025
Constructing and Evaluating Declarative RAG Pipelines in PyTerrier
Craig Macdonald
Jinyuan Fang
Andrew Parry
Zaiqiao Meng
AI4TS
125
0
0
12 Jun 2025
Lifting Data-Tracing Machine Unlearning to Knowledge-Tracing for Foundation Models
Yuwen Tan
Boqing Gong
MU
46
0
0
12 Jun 2025
Query-Level Uncertainty in Large Language Models
Lihu Chen
Gaël Varoquaux
74
0
0
11 Jun 2025
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song
Ruihan Ji
Naichen Shi
Fan Lai
Raed Al Kontar
84
0
0
11 Jun 2025
OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment
Chao-Hong Tan
Qian Chen
Wen Wang
Chong Deng
Qinglin Zhang
...
Yukun Ma
Yafeng Chen
Hui Wang
Jiaqing Liu
Jieping Ye
AuLLM
89
0
0
11 Jun 2025
TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
Bingheng Wu
Jingze Shi
Yifan Wu
Nan Tang
Yuyu Luo
106
0
0
11 Jun 2025
The Geometries of Truth Are Orthogonal Across Tasks
Waiss Azizian
Michael Kirchhof
Eugène Ndiaye
Louis Béthune
Michal Klein
Pierre Ablin
Marco Cuturi
34
0
0
10 Jun 2025
Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models
Wentao Shi
Yiqing Shen
KELM
LRM
26
0
0
10 Jun 2025
Flow Matching Meets PDEs: A Unified Framework for Physics-Constrained Generation
Giacomo Baldan
Qiang Liu
Alberto Guardone
Nils Thuerey
AI4CE
32
1
0
10 Jun 2025
PropMEND: Hypernetworks for Knowledge Propagation in LLMs
Zeyu Leo Liu
Greg Durrett
Eunsol Choi
KELM
36
0
0
10 Jun 2025
Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-
k
k
k
Chihiro Taguchi
Seiji Maekawa
Nikita Bhutani
RALM
35
0
0
10 Jun 2025
LEANN: A Low-Storage Vector Index
Yichuan Wang
Shu Liu
Zhifei Li
Yongji Wu
Ziming Mao
...
Yang Zhou
Ion Stoica
Sewon Min
Matei A. Zaharia
Joseph E. Gonzalez
34
0
0
09 Jun 2025
LLM Unlearning Should Be Form-Independent
Xiaotian Ye
Mengqi Zhang
Shu Wu
MU
27
0
0
09 Jun 2025
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered
Siddartha Devic
Tejas Srinivasan
Jesse Thomason
Willie Neiswanger
Vatsal Sharan
30
0
0
09 Jun 2025
Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models
Haoyu Wang
Peihao Wang
Mufei Li
Shikun Liu
Siqi Miao
Zhangyang Wang
P. Li
20
0
0
09 Jun 2025
GaRAGe: A Benchmark with Grounding Annotations for RAG Evaluation
Ionut Teodor Sorodoc
Leonardo F. R. Ribeiro
Rexhina Blloshmi
Christopher Davis
Adria de Gispert
21
0
0
09 Jun 2025
dots.llm1 Technical Report
Bi Huo
Bin Tu
Cheng Qin
Da Zheng
Debing Zhang
...
Yuqiu Ji
Ze Wen
Zhenhai Liu
Zichao Li
Zilong Liao
MoE
61
0
0
06 Jun 2025
Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge
Yi Sui
Chaozhuo Li
Chen Zhang
D. Song
Qiuchi Li
58
0
0
06 Jun 2025
Multidimensional Analysis of Specific Language Impairment Using Unsupervised Learning Through PCA and Clustering
Niruthiha Selvanayagam
39
0
0
05 Jun 2025
ECoRAG: Evidentiality-guided Compression for Long Context RAG
Yeonseok Jeong
Jinsu Kim
Dohyeon Lee
S. Hwang
163
0
0
05 Jun 2025
From Understanding to Generation: An Efficient Shortcut for Evaluating Language Models
Viktor Hangya
Fabian Küch
Darina Gold
ELM
63
0
0
04 Jun 2025
R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
Qingfei Zhao
Ruobing Wang
Dingling Xu
Daren Zha
Limin Liu
AI4TS
KELM
LRM
77
0
0
04 Jun 2025
RedDebate: Safer Responses through Multi-Agent Red Teaming Debates
Ali Asad
Stephen Obadinma
Radin Shayanfar
Xiaodan Zhu
AAML
LLMAG
29
0
0
04 Jun 2025
Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model
Haibin Wu
Yuxuan Hu
Ruchao Fan
Xiaofei Wang
K. Kumatani
...
J. Yu
Heng Lu
Lijuan Wang
Y. Qian
Jinyu Li
AuLLM
67
0
0
04 Jun 2025
SOVA-Bench: Benchmarking the Speech Conversation Ability for LLM-based Voice Assistant
Yixuan Hou
Heyang Liu
Yuhao Wang
Ziyang Cheng
Ronghua Wu
Qunshan Gu
Yanfeng Wang
Yu Wang
AuLLM
55
0
0
03 Jun 2025
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations
Jinyuan Luo
Zhen Fang
Yixuan Li
Seongheon Park
Ling Chen
AAML
HILM
65
0
0
03 Jun 2025
IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data
Bo Peng
Zhiheng Wang
Heyang Gong
Chaochao Lu
80
0
0
03 Jun 2025
Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes
Meng Li
Michael Vrazitulis
David Schlangen
63
0
0
02 Jun 2025
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
Pasunuti Prasanjith
Prathmesh B More
Anoop Kunchukuttan
Raj Dabre
RALM
61
0
0
02 Jun 2025
Reconsidering LLM Uncertainty Estimation Methods in the Wild
Yavuz Faruk Bakman
D. Yaldiz
Sungmin Kang
Tuo Zhang
Baturalp Buyukates
Salman Avestimehr
Sai Praneeth Karimireddy
59
0
0
01 Jun 2025
Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering Tasks
Yuntai Bao
Xuhong Zhang
Tianyu Du
Xinkui Zhao
Zhengwen Feng
Hao Peng
Jianwei Yin
HILM
63
0
0
01 Jun 2025
RLAE: Reinforcement Learning-Assisted Ensemble for LLMs
Y. Fu
Yuanheng Zhu
Jiajun Chai
Guojun Yin
Wei Lin
Qichao Zhang
Dongbin Zhao
27
0
0
31 May 2025
Efficient Latent Semantic Clustering for Scaling Test-Time Computation of LLMs
Sungjae Lee
Hoyoung Kim
Jeongyeon Hwang
Eunhyeok Park
Jungseul Ok
LRM
27
0
0
31 May 2025
ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation
Hao Chen
Yukun Yan
Sen Mei
Wanxiang Che
Zhenghao Liu
...
Yuchun Fan
Pengcheng Huang
Qiushi Xiong
Zhiyuan Liu
Maosong Sun
LRM
46
0
0
30 May 2025
Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity
Dang Nguyen
Ali Payani
Baharan Mirzasoleiman
22
0
0
30 May 2025
HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs
Qing Li
Jiahui Geng
Zongxiong Chen
Derui Zhu
Yuxia Wang
Congbo Ma
Chenyang Lyu
Fakhri Karray
21
0
0
30 May 2025
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
Gabrielle Kaili-May Liu
Gal Yona
Avi Caciularu
Idan Szpektor
Tim G. J. Rudner
Arman Cohan
46
0
0
30 May 2025
1
2
3
4
...
35
36
37
Next