Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03551
Cited By
v1
v2 (latest)
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"
50 / 1,823 papers shown
Title
Self-Routing RAG: Binding Selective Retrieval with Knowledge Verbalization
Di Wu
Jia-Chen Gu
Kai-Wei Chang
Nanyun Peng
121
0
0
01 Apr 2025
Focus Directions Make Your Language Models Pay More Attention to Relevant Contexts
Youxiang Zhu
Ruochen Li
Danqing Wang
Daniel Haehn
Xiaohui Liang
LRM
133
2
0
30 Mar 2025
A Refined Analysis of Massive Activations in LLMs
Louis Owen
Nilabhra Roy Chowdhury
Abhay Kumar
Fabian Güra
55
1
0
28 Mar 2025
Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Xinyu Wang
Linrui Ma
Jerry Huang
Peng Lu
Prasanna Parthasarathi
Xiao-Wen Chang
Boxing Chen
Yufei Cui
KELM
130
1
0
28 Mar 2025
Gemma 3 Technical Report
Gemma Team
Aishwarya B Kamath
Johan Ferret
Shreya Pathak
Nino Vieillard
...
Harshal Tushar Lehri
Hussein Hazimeh
Ian Ballantyne
Idan Szpektor
Ivan Nardini
VLM
195
137
0
25 Mar 2025
Reverse-Engineering the Retrieval Process in GenIR Models
Anja Reusch
Yonatan Belinkov
RALM
103
0
0
25 Mar 2025
Understanding and Improving Information Preservation in Prompt Compression for LLMs
Weronika Łajewska
Momchil Hardalov
Laura Aina
Neha Anna John
Hang Su
Lluís Marquez
162
1
0
24 Mar 2025
ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices
Aneesh Vathul
Daniel Lee
Sheryl Chen
Arthi Tasmia
HILM
127
0
0
23 Mar 2025
Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive Learning
Ke Ji
Yixin Lian
Linxu Li
Jingsheng Gao
Weiyuan Li
Bin Dai
81
2
0
22 Mar 2025
Variance Control via Weight Rescaling in LLM Pre-training
Louis Owen
Abhay Kumar
Nilabhra Roy Chowdhury
Fabian Güra
75
0
0
21 Mar 2025
Dense Passage Retrieval in Conversational Search
Ahmed H. Salamah
Pierre McWhannel
Nicole Yan
71
0
0
21 Mar 2025
Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey
Xiaoou Liu
Tiejin Chen
Longchao Da
Chacha Chen
Zhen Lin
Hua Wei
HILM
146
8
0
20 Mar 2025
Typed-RAG: Type-aware Multi-Aspect Decomposition for Non-Factoid Question Answering
DongGeon Lee
Ahjeong Park
Hyeri Lee
Hyeonseo Nam
Yunho Maeng
66
3
0
20 Mar 2025
LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates
Ying Shen
Lifu Huang
104
2
0
20 Mar 2025
ECLAIR: Enhanced Clarification for Interactive Responses
John Murzaku
Zifan Liu
Md Mehrab Tanjim
Vaishnavi Muppala
Xiang Chen
Yunyao Li
76
0
0
19 Mar 2025
SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Tongyao Zhu
Qian Liu
Haonan Wang
Shiqi Chen
Xiangming Gu
Tianyu Pang
Min-Yen Kan
102
0
0
19 Mar 2025
RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving
Wenqi Jiang
Suvinay Subramanian
Cat Graves
Gustavo Alonso
Amir Yazdanbakhsh
Vidushi Dadu
132
4
0
18 Mar 2025
GPU-Accelerated Motion Planning of an Underactuated Forestry Crane in Cluttered Environments
M. Vu
Gerald Ebmer
Alexander Watcher
Marc-Philip Ecker
Giang Nguyen
Tobias Glueck
133
3
0
18 Mar 2025
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
Ziwei Ji
L. Yu
Yeskendir Koishekenov
Yejin Bang
Anthony Hartshorn
Alan Schelten
Cheng Zhang
Pascale Fung
Nicola Cancedda
123
6
0
18 Mar 2025
SuperBPE: Space Travel for Language Models
Alisa Liu
J. Hayase
Valentin Hofmann
Sewoong Oh
Noah A. Smith
Yejin Choi
157
10
0
17 Mar 2025
Pensez: Less Data, Better Reasoning -- Rethinking French LLM
Huy Hoang Ha
ReLM
LRM
96
1
0
17 Mar 2025
OSCAR: Online Soft Compression And Reranking
Maxime Louis
Thibault Formal
Hervé Déjean
Stéphane Clinchant
120
0
0
17 Mar 2025
A Survey on Transformer Context Extension: Approaches and Evaluation
Yijun Liu
Jinzheng Yu
Yang Xu
Zhongyang Li
Qingfu Zhu
LLMAG
132
3
0
17 Mar 2025
A Survey on the Optimization of Large Language Model-based Agents
Shangheng Du
Jiabao Zhao
Jinxin Shi
Zhentao Xie
Xin Jiang
Yanhong Bai
Liang He
LLMAG
LM&Ro
LM&MA
544
5
0
16 Mar 2025
Key, Value, Compress: A Systematic Exploration of KV Cache Compression Techniques
Neusha Javidnia
B. Rouhani
F. Koushanfar
558
0
0
14 Mar 2025
AttentionRAG: Attention-Guided Context Pruning in Retrieval-Augmented Generation
Yixiong Fang
Tianran Sun
Yuling Shi
Xiaodong Gu
98
0
0
13 Mar 2025
Compute Optimal Scaling of Skills: Knowledge vs Reasoning
Nicholas Roberts
Niladri S. Chatterji
Sharan Narang
Mike Lewis
Dieuwke Hupkes
119
2
0
13 Mar 2025
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Bowen Jin
Hansi Zeng
Zhenrui Yue
Dong Wang
Sercan O. Arik
Dong Wang
Hamed Zamani
Jiawei Han
RALM
ReLM
KELM
OffRL
AI4TS
LRM
236
122
0
12 Mar 2025
Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation
Yu Wang
Jiaxin Zhang
Xiang Gao
Wendi Cui
Peng Li
Kamalika Das
86
1
0
11 Mar 2025
Mellow: a small audio language model for reasoning
Soham Deshmukh
Satvik Dixit
Rita Singh
Bhiksha Raj
AuLLM
ReLM
LRM
115
4
0
11 Mar 2025
Exploiting Instruction-Following Retrievers for Malicious Information Retrieval
Parishad BehnamGhader
Nicholas Meade
Siva Reddy
145
1
0
11 Mar 2025
DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering
Sher Badshah
Hassan Sajjad
134
1
0
11 Mar 2025
MapQA: Open-domain Geospatial Question Answering on Map Data
Zekun Li
Malcolm Grossman
Eric
Qasemi
Mihir Kulkarni
Muhao Chen
Yao-Yi Chiang
106
1
0
10 Mar 2025
Delusions of Large Language Models
Hongshen Xu
Zixv yang
Zichen Zhu
Kunyao Lan
Zihan Wang
Mengyue Wu
Ziwei Ji
Lu Chen
Pascale Fung
Kai Yu
LRM
HILM
135
0
0
09 Mar 2025
Alignment for Efficient Tool Calling of Large Language Models
Hongshen Xu
Zihan Wang
Zichen Zhu
Lei Pan
Xingyu Chen
Lu Chen
Kai Yu
92
1
0
09 Mar 2025
Agent models: Internalizing Chain-of-Action Generation into Reasoning models
Yuxiang Zhang
Yuqi Yang
Jiangming Shu
Xinyan Wen
Jitao Sang
LRM
LLMAG
LM&Ro
102
4
0
09 Mar 2025
No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding
Michael Krumdick
Charles Lovering
Varshini Reddy
Seth Ebner
Chris Tanner
ALM
ELM
112
6
0
07 Mar 2025
SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs
Samir Abdaljalil
Hasan Kurban
Parichit Sharma
Erchin Serpedin
Rachad Atat
HILM
99
3
0
07 Mar 2025
IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining
Yixiao Li
Xianzhi Du
Ajay Jaiswal
Tao Lei
T. Zhao
Chong-Jun Wang
Jianyu Wang
79
1
0
07 Mar 2025
Understanding the Limits of Lifelong Knowledge Editing in LLMs
Lukas Thede
Karsten Roth
Matthias Bethge
Zeynep Akata
Tom Hartvigsen
KELM
CLL
123
4
0
07 Mar 2025
Robust Data Watermarking in Language Models by Injecting Fictitious Knowledge
Xinyue Cui
Johnny Tian-Zheng Wei
Swabha Swayamdipta
Robin Jia
WaLM
148
2
0
06 Mar 2025
Continual Pre-training of MoEs: How robust is your router?
Benjamin Thérien
Charles-Étienne Joseph
Zain Sarwar
Ashwinee Panda
Anirban Das
Shi-Xiong Zhang
Stephen Rawls
Siyang Song
Eugene Belilovsky
Irina Rish
MoE
118
0
0
06 Mar 2025
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Siyang Song
Mohammed Irfan Kurpath
Sahal Shaji Mullappilly
Jean Lahoud
Fahad A Khan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
AuLLM
373
2
0
06 Mar 2025
DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning Ability
Yunzhen He
Yusuke Takase
Yoichi Ishibashi
Hidetoshi Shimodaira
78
1
0
04 Mar 2025
LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models
Pengwei Tang
Yebin Liu
Dongjie Zhang
Xing Wu
Debing Zhang
114
2
0
04 Mar 2025
Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
Zicong He
Boxuan Zhang
Lu Cheng
114
1
0
04 Mar 2025
Rewarding Doubt: A Reinforcement Learning Approach to Calibrated Confidence Expression of Large Language Models
Paul Stangel
David Bani-Harouni
Chantal Pellegrini
Ege Özsoy
Kamilia Zaripova
Matthias Keicher
Nassir Navab
84
1
0
04 Mar 2025
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
Lu Dai
Yijie Xu
Jinhui Ye
Hao Liu
Hui Xiong
3DV
RALM
215
3
0
03 Mar 2025
Steer LLM Latents for Hallucination Detection
Seongheon Park
Xuefeng Du
Min-Hsuan Yeh
Haobo Wang
Yixuan Li
LLMSV
112
3
0
01 Mar 2025
Optimizing Large Language Models for ESG Activity Detection in Financial Texts
Mattia Birti
Francesco Osborne
Andrea Maurino
86
0
0
28 Feb 2025
Previous
1
2
3
4
5
6
...
35
36
37
Next