ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.03551
  4. Cited By
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
v1v2 (latest)

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
    RALM
ArXiv (abs)PDFHTML

Papers citing "TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"

50 / 1,823 papers shown
Title
Mis-prompt: Benchmarking Large Language Models for Proactive Error Handling
Mis-prompt: Benchmarking Large Language Models for Proactive Error Handling
Jiayi Zeng
Yizhe Feng
Mengliang He
Wenhui Lei
Wei Zhang
Zeming Liu
Xiaoming Shi
Aimin Zhou
LRM
31
0
0
29 May 2025
SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA
SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA
Minrui Luo
Fuhang Kuang
Yu Wang
Zirui Liu
Tianxing He
CLL
64
0
0
29 May 2025
From Chat Logs to Collective Insights: Aggregative Question Answering
From Chat Logs to Collective Insights: Aggregative Question Answering
Wentao Zhang
Woojeong Kim
Yuntian Deng
LMTD
50
0
0
29 May 2025
Are Reasoning Models More Prone to Hallucination?
Are Reasoning Models More Prone to Hallucination?
Zijun Yao
Y. Liu
Yanxu Chen
Jianhui Chen
Junfeng Fang
Lei Hou
Juanzi Li
Tat-Seng Chua
ReLMHILMLRM
132
0
0
29 May 2025
UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions
UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions
Chuanyuan Tan
Wenbiao Shao
Hao Xiong
Tong Zhu
Zhenhua Liu
Kai Shi
Wenliang Chen
51
0
0
29 May 2025
Revisiting Uncertainty Estimation and Calibration of Large Language Models
Revisiting Uncertainty Estimation and Calibration of Large Language Models
Linwei Tao
Yi-Fan Yeh
Minjing Dong
Tao Huang
Philip Torr
Chang Xu
38
0
0
29 May 2025
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Caiqi Zhang
Xiaochen Zhu
Chengzu Li
Nigel Collier
Andreas Vlachos
OffRLHILM
55
1
0
29 May 2025
Safeguarding Privacy of Retrieval Data against Membership Inference Attacks: Is This Query Too Close to Home?
Safeguarding Privacy of Retrieval Data against Membership Inference Attacks: Is This Query Too Close to Home?
Yujin Choi
Youngjoo Park
Junyoung Byun
Jaewook Lee
Jinseong Park
AAML
54
0
0
28 May 2025
Stratified Selective Sampling for Instruction Tuning with Dedicated Scoring Strategy
Paramita Mirza
Lucas Weber
Fabian Küch
51
0
0
28 May 2025
RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering
RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering
Bolei He
Xinran He
Mengke Chen
Xianwei Xue
Ying Zhu
Zhenhua Ling
ReLMLRM
45
0
0
28 May 2025
LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents
LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents
Taro Yano
Yoichi Ishibashi
Masafumi Oyamada
LM&Ro
64
1
0
28 May 2025
ChatPD: An LLM-driven Paper-Dataset Networking System
ChatPD: An LLM-driven Paper-Dataset Networking System
Anjie Xu
Ruiqing Ding
Leye Wang
46
0
0
28 May 2025
Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers
Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers
Chaitanya Sharma
RALM3DV
44
0
0
28 May 2025
Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing
Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing
Yifan Lu
Jing Li
Yigeng Zhou
Yihui Zhang
Wenya Wang
Xiucheng Li
Meishan Zhang
Fangming Liu
Jun-chen Yu
Min Zhang
KELMCLL
56
1
0
28 May 2025
BLUR: A Benchmark for LLM Unlearning Robust to Forget-Retain Overlap
BLUR: A Benchmark for LLM Unlearning Robust to Forget-Retain Overlap
Shengyuan Hu
Neil Kale
Pratiksha Thaker
Yiwei Fu
Steven Wu
Virginia Smith
MUAAMLCLL
23
0
0
28 May 2025
EvolveSearch: An Iterative Self-Evolving Search Agent
EvolveSearch: An Iterative Self-Evolving Search Agent
Dingchu Zhang
Yida Zhao
Jialong Wu
Baixuan Li
Wenbiao Yin
...
Yong Jiang
Yufeng Li
Kewei Tu
Pengjun Xie
Fei Huang
LLMAGKELM
78
0
0
28 May 2025
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
Jakub Podolak
Rajeev Verma
ReLMLRM
27
0
0
28 May 2025
Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems
Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems
Hoang Pham
Thuy-Duong Nguyen
Khac-Hoai Nam Bui
LLMAG
40
0
0
28 May 2025
Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation
Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation
Ekaterina Fadeeva
Aleksandr Rubashevskii
Roman Vashurin
Shehzaad Dhuliawala
Artem Shelmanov
Timothy Baldwin
Preslav Nakov
Mrinmaya Sachan
Maxim Panov
HILM
77
0
0
27 May 2025
RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
Dario Satriani
Enzo Veltri
Donatello Santoro
Paolo Papotti
LMTDHILM
53
0
0
27 May 2025
Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
Xin Sun
Jianan Xie
Zhongqi Chen
Qiang Liu
Shu Wu
Yuehe Chen
Bowen Song
Weiqiang Wang
Zilei Wang
Liang Wang
RALM
41
0
0
27 May 2025
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA
Sergey Pletenev
Maria Marina
Nikolay Ivanov
Daria Galimzianova
Nikita Krayko
Mikhail Salnikov
Vasily Konovalov
Alexander Panchenko
Viktor Moskvoretskii
54
0
0
27 May 2025
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Junyan Zhang
Yubo Gao
Yibo Yan
Jungang Li
Zhaorui Hou
...
Shuliang Liu
Song Dai
Yonghua Hei
Junzhuo Li
Xuming Hu
67
0
0
27 May 2025
Pretrained LLMs Learn Multiple Types of Uncertainty
Pretrained LLMs Learn Multiple Types of Uncertainty
Roi Cohen
Omri Fahn
Gerard de Melo
43
0
0
27 May 2025
Self-reflective Uncertainties: Do LLMs Know Their Internal Answer Distribution?
Self-reflective Uncertainties: Do LLMs Know Their Internal Answer Distribution?
Michael Kirchhof
Luca Füger
Adam Goliñski
Eeshan Gunesh Dhekane
Arno Blaas
Sinead Williamson
37
1
0
26 May 2025
Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs
Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs
Artem Vazhentsev
Lyudmila Rvanova
Gleb Kuzmin
Ekaterina Fadeeva
Ivan Lazichny
...
Maxim Panov
Timothy Baldwin
Mrinmaya Sachan
Preslav Nakov
Artem Shelmanov
EDLHILM
84
0
0
26 May 2025
ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining
ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining
Melis Ilayda Bal
Volkan Cevher
Michael Muehlebach
41
0
0
26 May 2025
MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs
MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs
Zaid Alyafeai
Maged S. Al-Shaibani
Bernard Ghanem
30
0
0
26 May 2025
MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning
MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning
Thang Nguyen
Peter Chin
Yu-Wing Tai
LRM
82
1
0
26 May 2025
InFact: Informativeness Alignment for Improved LLM Factuality
InFact: Informativeness Alignment for Improved LLM Factuality
Roi Cohen
Russa Biswas
Gerard de Melo
22
0
0
26 May 2025
GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models
GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models
Tingjia Shen
Hao Wang
Chuan Qin
Ruijun Sun
Yang Song
Defu Lian
Hengshu Zhu
Enhong Chen
57
0
0
26 May 2025
Towards Harmonized Uncertainty Estimation for Large Language Models
Towards Harmonized Uncertainty Estimation for Large Language Models
Rui Li
Jing Long
Muge Qi
Heming Xia
Lei Sha
Peiyi Wang
Zhifang Sui
UQCV
66
0
0
25 May 2025
ReadBench: Measuring the Dense Text Visual Reading Ability of Vision-Language Models
ReadBench: Measuring the Dense Text Visual Reading Ability of Vision-Language Models
Benjamin Clavié
Florian Brand
VLMCoGe
64
0
0
25 May 2025
Hybrid Latent Reasoning via Reinforcement Learning
Hybrid Latent Reasoning via Reinforcement Learning
Zhenrui Yue
Bowen Jin
Huimin Zeng
Honglei Zhuang
Zhen Qin
Jinsung Yoon
Lanyu Shang
Jiawei Han
Dong Wang
OffRLBDLLRM
72
0
0
24 May 2025
Removal of Hallucination on Hallucination: Debate-Augmented RAG
Removal of Hallucination on Hallucination: Debate-Augmented RAG
Wentao Hu
Wengyu Zhang
Yiyang Jiang
C. Zhang
Xiaoyong Wei
Qing Li
63
0
0
24 May 2025
Adaptive Prediction-Powered AutoEval with Reliability and Efficiency Guarantees
Adaptive Prediction-Powered AutoEval with Reliability and Efficiency Guarantees
Sangwoo Park
Matteo Zecchin
Osvaldo Simeone
32
0
0
24 May 2025
GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis
GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis
Yi Jiang
Sendong Zhao
Jianbo Li
Haochun Wang
Bing Qin
RALM
186
0
0
24 May 2025
AI-Driven Climate Policy Scenario Generation for Sub-Saharan Africa
AI-Driven Climate Policy Scenario Generation for Sub-Saharan Africa
Rafiu Adekoya Badekale
Adewale Akinfaderin
46
0
0
24 May 2025
Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs
Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs
Hexiang Tan
Fei Sun
Sha Liu
Du Su
Qi Cao
...
Jingang Wang
Xunliang Cai
Yuanzhuo Wang
Huawei Shen
Xueqi Cheng
HILM
160
0
0
23 May 2025
T$^2$: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
T2^22: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
Zhengyi Zhao
Shubo Zhang
Zezhong Wang
Huimin Wang
Yutian Zhao
Bin Liang
Yefeng Zheng
Binyang Li
Kam-Fai Wong
X. Wu
LRM
92
0
0
23 May 2025
LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization
Qi Zhang
Shouqing Yang
Lirong Gao
Hao Chen
Xiaomeng Hu
...
Jiexiang Wang
Sheng Guo
Bo Zheng
Haobo Wang
Junbo Zhao
LRM
56
0
0
23 May 2025
HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation
HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation
Jinyu Guo
Xunlei Chen
Qiyang Xia
Zhaokun Wang
Jie Ou
Libo Qin
Shunyu Yao
Wenhong Tian
207
0
0
22 May 2025
CUB: Benchmarking Context Utilisation Techniques for Language Models
CUB: Benchmarking Context Utilisation Techniques for Language Models
Lovisa Hagström
Youna Kim
Haeun Yu
Sang-goo Lee
Richard Johansson
Hyunsoo Cho
Isabelle Augenstein
67
1
0
22 May 2025
CAIN: Hijacking LLM-Humans Conversations via a Two-Stage Malicious System Prompt Generation and Refining Framework
CAIN: Hijacking LLM-Humans Conversations via a Two-Stage Malicious System Prompt Generation and Refining Framework
Viet Pham
Thai Le
SILM
28
0
0
22 May 2025
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
Shuzheng Si
Haozhe Zhao
Cheng Gao
Yuzhuo Bai
Zhitong Wang
...
Gang Chen
Fanchao Qi
Minjia Zhang
Baobao Chang
Maosong Sun
SyDaHILM
45
0
0
22 May 2025
Small-to-Large Generalization: Data Influences Models Consistently Across Scale
Small-to-Large Generalization: Data Influences Models Consistently Across Scale
Alaa Khaddaj
Logan Engstrom
Aleksander Madry
TDIAI4CE
83
0
0
22 May 2025
When Do LLMs Admit Their Mistakes? Understanding the Role of Model Belief in Retraction
When Do LLMs Admit Their Mistakes? Understanding the Role of Model Belief in Retraction
Yuqing Yang
Robin Jia
KELMLRM
122
1
0
22 May 2025
ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts
ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts
Dongwon Noh
Donghyeok Koh
Junghun Yuk
Gyuwan Kim
Jaeyong Lee
Kyungtae Lim
Cheoneum Park
ELM
75
0
0
22 May 2025
Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty
Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty
Peilin Wu
Mian Zhang
Xinlu Zhang
Xinya Du
Zhiyu Zoey Chen
49
0
0
22 May 2025
UNCLE: Uncertainty Expressions in Long-Form Generation
UNCLE: Uncertainty Expressions in Long-Form Generation
Ruihan Yang
Caiqi Zhang
Zhisong Zhang
Xinting Huang
Dong Yu
Nigel Collier
Deqing Yang
ELM
69
2
0
22 May 2025
Previous
12345...353637
Next