ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.03551
  4. Cited By
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
v1v2 (latest)

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
    RALM
ArXiv (abs)PDFHTML

Papers citing "TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"

50 / 1,823 papers shown
Title
Context Filtering with Reward Modeling in Question Answering
Context Filtering with Reward Modeling in Question Answering
Sangryul Kim
James Thorne
157
0
0
16 Dec 2024
Let your LLM generate a few tokens and you will reduce the need for
  retrieval
Let your LLM generate a few tokens and you will reduce the need for retrieval
Hervé Déjean
155
0
0
16 Dec 2024
UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
Boyang Xue
Fei Mi
Qi Zhu
Hongru Wang
Rui Wang
Sheng Wang
Erxin Yu
Xuming Hu
Kam-Fai Wong
HILM
230
2
0
16 Dec 2024
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Jiale Cheng
Xiao-Chang Liu
C. Wang
Xiaotao Gu
Yaojie Lu
Dan Zhang
Yuxiao Dong
J. Tang
Hongning Wang
Minlie Huang
LRM
189
4
0
16 Dec 2024
DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE Inference
DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE Inference
Yujie Zhang
Shivam Aggarwal
T. Mitra
MoE
167
1
0
16 Dec 2024
Phi-4 Technical Report
Phi-4 Technical Report
Marah Abdin
J. Aneja
Harkirat Singh Behl
Sébastien Bubeck
Ronen Eldan
...
Rachel A. Ward
Yue Wu
Dingli Yu
Cyril Zhang
Yi Zhang
ALMSyDa
195
154
0
12 Dec 2024
JuStRank: Benchmarking LLM Judges for System Ranking
JuStRank: Benchmarking LLM Judges for System Ranking
Ariel Gera
Odellia Boni
Yotam Perlitz
Roy Bar-Haim
Lilach Eden
Asaf Yehudai
ALMELM
173
5
0
12 Dec 2024
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
Tianyi Li
Erenay Dayanik
Shubhi Tyagi
Andrea Pierleoni
HILM
124
0
0
10 Dec 2024
Label-Confidence-Aware Uncertainty Estimation in Natural Language
  Generation
Label-Confidence-Aware Uncertainty Estimation in Natural Language Generation
Qinhong Lin
Linna Zhou
Zhongliang Yang
Yuang Cai
HILM
99
0
0
10 Dec 2024
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long
  Context Extension for Large Language Models
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models
Haoran Lian
Junmin Chen
Wei Huang
Yizhe Xiong
Wenping Hu
...
Hui Chen
Jianwei Niu
Zijia Lin
Fuzheng Zhang
Di Zhang
129
0
0
10 Dec 2024
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token
Roi Cohen
Konstantin Dobler
Eden Biran
Gerard de Melo
194
9
0
09 Dec 2024
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
Ola Shorinwa
Zhiting Mei
Justin Lidard
Allen Z. Ren
Anirudha Majumdar
HILMLRM
155
19
0
07 Dec 2024
Smoothie: Label Free Language Model Routing
Smoothie: Label Free Language Model Routing
Neel Guha
Mayee F. Chen
Trevor Chow
Ishan S. Khare
Christopher Ré
132
5
0
06 Dec 2024
Enhancing Trust in Large Language Models with Uncertainty-Aware
  Fine-Tuning
Enhancing Trust in Large Language Models with Uncertainty-Aware Fine-Tuning
R. Krishnan
Piyush Khanna
Omesh Tickoo
HILM
116
1
0
03 Dec 2024
AI Benchmarks and Datasets for LLM Evaluation
AI Benchmarks and Datasets for LLM Evaluation
Todor Ivanov
Valeri Penchev
163
2
0
02 Dec 2024
Towards Adaptive Mechanism Activation in Language Agent
Towards Adaptive Mechanism Activation in Language Agent
Ziyang Huang
Jun Zhao
Kang Liu
LLMAGAI4CE
129
1
0
01 Dec 2024
DynRank: Improving Passage Retrieval with Dynamic Zero-Shot Prompting
  Based on Question Classification
DynRank: Improving Passage Retrieval with Dynamic Zero-Shot Prompting Based on Question Classification
Abdelrahman Abdallah
Jamshid Mozafari
Bhawna Piryani
Mohammed M. Abdelgwad
Adam Jatowt
150
1
0
30 Nov 2024
Does Self-Attention Need Separate Weights in Transformers?
Md. Kowsher
Nusrat Jahan Prottasha
Chun-Nam Yu
O. Garibay
Niloofar Yousefi
549
1
0
30 Nov 2024
Quantized Delta Weight Is Safety Keeper
Quantized Delta Weight Is Safety Keeper
Yule Liu
Zhen Sun
Xinlei He
Xinyi Huang
141
6
0
29 Nov 2024
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language
  Models
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models
Tian Yu
Shaolei Zhang
Yang Feng
RALM3DVAIFinLRM
137
11
0
29 Nov 2024
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world
  Human-Machine Interactions
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions
Yaqi Wang
Haipei Xu
LLMAG
107
0
0
21 Nov 2024
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
Zhaopeng Tu
VLM
270
0
0
21 Nov 2024
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
Riccardo Grazzi
Julien N. Siems
Jörg Franke
Arber Zela
Frank Hutter
Massimiliano Pontil
214
26
0
19 Nov 2024
Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality
Viktoriia Chekalina
Anton Razzigaev
Elizaveta Goncharova
Andrey Kuznetsov
KELM
142
0
0
18 Nov 2024
Information Anxiety in Large Language Models
Prasoon Bajpai
Sarah Masud
Tanmoy Chakraborty
69
0
0
16 Nov 2024
Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Yutao Hou
Yajing Luo
Zhiwen Ruan
Hongru Wang
Weifeng Ge
Yuxiao Chen
Guanhua Chen
ELM
84
0
0
15 Nov 2024
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Weiyun Wang
Zhe Chen
Wenhai Wang
Yue Cao
Yangzhou Liu
...
Jinguo Zhu
X. Zhu
Lewei Lu
Yu Qiao
Jifeng Dai
LRM
148
93
1
15 Nov 2024
AMXFP4: Taming Activation Outliers with Asymmetric Microscaling Floating-Point for 4-bit LLM Inference
AMXFP4: Taming Activation Outliers with Asymmetric Microscaling Floating-Point for 4-bit LLM Inference
Janghwan Lee
Jiwoong Park
Jinseok Kim
Yongjik Kim
Jungju Oh
Jinwook Oh
Jungwook Choi
80
2
0
15 Nov 2024
Continual Memorization of Factoids in Language Models
Continual Memorization of Factoids in Language Models
Howard Chen
Jiayi Geng
Adithya Bhaskar
Dan Friedman
Danqi Chen
KELM
129
1
0
11 Nov 2024
Exploring Knowledge Boundaries in Large Language Models for Retrieval
  Judgment
Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment
Zhen Zhang
Xinyu Wang
Yong Jiang
Zhuo Chen
Feiteng Mu
Mengting Hu
Pengjun Xie
Fei Huang
KELM
102
3
0
09 Nov 2024
Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small
  Language Model
Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model
Ben Koska
Mojmír Horváth
MoE
64
1
0
08 Nov 2024
Measuring short-form factuality in large language models
Measuring short-form factuality in large language models
Jason W. Wei
Nguyen Karina
Hyung Won Chung
Yunxin Joy Jiao
Spencer Papay
Amelia Glaese
John Schulman
W. Fedus
ELMKELMHILM
80
78
0
07 Nov 2024
Evaluation data contamination in LLMs: how do we measure it and (when)
  does it matter?
Evaluation data contamination in LLMs: how do we measure it and (when) does it matter?
Aaditya K. Singh
Muhammed Yusuf Kocyigit
Andrew Poulton
David Esiobu
Maria Lomeli
Gergely Szilvasy
Dieuwke Hupkes
82
13
0
06 Nov 2024
VERITAS: A Unified Approach to Reliability Evaluation
VERITAS: A Unified Approach to Reliability Evaluation
Rajkumar Ramamurthy
Meghana Arakkal Rajeev
Oliver Molenschot
James Zou
Nazneen Rajani
HILM
106
1
0
05 Nov 2024
PersianRAG: A Retrieval-Augmented Generation System for Persian Language
PersianRAG: A Retrieval-Augmented Generation System for Persian Language
Hossein Hosseini
Mohammad Sobhan Zare
Amir Hossein Mohammadi
Arefeh Kazemi
Zahra Zojaji
Mohammad Ali Nematbakhsh
VLMRALM
69
0
0
05 Nov 2024
Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI
Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI
R. Kaur
Colin Samplawski
Adam Cobb
Anirban Roy
Brian Matejek
...
Daniel Elenius
Alexander M. Berenbeim
John A. Pavlik
Nathaniel D. Bastian
Susmit Jha
119
5
0
04 Nov 2024
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated
  Parameters by Tencent
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Xingwu Sun
Yanfeng Chen
Yanwen Huang
Ruobing Xie
Jiaqi Zhu
...
Zhanhui Kang
Yong Yang
Yuhong Liu
Di Wang
Jie Jiang
MoEALMELM
167
34
0
04 Nov 2024
Graph-based Confidence Calibration for Large Language Models
Graph-based Confidence Calibration for Large Language Models
Yukun Li
Sijia Wang
Lifu Huang
Li-Ping Liu
UQCV
198
2
0
03 Nov 2024
Transfer Learning for Finetuning Large Language Models
Transfer Learning for Finetuning Large Language Models
Tobias Strangmann
Lennart Purucker
Jörg Franke
Ivo Rapant
Fabio Ferreira
Frank Hutter
115
0
0
02 Nov 2024
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model
  with Frozen LLM
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
Xiong Wang
Yangze Li
Chaoyou Fu
Yunhang Shen
Lei Xie
Ke Li
Xing Sun
Long Ma
AuLLMMLLM
154
40
0
01 Nov 2024
SLED: Self Logits Evolution Decoding for Improving Factuality in Large
  Language Models
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models
Jianyi Zhang
Da-Cheng Juan
Cyrus Rashtchian
Chun-Sung Ferng
Heinrich Jiang
Yiran Chen
86
4
0
01 Nov 2024
E2E-AFG: An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
E2E-AFG: An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
Yun Jiang
Zilong Xie
Wei Zhang
Yun Fang
Shuai Pan
RALM
477
0
0
01 Nov 2024
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on
  AI Accelerators
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators
Krishna Teja Chitty-Venkata
Siddhisanket Raskar
B. Kale
Farah Ferdaus
Aditya Tanikanti
Ken Raffenetti
Valerie Taylor
M. Emani
V. Vishwanath
152
12
0
31 Oct 2024
Exploring the Knowledge Mismatch Hypothesis: Hallucination Propensity in
  Small Models Fine-tuned on Data from Larger Models
Exploring the Knowledge Mismatch Hypothesis: Hallucination Propensity in Small Models Fine-tuned on Data from Larger Models
Phil Wee
Riyadh Baghdadi
HILM
73
1
0
31 Oct 2024
Can Models Help Us Create Better Models? Evaluating LLMs as Data
  Scientists
Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists
Michał Pietruszka
Łukasz Borchmann
Aleksander Jędrosz
Paweł Morawiecki
ELM
47
1
0
30 Oct 2024
Eliciting Critical Reasoning in Retrieval-Augmented Language Models via
  Contrastive Explanations
Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations
Leonardo Ranaldi
Marco Valentino
André Freitas
RALMLRM
100
4
0
30 Oct 2024
Improving Uncertainty Quantification in Large Language Models via
  Semantic Embeddings
Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings
Yashvir S. Grewal
Edwin V. Bonilla
Thang D. Bui
UQCV
84
9
0
30 Oct 2024
Retrieval-Augmented Generation with Estimation of Source Reliability
Retrieval-Augmented Generation with Estimation of Source Reliability
Jeongyeon Hwang
Junyoung Park
Hyejin Park
Dongwoo Kim
Sangdon Park
Jungseul Ok
RALM
100
1
0
30 Oct 2024
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh
Pradyot Prakash
Alexander Radovic
Akshay Shekher
Denis Savenkov
LRM
382
2
0
30 Oct 2024
Distinguishing Ignorance from Error in LLM Hallucinations
Distinguishing Ignorance from Error in LLM Hallucinations
Adi Simhi
Jonathan Herzig
Idan Szpektor
Yonatan Belinkov
HILM
95
4
0
29 Oct 2024
Previous
123...789...353637
Next