Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.06984
Cited By
Evaluating Open-Domain Question Answering in the Era of Large Language Models
11 May 2023
Ehsan Kamalloo
Nouha Dziri
C. Clarke
Davood Rafiei
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating Open-Domain Question Answering in the Era of Large Language Models"
24 / 24 papers shown
Title
Defending against Indirect Prompt Injection by Instruction Detection
Tongyu Wen
Chenglong Wang
Xiyuan Yang
Haoyu Tang
Yueqi Xie
Lingjuan Lyu
Zhicheng Dou
Fangzhao Wu
AAML
34
0
0
08 May 2025
Interpretable Zero-shot Learning with Infinite Class Concepts
Zihan Ye
Shreyank N Gowda
Shiming Chen
Yaochu Jin
Kaizhu Huang
Xiaobo Jin
VLM
37
0
0
06 May 2025
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering
Zhonghao Li
Kunpeng Zhang
Jinghuai Ou
Shuliang Liu
Xuming Hu
64
0
0
28 Apr 2025
Investigating Retrieval-Augmented Generation in Quranic Studies: A Study of 13 Open-Source Large Language Models
Zahra Khalila
Arbi Haza Nasution
Winda Monika
Aytug Onan
Yohei Murakami
Yasir Bin Ismail Radi
Noor Mohammad Osmani
RALM
78
0
0
20 Mar 2025
CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning
Hao Cui
Zahra Shamsi
Gowoon Cheon
Xuejian Ma
Shutong Li
...
Eun-Ah Kim
M. Brenner
Viren Jain
Sameera Ponda
Subhashini Venugopalan
ELM
LRM
57
0
0
14 Mar 2025
Benchmarking Prompt Sensitivity in Large Language Models
Amirhossein Razavi
Mina Soltangheis
Negar Arabzadeh
Sara Salamat
Morteza Zihayat
Ebrahim Bagheri
69
1
0
09 Feb 2025
Optimizing Temperature for Language Models with Multi-Sample Inference
Weihua Du
Yiming Yang
Sean Welleck
62
2
0
07 Feb 2025
TableMaster: A Recipe to Advance Table Understanding with Language Models
Lang Cao
Hanbing Liu
LMTD
RALM
219
0
1
31 Jan 2025
Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding
Yiming Zhang
Zhuokai Zhao
Zhaorun Chen
Zenghui Ding
Xianjun Yang
Yining Sun
212
1
0
21 Nov 2024
What Makes a Maze Look Like a Maze?
Joy Hsu
Jiayuan Mao
J. Tenenbaum
Noah D. Goodman
Jiajun Wu
OCL
54
6
0
12 Sep 2024
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Shraman Pramanick
Rama Chellappa
Subhashini Venugopalan
50
13
0
12 Jul 2024
Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting
Zilong Wang
Zifeng Wang
Long Le
Huaixiu Steven Zheng
Swaroop Mishra
...
Anush Mattapalli
Ankur Taly
Jingbo Shang
Chen-Yu Lee
Tomas Pfister
RALM
80
32
0
11 Jul 2024
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu
Ziwei Ji
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai Chen
HILM
39
5
0
05 Jul 2024
Few-shot Personalization of LLMs with Mis-aligned Responses
Jaehyung Kim
Yiming Yang
50
7
0
26 Jun 2024
Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Adam Fisch
Joshua Maynez
R. A. Hofer
Bhuwan Dhingra
Amir Globerson
William W. Cohen
41
8
0
06 Jun 2024
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
M. A. D. L. Balaguer
Vinamra Benara
Renato Luiz de Freitas Cunha
Roberto de M. Estevao Filho
Todd Hendry
...
Morris Sharp
B. Silva
Swati Sharma
Vijay Aski
Ranveer Chandra
FaML
38
81
0
16 Jan 2024
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4
Sondos Mahmoud Bsharat
Aidar Myrzakhan
Zhiqiang Shen
ALM
LRM
31
69
0
26 Dec 2023
KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models
Yuyang Bai
Shangbin Feng
Vidhisha Balachandran
Zhaoxuan Tan
Shiqi Lou
Tianxing He
Yulia Tsvetkov
ELM
40
2
0
15 Oct 2023
Improving Automatic VQA Evaluation Using Large Language Models
Oscar Manas
Benno Krojer
Aishwarya Agrawal
21
21
0
04 Oct 2023
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
Ruiyang Ren
Yuhao Wang
Yingqi Qu
Wayne Xin Zhao
Jiaheng Liu
Hao Tian
Huaqin Wu
Ji-Rong Wen
Haifeng Wang
RALM
KELM
35
123
0
20 Jul 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
328
11,953
0
04 Mar 2022
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking
Ruiyang Ren
Yingqi Qu
Jing Liu
Wayne Xin Zhao
Qiaoqiao She
Hua-Hong Wu
Haifeng Wang
Ji-Rong Wen
141
248
0
14 Oct 2021
What's in a Name? Answer Equivalence For Open-Domain Question Answering
Chenglei Si
Chen Zhao
Jordan L. Boyd-Graber
151
35
0
11 Sep 2021
Distilling Knowledge from Reader to Retriever for Question Answering
Gautier Izacard
Edouard Grave
RALM
185
251
0
08 Dec 2020
1