Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.10621
Cited By
Large language models can accurately predict searcher preferences
19 September 2023
Paul Thomas
S. Spielman
Nick Craswell
Bhaskar Mitra
ALM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Large language models can accurately predict searcher preferences"
28 / 28 papers shown
Title
QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines
Ohjoon Kwon
Changsu Lee
Jihye Back
Lim Sun Suk
Inho Kang
Donghyeon Jeon
40
0
0
12 May 2025
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models
Peichao Lai
K. Zhang
Yi-Tun Lin
L. Zhang
Feiyang Ye
...
Y. Xu
Conghui He
Y. Wang
Wentao Zhang
Bin Cui
ELM
LRM
42
0
0
12 May 2025
Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models
Hongwei Shang
Nguyen Vo
Nitin Yadav
Tian Zhang
Ajit Puthenputhussery
Xunfan Cai
Shuyi Chen
Prijith Chandran
Changsung Kang
RALM
43
0
0
11 May 2025
To Judge or not to Judge: Using LLM Judgements for Advertiser Keyphrase Relevance at eBay
Soumik Dey
Hansi Wu
Binbin Li
45
0
0
07 May 2025
A Survey on Privacy Risks and Protection in Large Language Models
Kang Chen
Xiuze Zhou
Yuanguo Lin
Shibo Feng
Li Shen
Pengcheng Wu
AILaw
PILM
141
0
0
04 May 2025
LLM-Evaluation Tropes: Perspectives on the Validity of LLM-Evaluations
Laura Dietz
Oleg Zendel
P. Bailey
Charles L. A. Clarke
Ellese Cotterill
Jeff Dalton
Faegheh Hasibi
Mark Sanderson
Nick Craswell
ELM
48
0
0
27 Apr 2025
Generative Product Recommendations for Implicit Superlative Queries
Kaustubh D. Dhole
Nikhita Vedula
Saar Kuzi
Giuseppe Castellucci
Eugene Agichtein
S. Malmasi
52
0
0
26 Apr 2025
The Viability of Crowdsourcing for RAG Evaluation
Lukas Gienapp
Tim Hagen
Maik Frobe
Matthias Hagen
Benno Stein
Martin Potthast
Harrisen Scells
21
0
0
22 Apr 2025
LLM-Driven Usefulness Judgment for Web Search Evaluation
Mouly Dewan
Jiqun Liu
Aditya Gautam
Chirag Shah
45
0
0
19 Apr 2025
Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG
Hengran Zhang
Minghao Tang
Keping Bi
J. Guo
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
19
0
0
07 Apr 2025
Can LLMs Understand Time Series Anomalies?
Zihao Zhou
Rose Yu
AI4TS
82
8
0
13 Mar 2025
Judging the Judges: A Collection of LLM-Generated Relevance Judgements
Hossein A. Rahmani
Clemencia Siro
Mohammad Aliannejadi
Nick Craswell
Charles L. A. Clarke
Guglielmo Faggioli
Bhaskar Mitra
Paul Thomas
Emine Yilmaz
ELM
59
0
0
20 Feb 2025
LLMs can be Fooled into Labelling a Document as Relevant (best caf\é near me; this paper is perfectly relevant)
Marwah Alaofi
Paul Thomas
Falk Scholer
Mark Sanderson
46
15
0
29 Jan 2025
SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Hossein A. Rahmani
Xi Wang
Emine Yilmaz
Nick Craswell
Bhaskar Mitra
Paul Thomas
79
4
0
28 Jan 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
68
1
0
03 Jan 2025
Perception of Visual Content: Differences Between Humans and Foundation Models
Nardiena A. Pratama
Shaoyang Fan
Gianluca Demartini
VLM
97
0
0
28 Nov 2024
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Dawei Li
Bohan Jiang
Liangjie Huang
Alimohammad Beigi
Chengshuai Zhao
...
Canyu Chen
Tianhao Wu
Kai Shu
Lu Cheng
Huan Liu
ELM
AILaw
120
67
0
25 Nov 2024
LLM-Assisted Relevance Assessments: When Should We Ask LLMs for Help?
Rikiya Takehi
E. Voorhees
Tetsuya Sakai
I. Soboroff
129
2
0
11 Nov 2024
An Investigation of Prompt Variations for Zero-shot LLM-based Rankers
Shuoqi Sun
Shengyao Zhuang
Shuai Wang
Guido Zuccon
40
5
0
20 Jun 2024
Enhancing user experience in large language models through human-centered design: Integrating theoretical insights with an experimental study to meet diverse software learning needs with a single document knowledge base
Yuchen Wang
Yin-Shan Lin
Ruixin Huang
Jinyin Wang
Sensen Liu
21
7
0
19 May 2024
MCRanker: Generating Diverse Criteria On-the-Fly to Improve Point-wise LLM Rankers
Fang Guo
Wenyu Li
Honglei Zhuang
Yun Luo
Yafu Li
Qi Zhu
Le Yan
Yue Zhang
ALM
70
6
0
18 Apr 2024
Prediction-Powered Ranking of Large Language Models
Ivi Chatzi
Eleni Straitouri
Suhas Thejaswi
Manuel Gomez Rodriguez
ALM
29
5
0
27 Feb 2024
Large Language Models for Stemming: Promises, Pitfalls and Failures
Shuai Wang
Shengyao Zhuang
Guido Zuccon
33
1
0
19 Feb 2024
Task Supportive and Personalized Human-Large Language Model Interaction: A User Study
Ben Wang
Jiqun Liu
Jamshed Karimnazarov
Nicolas Thompson
29
16
0
09 Feb 2024
Can Large Language Models Be an Alternative to Human Evaluations?
Cheng-Han Chiang
Hung-yi Lee
ALM
LM&MA
224
572
0
03 May 2023
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
316
4,077
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
358
8,495
0
28 Jan 2022
Carbon Emissions and Large Neural Network Training
David A. Patterson
Joseph E. Gonzalez
Quoc V. Le
Chen Liang
Lluís-Miquel Munguía
D. Rothchild
David R. So
Maud Texier
J. Dean
AI4CE
244
643
0
21 Apr 2021
1