Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.06967
Cited By
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
14 April 2021
Sebastian Hofstatter
Sheng-Chieh Lin
Jheng-Hong Yang
Jimmy J. Lin
Allan Hanbury
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling"
50 / 92 papers shown
Title
Pre-training vs. Fine-tuning: A Reproducibility Study on Dense Retrieval Knowledge Acquisition
Zheng Yao
Shuai Wang
Guido Zuccon
21
0
0
12 May 2025
QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public
Mingruo Yuan
Ben Kao
Tien-Hsuan Wu
AILaw
76
0
0
08 May 2025
Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions
Oliver Savolainen
Dur e Najaf Amjad
Roxana Petcu
AAML
35
0
0
04 May 2025
Effective Inference-Free Retrieval for Learned Sparse Representations
F. M. Nardini
Thong Nguyen
Cosimo Rulli
Rossano Venturini
Andrew Yates
RALM
45
0
0
30 Apr 2025
Unsupervised Corpus Poisoning Attacks in Continuous Space for Dense Retrieval
Yongkang Li
Panagiotis Eustratiadis
Simon Lupart
Evangelos Kanoulas
AAML
53
0
0
24 Apr 2025
Breaking the Lens of the Telescope: Online Relevance Estimation over Large Retrieval Sets
Mandeep Rathee
Venktesh V
Sean MacAvaney
Avishek Anand
KELM
32
1
0
12 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
J. Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
215
0
0
07 Apr 2025
Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking
Chris Samarinas
Hamed Zamani
ALM
LRM
74
1
0
04 Apr 2025
Scaling Sparse and Dense Retrieval in Decoder-Only LLMs
Hansi Zeng
Julian Killingback
Hamed Zamani
RALM
78
2
0
24 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM
3DV
73
2
0
21 Feb 2025
FactIR: A Real-World Zero-shot Open-Domain Retrieval Benchmark for Fact-Checking
Venktesh V
Vinay Setty
HILM
54
0
0
09 Feb 2025
Hypencoder: Hypernetworks for Information Retrieval
Julian Killingback
Hansi Zeng
Hamed Zamani
112
1
0
07 Feb 2025
Hierarchical Multi-field Representations for Two-Stage E-commerce Retrieval
Niklas Freymuth
Dong Liu
Thomas Ricatte
Saab Mansour
73
0
0
30 Jan 2025
GeAR: Generation Augmented Retrieval
Haoyu Liu
Shaohan Huang
Jianfeng Liu
Yuefeng Zhan
H. Sun
Weiwei Deng
Feng Sun
Furu Wei
Qi Zhang
47
1
0
06 Jan 2025
Boosting LLM-based Relevance Modeling with Distribution-Aware Robust Learning
Hong Liu
Saisai Gong
Yixin Ji
Kaixin Wu
Jia Xu
Jinjie Gu
9
1
0
17 Dec 2024
QAEncoder: Towards Aligned Representation Learning in Question Answering System
Zhengren Wang
Qinhan Yu
Shida Wei
Zhiyu Li
Zhiyu Li
Xiaoxing Wang
Pengnian Qi
Hao Liang
Wentao Zhang
RALM
35
1
0
30 Sep 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely
Siyun Zhao
Yuqing Yang
Zilong Wang
Zhiyuan He
Luna Qiu
Lili Qiu
SyDa
RALM
3DV
46
36
0
23 Sep 2024
W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering
Jinming Nian
Zhiyuan Peng
Qifan Wang
Yi Fang
RALM
78
2
0
15 Aug 2024
Preserving Multilingual Quality While Tuning Query Encoder on English Only
Oleg V. Vasilyev
Randy Sawaya
John Bohannon
35
1
0
01 Jul 2024
Retrieval-Augmented Conversational Recommendation with Prompt-based Semi-Structured Natural Language State Tracking
Sara Kemper
Justin Cui
Kai Dicarlantonio
Kathy Lin
Danjie Tang
Anton Korikov
Scott Sanner
32
11
0
25 May 2024
Towards Completeness-Oriented Tool Retrieval for Large Language Models
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jun Xu
Jirong Wen
KELM
33
7
0
25 May 2024
Words Blending Boxes. Obfuscating Queries in Information Retrieval using Differential Privacy
Francesco Luigi De Faveri
G. Faggioli
Nicola Ferro
AAML
44
0
0
15 May 2024
Measuring Bias in a Ranked List using Term-based Representations
Amin Abolghasemi
Leif Azzopardi
Arian Askari
Maarten de Rijke
Suzan Verberne
42
6
0
09 Mar 2024
Graph Regularized Encoder Training for Extreme Classification
Anshul Mittal
Shikhar Mohan
Deepak Saini
Suchith C. Prabhu
Jain jiao
...
Manik Varma
Sumeet Agarwal
Soumen Chakrabarti
Purushottam Kar
Manik Varma
GNN
46
0
0
28 Feb 2024
Unveiling the Magic: Investigating Attention Distillation in Retrieval-augmented Generation
Zizhong Li
Haopeng Zhang
Jiawei Zhang
RALM
56
1
0
19 Feb 2024
Robust Training of Temporal GNNs using Nearest Neighbours based Hard Negatives
Shubham Gupta
Srikanta J. Bedathur
OOD
30
1
0
14 Feb 2024
Domain Adaptation of Multilingual Semantic Search -- Literature Review
Anna Bringmann
Anastasia Zhukova
VLM
43
0
0
05 Feb 2024
ICXML: An In-Context Learning Framework for Zero-Shot Extreme Multi-Label Classification
Yaxin Zhu
Hamed Zamani
46
3
0
16 Nov 2023
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval
Nandan Thakur
Jianmo Ni
Gustavo Hernández Ábrego
John Wieting
Jimmy J. Lin
Daniel Cer
RALM
46
12
0
10 Nov 2023
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models
Ronak Pradeep
Sahel Sharifymoghaddam
Jimmy Lin
ALM
43
37
0
26 Sep 2023
Annotating Data for Fine-Tuning a Neural Ranker? Current Active Learning Strategies are not Better than Random Selection
Sophia Althammer
Guido Zuccon
Sebastian Hofstatter
Suzan Verberne
Allan Hanbury
39
4
0
12 Sep 2023
BERM: Training the Balanced and Extractable Representation for Matching to Improve Generalization Ability of Dense Retrieval
Shicheng Xu
Liang Pang
Huawei Shen
Xueqi Cheng
16
12
0
18 May 2023
Leveraging Large Language Models in Conversational Recommender Systems
Luke Friedman
Sameer Ahuja
David Allen
Zhenning Tan
Hakim Sidahmed
...
Ajay Patel
Harsh Lara
Brian Chu
Zexiang Chen
Manoj Kumar Tiwari
32
104
0
13 May 2023
Synergistic Interplay between Search and Large Language Models for Information Retrieval
Jiazhan Feng
Chongyang Tao
Xiubo Geng
Tao Shen
Can Xu
Guodong Long
Dongyan Zhao
Daxin Jiang
KELM
63
5
0
12 May 2023
Empowering Language Model with Guided Knowledge Fusion for Biomedical Document Re-ranking
D. Gupta
Dina Demner-Fushman
29
1
0
07 May 2023
CoT-MAE v2: Contextual Masked Auto-Encoder with Multi-view Modeling for Passage Retrieval
Xing Wu
Guangyuan Ma
Peng Wang
Meng Lin
Zijia Lin
Fuzheng Zhang
Songlin Hu
RALM
24
7
0
05 Apr 2023
Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
Jinhyuk Lee
Zhuyun Dai
Sai Meher Karthik Duddu
Tao Lei
Iftekhar Naim
Ming-Wei Chang
Vincent Zhao
24
15
0
04 Apr 2023
ControversialQA: Exploring Controversy in Question Answering
Zhen Wang
Peide Zhu
Jie Yang
34
1
0
10 Feb 2023
An Experimental Study on Pretraining Transformers from Scratch for IR
Carlos Lassance
Hervé Déjean
S. Clinchant
28
11
0
25 Jan 2023
Multimodal Inverse Cloze Task for Knowledge-based Visual Question Answering
Paul Lerner
O. Ferret
C. Guinaudeau
21
9
0
11 Jan 2023
Query-as-context Pre-training for Dense Passage Retrieval
Xing Wu
Guangyuan Ma
Wanhui Qian
Zijia Lin
Songlin Hu
35
9
0
19 Dec 2022
CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion
Xingwei He
Yeyun Gong
Alex Jin
Hang Zhang
Anlei Dong
Jian Jiao
Siu-Ming Yiu
Nan Duan
RALM
54
3
0
18 Dec 2022
LEAD: Liberal Feature-based Distillation for Dense Retrieval
Hao Sun
Xiao Liu
Yeyun Gong
Anlei Dong
Jing Lu
Yan Zhang
Linjun Yang
Rangan Majumder
Nan Duan
67
2
0
10 Dec 2022
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
Minghan Li
Sheng-Chieh Lin
Barlas Oğuz
Asish Ghoshal
Jimmy J. Lin
Yashar Mehdad
Wen-tau Yih
Xilun Chen
30
26
0
18 Nov 2022
Task-aware Retrieval with Instructions
Akari Asai
Timo Schick
Patrick Lewis
Xilun Chen
Gautier Izacard
Sebastian Riedel
Hannaneh Hajishirzi
Wen-tau Yih
45
88
0
16 Nov 2022
Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives
Si Sun
Chenyan Xiong
Yue Yu
Arnold Overwijk
Zhiyuan Liu
Jie Bao
45
6
0
31 Oct 2022
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning
Yue Yu
Chenyan Xiong
Si Sun
Chao Zhang
Arnold Overwijk
VLM
OOD
50
22
0
27 Oct 2022
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
Kun Zhou
Yeyun Gong
Xiao Liu
Wayne Xin Zhao
Yelong Shen
...
Jing Lu
Rangan Majumder
Ji-Rong Wen
Nan Duan
Weizhu Chen
44
33
0
21 Oct 2022
Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking
Tim Baumgärtner
Leonardo F. R. Ribeiro
Nils Reimers
Iryna Gurevych
37
6
0
19 Oct 2022
FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation
Sebastian Hofstatter
Jiecao Chen
K. Raman
Hamed Zamani
RALM
65
77
0
28 Sep 2022
1
2
Next