Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.10411
Cited By
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
18 November 2022
Minghan Li
Sheng-Chieh Lin
Barlas Oğuz
Asish Ghoshal
Jimmy J. Lin
Yashar Mehdad
Wen-tau Yih
Xilun Chen
Re-assign community
ArXiv (abs)
PDF
HTML
Github (292★)
Papers citing
"CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval"
42 / 42 papers shown
Title
Hypencoder: Hypernetworks for Information Retrieval
Julian Killingback
Hansi Zeng
Hamed Zamani
166
1
0
07 Feb 2025
WARP: An Efficient Engine for Multi-Vector Retrieval
Jan Luca Scheerer
Matei A. Zaharia
Christopher Potts
Gustavo Alonso
Omar Khattab
83
0
0
29 Jan 2025
RuleRAG: Rule-Guided Retrieval-Augmented Generation with Language Models for Question Answering
Zhongwu Chen
Chengjin Xu
Dingmin Wang
Zhen Huang
Yong Dou
Xuhui Jiang
Jian Guo
RALM
428
3
0
15 Oct 2024
Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
Jinhyuk Lee
Zhuyun Dai
Sai Meher Karthik Duddu
Tao Lei
Iftekhar Naim
Ming-Wei Chang
Vincent Zhao
63
17
0
04 Apr 2023
Multi-Vector Retrieval as Sparse Alignment
Yujie Qian
Jinhyuk Lee
Sai Meher Karthik Duddu
Zhuyun Dai
Siddhartha Brahma
Iftekhar Naim
Tao Lei
Vincent Zhao
35
13
0
02 Nov 2022
LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval
Kai Zhang
Chongyang Tao
Tao Shen
Can Xu
Xiubo Geng
Binxing Jiao
Daxin Jiang
RALM
48
25
0
29 Aug 2022
Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts
Basil Mustafa
C. Riquelme
J. Puigcerver
Rodolphe Jenatton
N. Houlsby
VLM
MoE
165
197
0
06 Jun 2022
UnifieR: A Unified Retriever for Large-Scale Retrieval
Tao Shen
Xiubo Geng
Chongyang Tao
Can Xu
Guodong Long
Kai Zhang
Daxin Jiang
RALM
53
29
0
23 May 2022
PLAID: An Efficient Engine for Late Interaction Retrieval
Keshav Santhanam
Omar Khattab
Christopher Potts
Matei A. Zaharia
VLM
105
76
0
19 May 2022
Introducing Neural Bag of Whole-Words with ColBERTer: Contextualized Late Interactions using Enhanced Reduction
Sebastian Hofstatter
Omar Khattab
Sophia Althammer
Mete Sertkan
Allan Hanbury
62
34
0
24 Mar 2022
Multi-View Document Representation Learning for Open-Domain Dense Retrieval
Shunyu Zhang
Yaobo Liang
Ming Gong
Daxin Jiang
Nan Duan
RALM
3DV
AI4TS
53
62
0
16 Mar 2022
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval
Luyu Gao
Xueguang Ma
Jimmy J. Lin
Jamie Callan
87
77
0
11 Mar 2022
Unsupervised Dense Information Retrieval with Contrastive Learning
Gautier Izacard
Mathilde Caron
Lucas Hosseini
Sebastian Riedel
Piotr Bojanowski
Armand Joulin
Edouard Grave
RALM
195
907
0
16 Dec 2021
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Keshav Santhanam
Omar Khattab
Jon Saad-Falcon
Christopher Potts
Matei A. Zaharia
103
414
0
02 Dec 2021
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?
Xilun Chen
Kushal Lakhotia
Barlas Oğuz
Anchit Gupta
Patrick Lewis
Stanislav Peshterliev
Yashar Mehdad
Sonal Gupta
Wen-tau Yih
106
69
0
13 Oct 2021
SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval
Thibault Formal
Carlos Lassance
Benjamin Piwowarski
Stéphane Clinchant
263
189
0
21 Sep 2021
Simple Entity-Centric Questions Challenge Dense Retrievers
Christopher Sciavolino
Zexuan Zhong
Jinhyuk Lee
Danqi Chen
RALM
78
167
0
17 Sep 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval
Luyu Gao
Jamie Callan
RALM
277
336
0
12 Aug 2021
A Few Brief Notes on DeepImpact, COIL, and a Conceptual Framework for Information Retrieval Techniques
Jimmy J. Lin
Xueguang Ma
90
148
0
28 Jun 2021
SimCSE: Simple Contrastive Learning of Sentence Embeddings
Tianyu Gao
Xingcheng Yao
Danqi Chen
AILaw
SSL
261
3,396
0
18 Apr 2021
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
425
1,041
0
17 Apr 2021
Condenser: a Pre-training Architecture for Dense Retrieval
Luyu Gao
Jamie Callan
AI4CE
59
262
0
16 Apr 2021
COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List
Luyu Gao
Zhuyun Dai
Jamie Callan
63
218
0
15 Apr 2021
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
Sebastian Hofstatter
Sheng-Chieh Lin
Jheng-Hong Yang
Jimmy J. Lin
Allan Hanbury
VLM
83
400
0
14 Apr 2021
Overview of the TREC 2020 deep learning track
Nick Craswell
Bhaskar Mitra
Emine Yilmaz
Daniel Fernando Campos
117
387
0
15 Feb 2021
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
W. Fedus
Barret Zoph
Noam M. Shazeer
MoE
88
2,187
0
11 Jan 2021
CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims
Thomas Diggelmann
Jordan L. Boyd-Graber
Jannis Bulian
Massimiliano Ciaramita
Markus Leippold
79
205
0
01 Dec 2020
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
Lee Xiong
Chenyan Xiong
Ye Li
Kwok-Fung Tang
Jialin Liu
Paul N. Bennett
Junaid Ahmed
Arnold Overwijk
139
1,225
0
01 Jul 2020
Sparse, Dense, and Attentional Representations for Text Retrieval
Y. Luan
Jacob Eisenstein
Kristina Toutanova
M. Collins
66
408
0
01 May 2020
Fact or Fiction: Verifying Scientific Claims
David Wadden
Shanchuan Lin
Kyle Lo
Lucy Lu Wang
Madeleine van Zuylen
Arman Cohan
Hannaneh Hajishirzi
HAI
140
455
0
30 Apr 2020
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
Omar Khattab
Matei A. Zaharia
138
1,370
0
27 Apr 2020
SPECTER: Document-level Representation Learning using Citation-informed Transformers
Arman Cohan
Sergey Feldman
Iz Beltagy
Doug Downey
Daniel S. Weld
AI4TS
81
552
0
15 Apr 2020
Dense Passage Retrieval for Open-Domain Question Answering
Vladimir Karpukhin
Barlas Oğuz
Sewon Min
Patrick Lewis
Ledell Yu Wu
Sergey Edunov
Danqi Chen
Wen-tau Yih
RALM
195
3,762
0
10 Apr 2020
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
665
24,464
0
26 Jul 2019
Latent Retrieval for Weakly Supervised Open Domain Question Answering
Kenton Lee
Ming-Wei Chang
Kristina Toutanova
RALM
112
1,014
0
01 Jun 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
94,891
0
11 Oct 2018
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
180
2,655
0
25 Sep 2018
FEVER: a large-scale dataset for Fact Extraction and VERification
James Thorne
Andreas Vlachos
Christos Christodoulopoulos
Arpit Mittal
HILM
148
1,657
0
14 Mar 2018
Reading Wikipedia to Answer Open-Domain Questions
Danqi Chen
Adam Fisch
Jason Weston
Antoine Bordes
RALM
116
2,015
0
31 Mar 2017
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
257
3,723
0
28 Feb 2017
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
Payal Bajaj
Daniel Fernando Campos
Nick Craswell
Li Deng
Jianfeng Gao
...
Mir Rosenberg
Xia Song
Alina Stoica
Saurabh Tiwary
Tong Wang
RALM
142
2,728
0
28 Nov 2016
Distilling the Knowledge in a Neural Network
Geoffrey E. Hinton
Oriol Vinyals
J. Dean
FedML
362
19,660
0
09 Mar 2015
1