Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.07899
Cited By
Large Dual Encoders Are Generalizable Retrievers
15 December 2021
Jianmo Ni
Chen Qu
Jing Lu
Zhuyun Dai
Gustavo Hernández Ábrego
Ji Ma
Vincent Zhao
Yi Luan
Keith B. Hall
Ming-Wei Chang
Yinfei Yang
DML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Large Dual Encoders Are Generalizable Retrievers"
50 / 174 papers shown
Title
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
Doyoung Kim
Youngjun Lee
Joeun Kim
Jihwan Bang
Hwanjun Song
Susik Yoon
Jae-Gil Lee
29
0
0
10 May 2025
Leveraging Decoder Architectures for Learned Sparse Retrieval
Jingfen Qiao
Thong Nguyen
Evangelos Kanoulas
Andrew Yates
51
0
0
25 Apr 2025
PropRAG: Guiding Retrieval with Beam Search over Proposition Paths
Jingjin Wang
LRM
143
0
0
25 Apr 2025
TIFIN India at SemEval-2025: Harnessing Translation to Overcome Multilingual IR Challenges in Fact-Checked Claim Retrieval
Prasanna Devadiga
Arya Suneesh
Pawan Kumar Rajpoot
Bharatdeep Hazarika
Aditya U Baliga
27
0
0
23 Apr 2025
CSPLADE: Learned Sparse Retrieval with Causal Language Models
Zhichao Xu
Aosong Feng
Yijun Tian
Haibo Ding
Lin Leee Cheong
RALM
40
0
0
15 Apr 2025
MURR: Model Updating with Regularized Replay for Searching a Document Stream
Eugene Yang
Nicola Tonellotto
Dawn J Lawrie
Sean MacAvaney
James Mayfield
Douglas W. Oard
Scott Miller
KELM
33
0
0
14 Apr 2025
Talking Point based Ideological Discourse Analysis in News Events
Nishanth Nakshatri
Nikhil Mehta
Siyi Liu
Sihao Chen
Daniel J. Hopkins
Dan Roth
Dan Goldwasser
34
0
0
10 Apr 2025
Causal Retrieval with Semantic Consideration
Hyunseo Shin
Wonseok Hwang
28
0
0
07 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
J. Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
141
0
0
07 Apr 2025
Real-time Ad retrieval via LLM-generative Commercial Intention for Sponsored Search Advertising
Tongtong Liu
Zhaohui Wang
Meiyue Qin
Zenghui Lu
Xudong Chen
Yuekui Yang
Peng Shu
38
0
0
02 Apr 2025
Generative Retrieval and Alignment Model: A New Paradigm for E-commerce Retrieval
Ming Pang
Chunyuan Yuan
Xiaoyu He
Zheng Fang
Donghao Xie
...
Xue Jiang
Changping Peng
Zhangang Lin
Zheng Luo
Jingping Shao
RALM
36
0
0
02 Apr 2025
Universal Zero-shot Embedding Inversion
Collin Zhang
John X. Morris
Vitaly Shmatikov
50
0
0
31 Mar 2025
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack
Cheng Wang
Yiwei Wang
Yujun Cai
Bryan Hooi
AAML
54
0
0
27 Mar 2025
Safeguarding LLM Embeddings in End-Cloud Collaboration via Entropy-Driven Perturbation
Shuaifan Jin
Xiaoyi Pang
Zhibo Wang
He Wang
Jiacheng Du
Jiahui Hu
Kui Ren
SILM
AAML
78
0
0
17 Mar 2025
Word2winners at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval
AmirMohammad Azadi
Sina Zamani
Mohammadmostafa Rostamkhani
Sauleh Eetemadi
LRM
49
2
0
12 Mar 2025
IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval
Tingyu Song
Guo Gan
Mingsheng Shang
Yilun Zhao
VLM
65
0
0
06 Mar 2025
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence
Mohsen Fayyaz
Ali Modarressi
Hinrich Schuetze
Nanyun Peng
57
0
0
06 Mar 2025
Large-Scale Data Selection for Instruction Tuning
Hamish Ivison
Muru Zhang
Faeze Brahman
Pang Wei Koh
Pradeep Dasigi
ALM
73
1
0
03 Mar 2025
Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution
K. Li
Tianhua Zhang
Yunxiang Li
Hongyin Luo
Abdalla Moustafa
Xixin Wu
James Glass
Helen Meng
61
0
0
03 Mar 2025
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition
Yifei Duan
Raphael Shang
Deng Liang
Yongqiang Cai
87
0
0
28 Feb 2025
Hierarchical corpus encoder: Fusing generative retrieval and dense indices
Tongfei Chen
Ankita Sharma
Adam Pauls
Benjamin Van Durme
RALM
46
1
0
26 Feb 2025
Vector-ICL: In-context Learning with Continuous Vector Representations
Yufan Zhuang
Chandan Singh
Liyuan Liu
Jingbo Shang
Jianfeng Gao
52
3
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM
3DV
58
2
0
21 Feb 2025
Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search
Yifan Ji
Zhipeng Xu
Zhenghao Liu
Yukun Yan
S. Yu
Yongqian Li
Zhiyuan Liu
Yu Gu
Ge Yu
Maosong Sun
RALM
63
0
0
18 Feb 2025
G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation
Yuhan Li
Xinni Zhang
Linhao Luo
Heng Chang
Yuxiang Ren
Irwin King
J. Li
57
3
0
18 Feb 2025
Following the Autoregressive Nature of LLM Embeddings via Compression and Alignment
Jingcheng Deng
Zhongtao Jiang
Liang Pang
Liwei Chen
Kun Xu
Zihao Wei
Huawei Shen
Xueqi Cheng
51
1
0
17 Feb 2025
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval
Ze Liu
Zhengyang Liang
Junjie Zhou
Zheng Liu
Defu Lian
OffRL
103
0
0
17 Feb 2025
ALGEN: Few-shot Inversion Attacks on Textual Embeddings using Alignment and Generation
Yiyi Chen
Qiongkai Xu
Johannes Bjerva
49
0
0
16 Feb 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
H. Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
W. Yu
Dong Yu
LLMAG
61
3
0
03 Jan 2025
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks
Ziyan Jiang
Rui Meng
Xinyi Yang
Semih Yavuz
Yingbo Zhou
Wenhu Chen
MLLM
VLM
51
19
0
03 Jan 2025
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models
Hieu Man
Nghia Trung Ngo
Viet Dac Lai
Ryan Rossi
Franck Dernoncourt
T. Nguyen
160
0
0
01 Jan 2025
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search
Matan Ben-Tov
Mahmood Sharif
RALM
40
0
0
31 Dec 2024
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
Jie He
Nan Hu
Wanqiu Long
Jiaoyan Chen
Jeff Z. Pan
ELM
LRM
96
6
0
22 Dec 2024
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling
Junyi Li
Hwee Tou Ng
LRM
90
1
0
19 Dec 2024
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
Jiayi Wu
Hengyi Cai
Lingyong Yan
Hao Sun
Xiang Li
S. Wang
Dawei Yin
Ming Gao
117
0
0
19 Dec 2024
LLMs are Also Effective Embedding Models: An In-depth Overview
Chongyang Tao
Tao Shen
Shen Gao
Junshuo Zhang
Zhen Li
Zhengwei Tao
Shuai Ma
83
7
0
17 Dec 2024
PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization
Yun Luo
Yingjie Li
Xiangkun Hu
Qinglin Qi
Fang Guo
Qipeng Guo
Zheng Zhang
Yue Zhang
67
0
0
17 Dec 2024
Generating a Low-code Complete Workflow via Task Decomposition and RAG
Orlando Marquez Ayala
Patrice Béchard
65
1
0
29 Nov 2024
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Chancharik Mitra
Brandon Huang
Tianning Chai
Zhiqiu Lin
Assaf Arbelle
Rogerio Feris
Leonid Karlinsky
Trevor Darrell
Deva Ramanan
Roei Herzig
VLM
126
4
0
28 Nov 2024
Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models
Iacopo Ghinassi
Leonardo Catalano
Tommaso Colella
70
1
0
21 Nov 2024
Any2Any: Incomplete Multimodal Retrieval with Conformal Prediction
Po-han Li
Yunhao Yang
Mohammad Omama
Sandeep P. Chinchali
Ufuk Topcu
41
1
0
15 Nov 2024
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research
Sian-Yao Huang
Cheng-Lin Yang
C. Lin
Chun-Ying Huang
35
2
0
02 Nov 2024
Length-Induced Embedding Collapse in Transformer-based Models
Yuqi Zhou
Sunhao Dai
Zhanshuo Cao
Xiao Zhang
Jun Xu
45
0
0
31 Oct 2024
Sensor2Text: Enabling Natural Language Interactions for Daily Activity Tracking Using Wearable Sensors
Wenqiang Chen
Jiaxuan Cheng
Leyao Wang
Wei Zhao
Wojciech Matusik
33
1
0
26 Oct 2024
UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
Dehai Min
Zhiyang Xu
Guilin Qi
Lifu Huang
Chenyu You
RALM
73
1
0
26 Oct 2024
Multi-Field Adaptive Retrieval
Millicent Li
Tongfei Chen
Benjamin Van Durme
Patrick Xia
136
1
0
26 Oct 2024
Little Giants: Synthesizing High-Quality Embedding Data at Scale
Haonan Chen
Liang Wang
Nan Yang
Bo Li
Ziliang Zhao
Furu Wei
Zhicheng Dou
SyDa
34
1
0
24 Oct 2024
Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction
Sergio Burdisso
S. Madikeri
P. Motlícek
37
1
0
24 Oct 2024
Improving Model Factuality with Fine-grained Critique-based Evaluator
Yiqing Xie
Wenxuan Zhou
Pradyot Prakash
Di Jin
Yuning Mao
...
Sinong Wang
Han Fang
Carolyn Rose
Daniel Fried
Hejia Zhang
HILM
33
5
0
24 Oct 2024
Compute-Constrained Data Selection
Junjie Oscar Yin
Alexander M. Rush
39
0
0
21 Oct 2024
1
2
3
4
Next