Billion-scale similarity search with GPUs

28 February 2017

Papers citing "Billion-scale similarity search with GPUs"

50 / 1,874 papers shown

Title
REIS: A High-Performance and Energy-Efficient Retrieval System with In-Storage Processing Kangqi Chen Andreas Kosmas Kakolyris Rakesh Nadig Manos Frouzakis Nika Mansouri-Ghiasi Yu Liang Haiyu Mao Jisung Park Mohammad Sadrosadati Onur Mutlu RALM 38 0 0 19 Jun 2025
SCISSOR: Mitigating Semantic Bias through Cluster-Aware Siamese Networks for Robust Classification Shuo Yang Bardh Prenkaj Gjergji Kasneci 31 0 0 17 Jun 2025
Refining music sample identification with a self-supervised graph neural network Aditya Bhattacharjee Ivan Meresman Higgs Mark Sandler Emmanouil Benetos 36 0 0 17 Jun 2025
Assessing the Performance Gap Between Lexical and Semantic Models for Information Retrieval With Formulaic Legal Language Larissa Mori Carlos Sousa de Oliveira Yuehwern Yih Mario Ventresca AILaw RALM ELM 31 0 0 15 Jun 2025
How Grounded is Wikipedia? A Study on Structured Evidential Support William Walden Kathryn Ricci Miriam Wanner Zhengping Jiang Chandler May Rongkun Zhou Benjamin Van Durme HILM 20 0 0 14 Jun 2025
KEENHash: Hashing Programs into Function-Aware Embeddings for Large-Scale Binary Code Similarity Analysis Zhijie Liu Qiyi Tang Sen Nie Shi Wu Liang Feng Zhang Yutian Tang 14 1 0 13 Jun 2025
Constructing and Evaluating Declarative RAG Pipelines in PyTerrier Craig Macdonald Jinyuan Fang Andrew Parry Zaiqiao Meng AI4TS 115 0 0 12 Jun 2025
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models Xuanchi Ren Y. Lu Tianshi Cao Ruiyuan Gao S. Huang ... Jun Gao Laura Leal-Taixe Mike Chen Sanja Fidler Huan Ling VGen 76 0 0 10 Jun 2025
RAISE: Enhancing Scientific Reasoning in LLMs via Step-by-Step Retrieval Minhae Oh Jeonghye Kim Nakyung Lee Donggeon Seo Taeuk Kim Jungwoo Lee ReLM LRM 29 0 0 10 Jun 2025
When Simple Model Just Works: Is Network Traffic Classification in Crisis? Kamil Jeřábek Jan Luxemburk Richard Plný Josef Koumar Jaroslav Pesek Karel Hynek 23 0 0 10 Jun 2025
Protriever: End-to-End Differentiable Protein Homology Search for Fitness Prediction Ruben Weitzman Peter Mørch Groth Lood Van Niekerk Aoi Otani Y. Gal D. Marks Pascal Notin 32 0 0 10 Jun 2025
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems Aniket Rege Zinnia Nie Mahesh Ramesh Unmesh Raskar Zhuoran Yu Aditya Kusupati Yong Jae Lee Ramya Korlakai Vinayak 28 0 0 09 Jun 2025
No Stupid Questions: An Analysis of Question Query Generation for Citation Recommendation Brian D. Zimmerman Julien Aubert-Béduchaud Florian Boudin Akiko Aizawa Olga Vechtomova 12 0 0 09 Jun 2025
The State-of-the-Art in Lifelog Retrieval: A Review of Progress at the ACM Lifelog Search Challenge Workshop 2022-24 Allie Tran Werner Bailer Duc-Tien Dang-Nguyen Graham Healy Steve Hodges ... Luca Rossetto Klaus Schoeffmann Minh-Triet Tran Lucia Vadicamo C. Gurrin 15 0 0 07 Jun 2025
Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning Junqi Gao Xiang Zou YIng Ai Dong Li Yichen Niu Biqing Qi Jianxing Liu 59 0 0 04 Jun 2025
Product Quantization for Surface Soil Similarity Haley Dozier Althea Henslee Ashley Abraham Andrew Strelzoff Mark Chappell 25 0 0 03 Jun 2025
Contrast & Compress: Learning Lightweight Embeddings for Short Trajectories Abhishek Vivekanandan Christian Hubschneider J. M. Zöllner 45 0 0 03 Jun 2025
AdaRewriter: Unleashing the Power of Prompting-based Conversational Query Reformulation via Test-Time Adaptation Yilong Lai Jialong Wu Zhenglin Wang Deyu Zhou 55 0 0 02 Jun 2025
Position: The Future of Bayesian Prediction Is Prior-Fitted Samuel G. Müller Arik Reuter Noah Hollmann David Rügamer Frank Hutter 30 0 0 29 May 2025
Deep Retrieval at CheckThat! 2025: Identifying Scientific Papers from Implicit Social Media Mentions via Hybrid Retrieval and Re-Ranking Pascal Sager Ashwini Kamaraj Benjamin Grewe Thilo Stadelmann 21 0 0 29 May 2025
Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule San Jiang Kan You Wanshou Jiang Qingquan Li 39 0 0 28 May 2025
Towards a More Generalized Approach in Open Relation Extraction Qing Wang Yuepei Li Qiao Qiao Kang Zhou Qi Li NAI 26 0 0 28 May 2025
Diagnosing and Resolving Cloud Platform Instability with Multi-modal RAG LLMs Yifan Wang Kenneth P. Birman 98 0 0 27 May 2025
ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision Dosung Lee Wonjun Oh Boyoung Kim Minyoung Kim Joonsuk Park Paul Hongsuck Seo LRM 25 0 0 27 May 2025
MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning Thang Nguyen Peter Chin Yu-Wing Tai LRM 80 1 0 26 May 2025
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning Yuan Li Qi Luo Xiaonan Li B. Li Qinyuan Cheng Bo Wang Y. Zheng Yuxin Wang Zhangyue Yin Xipeng Qiu RALM LRM 36 0 0 26 May 2025
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM Xun Gong Anqi Lv Zhiming Wang Huijia Zhu Y. Qian 56 0 0 25 May 2025
Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval Kidist Amde Mekonnen Yosef Worku Alemneh Maarten de Rijke RALM 50 0 0 25 May 2025
Enhancing Training Data Attribution with Representational Optimization W. Sun Haokun Liu Nikhil Kandpal Colin Raffel Yiming Yang TDI 46 0 0 24 May 2025
Improving Ad matching via Cluster-Adaptive Keyword Expansion and Relevance tuning Dipanwita Saha Anis Zaman Hua Zou Ning Chen Xinxin Shu Nadia Vase Abraham Bagherjeiran 21 0 0 24 May 2025
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation Li Zhong Ahmed Ghazal Jun-Jun Wan Frederik Zilly Patrick Mackens Joachim E. Vollrath Bogdan Sorin Coseriu 247 0 0 23 May 2025
Less Context, Same Performance: A RAG Framework for Resource-Efficient LLM-Based Clinical NLP Satya Narayana Cheetirala Ganesh Raut Dhavalkumar Patel Fabio Sanatana Robert Freeman ... Omar Dawkins Reba Miller Randolph M. Steinhagen Eyal Klang Prem Timsina RALM 48 0 0 23 May 2025
VIBE: Vector Index Benchmark for Embeddings Elias Jääsaari Ville Hyvönen Matteo Ceccarello Teemu Roos Martin Aumüller VLM 88 0 0 23 May 2025
Neighbour-Driven Gaussian Process Variational Autoencoders for Scalable Structured Latent Modelling Xinxing Shi Xiaoyu Jiang Mauricio A. Álvarez BDL 112 0 0 22 May 2025
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning Changtai Zhu Siyin Wang Ruijun Feng Kai Song Xipeng Qiu LRM 88 0 0 21 May 2025
HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving Zhiwen Chen Bo Leng Zhuoren Li Hanming Deng Guizhe Jin Ran Yu Huanxi Wen 231 0 0 21 May 2025
Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data Faeze Ghorbanpour Daryna Dementieva Alexander Fraser 81 0 0 20 May 2025
LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference Guangyuan Ma Yongliang Ma Xuanrui Gou Zhenpeng Su Ming Zhou Songlin Hu RALM 86 0 0 18 May 2025
Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural Routing Andrei-Laurentiu Bornea Fadhel Ayed Antonio De Domenico Nicola Piovesan Tareq Si Salem Ali Maatouk 54 0 0 17 May 2025
Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models Camille Couturier Spyros Mastorakis Haiying Shen Saravan Rajmohan Victor Rühle KELM 61 0 0 16 May 2025
Nearest Neighbor Multivariate Time Series Forecasting Huiliang Zhang Ping Nie Lijun Sun Benoit Boulet AI4TS 109 1 0 16 May 2025
Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights Yifan Wu Lutao Yan Yizhang Zhu Yinan Mei Jiannan Wang Nan Tang Yuyu Luo 117 1 0 15 May 2025
VLM-KG: Multimodal Radiology Knowledge Graph Generation Abdullah Abdullah Seong Tae Kim 89 0 0 13 May 2025
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration Rishabh Agrawal Himanshu Kumar 92 0 0 13 May 2025
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency Adel Ammar Anis Koubaa Omer Nacar W. Boulila RALM 3DV 99 0 0 13 May 2025
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation Doyoung Kim Youngjun Lee Joeun Kim Jihwan Bang Hwanjun Song Susik Yoon Jae-Gil Lee 203 0 0 10 May 2025
OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval Wei Yang Jingjing Fu Rongpin Wang Jinyu Wang Lei Song Jiang Bian 63 1 0 10 May 2025
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB Nitish Upreti Krishnan Sundaram Hari Sudan Sundar Samer Boshra Balachandar Perumalswamy ... Kevin Pilch Simon Moreno Aayush Kataria Vipul Vishal H. Simhadri 62 0 0 09 May 2025
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models Fahad Shahbaz Khan Jun Chen Youssef Mohamed Chun-Mei Feng Mohamed Elhoseiny VLM 133 1 0 08 May 2025
RAN Cortex: Memory-Augmented Intelligence for Context-Aware Decision-Making in AI-Native Networks Sebastian Barros AI4TS 67 0 0 06 May 2025