ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.08734
  4. Cited By
Billion-scale similarity search with GPUs

Billion-scale similarity search with GPUs

28 February 2017
Jeff Johnson
Matthijs Douze
Hervé Jégou
ArXivPDFHTML

Papers citing "Billion-scale similarity search with GPUs"

50 / 1,822 papers shown
Title
Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural Routing
Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural Routing
Andrei-Laurentiu Bornea
Fadhel Ayed
Antonio De Domenico
Nicola Piovesan
Tareq Si Salem
Ali Maatouk
7
0
0
17 May 2025
Nearest Neighbor Multivariate Time Series Forecasting
Nearest Neighbor Multivariate Time Series Forecasting
Huiliang Zhang
Ping Nie
Lijun Sun
Benoit Boulet
AI4TS
4
0
0
16 May 2025
Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights
Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights
Yifan Wu
Lutao Yan
Yizhang Zhu
Yinan Mei
Jiannan Wang
Nan Tang
Yuyu Luo
27
0
0
15 May 2025
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency
Adel Ammar
Anis Koubaa
Omer Nacar
W. Boulila
RALM
3DV
40
0
0
13 May 2025
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration
Rishabh Agrawal
Himanshu Kumar
21
0
0
13 May 2025
OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
Wei Yang
Jingjing Fu
R. Wang
Jinyu Wang
Lei Song
Jiang Bian
24
0
0
10 May 2025
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
Doyoung Kim
Youngjun Lee
Joeun Kim
Jihwan Bang
Hwanjun Song
Susik Yoon
Jae-Gil Lee
31
0
0
10 May 2025
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB
Nitish Upreti
Krishnan Sundaram
Hari Sudan Sundar
Samer Boshra
Balachandar Perumalswamy
...
Kevin Pilch
Simon Moreno
Aayush Kataria
Vipul Vishal
H. Simhadri
21
0
0
09 May 2025
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
Fahad Shahbaz Khan
Jun Chen
Youssef Mohamed
Chun-Mei Feng
Mohamed Elhoseiny
VLM
33
0
0
08 May 2025
Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Qi Gan
Sao Mai Nguyen
Eric Fenaux
Stephan Clémençon
Mounîm El Yacoubi
3DH
57
0
0
06 May 2025
RAN Cortex: Memory-Augmented Intelligence for Context-Aware Decision-Making in AI-Native Networks
RAN Cortex: Memory-Augmented Intelligence for Context-Aware Decision-Making in AI-Native Networks
Sebastian Barros
AI4TS
36
0
0
06 May 2025
30DayGen: Leveraging LLMs to Create a Content Corpus for Habit Formation
30DayGen: Leveraging LLMs to Create a Content Corpus for Habit Formation
Franklin Zhang
Sonya Zhang
Alon Halevy
CLL
37
0
0
02 May 2025
Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item Embeddings
Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item Embeddings
Aleksandr V. Petrov
Craig MacDonald
Nicola Tonellotto
36
0
0
01 May 2025
Efficient Conversational Search via Topical Locality in Dense Retrieval
Efficient Conversational Search via Topical Locality in Dense Retrieval
Cristina Ioana Muntean
F. M. Nardini
R. Perego
Guido Rocchietti
Cosimo Rulli
27
0
0
30 Apr 2025
Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity
Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity
Tygo Bloem
Filip Ilievski
21
0
0
30 Apr 2025
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Linjuan Wu
Haoran Wei
Huan Lin
Tianhao Li
Baosong Yang
Weiming Lu
38
0
0
29 Apr 2025
Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations
Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations
Santosh Bhupathi
AI4TS
GNN
32
0
0
26 Apr 2025
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
Yangxinyu Xie
Bowen Jiang
Tanwi Mallick
Joshua Bergerson
John K Hutchison
...
Robert B. Ross
Yan Feng
L. Levy
Weijie J. Su
Camillo J Taylor
32
1
0
24 Apr 2025
Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation
Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation
Yuanpeng Qu
Hajime Nobuhara
DiffM
AI4TS
32
0
0
22 Apr 2025
DataS^3: Dataset Subset Selection for Specialization
DataS^3: Dataset Subset Selection for Specialization
Neha Hulkund
Alaa Maalouf
Levi Cai
Daniel Yang
Tsun-Hsuan Wang
...
Ken Goldberg
Hannah Kerner
Irene Chen
Yogesh A. Girdhar
Sara Beery
33
0
0
22 Apr 2025
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
Yaxiong Wu
Sheng Liang
Chen Zhang
Yucheng Wang
Wenjie Qu
Huifeng Guo
Ruiming Tang
Yong Liu
KELM
49
1
0
22 Apr 2025
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring
Kaili Huang
Thejas Venkatesh
Uma Dingankar
Antonio Mallia
Daniel Campos
...
Matei A. Zaharia
Kwabena Boahen
Omar Khattab
Saarthak Sarup
Keshav Santhanam
37
0
0
21 Apr 2025
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
Xinyu Wang
Jijun Chi
Zhenghan Tai
Tung Sum Thomas Kwok
Muzhi Li
...
Suyuchen Wang
Yihong Wu
Jerry Huang
Jingrui Tian
Ling Zhou
79
0
0
20 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
37
0
0
18 Apr 2025
CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval
CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval
Hao Deng
Haibo Xing
Kanefumi Matsuyama
Moyu Zhang
Jinxin Hu
Hong Wen
Yu Zhang
Xiaoyi Zeng
Jing-Xuan Zhang
36
0
0
17 Apr 2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Dan Su
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
40
0
0
17 Apr 2025
Towards Lossless Token Pruning in Late-Interaction Retrieval Models
Towards Lossless Token Pruning in Late-Interaction Retrieval Models
Yuxuan Zong
Benjamin Piwowarski
41
0
0
17 Apr 2025
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Hyungwoo Lee
Kihyun Kim
Jinwoo Kim
Jungmin So
Myung-Hoon Cha
H. Kim
James J. Kim
Youngjae Kim
37
0
0
16 Apr 2025
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Shixuan Liu
Zhenzhe Zheng
Xiaoyao Huang
Fan Wu
Guihai Chen
Jie Wu
35
0
0
15 Apr 2025
Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases
Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases
Xing David Wang
Ulf Leser
31
0
0
14 Apr 2025
MURR: Model Updating with Regularized Replay for Searching a Document Stream
MURR: Model Updating with Regularized Replay for Searching a Document Stream
Eugene Yang
Nicola Tonellotto
Dawn J Lawrie
Sean MacAvaney
James Mayfield
Douglas W. Oard
Scott Miller
KELM
33
0
0
14 Apr 2025
Understanding and Optimizing Multi-Stage AI Inference Pipelines
Understanding and Optimizing Multi-Stage AI Inference Pipelines
Abhimanyu Bambhaniya
Hanjiang Wu
Suvinay Subramanian
Sudarshan Srinivasan
Souvik Kundu
Amir Yazdanbakhsh
Suvinay Subramanian
Madhu Kumar
Tushar Krishna
159
0
0
14 Apr 2025
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
Joo-Young Kim
Divya Mahajan
VLM
151
0
0
11 Apr 2025
Impact of Language Guidance: A Reproducibility Study
Impact of Language Guidance: A Reproducibility Study
Cherish Puniani
Advika Sinha
Shree Singhi
Aayan Yadav
VLM
47
0
0
10 Apr 2025
Automating quantum feature map design via large language models
Automating quantum feature map design via large language models
Kenya Sakka
K. Mitarai
Keisuke Fujii
33
2
0
10 Apr 2025
Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning
Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning
Tooraj Helmi
26
0
0
08 Apr 2025
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
Nathanael Beau
Benoît Crabbé
31
0
0
08 Apr 2025
MicroNN: An On-device Disk-resident Updatable Vector Database
MicroNN: An On-device Disk-resident Updatable Vector Database
Jeffrey Pound
Floris Chabert
Arjun Bhushan
Ankur Goswami
Anil Pacaci
S. R. Chowdhury
26
0
0
08 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
J. Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
165
0
0
07 Apr 2025
Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG
Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG
Hengran Zhang
Minghao Tang
Keping Bi
J. Guo
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
21
0
0
07 Apr 2025
Efficient Constant-Space Multi-Vector Retrieval
Efficient Constant-Space Multi-Vector Retrieval
Sean MacAvaney
Antonio Mallia
Nicola Tonellotto
33
1
0
02 Apr 2025
Knowledge-Base based Semantic Image Transmission Using CLIP
Knowledge-Base based Semantic Image Transmission Using CLIP
Chongyang Li
Yanmei He
Tianqian Zhang
Mingjian He
Shouyin Liu
31
0
0
01 Apr 2025
LLM-Assisted Proactive Threat Intelligence for Automated Reasoning
LLM-Assisted Proactive Threat Intelligence for Automated Reasoning
Shuva Paul
Farhad Alemi
Richard Macwan
50
1
0
01 Apr 2025
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
J. Lin
Tian Wang
Kun Qian
LRM
47
2
0
31 Mar 2025
MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices
MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices
Sijia Li
Young D. Kwon
Lik-Hang Lee
Pan Hui
36
0
0
31 Mar 2025
LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search
LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search
Ximu Zeng
Liwei Deng
Penghao Chen
Xu Chen
Han Su
Kai Zheng
39
0
0
30 Mar 2025
Long-Tail Crisis in Nearest Neighbor Language Models
Long-Tail Crisis in Nearest Neighbor Language Models
Yuto Nishida
Makoto Morishita
Hiroyuki Deguchi
Hidetaka Kamigaito
Taro Watanabe
RALM
63
0
0
28 Mar 2025
MemInsight: Autonomous Memory Augmentation for LLM Agents
MemInsight: Autonomous Memory Augmentation for LLM Agents
Rana Salama
Jason (Jinglun) Cai
Michelle Yuan
Anna Currey
Monica Sunkara
Yi Zhang
Yassine Benajiba
LLMAG
RALM
89
1
0
27 Mar 2025
GridMind: A Multi-Agent NLP Framework for Unified, Cross-Modal NFL Data Insights
GridMind: A Multi-Agent NLP Framework for Unified, Cross-Modal NFL Data Insights
Jordan Chipka
Chris Moyer
Clay Troyer
Tyler Fuelling
Jeremy Hochstedler
AI4CE
32
0
0
24 Mar 2025
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Deepayan Das
Davide Talon
Yiming Wang
Massimiliano Mancini
Elisa Ricci
VLM
LRM
50
0
0
24 Mar 2025
1234...353637
Next