ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.08734
  4. Cited By
Billion-scale similarity search with GPUs

Billion-scale similarity search with GPUs

28 February 2017
Jeff Johnson
Matthijs Douze
Hervé Jégou
ArXiv (abs)PDFHTML

Papers citing "Billion-scale similarity search with GPUs"

50 / 1,874 papers shown
Title
Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Qi Gan
Sao Mai Nguyen
Eric Fenaux
Stephan Clémençon
Mounîm El Yacoubi
3DH
104
0
0
06 May 2025
30DayGen: Leveraging LLMs to Create a Content Corpus for Habit Formation
30DayGen: Leveraging LLMs to Create a Content Corpus for Habit Formation
Franklin Zhang
Sonya Zhang
Alon Halevy
CLL
61
0
0
02 May 2025
Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item Embeddings
Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item Embeddings
Aleksandr V. Petrov
Craig MacDonald
Nicola Tonellotto
66
0
0
01 May 2025
Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity
Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity
Tygo Bloem
Filip Ilievski
85
0
0
30 Apr 2025
Efficient Conversational Search via Topical Locality in Dense Retrieval
Efficient Conversational Search via Topical Locality in Dense Retrieval
Cristina Ioana Muntean
F. M. Nardini
R. Perego
Guido Rocchietti
Cosimo Rulli
44
0
0
30 Apr 2025
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Linjuan Wu
Haoran Wei
Huan Lin
Tianhao Li
Baosong Yang
Weiming Lu
75
0
0
29 Apr 2025
Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations
Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations
Santosh Bhupathi
AI4TSGNN
54
0
0
26 Apr 2025
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
Yangxinyu Xie
Bowen Jiang
Tanwi Mallick
Joshua Bergerson
John K Hutchison
...
Robert B. Ross
Yan Feng
L. Levy
Weijie J. Su
Camillo J Taylor
102
2
0
24 Apr 2025
Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation
Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation
Yuanpeng Qu
Hajime Nobuhara
DiffMAI4TS
65
1
0
22 Apr 2025
DataS^3: Dataset Subset Selection for Specialization
DataS^3: Dataset Subset Selection for Specialization
Neha Hulkund
Alaa Maalouf
Levi Cai
Daniel Yang
Tsun-Hsuan Wang
...
Ken Goldberg
Hannah Kerner
Irene Chen
Yogesh A. Girdhar
Sara Beery
75
0
0
22 Apr 2025
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
Yaxiong Wu
Sheng Liang
Chen Zhang
Yucheng Wang
Yanzhe Zhang
Huifeng Guo
Ruiming Tang
Yong Liu
KELM
136
7
0
22 Apr 2025
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring
Kaili Huang
Thejas Venkatesh
Uma Dingankar
Antonio Mallia
Daniel Campos
...
Matei A. Zaharia
Kwabena Boahen
Omar Khattab
Saarthak Sarup
Keshav Santhanam
115
0
0
21 Apr 2025
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
Xinyu Wang
Jijun Chi
Zhenghan Tai
Tung Sum Thomas Kwok
Muzhi Li
...
Jerry Huang
Jingrui Tian
Fengran Mo
Yufei Cui
Ling Zhou
155
0
0
20 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
129
0
0
18 Apr 2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Jane Polak Scowcroft
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
122
2
0
17 Apr 2025
Towards Lossless Token Pruning in Late-Interaction Retrieval Models
Towards Lossless Token Pruning in Late-Interaction Retrieval Models
Yuxuan Zong
Benjamin Piwowarski
81
0
0
17 Apr 2025
CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval
CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval
Hao Deng
Haibo Xing
Kanefumi Matsuyama
Moyu Zhang
Jinxin Hu
Hong Wen
Yu Zhang
Xiaoyi Zeng
Jing-Xuan Zhang
75
0
0
17 Apr 2025
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Hyungwoo Lee
Kihyun Kim
Jinwoo Kim
Jungmin So
Myung-Hoon Cha
H. Kim
James J. Kim
Youngjae Kim
79
0
0
16 Apr 2025
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Shixuan Liu
Zhenzhe Zheng
Xiaoyao Huang
Fan Wu
Guihai Chen
Jie Wu
103
0
0
15 Apr 2025
MURR: Model Updating with Regularized Replay for Searching a Document Stream
MURR: Model Updating with Regularized Replay for Searching a Document Stream
Eugene Yang
Nicola Tonellotto
Dawn J Lawrie
Sean MacAvaney
James Mayfield
Douglas W. Oard
Scott Miller
KELM
72
0
0
14 Apr 2025
Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases
Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases
Xing David Wang
Ulf Leser
59
0
0
14 Apr 2025
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
Joo-Young Kim
Divya Mahajan
VLM
410
0
0
11 Apr 2025
Automating quantum feature map design via large language models
Automating quantum feature map design via large language models
Kenya Sakka
K. Mitarai
Keisuke Fujii
73
2
0
10 Apr 2025
Impact of Language Guidance: A Reproducibility Study
Impact of Language Guidance: A Reproducibility Study
Cherish Puniani
Advika Sinha
Shree Singhi
Aayan Yadav
VLM
208
0
0
10 Apr 2025
MicroNN: An On-device Disk-resident Updatable Vector Database
MicroNN: An On-device Disk-resident Updatable Vector Database
Jeffrey Pound
Floris Chabert
Arjun Bhushan
Ankur Goswami
Anil Pacaci
S. R. Chowdhury
52
1
0
08 Apr 2025
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
Nathanael Beau
Benoît Crabbé
104
0
0
08 Apr 2025
Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning
Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning
Tooraj Helmi
57
0
0
08 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
Jiafeng Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
481
0
0
07 Apr 2025
Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG
Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG
Hengran Zhang
Minghao Tang
Keping Bi
Jiafeng Guo
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
74
0
0
07 Apr 2025
Efficient Constant-Space Multi-Vector Retrieval
Efficient Constant-Space Multi-Vector Retrieval
Sean MacAvaney
Antonio Mallia
Nicola Tonellotto
62
1
0
02 Apr 2025
LLM-Assisted Proactive Threat Intelligence for Automated Reasoning
LLM-Assisted Proactive Threat Intelligence for Automated Reasoning
Shuva Paul
Farhad Alemi
Richard Macwan
109
1
0
01 Apr 2025
Knowledge-Base based Semantic Image Transmission Using CLIP
Knowledge-Base based Semantic Image Transmission Using CLIP
Chongyang Li
Yanmei He
Tianqian Zhang
Mingjian He
Shouyin Liu
63
0
0
01 Apr 2025
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
J. Lin
Tian Wang
Kun Qian
LRM
127
7
0
31 Mar 2025
MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices
MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices
Sijia Li
Young D. Kwon
Lik-Hang Lee
Pan Hui
91
0
0
31 Mar 2025
LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search
LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search
Ximu Zeng
Liwei Deng
Penghao Chen
Xu Chen
Han Su
Kai Zheng
88
0
0
30 Mar 2025
Long-Tail Crisis in Nearest Neighbor Language Models
Long-Tail Crisis in Nearest Neighbor Language Models
Yuto Nishida
Makoto Morishita
Hiroyuki Deguchi
Hidetaka Kamigaito
Taro Watanabe
RALM
100
0
0
28 Mar 2025
MemInsight: Autonomous Memory Augmentation for LLM Agents
MemInsight: Autonomous Memory Augmentation for LLM Agents
Rana Salama
Jason (Jinglun) Cai
Michelle Yuan
Anna Currey
Monica Sunkara
Yi Zhang
Yassine Benajiba
LLMAGRALM
152
3
0
27 Mar 2025
GridMind: A Multi-Agent NLP Framework for Unified, Cross-Modal NFL Data Insights
GridMind: A Multi-Agent NLP Framework for Unified, Cross-Modal NFL Data Insights
Jordan Chipka
Chris Moyer
Clay Troyer
Tyler Fuelling
Jeremy Hochstedler
AI4CE
42
0
0
24 Mar 2025
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Deepayan Das
Davide Talon
Yiming Wang
Massimiliano Mancini
Elisa Ricci
VLMLRM
144
0
0
24 Mar 2025
What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images
What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images
Dongheng Lin
Han Hu
Jianbo Jiao
63
0
0
23 Mar 2025
RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation
RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation
Linxi Liang
Jing Gong
Wentai Deng
Chong Wang
Guangsheng Ou
Yanlin Wang
Xin Peng
Zibin Zheng
ALM
100
0
0
21 Mar 2025
RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving
RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving
Wenqi Jiang
Suvinay Subramanian
Cat Graves
Gustavo Alonso
Amir Yazdanbakhsh
Vidushi Dadu
132
4
0
18 Mar 2025
Genicious: Contextual Few-shot Prompting for Insights Discovery
Genicious: Contextual Few-shot Prompting for Insights Discovery
Vineet Kumar
Ronald Tony
Darshita Rathore
Vipasha Rana
Bhuvanesh Mandora
Kanishka
Chetna Bansal
Anindya Moitra
76
0
0
15 Mar 2025
Speedy MASt3R
Jingxing Li
Yongjae Lee
Abhay Kumar Yadav
Cheng-Fang Peng
Rama Chellappa
Deliang Fan
3DGS
134
0
0
13 Mar 2025
Semantic Synergy: Unlocking Policy Insights and Learning Pathways Through Advanced Skill Mapping
Phoebe Koundouri
Conrad Landis
Georgios Feretzakis
86
0
0
13 Mar 2025
Continual Text-to-Video Retrieval with Frame Fusion and Task-Aware Routing
Continual Text-to-Video Retrieval with Frame Fusion and Task-Aware Routing
Zecheng Zhao
Zhi Chen
Zi-Rui Huang
S. Sadiq
Tong Chen
105
0
0
13 Mar 2025
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper
Jiaming Zhou
Songtao Zhao
Jiabei He
Hui Wang
Wenjia Zeng
Yong Chen
Haoqin Sun
Aobo Kong
Yong Qin
149
1
0
13 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Enhong Chen
3DV
159
7
0
11 Mar 2025
Towards Scalable and Cross-Lingual Specialist Language Models for Oncology
Morteza Rohanian
Tarun Mehra
Nicola Miglino
Farhad Nooralahzadeh
Michael Krauthammer
Andreas Wicki
LM&MA
51
0
0
11 Mar 2025
RoboDesign1M: A Large-scale Dataset for Robot Design Understanding
T. H. Le
T. H. Nguyen
Quang-Dieu Tran
Quang Minh Nguyen
Baoru Huang
Hoan Nguyen
M. Vu
Tung D. Ta
A. Nguyen
3DV
121
0
0
09 Mar 2025
Previous
12345...363738
Next