ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.14286
  4. Cited By
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s

TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s

28 June 2022
Felix Chern
Blake A. Hechtman
Andy Davis
Ruiqi Guo
David Majnemer
Surinder Kumar
ArXivPDFHTML

Papers citing "TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s"

16 / 16 papers shown
Title
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
J. Kim
Divya Mahajan
VLM
126
0
0
11 Apr 2025
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
Chien-Yu Lin
Keisuke Kamahori
Yiyu Liu
Xiaoxiang Shi
Madhav Kashyap
...
Stephanie Wang
Arvind Krishnamurthy
Rohan Kadekodi
Luis Ceze
Baris Kasikci
3DV
VLM
143
1
0
28 Feb 2025
Machine learning and high dimensional vector search
Matthijs Douze
63
0
0
24 Feb 2025
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Mohammad Mahdi Abootorabi
Amirhosein Zobeiri
Mahdi Dehghani
Mohammadali Mohammadkhani
Bardia Mohammadi
Omid Ghahroodi
M. Baghshah
Ehsaneddin Asgari
RALM
105
4
0
12 Feb 2025
BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single
  GPU
BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU
V. Karthik
Saim Khan
Somesh Singh
H. Simhadri
Jyothi Vedurada
GNN
15
8
0
20 Jan 2024
The Faiss library
The Faiss library
Matthijs Douze
Alexandr Guzhva
Chengqi Deng
Jeff Johnson
Gergely Szilvasy
Pierre-Emmanuel Mazaré
Maria Lomeli
Lucas Hosseini
Hervé Jégou
32
148
0
16 Jan 2024
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
54
16
0
15 Oct 2023
DeDrift: Robust Similarity Search under Content Drift
DeDrift: Robust Similarity Search under Content Drift
Dmitry Baranchuk
Matthijs Douze
Yash Upadhyay
I. Z. Yalniz
22
8
0
05 Aug 2023
Co-design Hardware and Algorithm for Vector Search
Co-design Hardware and Algorithm for Vector Search
Wenqi Jiang
Shigang Li
Yu Zhu
Johannes de Fine Licht
Zhenhao He
...
Cédric Renggli
Shuai Zhang
Theodoros Rekatsinas
Torsten Hoefler
Gustavo Alonso
84
20
0
19 Jun 2023
AVIS: Autonomous Visual Information Seeking with Large Language Model
  Agent
AVIS: Autonomous Visual Information Seeking with Large Language Model Agent
Ziniu Hu
Ahmet Iscen
Chen Sun
Kai-Wei Chang
Yizhou Sun
David A. Ross
Cordelia Schmid
Alireza Fathi
31
11
0
13 Jun 2023
Revisiting Neural Retrieval on Accelerators
Revisiting Neural Retrieval on Accelerators
Jiaqi Zhai
Zhaojie Gong
Yueming Wang
Xiao Sun
Zheng Yan
Fu Li
Xing Liu
13
9
0
06 Jun 2023
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with
  Multi-Source Multimodal Knowledge Memory
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Ziniu Hu
Ahmet Iscen
Chen Sun
Zirui Wang
Kai-Wei Chang
Yizhou Sun
Cordelia Schmid
David A. Ross
Alireza Fathi
RALM
VLM
40
88
0
10 Dec 2022
Augmentation with Projection: Towards an Effective and Efficient Data
  Augmentation Paradigm for Distillation
Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation
Ziqi Wang
Yuexin Wu
Frederick Liu
Daogao Liu
Le Hou
Hongkun Yu
Jing Li
Heng Ji
32
5
0
21 Oct 2022
The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in
  Transformers
The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in Transformers
Zong-xiao Li
Chong You
Srinadh Bhojanapalli
Daliang Li
A. S. Rawat
...
Kenneth Q Ye
Felix Chern
Felix X. Yu
Ruiqi Guo
Surinder Kumar
MoE
27
87
0
12 Oct 2022
Sparsity-Constrained Optimal Transport
Sparsity-Constrained Optimal Transport
Tianlin Liu
J. Puigcerver
Mathieu Blondel
OT
21
22
0
30 Sep 2022
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
304
3,708
0
11 Feb 2021
1