ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.01712
  4. Cited By
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search
  with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping

JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping

4 December 2023
Zihan Liu
Wentao Ni
Jingwen Leng
Yu Feng
Cong Guo
Quan Chen
Chao Li
Minyi Guo
Yuhao Zhu
ArXivPDFHTML

Papers citing "JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping"

8 / 8 papers shown
Title
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
Joo-Young Kim
Divya Mahajan
VLM
145
0
0
11 Apr 2025
RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving
RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving
Wenqi Jiang
Suvinay Subramanian
Cat Graves
Gustavo Alonso
Amir Yazdanbakhsh
Vidushi Dadu
49
6
0
18 Mar 2025
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
Chien-Yu Lin
Keisuke Kamahori
Yiyu Liu
Xiaoxiang Shi
Madhav Kashyap
...
Stephanie Wang
Arvind Krishnamurthy
Rohan Kadekodi
Luis Ceze
Baris Kasikci
3DV
VLM
158
1
0
28 Feb 2025
Potamoi: Accelerating Neural Rendering via a Unified Streaming
  Architecture
Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture
Yu Feng
Weikai Lin
Zihan Liu
Jingwen Leng
Minyi Guo
Han Zhao
Xiaofeng Hou
Jieru Zhao
Yuhao Zhu
36
3
0
13 Aug 2024
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
54
16
0
15 Oct 2023
Unlimiformer: Long-Range Transformers with Unlimited Length Input
Unlimiformer: Long-Range Transformers with Unlimited Length Input
Amanda Bertsch
Uri Alon
Graham Neubig
Matthew R. Gormley
RALM
99
122
0
02 May 2023
Sparsity in Deep Learning: Pruning and growth for efficient inference
  and training in neural networks
Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks
Torsten Hoefler
Dan Alistarh
Tal Ben-Nun
Nikoli Dryden
Alexandra Peste
MQ
141
684
0
31 Jan 2021
PointNet: Deep Learning on Point Sets for 3D Classification and
  Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
222
14,109
0
02 Dec 2016
1