ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.03213
  4. Cited By
LUT-NN: Empower Efficient Neural Network Inference with Centroid
  Learning and Table Lookup

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup

7 February 2023
Xiaohu Tang
Yang Wang
Ting Cao
Li Zhang
Qi Chen
Deng Cai
Yunxin Liu
Mao Yang
ArXivPDFHTML

Papers citing "LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup"

10 / 10 papers shown
Title
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
J. Wang
Hansong Zhou
Ting Song
Shijie Cao
Yan Xia
Ting Cao
Jianyu Wei
Shuming Ma
Hongyu Wang
Furu Wei
61
0
0
17 Feb 2025
VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution
VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution
Chendong Wang
Anlan Zhang
Yifan Yang
Lili Qiu
Yuqing Yang
Xinyang Jiang
Feng Qian
Suman Banerjee
72
1
0
17 Feb 2025
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
Guoyu Li
Shengyu Ye
Cen Chen
Yang Wang
Fan Yang
Ting Cao
Cheng Liu
Mohamed M. Sabry
Mao Yang
MQ
187
0
0
18 Jan 2025
AdaShadow: Responsive Test-time Model Adaptation in Non-stationary
  Mobile Environments
AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments
Cheng Fang
Sicong Liu
Zimu Zhou
Bin Guo
Jiaqi Tang
Ke Ma
Zhiwen Yu
TTA
39
1
0
10 Oct 2024
Fast, Scalable, Energy-Efficient Non-element-wise Matrix Multiplication
  on FPGA
Fast, Scalable, Energy-Efficient Non-element-wise Matrix Multiplication on FPGA
Xuqi Zhu
Huaizhi Zhang
JunKyu Lee
Jiacheng Zhu
Chandrajit Pal
S. Saha
Klaus D. McDonald-Maier
X. Zhai
26
0
0
02 Jul 2024
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
Jianyu Wei
Shijie Cao
Ting Cao
Lingxiao Ma
Lei Wang
Yanyong Zhang
Mao Yang
MQ
53
11
0
25 Jun 2024
GPTVQ: The Blessing of Dimensionality for LLM Quantization
GPTVQ: The Blessing of Dimensionality for LLM Quantization
M. V. Baalen
Andrey Kuzmin
Markus Nagel
Peter Couperus
Cédric Bastoul
E. Mahurin
Tijmen Blankevoort
Paul N. Whatmough
MQ
36
28
0
23 Feb 2024
Boosting Mobile CNN Inference through Semantic Memory
Boosting Mobile CNN Inference through Semantic Memory
Yun Li
Chen Zhang
S. Han
Li Zhang
B. Yin
Yunxin Liu
Mengwei Xu
ObjD
44
16
0
05 Dec 2021
What is the State of Neural Network Pruning?
What is the State of Neural Network Pruning?
Davis W. Blalock
Jose Javier Gonzalez Ortiz
Jonathan Frankle
John Guttag
191
1,032
0
06 Mar 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,996
0
20 Apr 2018
1