Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.00462
Cited By
Designing Efficient LLM Accelerators for Edge Devices
1 August 2024
Jude Haris
Rappy Saha
Wenhao Hu
José Cano
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Designing Efficient LLM Accelerators for Edge Devices"
4 / 4 papers shown
Title
Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency
E. J. Husom
Arda Goknil
Merve Astekin
Lwin Khin Shar
Andre Kåsen
S. Sen
Benedikt Andreas Mithassel
Ahmet Soylu
MQ
43
1
0
04 Apr 2025
Exploiting Unstructured Sparsity in Fully Homomorphic Encrypted DNNs
Aidan Ferguson
Perry Gibson
Lara DÁgata
Parker McLeod
Ferhat Yaman
Amitabh Das
Ian Colbert
José Cano
63
0
0
12 Mar 2025
Does Acceleration Cause Hidden Instability in Vision Language Models? Uncovering Instance-Level Divergence Through a Large-Scale Empirical Study
Yizheng Sun
Hao Li
Chang Xu
Hongpeng Zhou
Chenghua Lin
R. Batista-Navarro
Jingyuan Sun
62
0
0
09 Mar 2025
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
62
17
0
06 Oct 2024
1