Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.03384
Cited By
Hardware Acceleration of LLMs: A comprehensive survey and comparison
5 September 2024
Nikoletta Koilia
C. Kachris
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hardware Acceleration of LLMs: A comprehensive survey and comparison"
3 / 3 papers shown
Title
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
62
16
0
06 Oct 2024
DFX: A Low-latency Multi-FPGA Appliance for Accelerating Transformer-based Text Generation
Seongmin Hong
Seungjae Moon
Junsoo Kim
Sungjae Lee
Minsub Kim
Dongsoo Lee
Joo-Young Kim
69
76
0
22 Sep 2022
Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention
Zhe Zhou
Junling Liu
Zhenyu Gu
Guangyu Sun
64
42
0
18 Oct 2021
1