
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Papers citing "Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective"
50 / 136 papers shown
Title |
---|
![]() Qwen2 Technical Report An Yang Baosong Yang Binyuan Hui Jian Xu Bowen Yu ...Yuqiong Liu Zeyu Cui Zhenru Zhang Zhifang Guo Zhi-Wei Fan |
![]() ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity
within Large Language Models Chenyang Song Xu Han Zhengyan Zhang Shengding Hu Xiyu Shi ...Chen Chen Zhiyuan Liu Guanglin Li Tao Yang Maosong Sun |