Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.17178
Cited By
LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization
26 November 2024
Rui Xie
Tianchen Zhao
Zhihang Yuan
Rui Wan
Wenxi Gao
Zhenhua Zhu
Xuefei Ning
Yu Wang
VGen
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization"
2 / 2 papers shown
Title
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
Tianchen Zhao
Tongcheng Fang
Haofeng Huang
Enshu Liu
Widyadewi Soedarmadji
...
Shengen Yan
Huazhong Yang
Xuefei Ning
Xuefei Ning
Yu Wang
MQ
VGen
191
35
0
04 Jun 2024
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Chengyue Wu
Haotian Tang
Shang Yang
Zhekai Zhang
Guangxuan Xiao
Chuang Gan
Song Han
168
98
0
07 May 2024
1