
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Papers citing "LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning"
42 / 42 papers shown
Title |
---|
![]() Agentic Retrieval-Augmented Generation for Time Series Analysis Chidaksh Ravuru Sagar Srinivas Sakhinana Venkataramana Runkana |
![]() LLM Inference Unveiled: Survey and Roofline Model Insights Zhihang Yuan Yuzhang Shang Yang Zhou Zhen Dong Zhe Zhou ...Yong Jae Lee Yan Yan Beidi Chen Guangyu Sun Kurt Keutzer |