Title |
---|
![]() Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster
Speculative Decoding Weilin Zhao Yuxiang Huang Xu Han Wang Xu Chaojun Xiao Xinrong Zhang Yewei Fang Kaihuo Zhang Zhiyuan Liu Maosong Sun |
![]() Variator: Accelerating Pre-trained Models with Plug-and-Play Compression
Modules Chaojun Xiao Yuqi Luo Wenbin Zhang Pengle Zhang Xu Han ...Zhengyan Zhang Ruobing Xie Zhiyuan Liu Maosong Sun Jie Zhou |