Title |
---|
![]() ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity
within Large Language Models Chenyang Song Xu Han Zhengyan Zhang Shengding Hu Xiyu Shi ...Chen Chen Zhiyuan Liu Guanglin Li Tao Yang Maosong Sun |
![]() Learning to Edit: Aligning LLMs with Knowledge Editing Yuxin Jiang Yufei Wang Chuhan Wu Wanjun Zhong Xingshan Zeng ...Xin Jiang Lifeng Shang Ruiming Tang Qun Liu Wei Wang |
![]() ReLU Wins: Discovering Efficient Activation Functions for Sparse
LLMs Zhengyan Zhang Yixin Song Guanghui Yu Xu Han Yankai Lin Chaojun Xiao Chenyang Song Zhiyuan Liu Zeyu Mi Maosong Sun |
![]() OLMo: Accelerating the Science of Language Models Dirk Groeneveld Iz Beltagy Pete Walsh Akshita Bhagia Rodney Michael Kinney ...Jesse Dodge Kyle Lo Luca Soldaini Noah A. Smith Hanna Hajishirzi |