Title |
---|
![]() SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight
Compression Tim Dettmers Ruslan Svirschevski Vage Egiazarian Denis Kuznedelev Elias Frantar Saleh Ashkboos Alexander Borzunov Torsten Hoefler Dan Alistarh |
![]() DreamShard: Generalizable Embedding Table Placement for Recommender
Systems Daochen Zha Louis Feng Qiaoyu Tan Zirui Liu Kwei-Herng Lai Bhargav Bhushanam Yuandong Tian A. Kejariwal Xia Hu |
![]() AutoShard: Automated Embedding Table Sharding for Recommender Systems Daochen Zha Louis Feng Bhargav Bhushanam Dhruv Choudhary Jade Nie Yuandong Tian Jay Chae Yi-An Ma A. Kejariwal Xia Hu |