Model Compression in Practice: Lessons Learned from Practitioners
Creating On-device Machine Learning Experiences

Model Compression in Practice: Lessons Learned from Practitioners Creating On-device Machine Learning Experiences

6 October 2023

Dominik Moritz

Papers citing "Model Compression in Practice: Lessons Learned from Practitioners Creating On-device Machine Learning Experiences"

13 / 13 papers shown

Title
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing Cheng Deng Luoyang Sun Jiwen Jiang Yongcheng Zeng Xinjian Wu ... Haoyang Li Lei Chen Lionel M. Ni Jun Wang Jun Wang 189 0 0 15 Mar 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models Xubin Wang Zhiqing Tang Jianxiong Guo Tianhui Meng Chenhao Wang Tian-sheng Wang Weijia Jia 54 1 0 08 Mar 2025
OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering Jiahao Nick Li Zhuohao Jerry Zhang Zhang 56 1 0 24 Feb 2025
Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models Tom Wallace Naser Ezzati-Jivan Beatrice Ombuki-Berman MQ 38 1 0 16 Jan 2025
Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Angie Boggust Venkatesh Sivaraman Yannick Assogba Donghao Ren Dominik Moritz Fred Hohman VLM 55 3 0 06 Aug 2024
Combining Relevance and Magnitude for Resource-Aware DNN Pruning C. Chiasserini F. Malandrino Nuria Molner Zhiqiang Zhao 35 0 0 21 May 2024
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Keivan Alizadeh-Vahid Iman Mirzadeh Dmitry Belenko Karen Khatamifard Minsik Cho C. C. D. Mundo Mohammad Rastegari Mehrdad Farajtabar 77 112 0 12 Dec 2023
Designing and Training of Lightweight Neural Networks on Edge Devices using Early Halting in Knowledge Distillation Rahul Mishra Hari Prabhat Gupta 40 8 0 30 Sep 2022
Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks Torsten Hoefler Dan Alistarh Tal Ben-Nun Nikoli Dryden Alexandra Peste MQ 141 684 0 31 Jan 2021
Trust in Data Science: Collaboration, Translation, and Accountability in Corporate Data Science Projects Samir Passi S. Jackson 171 108 0 09 Feb 2020
Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI Dakuo Wang Justin D. Weisz Michael J. Muller Parikshit Ram Werner Geyer Casey Dugan Y. Tausczik Horst Samulowitz Alexander G. Gray 178 308 0 05 Sep 2019
Improving fairness in machine learning systems: What do industry practitioners need? Kenneth Holstein Jennifer Wortman Vaughan Hal Daumé Miroslav Dudík Hanna M. Wallach FaML HAI 192 743 0 13 Dec 2018
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications Andrew G. Howard Menglong Zhu Bo Chen Dmitry Kalenichenko Weijun Wang Tobias Weyand M. Andreetto Hartwig Adam 3DH 950 20,572 0 17 Apr 2017