v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown

Title
ERNet Family: Hardware-Oriented CNN Models for Computational Imaging Using Block-Based Inference Chao-Tsung Huang 48 5 0 13 Oct 2019
eCNN: A Block-Based and Highly-Parallel CNN Accelerator for Edge Inference Chao-Tsung Huang Yu-Chun Ding Huan-Ching Wang Chi-Wen Weng Kai-Ping Lin Li-Wei Wang Li-De Chen 77 44 0 13 Oct 2019
JSDoop and TensorFlow.js: Volunteer Distributed Web Browser-Based Neural Network Training José Á. Morell Andrés Camero Enrique Alba 63 9 0 12 Oct 2019
EDEN: Enabling Energy-Efficient, High-Performance Deep Neural Network Inference Using Approximate DRAM Skanda Koppula Lois Orosa A. G. Yaglikçi Roknoddin Azizi Taha Shahroodi Konstantinos Kanellopoulos O. Mutlu 82 108 0 12 Oct 2019
SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks Cenk Baykal Lucas Liebenwein Igor Gilitschenski Dan Feldman Daniela Rus 92 18 0 11 Oct 2019
Noise as a Resource for Learning in Knowledge Distillation Elahe Arani F. Sarfraz Bahram Zonooz 64 6 0 11 Oct 2019
Structured Pruning of Large Language Models Ziheng Wang Jeremy Wohlwend Tao Lei 98 293 0 10 Oct 2019
Knowledge Distillation from Internal Representations Gustavo Aguilar Yuan Ling Yu Zhang Benjamin Yao Xing Fan Edward Guo 110 181 0 08 Oct 2019
Differentiable Sparsification for Deep Neural Networks Yognjin Lee 87 7 0 08 Oct 2019
Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent Dilin Wang Meng Li Lemeng Wu Vikas Chandra Qiang Liu 111 21 0 07 Oct 2019
Deep Neural Network Compression for Image Classification and Object Detection Georgios Tzelepis A. Asif Saimir Baci Selçuk Çavdar E. Aksoy 64 13 0 07 Oct 2019
Splitting Steepest Descent for Growing Neural Architectures Qiang Liu Lemeng Wu Dilin Wang 113 63 0 06 Oct 2019
Distilling BERT into Simple Neural Networks with Unlabeled Transfer Data Subhabrata Mukherjee Ahmed Hassan Awadallah 93 25 0 04 Oct 2019
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing En Li Liekang Zeng Zhi Zhou Xu Chen 85 634 0 04 Oct 2019
SAFA: a Semi-Asynchronous Protocol for Fast Federated Learning with Low Overhead A. Masullo Ligang He Toby Perrett Rui Mao Carsten Maple Majid Mirmehdi 117 319 0 03 Oct 2019
On the Efficacy of Knowledge Distillation Ligang He Rui Mao 103 622 0 03 Oct 2019
Piracy Resistant Watermarks for Deep Neural Networks Huiying Li Emily Willson Shawn Shan Bing Ye Shehroz S. Khan 88 26 0 02 Oct 2019
AntMan: Sparse Low-Rank Compression to Accelerate RNN inference Samyam Rajbhandari H. Shrivastava J. Rho MQ 57 8 0 02 Oct 2019
Neural networks on microcontrollers: saving memory at inference via operator reordering Edgar Liberis Nicholas D. Lane 66 46 0 02 Oct 2019
XNOR-Net++: Improved Binary Neural Networks Adrian Bulat Georgios Tzimiropoulos MQ 92 205 0 30 Sep 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs Caiwen Ding Shuo Wang Ning Liu Kaidi Xu Yanzhi Wang Yun Liang MQ 55 90 0 29 Sep 2019
AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference Thierry Tambe En-Yu Yang Zishen Wan Yuntian Deng Vijay Janapa Reddi Alexander M. Rush David Brooks Gu-Yeon Wei MQ 69 21 0 29 Sep 2019
Learning Efficient Convolutional Networks through Irregular Convolutional Kernels Weiyu Guo Jiabin Ma Liang Wang Yongzhen Huang 28 5 0 29 Sep 2019
Additive Powers-of-Two Quantization: An Efficient Non-uniform Discretization for Neural Networks Yuhang Li Xin Dong Wei Wang MQ 86 260 0 28 Sep 2019
Training convolutional neural networks with cheap convolutions and online distillation Jiao Xie Shaohui Lin Yichen Zhang Linkai Luo 65 12 0 28 Sep 2019
A Dual Camera System for High Spatiotemporal Resolution Video Acquisition Ming Cheng Zhan Ma M. Salman Asif Yiling Xu Haojie Liu Wenbo Bao Jun Sun 65 21 0 28 Sep 2019
Training-Free Uncertainty Estimation for Dense Regression: Sensitivity as a Surrogate Lu Mi Hao Wang Yonglong Tian Hao He Nir Shavit UQCV 62 32 0 28 Sep 2019
Robust Membership Encoding: Inference Attacks and Copyright Protection for Deep Learning Congzheng Song Reza Shokri MIACV 33 5 0 27 Sep 2019
MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution Taojiannan Yang Sijie Zhu Chong Chen Shen Yan Mi Zhang Andrew Willis OOD 93 75 0 27 Sep 2019
Global Sparse Momentum SGD for Pruning Very Deep Neural Networks Xiaohan Ding Guiguang Ding Xiangxin Zhou Yuchen Guo Jungong Han Ji Liu 131 165 0 27 Sep 2019
Impact of Low-bitwidth Quantization on the Adversarial Robustness for Embedded Neural Networks Rémi Bernhard Pierre-Alain Moëllic J. Dutertre AAML MQ 98 18 0 27 Sep 2019
Pruning from Scratch Yulong Wang Xiaolu Zhang Lingxi Xie Jun Zhou Hang Su Bo Zhang Xiaolin Hu 77 196 0 27 Sep 2019
Balanced Binary Neural Networks with Gated Residual Mingzhu Shen Xianglong Liu Ruihao Gong Kai Han MQ 79 36 0 26 Sep 2019
CAT: Compression-Aware Training for bandwidth reduction Chaim Baskin Brian Chmiel Evgenii Zheltonozhskii Ron Banner A. Bronstein A. Mendelson MQ 69 12 0 25 Sep 2019
FALCON: Lightweight and Accurate Convolution Jun-Gi Jang Chun Quan Hyun Dong Lee U. Kang 13 1 0 25 Sep 2019
Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone Microcontroller Bardienus P. Duisterhof Srivatsan Krishnan Jonathan J. Cruz Colby R. Banbury William Fu Aleksandra Faust Guido de Croon Vijay Janapa Reddi 125 25 0 25 Sep 2019
Forward and Backward Information Retention for Accurate Binary Neural Networks Haotong Qin Ruihao Gong Xianglong Liu Mingzhu Shen Ziran Wei F. Yu Jingkuan Song MQ 224 334 0 24 Sep 2019
TinyBERT: Distilling BERT for Natural Language Understanding Xiaoqi Jiao Yichun Yin Lifeng Shang Xin Jiang Xiao Chen Linlin Li F. Wang Qun Liu VLM 148 1,881 0 23 Sep 2019
A generalization of regularized dual averaging and its dynamics Shih-Kang Chao Guang Cheng 65 18 0 22 Sep 2019
Structured Binary Neural Networks for Image Recognition Bohan Zhuang Chunhua Shen Mingkui Tan Peng Chen Lingqiao Liu Ian Reid MQ 137 19 0 22 Sep 2019
SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems Xiaofan Zhang Haoming Lu Cong Hao Jiachen Li Bowen Cheng ... Jinjun Xiong Thomas Huang Humphrey Shi Wen-mei W. Hwu Deming Chen 106 92 0 20 Sep 2019
Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks Zhonghui You Kun Yan Jinmian Ye Meng Ma Ping Wang 3DPC 93 252 0 18 Sep 2019
Ensemble Knowledge Distillation for Learning Improved and Efficient Networks Umar Asif Jianbin Tang S. Harrer FedML 103 76 0 17 Sep 2019
Searching for Accurate Binary Neural Architectures Mingzhu Shen Kai Han Chunjing Xu Yunhe Wang MQ 156 64 0 16 Sep 2019
Comparison of UNet, ENet, and BoxENet for Segmentation of Mast Cells in Scans of Histological Slices A. Karimov A. Razumov Ruslana Manbatchurina Ksenia Simonova Irina Donets A. Vlasova Y. Khramtsova K. Ushenin SSeg 15 10 0 15 Sep 2019
Neural Machine Translation with 4-Bit Precision and Beyond Alham Fikri Aji Kenneth Heafield MQ 31 7 0 13 Sep 2019
DASNet: Dynamic Activation Sparsity for Neural Network Efficiency Improvement Qing Yang Jiachen Mao Zuoguan Wang H. Li 65 15 0 13 Sep 2019
Characterizing the Deep Neural Networks Inference Performance of Mobile Applications Samuel S. Ogden Tian Guo 49 17 0 10 Sep 2019
VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks Shuang Gao Xin Liu Lung-Sheng Chien William Zhang J. Álvarez VLM 3DPC 63 15 0 10 Sep 2019
DeepObfuscator: Obfuscating Intermediate Representations with Privacy-Preserving Adversarial Learning on Smartphones Ang Li Jiayi Guo Huanrui Yang Flora D. Salim Yiran Chen AAML 55 37 0 09 Sep 2019