v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown

Title
Efficient Inferencing of Compressed Deep Neural Networks Dharma Teja Vooturi Saurabh Goyal Anamitra R. Choudhury Yogish Sabharwal Ashish Verma 42 6 0 01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks Bohan Zhuang Chunhua Shen Mingkui Tan Lingqiao Liu Ian Reid MQ 110 234 0 01 Nov 2017
Tensorizing Generative Adversarial Nets Xingwei Cao Xuyang Zhao Qibin Zhao GAN 56 9 0 30 Oct 2017
Knowledge Projection for Deep Neural Networks Zhi Zhang G. Ning Zhihai He 62 15 0 26 Oct 2017
Trace norm regularization and faster inference for embedded speech recognition RNNs Markus Kliegl Siddharth Goyal Kexin Zhao Kavya Srinet Mohammad Shoeybi 67 8 0 25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks Yu Cheng Duo Wang Pan Zhou Zhang Tao 179 1,101 0 23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick Oran Shayer Dan Levi Ethan Fetaya 90 88 0 21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks Raphael Gontijo-Lopes Stefano Fenu Thad Starner 100 273 0 19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization D. Loroch Norbert Wehn Franz-Josef Pfreundt J. Keuper MQ 57 23 0 13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition Nitin Rathi Priyadarshini Panda Kaushik Roy 83 114 0 12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers Jiaqi Guan Yang Liu Qiang Liu Jian-wei Peng 68 33 0 10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial Mingzhe Chen Ursula Challita Walid Saad Changchuan Yin Mérouane Debbah 110 209 0 09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures F. Iandola Kurt Keutzer 77 37 0 07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression Michael Zhu Suyog Gupta 213 1,289 0 05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear Filters D. Tran Alexandros Iosifidis Moncef Gabbouj 60 40 0 28 Sep 2017
Connectivity Learning in Multi-Branch Networks Karim Ahmed Lorenzo Torresani 73 26 0 27 Sep 2017
Machine Learning Models that Remember Too Much Congzheng Song Thomas Ristenpart Vitaly Shmatikov VLM 77 522 0 22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design Zhourui Song Zhenyu Liu Dongsheng Wang 57 42 0 22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration Huan Wang Qiming Zhang Yuehai Wang Roland Hu 117 11 0 20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks Julian Faraone Nicholas J. Fraser Giulio Gambardella Michaela Blott Philip H. W. Leong MQ UQCV 62 12 0 19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning A. Ashok Nicholas Rhinehart Fares N. Beainy Kris Kitani 98 171 0 18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement Tianchan Guan Xiaoyang Zeng Mingoo Seok MQ 31 6 0 15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications Yuan Du Li Du Yilei Li Junjie Su Mau-Chung Frank Chang 36 6 0 15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory W. Wen Yuxiong He Samyam Rajbhandari Minjia Zhang Wenhan Wang Fang Liu Bin Hu Yiran Chen H. Li MQ 141 142 0 15 Sep 2017
Supervising Unsupervised Learning Vikas Garg Adam Kalai SSL FedML 51 30 0 14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing model without retraining Ryuji Kamiya Takayoshi Yamashita Mitsuru Ambai Ikuro Sato Yuji Yamauchi H. Fujiyoshi MQ 26 5 0 14 Sep 2017
Flexible Network Binarization with Layer-wise Priority He Wang Yi Tian Xu Bingbing Ni Hongteng Xu MQ 72 10 0 13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification Chong-Jun Wang Xipeng Lan Yang Zhang CVBM 87 26 0 09 Sep 2017
Real-time convolutional networks for sonar image classification in low-power embedded systems Matias Valdenegro-Toro 42 10 0 07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis A. Chung M. Shafiee Paul Fieguth A. Wong 34 4 0 07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks Surat Teerapittayanon Bradley McDanel H. T. Kung UQCV 121 1,153 0 06 Sep 2017
Domain-adaptive deep network compression Marc Masana Joost van de Weijer Luis Herranz Andrew D. Bagdanov J. Álvarez 82 62 0 04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks Qifeng Chen Jia Xu V. Koltun 83 323 0 02 Sep 2017
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions Toby Lightheart S. Grainger Tien-Fu Lu 21 0 0 30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual Quantization Zefan Li Bingbing Ni Wenjun Zhang Xiaokang Yang Wen Gao MQ 91 107 0 29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices Caiwen Ding Siyu Liao Yanzhi Wang Zhe Li Ning Liu ... Yipeng Zhang Jian Tang Qinru Qiu Xinyu Lin Bo Yuan GNN 73 260 0 29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images Duc Minh Nguyen Evaggelia Tsiligianni Nikos Deligiannis 37 27 0 28 Aug 2017
The Convergence of Machine Learning and Communications Wojciech Samek S. Stańczak Thomas Wiegand AI4CE 46 29 0 28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming Zhuang Liu Jianguo Li Zhiqiang Shen Gao Huang Shoumeng Yan Changshui Zhang 229 2,431 0 22 Aug 2017
Neural Networks Compression for Language Modeling Artem M. Grachev D. Ignatov Andrey V. Savchenko 72 30 0 20 Aug 2017
Deep Neural Network Capacity Aosen Wang Huan Zhou Wenyao Xu Xin Chen 20 4 0 16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks Aswin Raghavan Mohamed R. Amer S. Chai Graham Taylor MQ 77 10 0 16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices Dawei Li Xiaolong Wang Deguang Kong 81 99 0 16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS J. Kepner Manoj Kumar José Moreira P. Pattnaik M. Serrano H. Tufo GNN 98 33 0 09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink Xuzhao Li Changsong Liu CVBM 20 4 0 08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks Jan A. Botha Emily Pitler Ji Ma A. Bakalov Alexandru Salcianu David J. Weiss Ryan T. McDonald Slav Petrov HAI 75 38 0 01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform Chaim Baskin Natan Liss Evgenii Zheltonozhskii A. Bronstein A. Mendelson GNN MQ 139 35 0 31 Jul 2017
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization Frederick Tung S. Muralidharan Greg Mori 72 36 0 28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks Nikolaos Passalis Anastasios Tefas 86 70 0 25 Jul 2017
Towards Evolutional Compression Yunhe Wang Chang Xu Jiayan Qiu Chao Xu Dacheng Tao 63 14 0 25 Jul 2017