v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown

Title
The Loss Surface of XOR Artificial Neural Networks D. Mehta Xiaojun Zhao Edgar A. Bernal D. Wales 167 19 0 06 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy Optimization K. Choromanski Mark Rowland Vikas Sindhwani Richard Turner Adrian Weller 118 149 0 06 Apr 2018
Building Efficient CNN Architecture for Offline Handwritten Chinese Character Recognition Zhiyuan Li Nanjun Teng Min Jin Huaxiang Lu 39 52 0 04 Apr 2018
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models B. Rouhani Huili Chen F. Koushanfar 130 48 0 02 Apr 2018
Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs Caiwen Ding Ao Ren Geng Yuan Xiaolong Ma Jiayu Li Ning Liu Bo Yuan Yanzhi Wang 77 23 0 28 Mar 2018
Adversarial Network Compression Vasileios Belagiannis Azade Farshad Fabio Galasso GAN AAML 69 58 0 28 Mar 2018
FPGA Implementations of 3D-SIMD Processor Architecture for Deep Neural Networks Using Relative Indexed Compressed Sparse Filter Encoding Format and Stacked Filters Stationary Flow Yuechao Gao Nianhong Liu Shenmin Zhang 98 1 0 28 Mar 2018
Incremental Training of Deep Convolutional Neural Networks R. Istrate A. Malossi C. Bekas Dimitrios S. Nikolopoulos CLL 68 21 0 27 Mar 2018
Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions Zheng Qin Zhaoning Zhang Dongsheng Li Yiming Zhang Yuxing Peng 62 28 0 27 Mar 2018
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs Chuanhao Zhuge Xinheng Liu Xiaofan Zhang S. Gummadi Jinjun Xiong Deming Chen CVBM 64 36 0 23 Mar 2018
Iterative Low-Rank Approximation for CNN Compression Maksym Kholiavchenko 36 9 0 23 Mar 2018
SqueezeNext: Hardware-Aware Neural Network Design A. Gholami K. Kwon Bichen Wu Zizheng Tai Xiangyu Yue Peter H. Jin Sicheng Zhao Kurt Keutzer 69 300 0 23 Mar 2018
Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network Namhyuk Ahn Byungkon Kang Kyung-ah Sohn SupR 123 1,131 0 23 Mar 2018
Design Principles for Sparse Matrix Multiplication on the GPU Carl Yang A. Buluç John Douglas Owens 61 109 0 22 Mar 2018
Task dependent Deep LDA pruning of neural networks Qing Tian Tal Arbel James J. Clark 31 0 0 21 Mar 2018
Efficient Recurrent Neural Networks using Structured Matrices in FPGAs Zhe Li Shuo Wang Caiwen Ding Qinru Qiu Yanzhi Wang Yun Liang GNN 41 21 0 20 Mar 2018
Local Binary Pattern Networks Jeng-Hau Lin Yunfan Yang Rajesh K. Gupta Zhuowen Tu MQ 50 13 0 19 Mar 2018
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation Sachin Mehta Mohammad Rastegari A. Caspi Linda G. Shapiro Hannaneh Hajishirzi SSeg 156 784 0 19 Mar 2018
Constrained Deep Learning using Conditional Gradient and Applications in Computer Vision Sathya Ravi Tuan Dinh Vishnu Suresh Lokhande Vikas Singh AI4CE 71 22 0 17 Mar 2018
EVA $^2$ : Exploiting Temporal Redundancy in Live Computer Vision Mark Buckler Philip Bedoukian Suren Jayasuriya Adrian Sampson 126 79 0 16 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training Hongyu Zhu Mohamed Akrout Bojian Zheng Andrew Pelegris Amar Phanishayee Bianca Schroeder Gennady Pekhimenko 103 81 0 16 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning Maurice Yang Mahmoud Faraj Assem Hussein V. Gaudet CVBM 69 12 0 15 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions Stylianos I. Venieris Alexandros Kouris C. Bouganis 76 185 0 15 Mar 2018
Exploring Linear Relationship in Feature Map Subspace for ConvNets Compression Dong Wang Lei Zhou Xueni Zhang Xiao Bai Jun Zhou 80 47 0 15 Mar 2018
C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs Shuo Wang Zhe Li Caiwen Ding Bo Yuan Yanzhi Wang Qinru Qiu Yun Liang 61 197 0 14 Mar 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC Kai Xu Dawei Li N. Cassimatis Xiaolong Wang 86 97 0 13 Mar 2018
Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation Xiaowei Xu Q. Lu Yu Hu Lin Yang X. S. Hu Benlin Liu Yiyu Shi MedIm 84 85 0 13 Mar 2018
FeTa: A DCA Pruning Algorithm with Generalization Error Guarantees Konstantinos Pitas Mike Davies P. Vandergheynst 27 2 0 12 Mar 2018
ShuffleSeg: Real-time Semantic Segmentation Network M. Gamal Mennatullah Siam Moemen Abdel-Razek SSeg 68 60 0 10 Mar 2018
Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How A. Delmas Patrick Judd Dylan Malone Stuart Zissis Poulos Mostafa Mahmoud Sayeh Sharify Milos Nikolic Andreas Moshovos 59 24 0 09 Mar 2018
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks Jonathan Frankle Michael Carbin 445 3,497 0 09 Mar 2018
High-Accuracy Low-Precision Training Christopher De Sa Megan Leszczynski Jian Zhang Alana Marzoev Christopher R. Aberger K. Olukotun Christopher Ré 94 109 0 09 Mar 2018
Exponential Discriminative Metric Embedding in Deep Learning Bowen Wu Zhangling Chen Jun Wang Hua-Ming Wu 73 11 0 07 Mar 2018
Learning SMaLL Predictors Vikas Garg O. Dekel Lin Xiao 76 3 0 06 Mar 2018
Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning Huan Yang Baoyuan Wang Noranart Vesdapunt Minyi Guo S. B. Kang 62 22 0 06 Mar 2018
Deep Neural Network Compression with Single and Multiple Level Quantization Yuhui Xu Yongzhuang Wang Aojun Zhou Weiyao Lin H. Xiong MQ 70 115 0 06 Mar 2018
Stochastic Activation Pruning for Robust Adversarial Defense Guneet Singh Dhillon Kamyar Azizzadenesheli Zachary Chase Lipton Jeremy Bernstein Jean Kossaifi Aran Khanna Anima Anandkumar AAML 107 548 0 05 Mar 2018
An Optimal Control Approach to Deep Learning and Applications to Discrete-Weight Neural Networks Qianxiao Li Shuji Hao 103 76 0 04 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md. Zahangir Alom T. Taha C. Yakopcic Stefan Westberg P. Sidike Mst Shamima Nasrin B. Van Essen A. Awwal V. Asari VLM 133 883 0 03 Mar 2018
Scalar Quantization as Sparse Least Square Optimization Chen Wang Xiaomei Yang Shaomin Fei Kai Zhou Xiaofeng Gong Miao Du Ruisen Luo MQ 40 3 0 01 Mar 2018
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning Yichi Zhang Zhijian Ou 64 0 0 01 Mar 2018
Compressing Neural Networks using the Variational Information Bottleneck Bin Dai Chen Zhu David Wipf MLT 72 182 0 28 Feb 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs Xuhao Chen 109 25 0 28 Feb 2018
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding Dong Liu Ke Sun Zhangyang Wang Runsheng Liu Zhengjun Zha 107 12 0 28 Feb 2018
Recurrent Residual Module for Fast Inference in Videos Bowen Pan Wuwei Lin Xiaolin Fang Chaoqin Huang Bolei Zhou Cewu Lu ObjD 94 34 0 27 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis Tal Ben-Nun Torsten Hoefler GNN 91 713 0 26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence Jinglan Liu Jiaxin Zhang Yukun Ding Xiaowei Xu Meng Jiang Yiyu Shi 103 4 0 26 Feb 2018
Wide Compression: Tensor Ring Nets Wenqi Wang Yifan Sun Brian Eriksson Wenlin Wang Vaneet Aggarwal 69 171 0 25 Feb 2018
Loss-aware Weight Quantization of Deep Networks Lu Hou James T. Kwok MQ 111 127 0 23 Feb 2018
Training wide residual networks for deployment using a single bit for each weight Mark D Mcdonnell MQ 96 71 0 23 Feb 2018