v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown

Title
Focus: Querying Large Video Datasets with Low Latency and Low Cost Kevin Hsieh Ganesh Ananthanarayanan P. Bodík P. Bahl Matthai Philipose Phillip B. Gibbons O. Mutlu 98 280 0 10 Jan 2018
Learning $3$ D-FilterMap for Deep Convolutional Neural Networks Yingzhen Yang Jianchao Yang N. Xu Wei Han 3DV MQ 30 1 0 05 Jan 2018
Overcoming catastrophic forgetting with hard attention to the task Joan Serrà Dídac Surís M. Miron Alexandros Karatzoglou CLL 201 1,087 0 04 Jan 2018
Learning a Wavelet-like Auto-Encoder to Accelerate Deep Neural Networks Tianshui Chen Liang Lin W. Zuo Xiaonan Luo Lei Zhang 64 56 0 20 Dec 2017
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car Michael Bechtel Elise McEllhiney Minje Kim H. Yun 91 103 0 19 Dec 2017
Squeezed Convolutional Variational AutoEncoder for Unsupervised Anomaly Detection in Edge Device Industrial Internet of Things Dohyung Kim Hyochang Yang Minki Chung Sungzoon Cho DRL 57 32 0 18 Dec 2017
Automated flow for compressing convolution neural networks for efficient edge-computation with FPGA F. Shafiq Takato Yamada Antonio T. Vilchez Sakyasingha Dasgupta MQ 46 3 0 18 Dec 2017
clcNet: Improving the Efficiency of Convolutional Neural Network using Channel Local Convolutions Dong-Qing Zhang 60 10 0 17 Dec 2017
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference Benoit Jacob S. Kligys Bo Chen Menglong Zhu Matthew Tang Andrew G. Howard Hartwig Adam Dmitry Kalenichenko MQ 294 3,159 0 15 Dec 2017
BT-Nets: Simplifying Deep Neural Networks via Block Term Decomposition Guangxi Li Jinmian Ye Haiqin Yang Di Chen Shuicheng Yan Zenglin Xu 68 11 0 15 Dec 2017
FFT-Based Deep Learning Deployment in Embedded Systems Sheng Lin Ning Liu M. Nazemi Hongjia Li Caiwen Ding Yanzhi Wang Massoud Pedram 62 54 0 13 Dec 2017
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks Eunwoo Kim Chanho Ahn Songhwai Oh 52 2 0 11 Dec 2017
AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training Chia-Yu Chen Jungwook Choi D. Brand A. Agrawal Wei Zhang K. Gopalakrishnan ODL 81 174 0 07 Dec 2017
Automated Pruning for Deep Neural Network Compression Franco Manessi A. Rozza Simone Bianco Paolo Napoletano Raimondo Schettini 94 57 0 05 Dec 2017
Learning Sparse Neural Networks through $L_0$ Regularization Christos Louizos Max Welling Diederik P. Kingma 561 1,150 0 04 Dec 2017
Adaptive Quantization for Deep Neural Network Yiren Zhou Seyed-Mohsen Moosavi-Dezfooli Ngai-Man Cheung P. Frossard MQ 102 185 0 04 Dec 2017
Homomorphic Parameter Compression for Distributed Deep Learning Training Jaehee Jang Byunggook Na Sungroh Yoon FedML 57 1 0 28 Nov 2017
WSNet: Compact and Efficient Networks Through Weight Sampling Xiaojie Jin Yingzhen Yang N. Xu Jianchao Yang Nebojsa Jojic Jiashi Feng Shuicheng Yan 49 2 0 28 Nov 2017
Slim Embedding Layers for Recurrent Neural Language Models Zhongliang Li Raymond Kulhanek Shaojun Wang Yunxin Zhao Shuang Wu KELM 76 23 0 27 Nov 2017
SkipNet: Learning Dynamic Routing in Convolutional Networks Xin Wang Feng Yu Zi-Yi Dou Trevor Darrell Joseph E. Gonzalez 154 640 0 26 Nov 2017
CondenseNet: An Efficient DenseNet using Learned Group Convolutions Gao Huang Shichen Liu Laurens van der Maaten Kilian Q. Weinberger 146 800 0 25 Nov 2017
Deep Expander Networks: Efficient Deep Networks from Graph Theory Ameya Prabhu G. Varma A. Namboodiri GNN 142 72 0 23 Nov 2017
BlockDrop: Dynamic Inference Paths in Residual Networks Zuxuan Wu Tushar Nagarajan Abhishek Kumar Steven J. Rennie L. Davis Kristen Grauman Rogerio Feris 117 470 0 22 Nov 2017
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions Bichen Wu Alvin Wan Xiangyu Yue Peter H. Jin Sicheng Zhao Noah Golmant A. Gholaminejad Joseph E. Gonzalez Kurt Keutzer 3DPC 121 365 0 22 Nov 2017
Evaluating Robustness of Neural Networks with Mixed Integer Programming Vincent Tjeng Kai Y. Xiao Russ Tedrake AAML 127 117 0 20 Nov 2017
Interleaver Design for Deep Neural Networks Sourya Dey Peter A. Beerel K. Chugg 36 6 0 18 Nov 2017
Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method Xu Sun Xuancheng Ren Shuming Ma Bingzhen Wei Wei Li Jingjing Xu Houfeng Wang Yi Zhang 58 24 0 17 Nov 2017
Improved Bayesian Compression Marco Federici Karen Ullrich Max Welling UQCV BDL 79 19 0 17 Nov 2017
Mobile Video Object Detection with Temporally-Aware Feature Maps Mason Liu Menglong Zhu ObjD 89 197 0 17 Nov 2017
NISP: Pruning Networks using Neuron Importance Score Propagation Ruichi Yu Ang Li Chun-Fu Chen Jui-Hsin Lai Vlad I. Morariu Xintong Han M. Gao Ching-Yung Lin L. Davis 78 801 0 16 Nov 2017
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy Asit K. Mishra Debbie Marr FedML 93 331 0 15 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler Yu Ji Youhui Zhang Wenguang Chen Yuan Xie 102 56 0 15 Nov 2017
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning Arun Mallya Svetlana Lazebnik CLL 166 1,313 0 15 Nov 2017
Deep Rewiring: Training very sparse deep networks G. Bellec David Kappel Wolfgang Maass Robert Legenstein BDL 210 281 0 14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation Moritz B. Milde Daniel Neil Alessandro Aimar T. Delbruck Giacomo Indiveri MQ 78 10 0 13 Nov 2017
Weightless: Lossy Weight Encoding For Deep Neural Network Compression Brandon Reagen Udit Gupta Bob Adolf Michael Mitzenmacher Alexander M. Rush Gu-Yeon Wei David Brooks 63 38 0 13 Nov 2017
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks for Image Super Resolution Haoyu Ren Mostafa El-Khamy Jungwon Lee SupR 59 27 0 11 Nov 2017
Learning K-way D-dimensional Discrete Code For Compact Embedding Representations Ting Chen Martin Renqiang Min Yizhou Sun 80 10 0 08 Nov 2017
Revealing structure components of the retina by deep learning networks Qianyu Yan Zhaofei Yu Feng Chen Jian K. Liu FAtt 36 7 0 08 Nov 2017
Block-Sparse Recurrent Neural Networks Sharan Narang Eric Undersander G. Diamos 69 139 0 08 Nov 2017
Compression-aware Training of Deep Networks J. Álvarez Mathieu Salzmann 82 172 0 07 Nov 2017
Moonshine: Distilling with Cheap Convolutions Elliot J. Crowley Gavia Gray Amos Storkey 89 121 0 07 Nov 2017
Interpreting Convolutional Neural Networks Through Compression R. Abbasi-Asl Bin Yu FAtt 49 21 0 07 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks Sanchari Sen Shubham Jain Swagath Venkataramani A. Raghunathan 62 30 0 07 Nov 2017
Characterizing Sparse Connectivity Patterns in Neural Networks Sourya Dey Kuan-Wen Huang Peter A. Beerel K. Chugg 63 11 0 06 Nov 2017
Neural Speed Reading via Skim-RNN Minjoon Seo Sewon Min Ali Farhadi Hannaneh Hajishirzi 101 79 0 06 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing Sourya Dey Yinan Shao K. Chugg Peter A. Beerel 70 16 0 03 Nov 2017
ReBNet: Residual Binarized Neural Network M. Ghasemzadeh Mohammad Samragh F. Koushanfar MQ 47 4 0 03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity Jingyang Zhu Jingbo Jiang Xizi Chen Chi-Ying Tsui 65 36 0 03 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning Raphael Shu Hideki Nakayama 106 129 0 03 Nov 2017