v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown

Title
Relaxed Quantization for Discretized Neural Networks Christos Louizos M. Reisser Tijmen Blankevoort E. Gavves Max Welling MQ 110 132 0 03 Oct 2018
Learning with Random Learning Rates Léonard Blier Pierre Wolinski Yann Ollivier OOD 109 20 0 02 Oct 2018
Training compact deep learning models for video classification using circulant matrices Alexandre Araujo Benjamin Négrevergne Y. Chevaleyre Jamal Atif 75 14 0 02 Oct 2018
Target Aware Network Adaptation for Efficient Representation Learning Yang Zhong Vladimir Li R. Okada A. Maki 53 6 0 02 Oct 2018
LIT: Block-wise Intermediate Representation Training for Model Compression Animesh Koratana Daniel Kang Peter Bailis Matei A. Zaharia 50 12 0 02 Oct 2018
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network using Truncated Gaussian Approximation Zhezhi He Deliang Fan MQ 73 67 0 02 Oct 2018
Extended Bit-Plane Compression for Convolutional Neural Network Accelerators Lukas Cavigelli Luca Benini 66 20 0 01 Oct 2018
ProxQuant: Quantized Neural Networks via Proximal Operators Yu Bai Yu Wang Edo Liberty MQ 116 118 0 01 Oct 2018
Dynamic Sparse Graph for Efficient Deep Learning Liu Liu Lei Deng Xing Hu Maohua Zhu Guoqi Li Yufei Ding Yuan Xie GNN 90 42 0 01 Oct 2018
Benchmark Analysis of Representative Deep Neural Network Architectures Simone Bianco Rémi Cadène Luigi Celona Paolo Napoletano BDL 76 679 0 01 Oct 2018
Procedural Noise Adversarial Examples for Black-Box Attacks on Deep Convolutional Networks Kenneth T. Co Luis Muñoz-González Sixte de Maupeou Emil C. Lupu AAML 85 67 0 30 Sep 2018
Minimal Random Code Learning: Getting Bits Back from Compressed Model Parameters Marton Havasi Robert Peharz José Miguel Hernández-Lobato 82 82 0 30 Sep 2018
Mini-batch Serialization: CNN Training with Inter-layer Data Reuse Sangkug Lym Armand Behroozi W. Wen Ge Li Yongkee Kwon M. Erez 53 26 0 30 Sep 2018
To compress or not to compress: Understanding the Interactions between Adversarial Attacks and Neural Network Compression Yiren Zhao Ilia Shumailov Robert D. Mullins Ross J. Anderson AAML 82 43 0 29 Sep 2018
Knowledge-guided Semantic Computing Network Guangming Shi Zhongqiang Zhang Dahua Gao Xuemei Xie Yihao Feng Xinrui Ma Danhua Liu 44 10 0 29 Sep 2018
Throughput Optimizations for FPGA-based Deep Neural Network Inference Thorbjörn Posewsky Daniel Ziener 48 25 0 28 Sep 2018
Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems Graham Gobieski Nathan Beckmann Brandon Lucia 74 207 0 28 Sep 2018
Deep learning systems as complex networks Alberto Testolin Michele Piccolini S. Suweis AI4CE BDL GNN 40 28 0 28 Sep 2018
Learning to Train a Binary Neural Network Joseph Bethge Haojin Yang Christian Bartz Christoph Meinel MQ 55 12 0 27 Sep 2018
Adaptive Pruning of Neural Language Models for Mobile Devices Raphael Tang Jimmy J. Lin 31 7 0 27 Sep 2018
Object Detection from Scratch with Deep Supervision Zhiqiang Shen Zhuang Liu Jianguo Li Yu-Gang Jiang Yurong Chen Xiangyang Xue ObjD 103 79 0 25 Sep 2018
No Multiplication? No Floating Point? No Problem! Training Networks for Efficient Inference S. Baluja David Marwood Michele Covell Nick Johnston MQ 40 8 0 24 Sep 2018
Shift-based Primitives for Efficient Convolutional Neural Networks Huasong Zhong Xianggen Liu Yihui He Yuchun Ma 77 20 0 22 Sep 2018
FastDeepIoT: Towards Understanding and Optimizing Neural Network Execution Time on Mobile and Embedded Devices Shuochao Yao Yiran Zhao Huajie Shao Shengzhong Liu Dongxin Liu Lu Su Tarek Abdelzaher HAI 77 133 0 19 Sep 2018
MBS: Macroblock Scaling for CNN Model Reduction Yu-Hsun Lin Chun-Nan Chou Edward Y. Chang MQ 22 4 0 18 Sep 2018
Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing Zhuo Chen Weisi Lin Shiqi Wang Ling-yu Duan Alex C. Kot 88 17 0 17 Sep 2018
FermiNets: Learning generative machines to generate efficient neural networks via generative synthesis A. Wong M. Shafiee Brendan Chwyl Francis Li 61 64 0 17 Sep 2018
Memristor-based Deep Convolution Neural Network: A Case Study Fan Zhang Miao Hu 19 6 0 14 Sep 2018
Hardware-Aware Machine Learning: Modeling and Optimization Diana Marculescu Dimitrios Stamoulis E. Cai 67 45 0 14 Sep 2018
Neural Network Topologies for Sparse Training Ryan A. Robinett J. Kepner 37 7 0 14 Sep 2018
Discretely Relaxing Continuous Variables for tractable Variational Inference Trefor W. Evans P. Nair BDL 59 0 0 12 Sep 2018
Deep Asymmetric Networks with a Set of Node-wise Variant Activation Functions Jinhyeok Jang Hyunjoong Cho Jaehong Kim Jaeyeon Lee Seungjoon Yang 29 2 0 11 Sep 2018
Deep Learning Towards Mobile Applications Ji Wang Bokai Cao Philip S. Yu Lichao Sun Weidong Bao Xiaomin Zhu HAI 92 99 0 10 Sep 2018
Not Just Privacy: Improving Performance of Private Deep Learning in Mobile Cloud Ji Wang Jianguo Zhang Weidong Bao Xiaomin Zhu Bokai Cao Philip S. Yu 76 196 0 10 Sep 2018
Probabilistic Binary Neural Networks Jorn W. T. Peters Max Welling BDL UQCV MQ 86 52 0 10 Sep 2018
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks Shivang Agarwal Jean Ogier du Terrail F. Jurie ObjD 158 126 0 10 Sep 2018
Fast and Efficient Information Transmission with Burst Spikes in Deep Spiking Neural Networks Seongsik Park Seijoon Kim Hyeokjun Choe Sungroh Yoon 81 96 0 10 Sep 2018
Training for Faster Adversarial Robustness Verification via Inducing ReLU Stability Kai Y. Xiao Vincent Tjeng Nur Muhammad (Mahi) Shafiullah Aleksander Madry AAML OOD 76 202 0 09 Sep 2018
2PFPCE: Two-Phase Filter Pruning Based on Conditional Entropy Chuhan Min Aosen Wang Yiran Chen Wenyao Xu Xin Chen 83 41 0 06 Sep 2018
Deep Learning for Generic Object Detection: A Survey Li Liu Wanli Ouyang Xiaogang Wang Paul Fieguth Jie Chen Xinwang Liu M. Pietikäinen ObjD VLM OOD 289 2,469 0 06 Sep 2018
Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing Athindran Ramesh Kumar Balaraman Ravindran A. Raghunathan ObjD 106 13 0 05 Sep 2018
ChannelNets: Compact and Efficient Convolutional Neural Networks via Channel-Wise Convolutions Hongyang Gao Zhengyang Wang Shuiwang Ji 58 70 0 05 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing Xiaofei Xie Lei Ma Felix Juefei Xu Hongxu Chen Minhui Xue Yue Liu Yang Liu Jianjun Zhao Jianxiong Yin Simon See 116 41 0 04 Sep 2018
Learning Sparse Low-Precision Neural Networks With Learnable Regularization Yoojin Choi Mostafa El-Khamy Jungwon Lee MQ 73 31 0 01 Sep 2018
An Adaptive Locally Connected Neuron Model: Focusing Neuron F. Boray Tek 40 6 0 31 Aug 2018
Fixed-Point Convolutional Neural Network for Real-Time Video Processing in FPGA R. Solovyev A. Kustov D. Telpukhov V. S. Ruhlov Alexandr A Kalinin MQ 123 41 0 29 Aug 2018
Sparsity in Deep Neural Networks - An Empirical Investigation with TensorQuant D. Loroch Franz-Josef Pfreundt Norbert Wehn J. Keuper 50 5 0 27 Aug 2018
Predefined Sparseness in Recurrent Sequence Models T. Demeester Johannes Deleu Fréderic Godin Chris Develder 33 3 0 27 Aug 2018
Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error Taiji Suzuki Hiroshi Abe Tomoya Murata Shingo Horiuchi Kotaro Ito Tokuma Wachi So Hirai Masatoshi Yukishima Tomoaki Nishimura MLT 65 10 0 26 Aug 2018
DeepTracker: Visualizing the Training Process of Convolutional Neural Networks Dongyu Liu Weiwei Cui Kai Jin Yuxiao Guo Huamin Qu HAI 56 35 0 26 Aug 2018