v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown

Title
Recurrent Convolution for Compact and Cost-Adjustable Neural Networks: An Empirical Study Zhendong Zhang Cheolkon Jung 31 2 0 26 Feb 2019
Saec: Similarity-Aware Embedding Compression in Recommendation Systems Xiaorui Wu Hong Xu Honglin Zhang Huaming Chen Jian Wang 50 15 0 26 Feb 2019
Learning Implicitly Recurrent CNNs Through Parameter Sharing Pedro H. P. Savarese Michael Maire 96 70 0 26 Feb 2019
STFNets: Learning Sensing Signals from the Time-Frequency Perspective with Short-Time Fourier Neural Networks Shuochao Yao Ailing Piao Wenjun Jiang Yiran Zhao Huajie Shao ... Tianshi Wang Shaohan Hu Lu Su Jiawei Han Tarek Abdelzaher AI4TS 77 79 0 21 Feb 2019
Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges Di Feng Christian Haase-Schuetz Lars Rosenbaum Heinz Hertlein Claudius Gläser Fabian Duffhauss W. Wiesbeck Klaus C. J. Dietmayer 3DPC 192 1,014 0 21 Feb 2019
Jointly Sparse Convolutional Neural Networks in Dual Spatial-Winograd Domains Yoojin Choi Mostafa El-Khamy Jungwon Lee 46 6 0 21 Feb 2019
Low-bit Quantization of Neural Networks for Efficient Inference Yoni Choukroun Eli Kravchik Fan Yang P. Kisilev MQ 110 366 0 18 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces Mohammad Saidur Rahman Mohsen Imani Nate Mathews M. Wright AAML 86 81 0 18 Feb 2019
Parameter Efficient Training of Deep Convolutional Neural Networks by Dynamic Sparse Reparameterization Hesham Mostafa Xin Wang 129 315 0 15 Feb 2019
Superposition of many models into one Brian Cheung A. Terekhov Yubei Chen Pulkit Agrawal Bruno A. Olshausen MoMe 94 116 0 14 Feb 2019
MultiGrain: a unified image embedding for classes and instances Maxim Berman Hervé Jégou Andrea Vedaldi Iasonas Kokkinos Matthijs Douze 79 112 0 14 Feb 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting Justice Amoh K. Odame 72 18 0 13 Feb 2019
Structured Bayesian Compression for Deep models in mobile enabled devices for connected healthcare Sijia Chen Bin Song Xiaojiang Du Nadra Guizani HAI MedIm 26 2 0 13 Feb 2019
Fast-SCNN: Fast Semantic Segmentation Network Rudra P. K. Poudel Stephan Liwicki R. Cipolla SSeg 64 516 0 12 Feb 2019
Effective Network Compression Using Simulation-Guided Iterative Pruning Dae-Woong Jeong Jaehun Kim Youngseok Kim Tae-Ho Kim Myungsu Chae 34 0 0 12 Feb 2019
Energy-recycling Blockchain with Proof-of-Deep-Learning Changhao Chenli Boyang Li Yiyu Shi Taeho Jung 42 57 0 11 Feb 2019
Model Compression with Adversarial Robustness: A Unified Optimization Framework Shupeng Gui Haotao Wang Chen Yu Haichuan Yang Zhangyang Wang Ji Liu MQ 86 139 0 10 Feb 2019
Improved Knowledge Distillation via Teacher Assistant Seyed Iman Mirzadeh Mehrdad Farajtabar Ang Li Nir Levine Akihiro Matsukawa H. Ghasemzadeh 125 1,092 0 09 Feb 2019
Architecture Compression A. Ashok 32 0 0 08 Feb 2019
FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary Yingzhen Yang Jiahui Yu Nebojsa Jojic Jun Huan Thomas S. Huang 63 17 0 08 Feb 2019
Radial and Directional Posteriors for Bayesian Neural Networks Changyong Oh Kamil Adamczewski Mijung Park BDL 115 20 0 07 Feb 2019
Compression of Recurrent Neural Networks for Efficient Language Modeling Artem M. Grachev D. Ignatov Andrey V. Savchenko 67 39 0 06 Feb 2019
Are All Layers Created Equal? Chiyuan Zhang Samy Bengio Y. Singer 111 140 0 06 Feb 2019
Multi-Kernel Prediction Networks for Denoising of Burst Images Talmaj Marinc Vignesh Srinivasan Serhan Gül C. Hellge Wojciech Samek 112 27 0 05 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization Eldad Meller Alexander Finkelstein Uri Almog Mark Grobman MQ 81 87 0 05 Feb 2019
ROMANet: Fine-Grained Reuse-Driven Off-Chip Memory Access Management and Data Organization for Deep Neural Network Accelerators Rachmad Vidya Wicaksana Putra Muhammad Abdullah Hanif Mohamed Bennai 72 22 0 04 Feb 2019
BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services Amir Erfan Eshratifar Amirhossein Esmaili Massoud Pedram 107 179 0 04 Feb 2019
MICIK: MIning Cross-Layer Inherent Similarity Knowledge for Deep Model Compression Jie Zhang Xiaolong Wang Dawei Li Shalini Ghosh Abhishek Kolagunda Yalin Wang 43 0 0 03 Feb 2019
Self-Binarizing Networks Fayez Lahoud R. Achanta Pablo Márquez-Neila Sabine Süsstrunk MQ 75 23 0 02 Feb 2019
Compressing Gradient Optimizers via Count-Sketches Ryan Spring Anastasios Kyrillidis Vijai Mohan Anshumali Shrivastava 60 36 0 01 Feb 2019
Towards Collaborative Intelligence Friendly Architectures for Deep Learning Amir Erfan Eshratifar Amirhossein Esmaili Massoud Pedram 83 27 0 01 Feb 2019
On Correlation of Features Extracted by Deep Neural Networks B. Ayinde T. Inanc J. Zurada 60 25 0 30 Jan 2019
Tensorized Embedding Layers for Efficient Model Compression Oleksii Hrinchuk Valentin Khrulkov L. Mirvakhabova Elena Orlova Ivan Oseledets 93 73 0 30 Jan 2019
Doubly Sparse: Sparse Mixture of Sparse Experts for Efficient Softmax Inference Shun Liao Ting Chen Tian Lin Denny Zhou Chong-Jun Wang MoE 16 2 0 30 Jan 2019
A Simple Method to Reduce Off-chip Memory Accesses on Convolutional Neural Networks Doyun Kim Kyoung-Young Kim Sangsoo Ko Sanghyuck Ha 28 5 0 28 Jan 2019
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting Ritchie Zhao Yuwei Hu Jordan Dotzel Christopher De Sa Zhiru Zhang OODD MQ 184 312 0 28 Jan 2019
Information-Theoretic Understanding of Population Risk Improvement with Model Compression Yuheng Bu Weihao Gao Shaofeng Zou Venugopal V. Veeravalli MedIm 61 15 0 27 Jan 2019
PruneTrain: Fast Neural Network Training by Dynamic Sparse Model Reconfiguration Sangkug Lym Esha Choukse Siavash Zangeneh W. Wen Sujay Sanghavi M. Erez CVBM 85 88 0 26 Jan 2019
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression Sian Jin Sheng Di Xin Liang Jiannan Tian Dingwen Tao Franck Cappello AI4CE 76 61 0 26 Jan 2019
Really should we pruning after model be totally trained? Pruning based on a small amount of training Li Yue Zhao Weibin Shang-Te Lin VLM 29 5 0 24 Jan 2019
QGAN: Quantized Generative Adversarial Networks Peiqi Wang Dongsheng Wang Yu Ji Xinfeng Xie Haoxuan Song XuXin Liu Yongqiang Lyu Yuan Xie GAN MQ 53 32 0 24 Jan 2019
Backprop with Approximate Activations for Memory-efficient Network Training Ayan Chakrabarti Benjamin Moseley 70 38 0 23 Jan 2019
Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning Shaohui Lin Rongrong Ji Yuchao Li Cheng Deng Xuelong Li 114 69 0 23 Jan 2019
Partition Pruning: Parallelization-Aware Pruning for Deep Neural Networks Sina Shahhosseini Ahmad Albaqsami Masoomeh Jasemi N. Bagherzadeh 22 8 0 21 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going Erwei Wang James J. Davis Ruizhe Zhao Ho-Cheung Ng Xinyu Niu Wayne Luk P. Cheung George A. Constantinides 88 59 0 21 Jan 2019
Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks Charbel Sakr Naigang Wang Chia-Yu Chen Jungwook Choi A. Agrawal Naresh R Shanbhag K. Gopalakrishnan MQ 76 34 0 19 Jan 2019
Feature Pyramid and Hierarchical Boosting Network for Pavement Crack Detection Fan Yang Lei Zhang Sijia Yu Danil Prokhorov Xue Mei Haibin Ling 98 730 0 18 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural Networks Asifullah Khan A. Sohail Umme Zahoora Aqsa Saeed Qureshi OOD 202 2,325 0 17 Jan 2019
CodeX: Bit-Flexible Encoding for Streaming-based FPGA Acceleration of DNNs Mohammad Samragh Mojan Javaheripi F. Koushanfar 54 11 0 17 Jan 2019
Light-weighted Saliency Detection with Distinctively Lower Memory Cost and Model Size Shanghua Xiao 54 1 0 15 Jan 2019