ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Recurrent Convolution for Compact and Cost-Adjustable Neural Networks:
  An Empirical Study
Recurrent Convolution for Compact and Cost-Adjustable Neural Networks: An Empirical Study
Zhendong Zhang
Cheolkon Jung
31
2
0
26 Feb 2019
Saec: Similarity-Aware Embedding Compression in Recommendation Systems
Saec: Similarity-Aware Embedding Compression in Recommendation Systems
Xiaorui Wu
Hong Xu
Honglin Zhang
Huaming Chen
Jian Wang
50
15
0
26 Feb 2019
Learning Implicitly Recurrent CNNs Through Parameter Sharing
Learning Implicitly Recurrent CNNs Through Parameter Sharing
Pedro H. P. Savarese
Michael Maire
96
70
0
26 Feb 2019
STFNets: Learning Sensing Signals from the Time-Frequency Perspective
  with Short-Time Fourier Neural Networks
STFNets: Learning Sensing Signals from the Time-Frequency Perspective with Short-Time Fourier Neural Networks
Shuochao Yao
Ailing Piao
Wenjun Jiang
Yiran Zhao
Huajie Shao
...
Tianshi Wang
Shaohan Hu
Lu Su
Jiawei Han
Tarek Abdelzaher
AI4TS
77
79
0
21 Feb 2019
Deep Multi-modal Object Detection and Semantic Segmentation for
  Autonomous Driving: Datasets, Methods, and Challenges
Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges
Di Feng
Christian Haase-Schuetz
Lars Rosenbaum
Heinz Hertlein
Claudius Gläser
Fabian Duffhauss
W. Wiesbeck
Klaus C. J. Dietmayer
3DPC
192
1,014
0
21 Feb 2019
Jointly Sparse Convolutional Neural Networks in Dual Spatial-Winograd
  Domains
Jointly Sparse Convolutional Neural Networks in Dual Spatial-Winograd Domains
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
46
6
0
21 Feb 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
110
366
0
18 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website
  Fingerprinting Attacks with Adversarial Traces
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces
Mohammad Saidur Rahman
Mohsen Imani
Nate Mathews
M. Wright
AAML
86
81
0
18 Feb 2019
Parameter Efficient Training of Deep Convolutional Neural Networks by
  Dynamic Sparse Reparameterization
Parameter Efficient Training of Deep Convolutional Neural Networks by Dynamic Sparse Reparameterization
Hesham Mostafa
Xin Wang
129
315
0
15 Feb 2019
Superposition of many models into one
Superposition of many models into one
Brian Cheung
A. Terekhov
Yubei Chen
Pulkit Agrawal
Bruno A. Olshausen
MoMe
94
116
0
14 Feb 2019
MultiGrain: a unified image embedding for classes and instances
MultiGrain: a unified image embedding for classes and instances
Maxim Berman
Hervé Jégou
Andrea Vedaldi
Iasonas Kokkinos
Matthijs Douze
79
112
0
14 Feb 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
Justice Amoh
K. Odame
72
18
0
13 Feb 2019
Structured Bayesian Compression for Deep models in mobile enabled
  devices for connected healthcare
Structured Bayesian Compression for Deep models in mobile enabled devices for connected healthcare
Sijia Chen
Bin Song
Xiaojiang Du
Nadra Guizani
HAIMedIm
26
2
0
13 Feb 2019
Fast-SCNN: Fast Semantic Segmentation Network
Fast-SCNN: Fast Semantic Segmentation Network
Rudra P. K. Poudel
Stephan Liwicki
R. Cipolla
SSeg
64
516
0
12 Feb 2019
Effective Network Compression Using Simulation-Guided Iterative Pruning
Effective Network Compression Using Simulation-Guided Iterative Pruning
Dae-Woong Jeong
Jaehun Kim
Youngseok Kim
Tae-Ho Kim
Myungsu Chae
34
0
0
12 Feb 2019
Energy-recycling Blockchain with Proof-of-Deep-Learning
Energy-recycling Blockchain with Proof-of-Deep-Learning
Changhao Chenli
Boyang Li
Yiyu Shi
Taeho Jung
42
57
0
11 Feb 2019
Model Compression with Adversarial Robustness: A Unified Optimization
  Framework
Model Compression with Adversarial Robustness: A Unified Optimization Framework
Shupeng Gui
Haotao Wang
Chen Yu
Haichuan Yang
Zhangyang Wang
Ji Liu
MQ
86
139
0
10 Feb 2019
Improved Knowledge Distillation via Teacher Assistant
Improved Knowledge Distillation via Teacher Assistant
Seyed Iman Mirzadeh
Mehrdad Farajtabar
Ang Li
Nir Levine
Akihiro Matsukawa
H. Ghasemzadeh
125
1,092
0
09 Feb 2019
Architecture Compression
Architecture Compression
A. Ashok
32
0
0
08 Feb 2019
FSNet: Compression of Deep Convolutional Neural Networks by Filter
  Summary
FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary
Yingzhen Yang
Jiahui Yu
Nebojsa Jojic
Jun Huan
Thomas S. Huang
63
17
0
08 Feb 2019
Radial and Directional Posteriors for Bayesian Neural Networks
Radial and Directional Posteriors for Bayesian Neural Networks
Changyong Oh
Kamil Adamczewski
Mijung Park
BDL
115
20
0
07 Feb 2019
Compression of Recurrent Neural Networks for Efficient Language Modeling
Compression of Recurrent Neural Networks for Efficient Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
67
39
0
06 Feb 2019
Are All Layers Created Equal?
Are All Layers Created Equal?
Chiyuan Zhang
Samy Bengio
Y. Singer
111
140
0
06 Feb 2019
Multi-Kernel Prediction Networks for Denoising of Burst Images
Multi-Kernel Prediction Networks for Denoising of Burst Images
Talmaj Marinc
Vignesh Srinivasan
Serhan Gül
C. Hellge
Wojciech Samek
112
27
0
05 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error
  Through Weight Factorization
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
81
87
0
05 Feb 2019
ROMANet: Fine-Grained Reuse-Driven Off-Chip Memory Access Management and
  Data Organization for Deep Neural Network Accelerators
ROMANet: Fine-Grained Reuse-Driven Off-Chip Memory Access Management and Data Organization for Deep Neural Network Accelerators
Rachmad Vidya Wicaksana Putra
Muhammad Abdullah Hanif
Mohamed Bennai
72
22
0
04 Feb 2019
BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud
  Computing Services
BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services
Amir Erfan Eshratifar
Amirhossein Esmaili
Massoud Pedram
107
179
0
04 Feb 2019
MICIK: MIning Cross-Layer Inherent Similarity Knowledge for Deep Model
  Compression
MICIK: MIning Cross-Layer Inherent Similarity Knowledge for Deep Model Compression
Jie Zhang
Xiaolong Wang
Dawei Li
Shalini Ghosh
Abhishek Kolagunda
Yalin Wang
43
0
0
03 Feb 2019
Self-Binarizing Networks
Self-Binarizing Networks
Fayez Lahoud
R. Achanta
Pablo Márquez-Neila
Sabine Süsstrunk
MQ
75
23
0
02 Feb 2019
Compressing Gradient Optimizers via Count-Sketches
Compressing Gradient Optimizers via Count-Sketches
Ryan Spring
Anastasios Kyrillidis
Vijai Mohan
Anshumali Shrivastava
60
36
0
01 Feb 2019
Towards Collaborative Intelligence Friendly Architectures for Deep
  Learning
Towards Collaborative Intelligence Friendly Architectures for Deep Learning
Amir Erfan Eshratifar
Amirhossein Esmaili
Massoud Pedram
83
27
0
01 Feb 2019
On Correlation of Features Extracted by Deep Neural Networks
On Correlation of Features Extracted by Deep Neural Networks
B. Ayinde
T. Inanc
J. Zurada
60
25
0
30 Jan 2019
Tensorized Embedding Layers for Efficient Model Compression
Tensorized Embedding Layers for Efficient Model Compression
Oleksii Hrinchuk
Valentin Khrulkov
L. Mirvakhabova
Elena Orlova
Ivan Oseledets
93
73
0
30 Jan 2019
Doubly Sparse: Sparse Mixture of Sparse Experts for Efficient Softmax
  Inference
Doubly Sparse: Sparse Mixture of Sparse Experts for Efficient Softmax Inference
Shun Liao
Ting Chen
Tian Lin
Denny Zhou
Chong-Jun Wang
MoE
16
2
0
30 Jan 2019
A Simple Method to Reduce Off-chip Memory Accesses on Convolutional
  Neural Networks
A Simple Method to Reduce Off-chip Memory Accesses on Convolutional Neural Networks
Doyun Kim
Kyoung-Young Kim
Sangsoo Ko
Sanghyuck Ha
28
5
0
28 Jan 2019
Improving Neural Network Quantization without Retraining using Outlier
  Channel Splitting
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODDMQ
184
312
0
28 Jan 2019
Information-Theoretic Understanding of Population Risk Improvement with
  Model Compression
Information-Theoretic Understanding of Population Risk Improvement with Model Compression
Yuheng Bu
Weihao Gao
Shaofeng Zou
Venugopal V. Veeravalli
MedIm
61
15
0
27 Jan 2019
PruneTrain: Fast Neural Network Training by Dynamic Sparse Model
  Reconfiguration
PruneTrain: Fast Neural Network Training by Dynamic Sparse Model Reconfiguration
Sangkug Lym
Esha Choukse
Siavash Zangeneh
W. Wen
Sujay Sanghavi
M. Erez
CVBM
85
88
0
26 Jan 2019
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using
  Error-Bounded Lossy Compression
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression
Sian Jin
Sheng Di
Xin Liang
Jiannan Tian
Dingwen Tao
Franck Cappello
AI4CE
76
61
0
26 Jan 2019
Really should we pruning after model be totally trained? Pruning based
  on a small amount of training
Really should we pruning after model be totally trained? Pruning based on a small amount of training
Li Yue
Zhao Weibin
Shang-Te Lin
VLM
29
5
0
24 Jan 2019
QGAN: Quantized Generative Adversarial Networks
QGAN: Quantized Generative Adversarial Networks
Peiqi Wang
Dongsheng Wang
Yu Ji
Xinfeng Xie
Haoxuan Song
XuXin Liu
Yongqiang Lyu
Yuan Xie
GANMQ
53
32
0
24 Jan 2019
Backprop with Approximate Activations for Memory-efficient Network
  Training
Backprop with Approximate Activations for Memory-efficient Network Training
Ayan Chakrabarti
Benjamin Moseley
70
38
0
23 Jan 2019
Towards Compact ConvNets via Structure-Sparsity Regularized Filter
  Pruning
Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning
Shaohui Lin
Rongrong Ji
Yuchao Li
Cheng Deng
Xuelong Li
114
69
0
23 Jan 2019
Partition Pruning: Parallelization-Aware Pruning for Deep Neural
  Networks
Partition Pruning: Parallelization-Aware Pruning for Deep Neural Networks
Sina Shahhosseini
Ahmad Albaqsami
Masoomeh Jasemi
N. Bagherzadeh
22
8
0
21 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been,
  Where We're Going
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang
James J. Davis
Ruizhe Zhao
Ho-Cheung Ng
Xinyu Niu
Wayne Luk
P. Cheung
George A. Constantinides
88
59
0
21 Jan 2019
Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep
  Networks
Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Charbel Sakr
Naigang Wang
Chia-Yu Chen
Jungwook Choi
A. Agrawal
Naresh R Shanbhag
K. Gopalakrishnan
MQ
76
34
0
19 Jan 2019
Feature Pyramid and Hierarchical Boosting Network for Pavement Crack
  Detection
Feature Pyramid and Hierarchical Boosting Network for Pavement Crack Detection
Fan Yang
Lei Zhang
Sijia Yu
Danil Prokhorov
Xue Mei
Haibin Ling
98
730
0
18 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural
  Networks
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Asifullah Khan
A. Sohail
Umme Zahoora
Aqsa Saeed Qureshi
OOD
202
2,325
0
17 Jan 2019
CodeX: Bit-Flexible Encoding for Streaming-based FPGA Acceleration of
  DNNs
CodeX: Bit-Flexible Encoding for Streaming-based FPGA Acceleration of DNNs
Mohammad Samragh
Mojan Javaheripi
F. Koushanfar
54
11
0
17 Jan 2019
Light-weighted Saliency Detection with Distinctively Lower Memory Cost
  and Model Size
Light-weighted Saliency Detection with Distinctively Lower Memory Cost and Model Size
Shanghua Xiao
54
1
0
15 Jan 2019
Previous
123...545556...686970
Next