ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Reaching Data Confidentiality and Model Accountability on the CalTrain
Reaching Data Confidentiality and Model Accountability on the CalTrain
Zhongshu Gu
Hani Jamjoom
D. Su
Heqing Huang
Jialong Zhang
Tengfei Ma
Dimitrios E. Pendarakis
Ian Molloy
FedML
71
16
0
07 Dec 2018
Harmonic Networks: Integrating Spectral Information into CNNs
Harmonic Networks: Integrating Spectral Information into CNNs
Matej Ulicny
V. Krylov
Rozenn Dahyot
102
7
0
07 Dec 2018
Wireless Network Intelligence at the Edge
Wireless Network Intelligence at the Edge
Jihong Park
S. Samarakoon
M. Bennis
Mérouane Debbah
121
521
0
07 Dec 2018
Knockoff Nets: Stealing Functionality of Black-Box Models
Knockoff Nets: Stealing Functionality of Black-Box Models
Tribhuvanesh Orekondy
Bernt Schiele
Mario Fritz
MLAU
118
539
0
06 Dec 2018
Trained Rank Pruning for Efficient Deep Neural Networks
Trained Rank Pruning for Efficient Deep Neural Networks
Yuhui Xu
Yuxi Li
Shuai Zhang
W. Wen
Botao Wang
Y. Qi
Yiran Chen
Weiyao Lin
H. Xiong
AAML
108
42
0
06 Dec 2018
DNQ: Dynamic Network Quantization
DNQ: Dynamic Network Quantization
Yuhui Xu
Shuai Zhang
Y. Qi
Jiaxian Guo
Weiyao Lin
H. Xiong
MQ
45
6
0
06 Dec 2018
Efficient and Robust Machine Learning for Real-World Systems
Efficient and Robust Machine Learning for Real-World Systems
Franz Pernkopf
Wolfgang Roth
Matthias Zöhrer
Lukas Pfeifenberger
Günther Schindler
Holger Froening
Sebastian Tschiatschek
Robert Peharz
Matthew Mattina
Zoubin Ghahramani
OOD
39
1
0
05 Dec 2018
Training Competitive Binary Neural Networks from Scratch
Training Competitive Binary Neural Networks from Scratch
Joseph Bethge
Marvin Bornstein
Adrian Loy
Haojin Yang
Christoph Meinel
MQ
92
33
0
05 Dec 2018
ECC: Platform-Independent Energy-Constrained Deep Neural Network
  Compression via a Bilinear Regression Model
ECC: Platform-Independent Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model
Haichuan Yang
Yuhao Zhu
Ji Liu
111
40
0
05 Dec 2018
Deep Positron: A Deep Neural Network Using the Posit Number System
Deep Positron: A Deep Neural Network Using the Posit Number System
Zachariah Carmichael
Seyed Hamed Fatemi Langroudi
Char Khazanov
Jeffrey Lillie
J. Gustafson
Dhireesha Kudithipudi
MQ
78
96
0
05 Dec 2018
Pre-Defined Sparse Neural Networks with Hardware Acceleration
Pre-Defined Sparse Neural Networks with Hardware Acceleration
Sourya Dey
Kuan-Wen Huang
Peter A. Beerel
K. Chugg
116
25
0
04 Dec 2018
Split learning for health: Distributed deep learning without sharing raw
  patient data
Split learning for health: Distributed deep learning without sharing raw patient data
Praneeth Vepakomma
O. Gupta
Tristan Swedish
Ramesh Raskar
FedML
127
714
0
03 Dec 2018
Neural Rejuvenation: Improving Deep Network Training by Enhancing
  Computational Resource Utilization
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
Siyuan Qiao
Zhe Lin
Jianming Zhang
Alan Yuille
65
23
0
02 Dec 2018
Accelerate CNN via Recursive Bayesian Pruning
Accelerate CNN via Recursive Bayesian Pruning
Yuefu Zhou
Ya Zhang
Yanfeng Wang
Qi Tian
BDL
94
58
0
02 Dec 2018
MDU-Net: Multi-scale Densely Connected U-Net for biomedical image
  segmentation
MDU-Net: Multi-scale Densely Connected U-Net for biomedical image segmentation
Jiawei Zhang
Yuzhen Jin
Jilan Xu
Xiaowei Xu
Yanchun Zhang
SSegMedImAI4CE
131
119
0
02 Dec 2018
ProxylessNAS: Direct Neural Architecture Search on Target Task and
  Hardware
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
Han Cai
Ligeng Zhu
Song Han
169
1,878
0
02 Dec 2018
DVC: An End-to-end Deep Video Compression Framework
DVC: An End-to-end Deep Video Compression Framework
Guo Lu
Wanli Ouyang
Dong Xu
Xiaoyun Zhang
Chunlei Cai
Zhiyong Gao
VGen
147
667
0
30 Nov 2018
Mixed Precision Quantization of ConvNets via Differentiable Neural
  Architecture Search
Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search
Bichen Wu
Yanghan Wang
Peizhao Zhang
Yuandong Tian
Peter Vajda
Kurt Keutzer
MQ
92
274
0
30 Nov 2018
Projection Convolutional Neural Networks for 1-bit CNNs via Discrete
  Back Propagation
Projection Convolutional Neural Networks for 1-bit CNNs via Discrete Back Propagation
Jiaxin Gu
Ce Li
Baochang Zhang
Jiawei Han
Xianbin Cao
Jianzhuang Liu
David Doermann
3DV
250
87
0
30 Nov 2018
LEARN Codes: Inventing Low-latency Codes via Recurrent Neural Networks
LEARN Codes: Inventing Low-latency Codes via Recurrent Neural Networks
Yihan Jiang
Hyeji Kim
Himanshu Asnani
Sreeram Kannan
Sewoong Oh
Pramod Viswanath
103
79
0
30 Nov 2018
On Implicit Filter Level Sparsity in Convolutional Neural Networks
On Implicit Filter Level Sparsity in Convolutional Neural Networks
Dushyant Mehta
K. Kim
Christian Theobalt
86
28
0
29 Nov 2018
ESPNetv2: A Light-weight, Power Efficient, and General Purpose
  Convolutional Neural Network
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network
Sachin Mehta
Mohammad Rastegari
Linda G. Shapiro
Hannaneh Hajishirzi
VLM
101
401
0
28 Nov 2018
ShelfNet for Fast Semantic Segmentation
ShelfNet for Fast Semantic Segmentation
Juntang Zhuang
Tomoki Hayashi
Lin Gu
Shinji Watanabe
SSeg
73
4
0
27 Nov 2018
Calibrating Uncertainties in Object Localization Task
Calibrating Uncertainties in Object Localization Task
Buu Phan
Rick Salay
Krzysztof Czarnecki
Vahdat Abdelzad
Taylor Denouden
Sachin Vernekar
UQCV
68
22
0
27 Nov 2018
MobiFace: A Lightweight Deep Learning Face Recognition on Mobile Devices
MobiFace: A Lightweight Deep Learning Face Recognition on Mobile Devices
C. Duong
Kha Gia Quach
Ibsa Jalata
Nghia T. Nguyen
Khoa Luu
CVBM3DH
99
59
0
27 Nov 2018
Efficient non-uniform quantizer for quantized neural network targeting
  reconfigurable hardware
Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss
Chaim Baskin
A. Mendelson
A. Bronstein
Raja Giryes
MQ
41
5
0
27 Nov 2018
Leveraging Filter Correlations for Deep Model Compression
Leveraging Filter Correlations for Deep Model Compression
Pravendra Singh
Vinay Kumar Verma
Piyush Rai
Vinay P. Namboodiri
95
65
0
26 Nov 2018
ExpandNets: Linear Over-parameterization to Train Compact Convolutional
  Networks
ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks
Shuxuan Guo
J. Álvarez
Mathieu Salzmann
121
80
0
26 Nov 2018
A Survey of Mobile Computing for the Visually Impaired
A Survey of Mobile Computing for the Visually Impaired
Martin Weiss
Margaux Luck
Roger Girgis
C. Pal
Joseph Paul Cohen
59
10
0
25 Nov 2018
Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted
  Inference
Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference
Edward Chou
Josh Beal
Daniel Levy
Serena Yeung
Albert Haque
Li Fei-Fei
76
200
0
25 Nov 2018
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep
  LearningInference in Function as a Service Environments
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep LearningInference in Function as a Service Environments
Abdul Dakkak
Cheng-rong Li
Simon Garcia De Gonzalo
Jinjun Xiong
Wen-mei W. Hwu
50
19
0
24 Nov 2018
Joint Neural Architecture Search and Quantization
Joint Neural Architecture Search and Quantization
Yukang Chen
Gaofeng Meng
Qian Zhang
Xinbang Zhang
Liangchen Song
Shiming Xiang
Chunhong Pan
MQ
87
29
0
23 Nov 2018
Efficient Structured Pruning and Architecture Searching for Group
  Convolution
Efficient Structured Pruning and Architecture Searching for Group Convolution
Ruizhe Zhao
Wayne Luk
114
16
0
23 Nov 2018
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
178
886
0
21 Nov 2018
Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on
  Embedded FPGAs
Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs
Yifan Yang
Qijing Huang
Bichen Wu
Tianjun Zhang
Liang Ma
...
Michaela Blott
Sebastiano Fabio Schifano
K. Vissers
J. Wawrzynek
Kurt Keutzer
122
118
0
21 Nov 2018
SuperNeurons: FFT-based Gradient Sparsification in the Distributed
  Training of Deep Neural Networks
SuperNeurons: FFT-based Gradient Sparsification in the Distributed Training of Deep Neural Networks
Linnan Wang
Wei Wu
Junyu Zhang
Hang Liu
G. Bosilca
Maurice Herlihy
Rodrigo Fonseca
GNN
55
5
0
21 Nov 2018
Graph-Adaptive Pruning for Efficient Inference of Convolutional Neural
  Networks
Graph-Adaptive Pruning for Efficient Inference of Convolutional Neural Networks
Mengdi Wang
Qing Zhang
Jun Yang
Xiaoyuan Cui
Wei Lin
GNN
55
2
0
21 Nov 2018
WEST: Word Encoded Sequence Transducers
WEST: Word Encoded Sequence Transducers
Ehsan Variani
A. Suresh
M. Weintraub
48
9
0
20 Nov 2018
Structured Pruning for Efficient ConvNets via Incremental Regularization
Structured Pruning for Efficient ConvNets via Incremental Regularization
Huan Wang
Qiming Zhang
Yuehai Wang
Haoji Hu
3DPC
105
45
0
20 Nov 2018
TSM: Temporal Shift Module for Efficient Video Understanding
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
162
1,699
0
20 Nov 2018
Multi-layer Pruning Framework for Compressing Single Shot MultiBox
  Detector
Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector
Pravendra Singh
Manikandan Ravikiran
Neeraj Matiyali
Vinay P. Namboodiri
82
21
0
20 Nov 2018
Stability Based Filter Pruning for Accelerating Deep CNNs
Stability Based Filter Pruning for Accelerating Deep CNNs
Pravendra Singh
Vinay Sameer Raja Kadi
N. Verma
Vinay P. Namboodiri
CVBM
74
26
0
20 Nov 2018
DeepZip: Lossless Data Compression using Recurrent Neural Networks
DeepZip: Lossless Data Compression using Recurrent Neural Networks
Mohit Goyal
Kedar Tatwawadi
Shubham Chandak
Idoia Ochoa
AI4CE
78
79
0
20 Nov 2018
Self-Referenced Deep Learning
Self-Referenced Deep Learning
Xu Lan
Xiatian Zhu
S. Gong
138
24
0
19 Nov 2018
Three Dimensional Convolutional Neural Network Pruning with
  Regularization-Based Method
Three Dimensional Convolutional Neural Network Pruning with Regularization-Based Method
Yu-xin Zhang
Huan Wang
Yang Luo
Lu Yu
Roland Hu
Hangguan Shan
Tony Q. S. Quek
3DPC
53
11
0
19 Nov 2018
RePr: Improved Training of Convolutional Filters
RePr: Improved Training of Convolutional Filters
Aaditya (Adi) Prakash
J. Storer
D. Florêncio
Cha Zhang
VLMCVBM
106
57
0
18 Nov 2018
PydMobileNet: Improved Version of MobileNets with Pyramid Depthwise
  Separable Convolution
PydMobileNet: Improved Version of MobileNets with Pyramid Depthwise Separable Convolution
Van-Thanh Hoang
K. Jo
63
3
0
17 Nov 2018
AclNet: efficient end-to-end audio classification CNN
AclNet: efficient end-to-end audio classification CNN
Jonathan Huang
Juan Jose Alvarado Leanos
61
52
0
16 Nov 2018
Composite Binary Decomposition Networks
Composite Binary Decomposition Networks
You Qiaoben
Ziyi Wang
Jianguo Li
Yinpeng Dong
Yu-Gang Jiang
Jun Zhu
MQ
26
0
0
16 Nov 2018
Detecting The Objects on The Road Using Modular Lightweight Network
Detecting The Objects on The Road Using Modular Lightweight Network
Sen Cao
Yazhou Liu
P. Lasang
Shengmei Shen
ObjD
46
7
0
16 Nov 2018
Previous
123...565758...686970
Next