ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
STEERAGE: Synthesis of Neural Networks Using Architecture Search and
  Grow-and-Prune Methods
STEERAGE: Synthesis of Neural Networks Using Architecture Search and Grow-and-Prune Methods
Shayan Hassantabar
Xiaoliang Dai
N. Jha
3DV
65
17
0
12 Dec 2019
An Improving Framework of regularization for Network Compression
An Improving Framework of regularization for Network Compression
E. Zhenqian
Weiguo Gao
AI4CE
43
0
0
11 Dec 2019
Advances and Open Problems in Federated Learning
Advances and Open Problems in Federated Learning
Peter Kairouz
H. B. McMahan
Brendan Avent
A. Bellet
M. Bennis
...
Zheng Xu
Qiang Yang
Felix X. Yu
Han Yu
Sen Zhao
FedMLAI4CE
308
6,355
0
10 Dec 2019
Magnitude and Uncertainty Pruning Criterion for Neural Networks
Magnitude and Uncertainty Pruning Criterion for Neural Networks
V. Ko
Stefan Oehmcke
Fabian Gieseke
35
3
0
10 Dec 2019
Frivolous Units: Wider Networks Are Not Really That Wide
Frivolous Units: Wider Networks Are Not Really That Wide
Stephen Casper
Xavier Boix
Vanessa D’Amario
Ling Guo
Martin Schrimpf
Kasper Vinken
Gabriel Kreiman
73
19
0
10 Dec 2019
Winning the Lottery with Continuous Sparsification
Winning the Lottery with Continuous Sparsification
Pedro H. P. Savarese
Hugo Silva
Michael Maire
116
137
0
10 Dec 2019
Explicit Group Sparse Projection with Applications to Deep Learning and
  NMF
Explicit Group Sparse Projection with Applications to Deep Learning and NMF
Riyasat Ohib
Nicolas Gillis
Niccolò Dalmasso
Sameena Shah
Vamsi K. Potluru
Sergey Plis
76
9
0
09 Dec 2019
Compressing 3DCNNs Based on Tensor Train Decomposition
Compressing 3DCNNs Based on Tensor Train Decomposition
Dingheng Wang
Guangshe Zhao
Guoqi Li
Lei Deng
Yang Wu
67
6
0
08 Dec 2019
Dynamic Convolution: Attention over Convolution Kernels
Dynamic Convolution: Attention over Convolution Kernels
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Dongdong Chen
Lu Yuan
Zicheng Liu
155
904
0
07 Dec 2019
Dynamic Convolutions: Exploiting Spatial Sparsity for Faster Inference
Dynamic Convolutions: Exploiting Spatial Sparsity for Faster Inference
Thomas Verelst
Tinne Tuytelaars
84
153
0
06 Dec 2019
Sampling-Free Learning of Bayesian Quantized Neural Networks
Sampling-Free Learning of Bayesian Quantized Neural Networks
Jiahao Su
Milan Cvitkovic
Furong Huang
BDLMQUQCV
52
7
0
06 Dec 2019
Deep Model Compression Via Two-Stage Deep Reinforcement Learning
Deep Model Compression Via Two-Stage Deep Reinforcement Learning
Huixin Zhan
Wei-Ming Lin
Yongcan Cao
57
12
0
04 Dec 2019
RTN: Reparameterized Ternary Network
RTN: Reparameterized Ternary Network
Yuhang Li
Xin Dong
Shanghang Zhang
Haoli Bai
Yuanpeng Chen
Wei Wang
MQ
72
29
0
04 Dec 2019
Learning to synthesise the ageing brain without longitudinal data
Learning to synthesise the ageing brain without longitudinal data
Tian Xia
A. Chartsias
Chengjia Wang
Sotirios A. Tsaftaris
OODMedIm
109
54
0
04 Dec 2019
Fast Intent Classification for Spoken Language Understanding
Fast Intent Classification for Spoken Language Understanding
Akshit Tyagi
V. Sharma
Rahul Gupta
L. Samson
Nan Zhuang
Zihang Wang
Bill Campbell
48
8
0
03 Dec 2019
BERT for Large-scale Video Segment Classification with Test-time
  Augmentation
BERT for Large-scale Video Segment Classification with Test-time Augmentation
Tianqi Liu
Qizhan Shao
78
4
0
02 Dec 2019
FT-ClipAct: Resilience Analysis of Deep Neural Networks and Improving
  their Fault Tolerance using Clipped Activation
FT-ClipAct: Resilience Analysis of Deep Neural Networks and Improving their Fault Tolerance using Clipped Activation
L. Hoang
Muhammad Abdullah Hanif
Mohamed Bennai
AI4CE
66
117
0
02 Dec 2019
ReD-CaNe: A Systematic Methodology for Resilience Analysis and Design of
  Capsule Networks under Approximations
ReD-CaNe: A Systematic Methodology for Resilience Analysis and Design of Capsule Networks under Approximations
Alberto Marchisio
Vojtěch Mrázek
Muhammad Abdullah Hanif
Mohamed Bennai
AAML
61
15
0
02 Dec 2019
Diversifying Inference Path Selection: Moving-Mobile-Network for
  Landmark Recognition
Diversifying Inference Path Selection: Moving-Mobile-Network for Landmark Recognition
Biao Qian
Yang Wang
Zhao Zhang
Richang Hong
Meng Wang
Ling Shao
48
12
0
01 Dec 2019
Online Knowledge Distillation with Diverse Peers
Online Knowledge Distillation with Diverse Peers
Defang Chen
Jian-Ping Mei
Can Wang
Yan Feng
Chun-Yen Chen
FedML
98
305
0
01 Dec 2019
Pruning at a Glance: Global Neural Pruning for Model Compression
Pruning at a Glance: Global Neural Pruning for Model Compression
Abdullah Salama
O. Ostapenko
T. Klein
Moin Nabi
VLM
55
12
0
30 Nov 2019
Investigations on the inference optimization techniques and their impact
  on multiple hardware platforms for Semantic Segmentation
Investigations on the inference optimization techniques and their impact on multiple hardware platforms for Semantic Segmentation
Sethu Hareesh Kolluru
SSeg
18
1
0
29 Nov 2019
Semi-Relaxed Quantization with DropBits: Training Low-Bit Neural Networks via Bit-wise Regularization
J. H. Lee
Jihun Yun
Sung Ju Hwang
Eunho Yang
MQ
49
0
0
29 Nov 2019
Data-Driven Compression of Convolutional Neural Networks
Data-Driven Compression of Convolutional Neural Networks
Ramit Pahwa
Manoj Ghuhan Arivazhagan
Ankur Garg
S. Krishnamoorthy
Rohit Saxena
Sunav Choudhary
32
3
0
28 Nov 2019
QKD: Quantization-aware Knowledge Distillation
QKD: Quantization-aware Knowledge Distillation
Jangho Kim
Yash Bhalgat
Jinwon Lee
Chirag I. Patel
Nojun Kwak
MQ
107
66
0
28 Nov 2019
Orthogonal Convolutional Neural Networks
Orthogonal Convolutional Neural Networks
Jiayun Wang
Yubei Chen
Rudrasis Chakraborty
Stella X. Yu
93
190
0
27 Nov 2019
Optimal checkpointing for heterogeneous chains: how to train deep neural
  networks with limited memory
Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory
Julien Herrmann
Olivier Beaumont
Lionel Eyraud-Dubois
J. Herrmann
Alexis Joly
Alena Shilova
BDL
83
30
0
27 Nov 2019
Survey of Attacks and Defenses on Edge-Deployed Neural Networks
Survey of Attacks and Defenses on Edge-Deployed Neural Networks
Mihailo Isakov
V. Gadepally
K. Gettings
Michel A. Kinsy
AAML
56
31
0
27 Nov 2019
GhostNet: More Features from Cheap Operations
GhostNet: More Features from Cheap Operations
Kai Han
Yunhe Wang
Qi Tian
Jianyuan Guo
Chunjing Xu
Chang Xu
131
2,724
0
27 Nov 2019
Domain-Aware Dynamic Networks
Domain-Aware Dynamic Networks
Tianyuan Zhang
Bichen Wu
Xin Wang
Joseph E. Gonzalez
Kurt Keutzer
79
6
0
26 Nov 2019
Structured Multi-Hashing for Model Compression
Structured Multi-Hashing for Model Compression
Elad Eban
Yair Movshovitz-Attias
Hao Wu
Mark Sandler
Andrew Poon
Yerlan Idelbayev
M. A. Carreira-Perpiñán
87
18
0
25 Nov 2019
Real-Time Object Tracking via Meta-Learning: Efficient Model Adaptation
  and One-Shot Channel Pruning
Real-Time Object Tracking via Meta-Learning: Efficient Model Adaptation and One-Shot Channel Pruning
Ilchae Jung
Kihyun You
Hyeonwoo Noh
Minsu Cho
Bohyung Han
82
27
0
25 Nov 2019
Rigging the Lottery: Making All Tickets Winners
Rigging the Lottery: Making All Tickets Winners
Utku Evci
Trevor Gale
Jacob Menick
Pablo Samuel Castro
Erich Elsen
252
612
0
25 Nov 2019
A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN
  Implementation
A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation
Geng Yuan
Xiaolong Ma
Sheng Lin
Hao Sun
Caiwen Ding
50
8
0
24 Nov 2019
Reinventing 2D Convolutions for 3D Images
Reinventing 2D Convolutions for 3D Images
Jiancheng Yang
Xiaoyang Huang
Yi He
Jingwei Xu
Canqian Yang
Guozheng Xu
Bingbing Ni
120
11
0
24 Nov 2019
Compressing Representations for Embedded Deep Learning
Compressing Representations for Embedded Deep Learning
Juliano S. Assine
Alan Godoy
Eduardo Valle
63
3
0
23 Nov 2019
Training Modern Deep Neural Networks for Memory-Fault Robustness
Training Modern Deep Neural Networks for Memory-Fault Robustness
G. B. Hacene
François Leduc-Primeau
Amal Ben Soussia
Vincent Gripon
F. Gagnon
55
29
0
23 Nov 2019
SparseTrain:Leveraging Dynamic Sparsity in Training DNNs on
  General-Purpose SIMD Processors
SparseTrain:Leveraging Dynamic Sparsity in Training DNNs on General-Purpose SIMD Processors
Zhangxiaowen Gong
Houxiang Ji
Christopher W. Fletcher
C. Hughes
Josep Torrellas
63
5
0
22 Nov 2019
Go From the General to the Particular: Multi-Domain Translation with
  Domain Transformation Networks
Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks
Yong Wang
Longyue Wang
Shuming Shi
Victor O.K. Li
Zhaopeng Tu
62
25
0
22 Nov 2019
Few Shot Network Compression via Cross Distillation
Few Shot Network Compression via Cross Distillation
Haoli Bai
Jiaxiang Wu
Irwin King
Michael Lyu
FedML
105
60
0
21 Nov 2019
Band-limited Training and Inference for Convolutional Neural Networks
Band-limited Training and Inference for Convolutional Neural Networks
Adam Dziedzic
John Paparrizos
S. Krishnan
Aaron J. Elmore
Michael Franklin
75
53
0
21 Nov 2019
Localized Compression: Applying Convolutional Neural Networks to
  Compressed Images
Localized Compression: Applying Convolutional Neural Networks to Compressed Images
Christopher A. George
Bradley M. West
18
2
0
20 Nov 2019
DRNet: Dissect and Reconstruct the Convolutional Neural Network via
  Interpretable Manners
DRNet: Dissect and Reconstruct the Convolutional Neural Network via Interpretable Manners
Xiaolong Hu
Zhulin An
Chuanguang Yang
Hui Zhu
Kaiqiang Xu
Yongjun Xu
52
3
0
20 Nov 2019
CUP: Cluster Pruning for Compressing Deep Neural Networks
CUP: Cluster Pruning for Compressing Deep Neural Networks
Rahul Duggal
Cao Xiao
R. Vuduc
Jimeng Sun
3DPCVLM
53
23
0
19 Nov 2019
Deep Spiking Neural Networks for Large Vocabulary Automatic Speech
  Recognition
Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition
Jibin Wu
Emre Yilmaz
Malu Zhang
Haizhou Li
Kay Chen Tan
80
107
0
19 Nov 2019
Neural Network Pruning with Residual-Connections and Limited-Data
Neural Network Pruning with Residual-Connections and Limited-Data
Jian-Hao Luo
Jianxin Wu
95
118
0
19 Nov 2019
AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
Julian Faraone
M. Kumm
M. Hardieck
P. Zipf
Xueyuan Liu
David Boland
Philip H. W. Leong
MQ
49
45
0
19 Nov 2019
DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks
DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks
Ao Ren
Tao Zhang
Yuhao Wang
Sheng Lin
Zhaoyang Han
Yen-kuang Chen
Yuan Xie
Yanzhi Wang
82
11
0
19 Nov 2019
FeCaffe: FPGA-enabled Caffe with OpenCL for Deep Learning Training and
  Inference on Intel Stratix 10
FeCaffe: FPGA-enabled Caffe with OpenCL for Deep Learning Training and Inference on Intel Stratix 10
Ke He
Bo Liu
Yu Zhang
A. Ling
Dian Gu
38
8
0
18 Nov 2019
Fine-Grained Neural Architecture Search
Fine-Grained Neural Architecture Search
Heewon Kim
Seokil Hong
Bohyung Han
Heesoo Myeong
Kyoung Mu Lee
AI4CE
65
13
0
18 Nov 2019
Previous
123...464748...686970
Next