ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,448 papers shown
Title
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural
  Networks
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks
Ke Sun
Mingjie Li
Dong Liu
Jingdong Wang
48
126
0
01 Jun 2018
A Highly Parallel FPGA Implementation of Sparse Neural Network Training
A Highly Parallel FPGA Implementation of Sparse Neural Network Training
Sourya Dey
Diandian Chen
Zongyang Li
Souvik Kundu
Kuan-Wen Huang
K. Chugg
Peter A. Beerel
17
11
0
31 May 2018
Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural
  Networks
Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks
Kang Liu
Brendan Dolan-Gavitt
S. Garg
AAML
26
1,022
0
30 May 2018
Channel Gating Neural Networks
Channel Gating Neural Networks
Weizhe Hua
Yuan Zhou
Christopher De Sa
Zhiru Zhang
G. E. Suh
15
180
0
29 May 2018
A novel channel pruning method for deep neural network compression
A novel channel pruning method for deep neural network compression
Yiming Hu
Siyang Sun
Jianquan Li
Xingang Wang
Qingyi Gu
20
67
0
29 May 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Dongsoo Lee
Byeongwook Kim
MQ
36
16
0
29 May 2018
Adaptive Network Sparsification with Dependent Variational
  Beta-Bernoulli Dropout
Adaptive Network Sparsification with Dependent Variational Beta-Bernoulli Dropout
Juho Lee
Saehoon Kim
Jaehong Yoon
Haebeom Lee
Eunho Yang
Sung Ju Hwang
14
12
0
28 May 2018
Constructing Fast Network through Deconstruction of Convolution
Constructing Fast Network through Deconstruction of Convolution
Yunho Jeon
Junmo Kim
22
71
0
28 May 2018
Compact and Computationally Efficient Representation of Deep Neural
  Networks
Compact and Computationally Efficient Representation of Deep Neural Networks
Simon Wiedemann
K. Müller
Wojciech Samek
MQ
42
67
0
27 May 2018
Accelerating CNN inference on FPGAs: A Survey
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
30
147
0
26 May 2018
Heterogeneous Bitwidth Binarization in Convolutional Neural Networks
Heterogeneous Bitwidth Binarization in Convolutional Neural Networks
Josh Fromm
Shwetak N. Patel
Matthai Philipose
MQ
19
27
0
25 May 2018
Tensorial Neural Networks: Generalization of Neural Networks and
  Application to Model Compression
Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression
Jiahao Su
Jingling Li
Bobby Bhattacharjee
Furong Huang
16
20
0
25 May 2018
Scalable Methods for 8-bit Training of Neural Networks
Scalable Methods for 8-bit Training of Neural Networks
Ron Banner
Itay Hubara
Elad Hoffer
Daniel Soudry
MQ
54
332
0
25 May 2018
Multi-Task Zipping via Layer-wise Neuron Sharing
Multi-Task Zipping via Layer-wise Neuron Sharing
Xiaoxi He
Zimu Zhou
Lothar Thiele
MoMe
13
61
0
24 May 2018
Learning towards Minimum Hyperspherical Energy
Learning towards Minimum Hyperspherical Energy
Weiyang Liu
Rongmei Lin
Ziqiang Liu
Lixin Liu
Zhiding Yu
Bo Dai
Le Song
30
146
0
23 May 2018
AutoPruner: An End-to-End Trainable Filter Pruning Method for Efficient
  Deep Model Inference
AutoPruner: An End-to-End Trainable Filter Pruning Method for Efficient Deep Model Inference
Jian-Hao Luo
Jianxin Wu
23
207
0
23 May 2018
Approximate Random Dropout
Approximate Random Dropout
Zhuoran Song
Ru Wang
Dongyu Ru
Hongru Huang
Zhenghao Peng
Hai Zhao
Xiaoyao Liang
Li Jiang
BDL
30
9
0
23 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN
  Training
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
27
44
0
22 May 2018
CascadeCNN: Pushing the performance limits of quantisation
CascadeCNN: Pushing the performance limits of quantisation
Alexandros Kouris
Stylianos I. Venieris
C. Bouganis
MQ
30
24
0
22 May 2018
Parsimonious Bayesian deep networks
Parsimonious Bayesian deep networks
Mingyuan Zhou
BDL
20
8
0
22 May 2018
AxTrain: Hardware-Oriented Neural Network Training for Approximate
  Inference
AxTrain: Hardware-Oriented Neural Network Training for Approximate Inference
Xin He
Liu Ke
Wenyan Lu
Guihai Yan
Xuan Zhang
27
34
0
21 May 2018
Compression of Deep Convolutional Neural Networks under Joint Sparsity
  Constraints
Compression of Deep Convolutional Neural Networks under Joint Sparsity Constraints
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
11
6
0
21 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
36
26
0
21 May 2018
Neural Network Compression using Transform Coding and Clustering
Neural Network Compression using Transform Coding and Clustering
Thorsten Laude
Yannick Richter
Jörn Ostermann
18
4
0
18 May 2018
RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant
  Deep Networks
RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant Deep Networks
Xiuyuan Cheng
Qiang Qiu
Robert Calderbank
Guillermo Sapiro
30
43
0
17 May 2018
Object detection at 200 Frames Per Second
Object detection at 200 Frames Per Second
Rakesh Mehta
Cemalettin Öztürk
ObjD
37
61
0
16 May 2018
Lightweight Pyramid Networks for Image Deraining
Lightweight Pyramid Networks for Image Deraining
Xueyang Fu
Borong Liang
Yue Huang
Xinghao Ding
John Paisley
18
323
0
16 May 2018
Hu-Fu: Hardware and Software Collaborative Attack Framework against
  Neural Networks
Hu-Fu: Hardware and Software Collaborative Attack Framework against Neural Networks
Wenshuo Li
Jincheng Yu
Xuefei Ning
Pengjun Wang
Qi Wei
Yu Wang
Huazhong Yang
AAML
39
61
0
14 May 2018
Unifying and Merging Well-trained Deep Neural Networks for Inference
  Stage
Unifying and Merging Well-trained Deep Neural Networks for Inference Stage
Yi-Min Chou
Yi-Ming Chan
Jia-Hong Lee
Chih-Yi Chiu
Chu-Song Chen
MoMe
35
34
0
14 May 2018
ContextNet: Exploring Context and Detail for Semantic Segmentation in
  Real-time
ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time
Rudra P. K. Poudel
Ujwal D. Bonde
Stephan Liwicki
Christopher Zach
SSeg
38
230
0
11 May 2018
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial
  Networks
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks
Amir Yazdanbakhsh
Hajar Falahati
Philip J. Wolfe
K. Samadi
Nam Sung Kim
H. Esmaeilzadeh
30
71
0
10 May 2018
Boosting up Scene Text Detectors with Guided CNN
Boosting up Scene Text Detectors with Guided CNN
Xiaoyu Yue
Zhanghui Kuang
Zhaoyang Zhang
Zhenfang Chen
Pan He
Yu Qiao
Wayne Zhang
17
8
0
10 May 2018
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
Charles Eckert
Xiaowei Wang
Jingcheng Wang
Arun K. Subramaniyan
R. Iyer
D. Sylvester
D. Blaauw
R. Das
MQ
13
334
0
09 May 2018
Towards Accurate and High-Speed Spiking Neuromorphic Systems with Data
  Quantization-Aware Deep Networks
Towards Accurate and High-Speed Spiking Neuromorphic Systems with Data Quantization-Aware Deep Networks
Fuqiang Liu
Chenchen Liu
16
5
0
08 May 2018
A Hierarchical Matcher using Local Classifier Chains
A Hierarchical Matcher using Local Classifier Chains
Lingfeng Zhang
I. Kakadiaris
9
0
0
07 May 2018
Enhancing the Regularization Effect of Weight Pruning in Artificial
  Neural Networks
Enhancing the Regularization Effect of Weight Pruning in Artificial Neural Networks
Brian Bartoldson
Adrian Barbu
G. Erlebacher
14
5
0
04 May 2018
Power Law in Sparsified Deep Neural Networks
Power Law in Sparsified Deep Neural Networks
Lu Hou
James T. Kwok
29
3
0
04 May 2018
Pixel-wise Attentional Gating for Parsimonious Pixel Labeling
Pixel-wise Attentional Gating for Parsimonious Pixel Labeling
Shu Kong
Charless C. Fowlkes
57
40
0
03 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural
  Networks
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
27
45
0
29 Apr 2018
Precise Box Score: Extract More Information from Datasets to Improve the
  Performance of Face Detection
Precise Box Score: Extract More Information from Datasets to Improve the Performance of Face Detection
Ce Qi
Xiaoping Chen
Pingyu Wang
Fei Su
CVBM
16
1
0
28 Apr 2018
Low-memory convolutional neural networks through incremental depth-first
  processing
Low-memory convolutional neural networks through incremental depth-first processing
Jonathan Binas
Yoshua Bengio
SupR
30
3
0
28 Apr 2018
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip
Feiwen Zhu
Jeff Pool
M. Andersch
J. Appleyard
Fung Xie
22
29
0
26 Apr 2018
Profile-guided memory optimization for deep neural networks
Profile-guided memory optimization for deep neural networks
Taro Sekiyama
T. Imamichi
Haruki Imai
Raymond H. Putra
39
22
0
26 Apr 2018
Accelerator-Aware Pruning for Convolutional Neural Networks
Accelerator-Aware Pruning for Convolutional Neural Networks
Hyeong-Ju Kang
13
88
0
26 Apr 2018
Efficient Multi-objective Neural Architecture Search via Lamarckian
  Evolution
Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution
T. Elsken
J. H. Metzen
Frank Hutter
131
499
0
24 Apr 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
43
401
0
24 Apr 2018
MQGrad: Reinforcement Learning of Gradient Quantization in Parameter
  Server
MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server
Guoxin Cui
Jun Xu
Wei Zeng
Yanyan Lan
Jiafeng Guo
Xueqi Cheng
MQ
16
13
0
22 Apr 2018
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification
  on Mobile Devices
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices
Sheng Chen
Yang Liu
Xiang Gao
Zhen Han
CVBM
3DH
30
558
0
20 Apr 2018
Minimizing Area and Energy of Deep Learning Hardware Design Using
  Collective Low Precision and Structured Compression
Minimizing Area and Energy of Deep Learning Hardware Design Using Collective Low Precision and Structured Compression
Shihui Yin
Gaurav Srivastava
S. Venkataramanaiah
C. Chakrabarti
Visar Berisha
Jae-sun Seo
25
8
0
19 Apr 2018
Pelee: A Real-Time Object Detection System on Mobile Devices
Pelee: A Real-Time Object Detection System on Mobile Devices
R. Wang
Xiang Li
Charles X. Ling
ObjD
30
454
0
18 Apr 2018
Previous
123...606162...676869
Next