ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,452 papers shown
Title
Discrimination-aware Channel Pruning for Deep Neural Networks
Discrimination-aware Channel Pruning for Deep Neural Networks
Zhuangwei Zhuang
Mingkui Tan
Bohan Zhuang
Jing Liu
Yong Guo
Qingyao Wu
Junzhou Huang
Jin-Hui Zhu
47
595
0
28 Oct 2018
Learning Sparse Neural Networks via Sensitivity-Driven Regularization
Learning Sparse Neural Networks via Sensitivity-Driven Regularization
Enzo Tartaglione
S. Lepsøy
Attilio Fiandrotti
Gianluca Francini
9
69
0
28 Oct 2018
A Miniaturized Semantic Segmentation Method for Remote Sensing Image
A Miniaturized Semantic Segmentation Method for Remote Sensing Image
Shou-Yu Chen
Guang-Sheng Chen
Wei-Peng Jing
14
1
0
27 Oct 2018
Distilling with Performance Enhanced Students
Distilling with Performance Enhanced Students
Jack Turner
Elliot J. Crowley
Valentin Radu
José Cano
Amos Storkey
Michael F. P. O'Boyle
24
3
0
24 Oct 2018
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for
  Continuous Mobile Vision
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
Biyi Fang
Xiao Zeng
Mi Zhang
3DH
34
264
0
23 Oct 2018
Deep Neural Network inference with reduced word length
Deep Neural Network inference with reduced word length
Lukas Mauch
Binh Yang
MQ
14
0
0
23 Oct 2018
Convolutional Neural Network Pruning to Accelerate Membrane Segmentation
  in Electron Microscopy
Convolutional Neural Network Pruning to Accelerate Membrane Segmentation in Electron Microscopy
J. Roels
Jonas De Vylder
J. Aelterman
Yvan Saeys
Wilfried Philips
14
6
0
23 Oct 2018
Learning sparse transformations through backpropagation
Learning sparse transformations through backpropagation
Peter Bloem
24
0
0
22 Oct 2018
To Compress, or Not to Compress: Characterizing Deep Learning Model
  Compression for Embedded Inference
To Compress, or Not to Compress: Characterizing Deep Learning Model Compression for Embedded Inference
Qing Qin
Jie Ren
Jia-Le Yu
Ling Gao
Hai Wang
Jie Zheng
Yansong Feng
Jianbin Fang
Zheng Wang
16
21
0
21 Oct 2018
CNN inference acceleration using dictionary of centroids
CNN inference acceleration using dictionary of centroids
D.Babin
I.Mazurenko
D.Parkhomenko
A.Voloshko
MQ
14
0
0
19 Oct 2018
Real-time Neural-based Input Method
Real-time Neural-based Input Method
Jiali Yao
Raphael Shu
Xinjian Li
K. Ohtsuki
Hideki Nakayama
11
4
0
19 Oct 2018
KTAN: Knowledge Transfer Adversarial Network
KTAN: Knowledge Transfer Adversarial Network
Peiye Liu
Wu Liu
Huadong Ma
Tao Mei
Mingoo Seok
GAN
36
28
0
18 Oct 2018
Progressive Weight Pruning of Deep Neural Networks using ADMM
Progressive Weight Pruning of Deep Neural Networks using ADMM
Shaokai Ye
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Kaidi Xu
...
M. Fardad
Sijia Liu
Xiang Chen
Xinyu Lin
Yanzhi Wang
AI4CE
53
38
0
17 Oct 2018
Quantization for Rapid Deployment of Deep Neural Networks
Quantization for Rapid Deployment of Deep Neural Networks
J. Lee
Sangwon Ha
Saerom Choi
Won-Jo Lee
Seungwon Lee
MQ
14
48
0
12 Oct 2018
Rethinking the Value of Network Pruning
Rethinking the Value of Network Pruning
Zhuang Liu
Mingjie Sun
Tinghui Zhou
Gao Huang
Trevor Darrell
10
1,457
0
11 Oct 2018
A Closer Look at Structured Pruning for Neural Network Compression
A Closer Look at Structured Pruning for Neural Network Compression
Elliot J. Crowley
Jack Turner
Amos Storkey
Michael F. P. O'Boyle
3DPC
34
31
0
10 Oct 2018
Extreme Classification in Log Memory
Extreme Classification in Log Memory
Qixuan Huang
Yiqiu Wang
Tharun Medini
Anshumali Shrivastava
VLM
18
3
0
09 Oct 2018
Deep Neural Network Compression for Aircraft Collision Avoidance Systems
Deep Neural Network Compression for Aircraft Collision Avoidance Systems
Kyle D. Julian
Mykel J. Kochenderfer
Michael P. Owen
28
169
0
09 Oct 2018
Rate Distortion For Model Compression: From Theory To Practice
Rate Distortion For Model Compression: From Theory To Practice
Weihao Gao
Yu-Han Liu
Chong-Jun Wang
Sewoong Oh
30
31
0
09 Oct 2018
Light-Weight RefineNet for Real-Time Semantic Segmentation
Light-Weight RefineNet for Real-Time Semantic Segmentation
Vladimir Nekrasov
Chunhua Shen
Ian Reid
SSeg
VLM
35
147
0
08 Oct 2018
Sparse Winograd Convolutional neural networks on small-scale systolic
  arrays
Sparse Winograd Convolutional neural networks on small-scale systolic arrays
Feng Shi
Haochen Li
Yuhe Gao
Benjamin Kuschner
Song-Chun Zhu
11
15
0
03 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
46
132
0
03 Oct 2018
Learning with Random Learning Rates
Learning with Random Learning Rates
Léonard Blier
Pierre Wolinski
Yann Ollivier
OOD
34
20
0
02 Oct 2018
Training compact deep learning models for video classification using
  circulant matrices
Training compact deep learning models for video classification using circulant matrices
Alexandre Araujo
Benjamin Négrevergne
Y. Chevaleyre
Jamal Atif
34
14
0
02 Oct 2018
Target Aware Network Adaptation for Efficient Representation Learning
Target Aware Network Adaptation for Efficient Representation Learning
Yang Zhong
Vladimir Li
R. Okada
A. Maki
23
6
0
02 Oct 2018
LIT: Block-wise Intermediate Representation Training for Model
  Compression
LIT: Block-wise Intermediate Representation Training for Model Compression
Animesh Koratana
Daniel Kang
Peter Bailis
Matei A. Zaharia
19
12
0
02 Oct 2018
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network
  using Truncated Gaussian Approximation
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network using Truncated Gaussian Approximation
Zhezhi He
Deliang Fan
MQ
16
67
0
02 Oct 2018
Extended Bit-Plane Compression for Convolutional Neural Network
  Accelerators
Extended Bit-Plane Compression for Convolutional Neural Network Accelerators
Lukas Cavigelli
Luca Benini
32
20
0
01 Oct 2018
ProxQuant: Quantized Neural Networks via Proximal Operators
ProxQuant: Quantized Neural Networks via Proximal Operators
Yu Bai
Yu Wang
Edo Liberty
MQ
24
117
0
01 Oct 2018
Dynamic Sparse Graph for Efficient Deep Learning
Dynamic Sparse Graph for Efficient Deep Learning
Liu Liu
Lei Deng
Xing Hu
Maohua Zhu
Guoqi Li
Yufei Ding
Yuan Xie
GNN
40
42
0
01 Oct 2018
Benchmark Analysis of Representative Deep Neural Network Architectures
Benchmark Analysis of Representative Deep Neural Network Architectures
Simone Bianco
Rémi Cadène
Luigi Celona
Paolo Napoletano
BDL
28
672
0
01 Oct 2018
Procedural Noise Adversarial Examples for Black-Box Attacks on Deep
  Convolutional Networks
Procedural Noise Adversarial Examples for Black-Box Attacks on Deep Convolutional Networks
Kenneth T. Co
Luis Muñoz-González
Sixte de Maupeou
Emil C. Lupu
AAML
24
67
0
30 Sep 2018
Minimal Random Code Learning: Getting Bits Back from Compressed Model
  Parameters
Minimal Random Code Learning: Getting Bits Back from Compressed Model Parameters
Marton Havasi
Robert Peharz
José Miguel Hernández-Lobato
20
80
0
30 Sep 2018
Mini-batch Serialization: CNN Training with Inter-layer Data Reuse
Mini-batch Serialization: CNN Training with Inter-layer Data Reuse
Sangkug Lym
Armand Behroozi
W. Wen
Ge Li
Yongkee Kwon
M. Erez
17
25
0
30 Sep 2018
To compress or not to compress: Understanding the Interactions between
  Adversarial Attacks and Neural Network Compression
To compress or not to compress: Understanding the Interactions between Adversarial Attacks and Neural Network Compression
Yiren Zhao
Ilia Shumailov
Robert D. Mullins
Ross J. Anderson
AAML
19
43
0
29 Sep 2018
Knowledge-guided Semantic Computing Network
Knowledge-guided Semantic Computing Network
Guangming Shi
Zhongqiang Zhang
Dahua Gao
Xuemei Xie
Yihao Feng
Xinrui Ma
Danhua Liu
34
8
0
29 Sep 2018
Throughput Optimizations for FPGA-based Deep Neural Network Inference
Throughput Optimizations for FPGA-based Deep Neural Network Inference
Thorbjörn Posewsky
Daniel Ziener
19
25
0
28 Sep 2018
Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems
Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems
Graham Gobieski
Nathan Beckmann
Brandon Lucia
22
204
0
28 Sep 2018
Deep learning systems as complex networks
Deep learning systems as complex networks
Alberto Testolin
Michele Piccolini
S. Suweis
AI4CE
BDL
GNN
13
27
0
28 Sep 2018
Learning to Train a Binary Neural Network
Learning to Train a Binary Neural Network
Joseph Bethge
Haojin Yang
Christian Bartz
Christoph Meinel
MQ
17
12
0
27 Sep 2018
Adaptive Pruning of Neural Language Models for Mobile Devices
Adaptive Pruning of Neural Language Models for Mobile Devices
Raphael Tang
Jimmy J. Lin
24
6
0
27 Sep 2018
Object Detection from Scratch with Deep Supervision
Object Detection from Scratch with Deep Supervision
Zhiqiang Shen
Zhuang Liu
Jianguo Li
Yu-Gang Jiang
Yurong Chen
Xiangyang Xue
ObjD
24
77
0
25 Sep 2018
No Multiplication? No Floating Point? No Problem! Training Networks for
  Efficient Inference
No Multiplication? No Floating Point? No Problem! Training Networks for Efficient Inference
S. Baluja
David Marwood
Michele Covell
Nick Johnston
MQ
26
8
0
24 Sep 2018
Shift-based Primitives for Efficient Convolutional Neural Networks
Shift-based Primitives for Efficient Convolutional Neural Networks
Huasong Zhong
Xianggen Liu
Yihui He
Yuchun Ma
35
20
0
22 Sep 2018
FastDeepIoT: Towards Understanding and Optimizing Neural Network
  Execution Time on Mobile and Embedded Devices
FastDeepIoT: Towards Understanding and Optimizing Neural Network Execution Time on Mobile and Embedded Devices
Shuochao Yao
Yiran Zhao
Huajie Shao
Shengzhong Liu
Dongxin Liu
Lu Su
Tarek Abdelzaher
HAI
21
132
0
19 Sep 2018
MBS: Macroblock Scaling for CNN Model Reduction
MBS: Macroblock Scaling for CNN Model Reduction
Yu-Hsun Lin
Chun-Nan Chou
Edward Y. Chang
MQ
16
4
0
18 Sep 2018
Intermediate Deep Feature Compression: the Next Battlefield of
  Intelligent Sensing
Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing
Zhuo Chen
Weisi Lin
Shiqi Wang
Ling-yu Duan
Alex C. Kot
35
16
0
17 Sep 2018
FermiNets: Learning generative machines to generate efficient neural
  networks via generative synthesis
FermiNets: Learning generative machines to generate efficient neural networks via generative synthesis
A. Wong
M. Shafiee
Brendan Chwyl
Francis Li
13
64
0
17 Sep 2018
Memristor-based Deep Convolution Neural Network: A Case Study
Memristor-based Deep Convolution Neural Network: A Case Study
Fan Zhang
Miao Hu
11
6
0
14 Sep 2018
Hardware-Aware Machine Learning: Modeling and Optimization
Hardware-Aware Machine Learning: Modeling and Optimization
Diana Marculescu
Dimitrios Stamoulis
E. Cai
28
45
0
14 Sep 2018
Previous
123...575859...686970
Next