ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,449 papers shown
Title
Optimize Deep Convolutional Neural Network with Ternarized Weights and
  High Accuracy
Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
Zhezhi He
Boqing Gong
Deliang Fan
24
22
0
20 Jul 2018
Statistical Model Compression for Small-Footprint Natural Language
  Understanding
Statistical Model Compression for Small-Footprint Natural Language Understanding
Grant P. Strimel
Kanthashree Mysore Sathyendra
Stanislav Peshterliev
32
9
0
19 Jul 2018
Defend Deep Neural Networks Against Adversarial Examples via Fixed and
  Dynamic Quantized Activation Functions
Defend Deep Neural Networks Against Adversarial Examples via Fixed and Dynamic Quantized Activation Functions
Adnan Siraj Rakin
Jinfeng Yi
Boqing Gong
Deliang Fan
AAML
MQ
24
50
0
18 Jul 2018
BRIEF: Backward Reduction of CNNs with Information Flow Analysis
BRIEF: Backward Reduction of CNNs with Information Flow Analysis
Yu-Hsun Lin
Chun-Nan Chou
Edward Y. Chang
14
0
0
16 Jul 2018
Morse Code Datasets for Machine Learning
Morse Code Datasets for Machine Learning
Sourya Dey
K. Chugg
Peter A. Beerel
18
10
0
11 Jul 2018
Make $\ell_1$ Regularization Effective in Training Sparse CNN
Make ℓ1\ell_1ℓ1​ Regularization Effective in Training Sparse CNN
Juncai He
Xiaodong Jia
Jinchao Xu
Lian Zhang
Liang Zhao
27
5
0
11 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable
  Precision LSTM Networks on FPGAs
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
27
72
0
11 Jul 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for
  Visual and Speech Recognition
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
20
96
0
10 Jul 2018
Auto Deep Compression by Reinforcement Learning Based Actor-Critic
  Structure
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure
Hamed Hakkak
OffRL
AI4CE
20
1
0
08 Jul 2018
Anytime Neural Prediction via Slicing Networks Vertically
Anytime Neural Prediction via Slicing Networks Vertically
Hankook Lee
Jinwoo Shin
AI4CE
27
16
0
07 Jul 2018
Sparse Deep Neural Network Exact Solutions
Sparse Deep Neural Network Exact Solutions
J. Kepner
V. Gadepally
Hayden Jananthan
Lauren Milechin
S. Samsi
24
14
0
06 Jul 2018
SGAD: Soft-Guided Adaptively-Dropped Neural Network
SGAD: Soft-Guided Adaptively-Dropped Neural Network
Zhisheng Wang
Fangxuan Sun
Jun Lin
Zhongfeng Wang
Bo Yuan
19
7
0
04 Jul 2018
Restructuring Batch Normalization to Accelerate CNN Training
Restructuring Batch Normalization to Accelerate CNN Training
Wonkyung Jung
Daejin Jung
and Byeongho Kim
Sunjung Lee
Wonjong Rhee
Jung Ho Ahn
26
62
0
04 Jul 2018
Confidential Inference via Ternary Model Partitioning
Confidential Inference via Ternary Model Partitioning
Zhongshu Gu
Heqing Huang
Jialong Zhang
D. Su
Hani Jamjoom
Ankita Lamba
Dimitrios E. Pendarakis
Ian Molloy
24
53
0
03 Jul 2018
Stochastic Layer-Wise Precision in Deep Neural Networks
Stochastic Layer-Wise Precision in Deep Neural Networks
Griffin Lacey
Graham W. Taylor
S. Areibi
42
18
0
03 Jul 2018
Weight-importance sparse training in keyword spotting
Weight-importance sparse training in keyword spotting
Sihao Xue
Zhenyi Ying
Fan Mo
Min Wang
Jue Sun
17
0
0
02 Jul 2018
Evenly Cascaded Convolutional Networks
Evenly Cascaded Convolutional Networks
Chengxi Ye
Chinmaya Devaraj
Michael Maynord
Cornelia Fermuller
Yiannis Aloimonos
18
7
0
02 Jul 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
33
133
0
01 Jul 2018
Automatic Rank Selection for High-Speed Convolutional Neural Network
Automatic Rank Selection for High-Speed Convolutional Neural Network
Hyeji Kim
C. Kyung
27
5
0
28 Jun 2018
DeepObfuscation: Securing the Structure of Convolutional Neural Networks
  via Knowledge Distillation
DeepObfuscation: Securing the Structure of Convolutional Neural Networks via Knowledge Distillation
Hui Xu
Yuxin Su
Zirui Zhao
Yangfan Zhou
Michael R. Lyu
Irwin King
FedML
13
26
0
27 Jun 2018
Deep $k$-Means: Re-Training and Parameter Sharing with Harder Cluster
  Assignments for Compressing Deep Convolutions
Deep kkk-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions
Junru Wu
Yue Wang
Zhenyu Wu
Zhangyang Wang
Ashok Veeraraghavan
Yingyan Lin
15
115
0
24 Jun 2018
Constructing Deep Neural Networks by Bayesian Network Structure Learning
Constructing Deep Neural Networks by Bayesian Network Structure Learning
R. Y. Rohekar
Shami Nisimov
Yaniv Gurwicz
G. Koren
Gal Novik
BDL
36
26
0
24 Jun 2018
Compact Deep Neural Networks for Computationally Efficient Gesture
  Classification From Electromyography Signals
Compact Deep Neural Networks for Computationally Efficient Gesture Classification From Electromyography Signals
A. Hartwell
V. Kadirkamanathan
S. Anderson
11
17
0
22 Jun 2018
Deploying Deep Neural Networks in the Embedded Space
Deploying Deep Neural Networks in the Embedded Space
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
32
13
0
22 Jun 2018
Efficient Semantic Segmentation using Gradual Grouping
Efficient Semantic Segmentation using Gradual Grouping
Nikitha Vallurupalli
Sriharsha Annamaneni
G. Varma
C. V. Jawahar
Manu Mathew
S. Nagori
SSeg
14
12
0
22 Jun 2018
Learning K-way D-dimensional Discrete Codes for Compact Embedding
  Representations
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations
Ting-Li Chen
Martin Renqiang Min
Yizhou Sun
26
70
0
21 Jun 2018
Quantizing deep convolutional networks for efficient inference: A
  whitepaper
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
53
999
0
21 Jun 2018
Rethinking Machine Learning Development and Deployment for Edge Devices
Rethinking Machine Learning Development and Deployment for Edge Devices
Liangzhen Lai
Naveen Suda
19
10
0
20 Jun 2018
Edge Intelligence: On-Demand Deep Learning Model Co-Inference with
  Device-Edge Synergy
Edge Intelligence: On-Demand Deep Learning Model Co-Inference with Device-Edge Synergy
En Li
Zhi Zhou
Xu Chen
27
325
0
20 Jun 2018
Doubly Nested Network for Resource-Efficient Inference
Doubly Nested Network for Resource-Efficient Inference
Jaehong Kim
Sungeun Hong
Yongseok Choi
Jiwon Kim
21
5
0
20 Jun 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks
  per Bit?
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
35
136
0
20 Jun 2018
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model
  Shrinking
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking
Patrick H. Chen
Si Si
Yang Li
Ciprian Chelba
Cho-Jui Hsieh
24
67
0
18 Jun 2018
Fast Convex Pruning of Deep Neural Networks
Fast Convex Pruning of Deep Neural Networks
Alireza Aghasi
Afshin Abdi
Justin Romberg
29
24
0
17 Jun 2018
On Machine Learning and Structure for Mobile Robots
On Machine Learning and Structure for Mobile Robots
Markus Wulfmeier
27
6
0
15 Jun 2018
Three dimensional Deep Learning approach for remote sensing image
  classification
Three dimensional Deep Learning approach for remote sensing image classification
A. Ben Hamida
A. Benoît
P. Lambert
C. Ben Amar
63
570
0
15 Jun 2018
RAPIDNN: In-Memory Deep Neural Network Acceleration Framework
RAPIDNN: In-Memory Deep Neural Network Acceleration Framework
Mohsen Imani
Mohammad Samragh
Yeseong Kim
Saransh Gupta
F. Koushanfar
Tajana Simunic
24
51
0
15 Jun 2018
Deep Learning Approximation: Zero-Shot Neural Network Speedup
Deep Learning Approximation: Zero-Shot Neural Network Speedup
Michele Pratusevich
30
0
0
15 Jun 2018
Insights on representational similarity in neural networks with
  canonical correlation
Insights on representational similarity in neural networks with canonical correlation
Ari S. Morcos
M. Raghu
Samy Bengio
DRL
32
435
0
14 Jun 2018
PCAS: Pruning Channels with Attention Statistics for Deep Network
  Compression
PCAS: Pruning Channels with Attention Statistics for Deep Network Compression
Kohei Yamamoto
K. Maeno
24
32
0
14 Jun 2018
Scalable Neural Network Compression and Pruning Using Hard Clustering
  and L1 Regularization
Scalable Neural Network Compression and Pruning Using Hard Clustering and L1 Regularization
Yibo Yang
Nicholas Ruozzi
Vibhav Gogate
21
2
0
14 Jun 2018
The streaming rollout of deep networks - towards fully model-parallel
  execution
The streaming rollout of deep networks - towards fully model-parallel execution
Volker Fischer
Jan M. Köhler
Thomas Pfeil
32
16
0
13 Jun 2018
Knowledge Distillation by On-the-Fly Native Ensemble
Knowledge Distillation by On-the-Fly Native Ensemble
Xu Lan
Xiatian Zhu
S. Gong
214
474
0
12 Jun 2018
Energy-Constrained Compression for Deep Neural Networks via Weighted
  Sparse Projection and Layer Input Masking
Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
Haichuan Yang
Yuhao Zhu
Ji Liu
CVBM
21
36
0
12 Jun 2018
Full deep neural network training on a pruned weight budget
Full deep neural network training on a pruned weight budget
Maximilian Golub
G. Lemieux
Mieszko Lis
33
28
0
11 Jun 2018
Smallify: Learning Network Size while Training
Smallify: Learning Network Size while Training
Guillaume Leclerc
Manasi Vartak
Raul Castro Fernandez
Tim Kraska
Samuel Madden
14
13
0
10 Jun 2018
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
Amartya Sanyal
Matt J. Kusner
Adria Gascon
Varun Kanade
FedML
27
126
0
09 Jun 2018
Slalom: Fast, Verifiable and Private Execution of Neural Networks in
  Trusted Hardware
Slalom: Fast, Verifiable and Private Execution of Neural Networks in Trusted Hardware
Florian Tramèr
Dan Boneh
FedML
114
396
0
08 Jun 2018
EasyConvPooling: Random Pooling with Easy Convolution for Accelerating
  Training and Testing
EasyConvPooling: Random Pooling with Easy Convolution for Accelerating Training and Testing
Jianzhong Sheng
Chuanbo Chen
Chenchen Fu
Chun Jason Xue
30
4
0
05 Jun 2018
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance
  Benchmark
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark
Cody Coleman
Daniel Kang
Deepak Narayanan
Luigi Nardi
Tian Zhao
Jian Zhang
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
18
117
0
04 Jun 2018
Dynamically Hierarchy Revolution: DirNet for Compressing Recurrent
  Neural Network on Mobile Devices
Dynamically Hierarchy Revolution: DirNet for Compressing Recurrent Neural Network on Mobile Devices
Jie Zhang
Xiaolong Wang
Dawei Li
Yalin Wang
6
14
0
04 Jun 2018
Previous
123...596061...676869
Next