ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,448 papers shown
Title
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network
  Simulation Expansion and STDP Convergence Predictions
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions
Toby Lightheart
S. Grainger
Tien-Fu Lu
8
0
0
30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual
  Quantization
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Zefan Li
Bingbing Ni
Wenjun Zhang
Xiaokang Yang
Wen Gao
MQ
32
107
0
29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using
  Block-CirculantWeight Matrices
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Caiwen Ding
Siyu Liao
Yanzhi Wang
Zhe Li
Ning Liu
...
Yipeng Zhang
Jian Tang
Qinru Qiu
Xinyu Lin
Bo Yuan
GNN
32
259
0
29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of
  Images
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images
Duc Minh Nguyen
Evaggelia Tsiligianni
Nikos Deligiannis
13
26
0
28 Aug 2017
The Convergence of Machine Learning and Communications
The Convergence of Machine Learning and Communications
Wojciech Samek
S. Stańczak
Thomas Wiegand
AI4CE
32
29
0
28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming
Learning Efficient Convolutional Networks through Network Slimming
Zhuang Liu
Jianguo Li
Zhiqiang Shen
Gao Huang
Shoumeng Yan
Changshui Zhang
70
2,391
0
22 Aug 2017
Neural Networks Compression for Language Modeling
Neural Networks Compression for Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
22
30
0
20 Aug 2017
Deep Neural Network Capacity
Aosen Wang
Huan Zhou
Wenyao Xu
Xin Chen
13
4
0
16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
38
10
0
16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile
  Devices
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices
Dawei Li
Xiaolong Wang
Deguang Kong
31
97
0
16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS
Enabling Massive Deep Neural Networks with the GraphBLAS
J. Kepner
Manoj Kumar
José Moreira
P. Pattnaik
M. Serrano
H. Tufo
GNN
22
33
0
09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink
Prune the Convolutional Neural Networks with Sparse Shrink
Xuzhao Li
Changsong Liu
CVBM
11
4
0
08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks
Natural Language Processing with Small Feed-Forward Networks
Jan A. Botha
Emily Pitler
Ji Ma
A. Bakalov
Alexandru Salcianu
David J. Weiss
Ryan T. McDonald
Slav Petrov
HAI
30
38
0
01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an
  FPGA-Based Dataflow Platform
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNN
MQ
45
35
0
31 Jul 2017
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional
  Network with Bayesian Optimization
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization
Frederick Tung
S. Muralidharan
Greg Mori
35
35
0
28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Nikolaos Passalis
Anastasios Tefas
20
70
0
25 Jul 2017
Towards Evolutional Compression
Towards Evolutional Compression
Yunhe Wang
Chang Xu
Jiayan Qiu
Chao Xu
Dacheng Tao
22
14
0
25 Jul 2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Cong Leng
Hao Li
Shenghuo Zhu
Rong Jin
MQ
38
286
0
24 Jul 2017
Neuron Pruning for Compressing Deep Networks using Maxout Architectures
Neuron Pruning for Compressing Deep Networks using Maxout Architectures
Fernando Moya Rueda
René Grzeszick
G. Fink
CVBM
22
17
0
21 Jul 2017
ThiNet: A Filter Level Pruning Method for Deep Neural Network
  Compression
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
Jian-Hao Luo
Jianxin Wu
Weiyao Lin
19
1,746
0
20 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
128
2,508
0
19 Jul 2017
Pruning Convolutional Neural Networks for Image Instance Retrieval
Pruning Convolutional Neural Networks for Image Instance Retrieval
Gaurav Manek
Jie Lin
V. Chandrasekhar
Ling-Yu Duan
Sateesh Giduthuri
Xiaoli Li
T. Poggio
30
2
0
18 Jul 2017
Fast and Accurate Image Super Resolution by Deep CNN with Skip
  Connection and Network in Network
Fast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network
Jin Yamanaka
S. Kuwashima
Takio Kurita
SupR
33
213
0
18 Jul 2017
Ternary Residual Networks
Ternary Residual Networks
Abhisek Kundu
K. Banerjee
Naveen Mellempudi
Dheevatsa Mudigere
Dipankar Das
Bharat Kaul
Pradeep Dubey
34
8
0
15 Jul 2017
Interleaved Group Convolutions for Deep Neural Networks
Interleaved Group Convolutions for Deep Neural Networks
Ting Zhang
Guo-Jun Qi
Bin Xiao
Jingdong Wang
36
81
0
10 Jul 2017
An Embedded Deep Learning based Word Prediction
An Embedded Deep Learning based Word Prediction
Seunghak Yu
Nilesh Kulkarni
Haejun Lee
J. Kim
42
0
0
06 Jul 2017
Model compression as constrained optimization, with application to
  neural nets. Part I: general framework
Model compression as constrained optimization, with application to neural nets. Part I: general framework
Miguel Á. Carreira-Perpiñán
MQ
20
32
0
05 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for
  Mobile Devices
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
AI4TS
81
6,792
0
04 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for
  Efficient Hardware Implementations
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Yoonho Boo
Wonyong Sung
MQ
30
11
0
01 Jul 2017
Irregular Convolutional Neural Networks
Irregular Convolutional Neural Networks
Jiabin Ma
Wei Wang
Liang Wang
39
12
0
24 Jun 2017
Balanced Quantization: An Effective and Efficient Approach to Quantized
  Neural Networks
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks
Shuchang Zhou
Yuzhi Wang
He Wen
Qinyao He
Yuheng Zou
MQ
30
110
0
22 Jun 2017
MEC: Memory-efficient Convolution for Deep Neural Network
MEC: Memory-efficient Convolution for Deep Neural Network
Minsik Cho
D. Brand
24
86
0
21 Jun 2017
Using Convolutional Neural Networks in Robots with Limited Computational
  Resources: Detecting NAO Robots while Playing Soccer
Using Convolutional Neural Networks in Robots with Limited Computational Resources: Detecting NAO Robots while Playing Soccer
Nicolás Cruz
Kenzo Lobos-Tsunekawa
Javier Ruiz-del-Solar
27
35
0
20 Jun 2017
An Entropy-based Pruning Method for CNN Compression
An Entropy-based Pruning Method for CNN Compression
Jian-Hao Luo
Jianxin Wu
19
180
0
19 Jun 2017
Sobolev Training for Neural Networks
Sobolev Training for Neural Networks
Wojciech M. Czarnecki
Simon Osindero
Max Jaderberg
G. Swirszcz
Razvan Pascanu
21
242
0
15 Jun 2017
LinkNet: Exploiting Encoder Representations for Efficient Semantic
  Segmentation
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation
Abhishek Chaurasia
Eugenio Culurciello
SSeg
18
1,367
0
14 Jun 2017
Getting deep recommenders fit: Bloom embeddings for sparse binary
  input/output networks
Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks
Joan Serrà
Alexandros Karatzoglou
33
52
0
13 Jun 2017
SEP-Nets: Small and Effective Pattern Networks
SEP-Nets: Small and Effective Pattern Networks
Zhe Li
Xiaoyu Wang
Xutao Lv
Tianbao Yang
30
12
0
13 Jun 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of
  Convolutional Neural Networks
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks
Denis A. Gudovskiy
Luca Rigazio
MQ
27
52
0
07 Jun 2017
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems
  with a Compressor-Critic Framework
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework
Shuochao Yao
Yiran Zhao
Aston Zhang
Lu Su
Tarek Abdelzaher
31
183
0
05 Jun 2017
IDK Cascades: Fast Deep Learning by Learning not to Overthink
IDK Cascades: Fast Deep Learning by Learning not to Overthink
Xin Wang
Yujia Luo
D. Crankshaw
Alexey Tumanov
Fisher Yu
Joseph E. Gonzalez
35
107
0
03 Jun 2017
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU
Qingqing Cao
Niranjan Balasubramanian
A. Balasubramanian
26
61
0
03 Jun 2017
Tensor Contraction Layers for Parsimonious Deep Nets
Tensor Contraction Layers for Parsimonious Deep Nets
Jean Kossaifi
Aran Khanna
Zachary Chase Lipton
Tommaso Furlanello
Anima Anandkumar
37
60
0
01 Jun 2017
Deep Mutual Learning
Deep Mutual Learning
Ying Zhang
Tao Xiang
Timothy M. Hospedales
Huchuan Lu
FedML
77
1,640
0
01 Jun 2017
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super
  Networks
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks
Tom Véniat
Ludovic Denoyer
35
21
0
31 May 2017
Computation-Performance Optimization of Convolutional Neural Networks
  with Redundant Kernel Removal
Computation-Performance Optimization of Convolutional Neural Networks with Redundant Kernel Removal
Chih-Ting Liu
Yi-Heng Wu
Yu-Sheng Lin
Shao-Yi Chien
SupR
19
5
0
30 May 2017
Iterative Machine Teaching
Iterative Machine Teaching
Weiyang Liu
Bo Dai
Ahmad Humayun
C. Tay
Chen Yu
Linda B. Smith
James M. Rehg
Le Song
34
141
0
30 May 2017
GXNOR-Net: Training deep neural networks with ternary weights and
  activations without full-precision memory under a unified discretization
  framework
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework
Lei Deng
Peng Jiao
Jing Pei
Zhenzhi Wu
Guoqi Li
MQ
34
20
0
25 May 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural
  Networks
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Huizi Mao
Song Han
Jeff Pool
Wenshuo Li
Xingyu Liu
Yu Wang
W. Dally
27
241
0
24 May 2017
Bayesian Compression for Deep Learning
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
23
479
0
24 May 2017
Previous
123...6566676869
Next