Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,448 papers shown
Title
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions
Toby Lightheart
S. Grainger
Tien-Fu Lu
8
0
0
30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Zefan Li
Bingbing Ni
Wenjun Zhang
Xiaokang Yang
Wen Gao
MQ
32
107
0
29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Caiwen Ding
Siyu Liao
Yanzhi Wang
Zhe Li
Ning Liu
...
Yipeng Zhang
Jian Tang
Qinru Qiu
Xinyu Lin
Bo Yuan
GNN
32
259
0
29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images
Duc Minh Nguyen
Evaggelia Tsiligianni
Nikos Deligiannis
13
26
0
28 Aug 2017
The Convergence of Machine Learning and Communications
Wojciech Samek
S. Stańczak
Thomas Wiegand
AI4CE
32
29
0
28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming
Zhuang Liu
Jianguo Li
Zhiqiang Shen
Gao Huang
Shoumeng Yan
Changshui Zhang
70
2,391
0
22 Aug 2017
Neural Networks Compression for Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
22
30
0
20 Aug 2017
Deep Neural Network Capacity
Aosen Wang
Huan Zhou
Wenyao Xu
Xin Chen
13
4
0
16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
38
10
0
16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices
Dawei Li
Xiaolong Wang
Deguang Kong
31
97
0
16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS
J. Kepner
Manoj Kumar
José Moreira
P. Pattnaik
M. Serrano
H. Tufo
GNN
22
33
0
09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink
Xuzhao Li
Changsong Liu
CVBM
11
4
0
08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks
Jan A. Botha
Emily Pitler
Ji Ma
A. Bakalov
Alexandru Salcianu
David J. Weiss
Ryan T. McDonald
Slav Petrov
HAI
30
38
0
01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNN
MQ
45
35
0
31 Jul 2017
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization
Frederick Tung
S. Muralidharan
Greg Mori
35
35
0
28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Nikolaos Passalis
Anastasios Tefas
20
70
0
25 Jul 2017
Towards Evolutional Compression
Yunhe Wang
Chang Xu
Jiayan Qiu
Chao Xu
Dacheng Tao
22
14
0
25 Jul 2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Cong Leng
Hao Li
Shenghuo Zhu
Rong Jin
MQ
38
286
0
24 Jul 2017
Neuron Pruning for Compressing Deep Networks using Maxout Architectures
Fernando Moya Rueda
René Grzeszick
G. Fink
CVBM
22
17
0
21 Jul 2017
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
Jian-Hao Luo
Jianxin Wu
Weiyao Lin
19
1,746
0
20 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
128
2,508
0
19 Jul 2017
Pruning Convolutional Neural Networks for Image Instance Retrieval
Gaurav Manek
Jie Lin
V. Chandrasekhar
Ling-Yu Duan
Sateesh Giduthuri
Xiaoli Li
T. Poggio
30
2
0
18 Jul 2017
Fast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network
Jin Yamanaka
S. Kuwashima
Takio Kurita
SupR
33
213
0
18 Jul 2017
Ternary Residual Networks
Abhisek Kundu
K. Banerjee
Naveen Mellempudi
Dheevatsa Mudigere
Dipankar Das
Bharat Kaul
Pradeep Dubey
34
8
0
15 Jul 2017
Interleaved Group Convolutions for Deep Neural Networks
Ting Zhang
Guo-Jun Qi
Bin Xiao
Jingdong Wang
36
81
0
10 Jul 2017
An Embedded Deep Learning based Word Prediction
Seunghak Yu
Nilesh Kulkarni
Haejun Lee
J. Kim
42
0
0
06 Jul 2017
Model compression as constrained optimization, with application to neural nets. Part I: general framework
Miguel Á. Carreira-Perpiñán
MQ
20
32
0
05 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
AI4TS
81
6,792
0
04 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Yoonho Boo
Wonyong Sung
MQ
30
11
0
01 Jul 2017
Irregular Convolutional Neural Networks
Jiabin Ma
Wei Wang
Liang Wang
39
12
0
24 Jun 2017
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks
Shuchang Zhou
Yuzhi Wang
He Wen
Qinyao He
Yuheng Zou
MQ
30
110
0
22 Jun 2017
MEC: Memory-efficient Convolution for Deep Neural Network
Minsik Cho
D. Brand
24
86
0
21 Jun 2017
Using Convolutional Neural Networks in Robots with Limited Computational Resources: Detecting NAO Robots while Playing Soccer
Nicolás Cruz
Kenzo Lobos-Tsunekawa
Javier Ruiz-del-Solar
27
35
0
20 Jun 2017
An Entropy-based Pruning Method for CNN Compression
Jian-Hao Luo
Jianxin Wu
19
180
0
19 Jun 2017
Sobolev Training for Neural Networks
Wojciech M. Czarnecki
Simon Osindero
Max Jaderberg
G. Swirszcz
Razvan Pascanu
21
242
0
15 Jun 2017
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation
Abhishek Chaurasia
Eugenio Culurciello
SSeg
18
1,367
0
14 Jun 2017
Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks
Joan Serrà
Alexandros Karatzoglou
33
52
0
13 Jun 2017
SEP-Nets: Small and Effective Pattern Networks
Zhe Li
Xiaoyu Wang
Xutao Lv
Tianbao Yang
30
12
0
13 Jun 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks
Denis A. Gudovskiy
Luca Rigazio
MQ
27
52
0
07 Jun 2017
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework
Shuochao Yao
Yiran Zhao
Aston Zhang
Lu Su
Tarek Abdelzaher
31
183
0
05 Jun 2017
IDK Cascades: Fast Deep Learning by Learning not to Overthink
Xin Wang
Yujia Luo
D. Crankshaw
Alexey Tumanov
Fisher Yu
Joseph E. Gonzalez
35
107
0
03 Jun 2017
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU
Qingqing Cao
Niranjan Balasubramanian
A. Balasubramanian
26
61
0
03 Jun 2017
Tensor Contraction Layers for Parsimonious Deep Nets
Jean Kossaifi
Aran Khanna
Zachary Chase Lipton
Tommaso Furlanello
Anima Anandkumar
37
60
0
01 Jun 2017
Deep Mutual Learning
Ying Zhang
Tao Xiang
Timothy M. Hospedales
Huchuan Lu
FedML
77
1,640
0
01 Jun 2017
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks
Tom Véniat
Ludovic Denoyer
35
21
0
31 May 2017
Computation-Performance Optimization of Convolutional Neural Networks with Redundant Kernel Removal
Chih-Ting Liu
Yi-Heng Wu
Yu-Sheng Lin
Shao-Yi Chien
SupR
19
5
0
30 May 2017
Iterative Machine Teaching
Weiyang Liu
Bo Dai
Ahmad Humayun
C. Tay
Chen Yu
Linda B. Smith
James M. Rehg
Le Song
34
141
0
30 May 2017
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework
Lei Deng
Peng Jiao
Jing Pei
Zhenzhi Wu
Guoqi Li
MQ
34
20
0
25 May 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Huizi Mao
Song Han
Jeff Pool
Wenshuo Li
Xingyu Liu
Yu Wang
W. Dally
27
241
0
24 May 2017
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
23
479
0
24 May 2017
Previous
1
2
3
...
65
66
67
68
69
Next