Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Focus: Querying Large Video Datasets with Low Latency and Low Cost
Kevin Hsieh
Ganesh Ananthanarayanan
P. Bodík
P. Bahl
Matthai Philipose
Phillip B. Gibbons
O. Mutlu
98
280
0
10 Jan 2018
Learning
3
3
3
D-FilterMap for Deep Convolutional Neural Networks
Yingzhen Yang
Jianchao Yang
N. Xu
Wei Han
3DV
MQ
30
1
0
05 Jan 2018
Overcoming catastrophic forgetting with hard attention to the task
Joan Serrà
Dídac Surís
M. Miron
Alexandros Karatzoglou
CLL
201
1,087
0
04 Jan 2018
Learning a Wavelet-like Auto-Encoder to Accelerate Deep Neural Networks
Tianshui Chen
Liang Lin
W. Zuo
Xiaonan Luo
Lei Zhang
64
56
0
20 Dec 2017
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car
Michael Bechtel
Elise McEllhiney
Minje Kim
H. Yun
91
103
0
19 Dec 2017
Squeezed Convolutional Variational AutoEncoder for Unsupervised Anomaly Detection in Edge Device Industrial Internet of Things
Dohyung Kim
Hyochang Yang
Minki Chung
Sungzoon Cho
DRL
57
32
0
18 Dec 2017
Automated flow for compressing convolution neural networks for efficient edge-computation with FPGA
F. Shafiq
Takato Yamada
Antonio T. Vilchez
Sakyasingha Dasgupta
MQ
46
3
0
18 Dec 2017
clcNet: Improving the Efficiency of Convolutional Neural Network using Channel Local Convolutions
Dong-Qing Zhang
60
10
0
17 Dec 2017
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
294
3,159
0
15 Dec 2017
BT-Nets: Simplifying Deep Neural Networks via Block Term Decomposition
Guangxi Li
Jinmian Ye
Haiqin Yang
Di Chen
Shuicheng Yan
Zenglin Xu
68
11
0
15 Dec 2017
FFT-Based Deep Learning Deployment in Embedded Systems
Sheng Lin
Ning Liu
M. Nazemi
Hongjia Li
Caiwen Ding
Yanzhi Wang
Massoud Pedram
62
54
0
13 Dec 2017
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks
Eunwoo Kim
Chanho Ahn
Songhwai Oh
52
2
0
11 Dec 2017
AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training
Chia-Yu Chen
Jungwook Choi
D. Brand
A. Agrawal
Wei Zhang
K. Gopalakrishnan
ODL
81
174
0
07 Dec 2017
Automated Pruning for Deep Neural Network Compression
Franco Manessi
A. Rozza
Simone Bianco
Paolo Napoletano
Raimondo Schettini
94
57
0
05 Dec 2017
Learning Sparse Neural Networks through
L
0
L_0
L
0
Regularization
Christos Louizos
Max Welling
Diederik P. Kingma
561
1,150
0
04 Dec 2017
Adaptive Quantization for Deep Neural Network
Yiren Zhou
Seyed-Mohsen Moosavi-Dezfooli
Ngai-Man Cheung
P. Frossard
MQ
102
185
0
04 Dec 2017
Homomorphic Parameter Compression for Distributed Deep Learning Training
Jaehee Jang
Byunggook Na
Sungroh Yoon
FedML
57
1
0
28 Nov 2017
WSNet: Compact and Efficient Networks Through Weight Sampling
Xiaojie Jin
Yingzhen Yang
N. Xu
Jianchao Yang
Nebojsa Jojic
Jiashi Feng
Shuicheng Yan
49
2
0
28 Nov 2017
Slim Embedding Layers for Recurrent Neural Language Models
Zhongliang Li
Raymond Kulhanek
Shaojun Wang
Yunxin Zhao
Shuang Wu
KELM
76
23
0
27 Nov 2017
SkipNet: Learning Dynamic Routing in Convolutional Networks
Xin Wang
Feng Yu
Zi-Yi Dou
Trevor Darrell
Joseph E. Gonzalez
154
640
0
26 Nov 2017
CondenseNet: An Efficient DenseNet using Learned Group Convolutions
Gao Huang
Shichen Liu
Laurens van der Maaten
Kilian Q. Weinberger
146
800
0
25 Nov 2017
Deep Expander Networks: Efficient Deep Networks from Graph Theory
Ameya Prabhu
G. Varma
A. Namboodiri
GNN
142
72
0
23 Nov 2017
BlockDrop: Dynamic Inference Paths in Residual Networks
Zuxuan Wu
Tushar Nagarajan
Abhishek Kumar
Steven J. Rennie
L. Davis
Kristen Grauman
Rogerio Feris
117
470
0
22 Nov 2017
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
Bichen Wu
Alvin Wan
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Noah Golmant
A. Gholaminejad
Joseph E. Gonzalez
Kurt Keutzer
3DPC
121
365
0
22 Nov 2017
Evaluating Robustness of Neural Networks with Mixed Integer Programming
Vincent Tjeng
Kai Y. Xiao
Russ Tedrake
AAML
127
117
0
20 Nov 2017
Interleaver Design for Deep Neural Networks
Sourya Dey
Peter A. Beerel
K. Chugg
36
6
0
18 Nov 2017
Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method
Xu Sun
Xuancheng Ren
Shuming Ma
Bingzhen Wei
Wei Li
Jingjing Xu
Houfeng Wang
Yi Zhang
58
24
0
17 Nov 2017
Improved Bayesian Compression
Marco Federici
Karen Ullrich
Max Welling
UQCV
BDL
79
19
0
17 Nov 2017
Mobile Video Object Detection with Temporally-Aware Feature Maps
Mason Liu
Menglong Zhu
ObjD
89
197
0
17 Nov 2017
NISP: Pruning Networks using Neuron Importance Score Propagation
Ruichi Yu
Ang Li
Chun-Fu Chen
Jui-Hsin Lai
Vlad I. Morariu
Xintong Han
M. Gao
Ching-Yung Lin
L. Davis
78
801
0
16 Nov 2017
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy
Asit K. Mishra
Debbie Marr
FedML
93
331
0
15 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler
Yu Ji
Youhui Zhang
Wenguang Chen
Yuan Xie
102
56
0
15 Nov 2017
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
Arun Mallya
Svetlana Lazebnik
CLL
166
1,313
0
15 Nov 2017
Deep Rewiring: Training very sparse deep networks
G. Bellec
David Kappel
Wolfgang Maass
Robert Legenstein
BDL
210
281
0
14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
78
10
0
13 Nov 2017
Weightless: Lossy Weight Encoding For Deep Neural Network Compression
Brandon Reagen
Udit Gupta
Bob Adolf
Michael Mitzenmacher
Alexander M. Rush
Gu-Yeon Wei
David Brooks
63
38
0
13 Nov 2017
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks for Image Super Resolution
Haoyu Ren
Mostafa El-Khamy
Jungwon Lee
SupR
59
27
0
11 Nov 2017
Learning K-way D-dimensional Discrete Code For Compact Embedding Representations
Ting Chen
Martin Renqiang Min
Yizhou Sun
80
10
0
08 Nov 2017
Revealing structure components of the retina by deep learning networks
Qianyu Yan
Zhaofei Yu
Feng Chen
Jian K. Liu
FAtt
36
7
0
08 Nov 2017
Block-Sparse Recurrent Neural Networks
Sharan Narang
Eric Undersander
G. Diamos
69
139
0
08 Nov 2017
Compression-aware Training of Deep Networks
J. Álvarez
Mathieu Salzmann
82
172
0
07 Nov 2017
Moonshine: Distilling with Cheap Convolutions
Elliot J. Crowley
Gavia Gray
Amos Storkey
89
121
0
07 Nov 2017
Interpreting Convolutional Neural Networks Through Compression
R. Abbasi-Asl
Bin Yu
FAtt
49
21
0
07 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks
Sanchari Sen
Shubham Jain
Swagath Venkataramani
A. Raghunathan
62
30
0
07 Nov 2017
Characterizing Sparse Connectivity Patterns in Neural Networks
Sourya Dey
Kuan-Wen Huang
Peter A. Beerel
K. Chugg
63
11
0
06 Nov 2017
Neural Speed Reading via Skim-RNN
Minjoon Seo
Sewon Min
Ali Farhadi
Hannaneh Hajishirzi
101
79
0
06 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing
Sourya Dey
Yinan Shao
K. Chugg
Peter A. Beerel
70
16
0
03 Nov 2017
ReBNet: Residual Binarized Neural Network
M. Ghasemzadeh
Mohammad Samragh
F. Koushanfar
MQ
47
4
0
03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity
Jingyang Zhu
Jingbo Jiang
Xizi Chen
Chi-Ying Tsui
65
36
0
03 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning
Raphael Shu
Hideki Nakayama
106
129
0
03 Nov 2017
Previous
1
2
3
...
64
65
66
...
68
69
70
Next