Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,449 papers shown
Title
Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
Zhezhi He
Boqing Gong
Deliang Fan
24
22
0
20 Jul 2018
Statistical Model Compression for Small-Footprint Natural Language Understanding
Grant P. Strimel
Kanthashree Mysore Sathyendra
Stanislav Peshterliev
32
9
0
19 Jul 2018
Defend Deep Neural Networks Against Adversarial Examples via Fixed and Dynamic Quantized Activation Functions
Adnan Siraj Rakin
Jinfeng Yi
Boqing Gong
Deliang Fan
AAML
MQ
24
50
0
18 Jul 2018
BRIEF: Backward Reduction of CNNs with Information Flow Analysis
Yu-Hsun Lin
Chun-Nan Chou
Edward Y. Chang
14
0
0
16 Jul 2018
Morse Code Datasets for Machine Learning
Sourya Dey
K. Chugg
Peter A. Beerel
18
10
0
11 Jul 2018
Make
ℓ
1
\ell_1
ℓ
1
Regularization Effective in Training Sparse CNN
Juncai He
Xiaodong Jia
Jinchao Xu
Lian Zhang
Liang Zhao
27
5
0
11 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
27
72
0
11 Jul 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
20
96
0
10 Jul 2018
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure
Hamed Hakkak
OffRL
AI4CE
20
1
0
08 Jul 2018
Anytime Neural Prediction via Slicing Networks Vertically
Hankook Lee
Jinwoo Shin
AI4CE
27
16
0
07 Jul 2018
Sparse Deep Neural Network Exact Solutions
J. Kepner
V. Gadepally
Hayden Jananthan
Lauren Milechin
S. Samsi
24
14
0
06 Jul 2018
SGAD: Soft-Guided Adaptively-Dropped Neural Network
Zhisheng Wang
Fangxuan Sun
Jun Lin
Zhongfeng Wang
Bo Yuan
19
7
0
04 Jul 2018
Restructuring Batch Normalization to Accelerate CNN Training
Wonkyung Jung
Daejin Jung
and Byeongho Kim
Sunjung Lee
Wonjong Rhee
Jung Ho Ahn
26
62
0
04 Jul 2018
Confidential Inference via Ternary Model Partitioning
Zhongshu Gu
Heqing Huang
Jialong Zhang
D. Su
Hani Jamjoom
Ankita Lamba
Dimitrios E. Pendarakis
Ian Molloy
24
53
0
03 Jul 2018
Stochastic Layer-Wise Precision in Deep Neural Networks
Griffin Lacey
Graham W. Taylor
S. Areibi
42
18
0
03 Jul 2018
Weight-importance sparse training in keyword spotting
Sihao Xue
Zhenyi Ying
Fan Mo
Min Wang
Jue Sun
17
0
0
02 Jul 2018
Evenly Cascaded Convolutional Networks
Chengxi Ye
Chinmaya Devaraj
Michael Maynord
Cornelia Fermuller
Yiannis Aloimonos
18
7
0
02 Jul 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
33
133
0
01 Jul 2018
Automatic Rank Selection for High-Speed Convolutional Neural Network
Hyeji Kim
C. Kyung
27
5
0
28 Jun 2018
DeepObfuscation: Securing the Structure of Convolutional Neural Networks via Knowledge Distillation
Hui Xu
Yuxin Su
Zirui Zhao
Yangfan Zhou
Michael R. Lyu
Irwin King
FedML
13
26
0
27 Jun 2018
Deep
k
k
k
-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions
Junru Wu
Yue Wang
Zhenyu Wu
Zhangyang Wang
Ashok Veeraraghavan
Yingyan Lin
15
115
0
24 Jun 2018
Constructing Deep Neural Networks by Bayesian Network Structure Learning
R. Y. Rohekar
Shami Nisimov
Yaniv Gurwicz
G. Koren
Gal Novik
BDL
36
26
0
24 Jun 2018
Compact Deep Neural Networks for Computationally Efficient Gesture Classification From Electromyography Signals
A. Hartwell
V. Kadirkamanathan
S. Anderson
11
17
0
22 Jun 2018
Deploying Deep Neural Networks in the Embedded Space
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
32
13
0
22 Jun 2018
Efficient Semantic Segmentation using Gradual Grouping
Nikitha Vallurupalli
Sriharsha Annamaneni
G. Varma
C. V. Jawahar
Manu Mathew
S. Nagori
SSeg
14
12
0
22 Jun 2018
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations
Ting-Li Chen
Martin Renqiang Min
Yizhou Sun
26
70
0
21 Jun 2018
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
53
999
0
21 Jun 2018
Rethinking Machine Learning Development and Deployment for Edge Devices
Liangzhen Lai
Naveen Suda
19
10
0
20 Jun 2018
Edge Intelligence: On-Demand Deep Learning Model Co-Inference with Device-Edge Synergy
En Li
Zhi Zhou
Xu Chen
27
325
0
20 Jun 2018
Doubly Nested Network for Resource-Efficient Inference
Jaehong Kim
Sungeun Hong
Yongseok Choi
Jiwon Kim
21
5
0
20 Jun 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
35
136
0
20 Jun 2018
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking
Patrick H. Chen
Si Si
Yang Li
Ciprian Chelba
Cho-Jui Hsieh
24
67
0
18 Jun 2018
Fast Convex Pruning of Deep Neural Networks
Alireza Aghasi
Afshin Abdi
Justin Romberg
29
24
0
17 Jun 2018
On Machine Learning and Structure for Mobile Robots
Markus Wulfmeier
27
6
0
15 Jun 2018
Three dimensional Deep Learning approach for remote sensing image classification
A. Ben Hamida
A. Benoît
P. Lambert
C. Ben Amar
63
570
0
15 Jun 2018
RAPIDNN: In-Memory Deep Neural Network Acceleration Framework
Mohsen Imani
Mohammad Samragh
Yeseong Kim
Saransh Gupta
F. Koushanfar
Tajana Simunic
24
51
0
15 Jun 2018
Deep Learning Approximation: Zero-Shot Neural Network Speedup
Michele Pratusevich
30
0
0
15 Jun 2018
Insights on representational similarity in neural networks with canonical correlation
Ari S. Morcos
M. Raghu
Samy Bengio
DRL
32
435
0
14 Jun 2018
PCAS: Pruning Channels with Attention Statistics for Deep Network Compression
Kohei Yamamoto
K. Maeno
24
32
0
14 Jun 2018
Scalable Neural Network Compression and Pruning Using Hard Clustering and L1 Regularization
Yibo Yang
Nicholas Ruozzi
Vibhav Gogate
21
2
0
14 Jun 2018
The streaming rollout of deep networks - towards fully model-parallel execution
Volker Fischer
Jan M. Köhler
Thomas Pfeil
32
16
0
13 Jun 2018
Knowledge Distillation by On-the-Fly Native Ensemble
Xu Lan
Xiatian Zhu
S. Gong
214
474
0
12 Jun 2018
Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
Haichuan Yang
Yuhao Zhu
Ji Liu
CVBM
21
36
0
12 Jun 2018
Full deep neural network training on a pruned weight budget
Maximilian Golub
G. Lemieux
Mieszko Lis
33
28
0
11 Jun 2018
Smallify: Learning Network Size while Training
Guillaume Leclerc
Manasi Vartak
Raul Castro Fernandez
Tim Kraska
Samuel Madden
14
13
0
10 Jun 2018
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
Amartya Sanyal
Matt J. Kusner
Adria Gascon
Varun Kanade
FedML
27
126
0
09 Jun 2018
Slalom: Fast, Verifiable and Private Execution of Neural Networks in Trusted Hardware
Florian Tramèr
Dan Boneh
FedML
114
396
0
08 Jun 2018
EasyConvPooling: Random Pooling with Easy Convolution for Accelerating Training and Testing
Jianzhong Sheng
Chuanbo Chen
Chenchen Fu
Chun Jason Xue
30
4
0
05 Jun 2018
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark
Cody Coleman
Daniel Kang
Deepak Narayanan
Luigi Nardi
Tian Zhao
Jian Zhang
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
18
117
0
04 Jun 2018
Dynamically Hierarchy Revolution: DirNet for Compressing Recurrent Neural Network on Mobile Devices
Jie Zhang
Xiaolong Wang
Dawei Li
Yalin Wang
6
14
0
04 Jun 2018
Previous
1
2
3
...
59
60
61
...
67
68
69
Next