Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Recurrent Convolution for Compact and Cost-Adjustable Neural Networks: An Empirical Study
Zhendong Zhang
Cheolkon Jung
31
2
0
26 Feb 2019
Saec: Similarity-Aware Embedding Compression in Recommendation Systems
Xiaorui Wu
Hong Xu
Honglin Zhang
Huaming Chen
Jian Wang
50
15
0
26 Feb 2019
Learning Implicitly Recurrent CNNs Through Parameter Sharing
Pedro H. P. Savarese
Michael Maire
96
70
0
26 Feb 2019
STFNets: Learning Sensing Signals from the Time-Frequency Perspective with Short-Time Fourier Neural Networks
Shuochao Yao
Ailing Piao
Wenjun Jiang
Yiran Zhao
Huajie Shao
...
Tianshi Wang
Shaohan Hu
Lu Su
Jiawei Han
Tarek Abdelzaher
AI4TS
77
79
0
21 Feb 2019
Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges
Di Feng
Christian Haase-Schuetz
Lars Rosenbaum
Heinz Hertlein
Claudius Gläser
Fabian Duffhauss
W. Wiesbeck
Klaus C. J. Dietmayer
3DPC
192
1,014
0
21 Feb 2019
Jointly Sparse Convolutional Neural Networks in Dual Spatial-Winograd Domains
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
46
6
0
21 Feb 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
110
366
0
18 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces
Mohammad Saidur Rahman
Mohsen Imani
Nate Mathews
M. Wright
AAML
86
81
0
18 Feb 2019
Parameter Efficient Training of Deep Convolutional Neural Networks by Dynamic Sparse Reparameterization
Hesham Mostafa
Xin Wang
129
315
0
15 Feb 2019
Superposition of many models into one
Brian Cheung
A. Terekhov
Yubei Chen
Pulkit Agrawal
Bruno A. Olshausen
MoMe
94
116
0
14 Feb 2019
MultiGrain: a unified image embedding for classes and instances
Maxim Berman
Hervé Jégou
Andrea Vedaldi
Iasonas Kokkinos
Matthijs Douze
79
112
0
14 Feb 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
Justice Amoh
K. Odame
72
18
0
13 Feb 2019
Structured Bayesian Compression for Deep models in mobile enabled devices for connected healthcare
Sijia Chen
Bin Song
Xiaojiang Du
Nadra Guizani
HAI
MedIm
26
2
0
13 Feb 2019
Fast-SCNN: Fast Semantic Segmentation Network
Rudra P. K. Poudel
Stephan Liwicki
R. Cipolla
SSeg
64
516
0
12 Feb 2019
Effective Network Compression Using Simulation-Guided Iterative Pruning
Dae-Woong Jeong
Jaehun Kim
Youngseok Kim
Tae-Ho Kim
Myungsu Chae
34
0
0
12 Feb 2019
Energy-recycling Blockchain with Proof-of-Deep-Learning
Changhao Chenli
Boyang Li
Yiyu Shi
Taeho Jung
42
57
0
11 Feb 2019
Model Compression with Adversarial Robustness: A Unified Optimization Framework
Shupeng Gui
Haotao Wang
Chen Yu
Haichuan Yang
Zhangyang Wang
Ji Liu
MQ
86
139
0
10 Feb 2019
Improved Knowledge Distillation via Teacher Assistant
Seyed Iman Mirzadeh
Mehrdad Farajtabar
Ang Li
Nir Levine
Akihiro Matsukawa
H. Ghasemzadeh
125
1,092
0
09 Feb 2019
Architecture Compression
A. Ashok
32
0
0
08 Feb 2019
FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary
Yingzhen Yang
Jiahui Yu
Nebojsa Jojic
Jun Huan
Thomas S. Huang
63
17
0
08 Feb 2019
Radial and Directional Posteriors for Bayesian Neural Networks
Changyong Oh
Kamil Adamczewski
Mijung Park
BDL
115
20
0
07 Feb 2019
Compression of Recurrent Neural Networks for Efficient Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
67
39
0
06 Feb 2019
Are All Layers Created Equal?
Chiyuan Zhang
Samy Bengio
Y. Singer
111
140
0
06 Feb 2019
Multi-Kernel Prediction Networks for Denoising of Burst Images
Talmaj Marinc
Vignesh Srinivasan
Serhan Gül
C. Hellge
Wojciech Samek
112
27
0
05 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
81
87
0
05 Feb 2019
ROMANet: Fine-Grained Reuse-Driven Off-Chip Memory Access Management and Data Organization for Deep Neural Network Accelerators
Rachmad Vidya Wicaksana Putra
Muhammad Abdullah Hanif
Mohamed Bennai
72
22
0
04 Feb 2019
BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services
Amir Erfan Eshratifar
Amirhossein Esmaili
Massoud Pedram
107
179
0
04 Feb 2019
MICIK: MIning Cross-Layer Inherent Similarity Knowledge for Deep Model Compression
Jie Zhang
Xiaolong Wang
Dawei Li
Shalini Ghosh
Abhishek Kolagunda
Yalin Wang
43
0
0
03 Feb 2019
Self-Binarizing Networks
Fayez Lahoud
R. Achanta
Pablo Márquez-Neila
Sabine Süsstrunk
MQ
75
23
0
02 Feb 2019
Compressing Gradient Optimizers via Count-Sketches
Ryan Spring
Anastasios Kyrillidis
Vijai Mohan
Anshumali Shrivastava
60
36
0
01 Feb 2019
Towards Collaborative Intelligence Friendly Architectures for Deep Learning
Amir Erfan Eshratifar
Amirhossein Esmaili
Massoud Pedram
83
27
0
01 Feb 2019
On Correlation of Features Extracted by Deep Neural Networks
B. Ayinde
T. Inanc
J. Zurada
60
25
0
30 Jan 2019
Tensorized Embedding Layers for Efficient Model Compression
Oleksii Hrinchuk
Valentin Khrulkov
L. Mirvakhabova
Elena Orlova
Ivan Oseledets
93
73
0
30 Jan 2019
Doubly Sparse: Sparse Mixture of Sparse Experts for Efficient Softmax Inference
Shun Liao
Ting Chen
Tian Lin
Denny Zhou
Chong-Jun Wang
MoE
16
2
0
30 Jan 2019
A Simple Method to Reduce Off-chip Memory Accesses on Convolutional Neural Networks
Doyun Kim
Kyoung-Young Kim
Sangsoo Ko
Sanghyuck Ha
28
5
0
28 Jan 2019
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
184
312
0
28 Jan 2019
Information-Theoretic Understanding of Population Risk Improvement with Model Compression
Yuheng Bu
Weihao Gao
Shaofeng Zou
Venugopal V. Veeravalli
MedIm
61
15
0
27 Jan 2019
PruneTrain: Fast Neural Network Training by Dynamic Sparse Model Reconfiguration
Sangkug Lym
Esha Choukse
Siavash Zangeneh
W. Wen
Sujay Sanghavi
M. Erez
CVBM
85
88
0
26 Jan 2019
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression
Sian Jin
Sheng Di
Xin Liang
Jiannan Tian
Dingwen Tao
Franck Cappello
AI4CE
76
61
0
26 Jan 2019
Really should we pruning after model be totally trained? Pruning based on a small amount of training
Li Yue
Zhao Weibin
Shang-Te Lin
VLM
29
5
0
24 Jan 2019
QGAN: Quantized Generative Adversarial Networks
Peiqi Wang
Dongsheng Wang
Yu Ji
Xinfeng Xie
Haoxuan Song
XuXin Liu
Yongqiang Lyu
Yuan Xie
GAN
MQ
53
32
0
24 Jan 2019
Backprop with Approximate Activations for Memory-efficient Network Training
Ayan Chakrabarti
Benjamin Moseley
70
38
0
23 Jan 2019
Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning
Shaohui Lin
Rongrong Ji
Yuchao Li
Cheng Deng
Xuelong Li
114
69
0
23 Jan 2019
Partition Pruning: Parallelization-Aware Pruning for Deep Neural Networks
Sina Shahhosseini
Ahmad Albaqsami
Masoomeh Jasemi
N. Bagherzadeh
22
8
0
21 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang
James J. Davis
Ruizhe Zhao
Ho-Cheung Ng
Xinyu Niu
Wayne Luk
P. Cheung
George A. Constantinides
88
59
0
21 Jan 2019
Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Charbel Sakr
Naigang Wang
Chia-Yu Chen
Jungwook Choi
A. Agrawal
Naresh R Shanbhag
K. Gopalakrishnan
MQ
76
34
0
19 Jan 2019
Feature Pyramid and Hierarchical Boosting Network for Pavement Crack Detection
Fan Yang
Lei Zhang
Sijia Yu
Danil Prokhorov
Xue Mei
Haibin Ling
98
730
0
18 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Asifullah Khan
A. Sohail
Umme Zahoora
Aqsa Saeed Qureshi
OOD
202
2,325
0
17 Jan 2019
CodeX: Bit-Flexible Encoding for Streaming-based FPGA Acceleration of DNNs
Mohammad Samragh
Mojan Javaheripi
F. Koushanfar
54
11
0
17 Jan 2019
Light-weighted Saliency Detection with Distinctively Lower Memory Cost and Model Size
Shanghua Xiao
54
1
0
15 Jan 2019
Previous
1
2
3
...
54
55
56
...
68
69
70
Next