Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
48 / 3,448 papers shown
Title
Learning Structured Sparsity in Deep Neural Networks
W. Wen
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
49
2,323
0
12 Aug 2016
Faster CNNs with Direct Sparse Convolutions and Guided Pruning
Jongsoo Park
Sheng Li
W. Wen
P. T. P. Tang
Hai Helen Li
Yiran Chen
Pradeep Dubey
39
182
0
04 Aug 2016
Local Feature Detectors, Descriptors, and Image Representations: A Survey
Yusuke Uchida
40
13
0
28 Jul 2016
Training Skinny Deep Neural Networks with Iterative Hard Thresholding Methods
Xiaojie Jin
Xiao-Tong Yuan
Jiashi Feng
Shuicheng Yan
16
78
0
19 Jul 2016
On the efficient representation and execution of deep acoustic models
R. Álvarez
Rohit Prabhavalkar
A. Bakhtin
MQ
27
55
0
15 Jul 2016
DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Song Han
Jeff Pool
Sharan Narang
Huizi Mao
Enhao Gong
...
Peter Vajda
Manohar Paluri
J. Tran
Bryan Catanzaro
W. Dally
CVBM
27
83
0
15 Jul 2016
Accelerating Eulerian Fluid Simulation With Convolutional Networks
Jonathan Tompson
Kristofer Schlachter
Pablo Sprechmann
Ken Perlin
58
530
0
13 Jul 2016
Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient Deep Architectures
Hengyuan Hu
Rui Peng
Yu-Wing Tai
Chi-Keung Tang
32
881
0
12 Jul 2016
Intra-layer Nonuniform Quantization for Deep Convolutional Neural Network
Fangxuan Sun
Jun Lin
Zhongfeng Wang
MQ
20
3
0
10 Jul 2016
Overcoming Challenges in Fixed Point Training of Deep Convolutional Networks
D. Lin
S. Talathi
34
45
0
08 Jul 2016
Compression of Neural Machine Translation Models via Pruning
A. See
Minh-Thang Luong
Christopher D. Manning
MedIm
VLM
29
221
0
29 Jun 2016
Sequence-Level Knowledge Distillation
Yoon Kim
Alexander M. Rush
47
1,101
0
25 Jun 2016
Precise neural network computation with imprecise analog devices
Jonathan Binas
Daniel Neil
Giacomo Indiveri
Shih-Chii Liu
Michael Pfeiffer
31
11
0
23 Jun 2016
DropNeuron: Simplifying the Structure of Deep Neural Networks
W. Pan
Hao Dong
Yike Guo
24
35
0
23 Jun 2016
CNNLab: a Novel Parallel Framework for Neural Networks using GPU and FPGA-a Practical Study with Trade-off Analysis
Maohua Zhu
L. Liu
Chao Wang
Yuan Xie
GNN
24
20
0
20 Jun 2016
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients
Shuchang Zhou
Yuxin Wu
Zekun Ni
Xinyu Zhou
He Wen
Yuheng Zou
MQ
44
2,074
0
20 Jun 2016
Deep Learning with Darwin: Evolutionary Synthesis of Deep Neural Networks
M. Shafiee
A. Mishra
A. Wong
32
44
0
14 Jun 2016
Structured Convolution Matrices for Energy-efficient Deep learning
R. Appuswamy
T. Nayak
John V. Arthur
S. K. Esser
P. Merolla
J. McKinstry
T. Melano
M. Flickner
D. Modha
38
11
0
08 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
235
2,059
0
07 Jun 2016
Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention
Yang Liu
Chengjie Sun
Mehdi Alizadeh
Xiaolong Wang
35
6
0
30 May 2016
An Analysis of Deep Neural Network Models for Practical Applications
A. Canziani
Adam Paszke
Eugenio Culurciello
19
1,165
0
24 May 2016
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations
Behnam Neyshabur
Yuhuai Wu
Ruslan Salakhutdinov
Nathan Srebro
AI4CE
ODL
30
30
0
23 May 2016
Learning Sensor Multiplexing Design through Back-propagation
Ayan Chakrabarti
SSL
34
126
0
23 May 2016
Functional Hashing for Compressing Neural Networks
Lei Shi
Shikun Feng
Zhifan Zhu
27
4
0
20 May 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
Philipp Gysel
29
127
0
20 May 2016
Reducing the Model Order of Deep Neural Networks Using Information Theory
Ming Tu
Visar Berisha
Yu Cao
Jae-sun Seo
19
23
0
16 May 2016
Ternary Weight Networks
Fengfu Li
Bin Liu
Xiaoxing Wang
Bo Zhang
Junchi Yan
MQ
43
521
0
16 May 2016
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels
H. G. Chen
Suren Jayasuriya
Jiyue Yang
J. Stephen
S. Sivaramakrishnan
Ashok Veeraraghavan
A. Molnar
26
66
0
11 May 2016
Hardware-oriented Approximation of Convolutional Neural Networks
Philipp Gysel
Mohammad Motamedi
S. Ghiasi
41
310
0
11 Apr 2016
Training Constrained Deconvolutional Networks for Road Scene Semantic Segmentation
G. Ros
Simon Stent
P. Alcantarilla
Tomoki Watanabe
21
55
0
06 Apr 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
75
4,332
0
16 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation
Daisuke Miyashita
Edward H. Lee
B. Murmann
MQ
30
425
0
03 Mar 2016
vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design
Minsoo Rhu
N. Gimelshein
Jason Clemons
A. Zulfiqar
S. Keckler
GNN
14
32
0
25 Feb 2016
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
F. Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
W. Dally
Kurt Keutzer
82
7,427
0
24 Feb 2016
Binarized Neural Networks
Itay Hubara
Daniel Soudry
Ran El-Yaniv
MQ
58
1,350
0
08 Feb 2016
EIE: Efficient Inference Engine on Compressed Deep Neural Network
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
48
2,448
0
04 Feb 2016
Relief R-CNN : Utilizing Convolutional Features for Fast Object Detection
Guiying Li
Junlong Liu
Chunhui Jiang
Liangpeng Zhang
Minlong Lin
Ke Tang
ObjD
29
7
0
25 Jan 2016
Structured Pruning of Deep Convolutional Neural Networks
S. Anwar
Kyuyeon Hwang
Wonyong Sung
32
743
0
29 Dec 2015
Recent Advances in Convolutional Neural Networks
Jiuxiang Gu
Zhenhua Wang
Jason Kuen
Lianyang Ma
Amir Shahroudy
...
Xingxing Wang
Li Wang
Gang Wang
Jianfei Cai
Tsuhan Chen
37
5,150
0
22 Dec 2015
Quantized Convolutional Neural Networks for Mobile Devices
Jiaxiang Wu
Cong Leng
Yuhang Wang
Qinghao Hu
Jian Cheng
MQ
35
1,158
0
21 Dec 2015
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications
Yong-Deok Kim
Eunhyeok Park
S. Yoo
Taelim Choi
Lu Yang
Dongjun Shin
40
891
0
20 Nov 2015
Resiliency of Deep Neural Networks under Quantization
Wonyong Sung
Sungho Shin
Kyuyeon Hwang
MQ
20
157
0
20 Nov 2015
Blending LSTMs into CNNs
Krzysztof J. Geras
Abdel-rahman Mohamed
R. Caruana
G. Urban
Shengjie Wang
Ozlem Aslan
Matthai Philipose
Matthew Richardson
Charles Sutton
27
60
0
19 Nov 2015
Fixed Point Quantization of Deep Convolutional Networks
D. Lin
S. Talathi
V. Annapureddy
MQ
44
811
0
19 Nov 2015
Adjustable Bounded Rectifiers: Towards Deep Binary Representations
Zhirong Wu
Dahua Lin
Xiaoou Tang
MQ
19
14
0
19 Nov 2015
ACDC: A Structured Efficient Linear Layer
Marcin Moczulski
Misha Denil
J. Appleyard
Nando de Freitas
38
98
0
18 Nov 2015
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
33
302
0
31 Oct 2015
Learning both Weights and Connections for Efficient Neural Networks
Song Han
Jeff Pool
J. Tran
W. Dally
CVBM
31
6,585
0
08 Jun 2015
Previous
1
2
3
...
67
68
69