ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,448 papers shown
Title
Hardware for Machine Learning: Challenges and Opportunities
Hardware for Machine Learning: Challenges and Opportunities
Vivienne Sze
Yu-hsin Chen
Joel S. Einer
Amr Suleiman
Zhengdong Zhang
22
77
0
22 Dec 2016
Wide-Slice Residual Networks for Food Recognition
Wide-Slice Residual Networks for Food Recognition
N. Martinel
G. Foresti
C. Micheloni
36
200
0
20 Dec 2016
Exploring the Design Space of Deep Convolutional Neural Networks at
  Large Scale
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
26
18
0
20 Dec 2016
Quantization and Training of Low Bit-Width Convolutional Neural Networks
  for Object Detection
Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection
Penghang Yin
Shuai Zhang
Y. Qi
Jack Xin
MQ
62
41
0
19 Dec 2016
Delta Networks for Optimized Recurrent Network Computation
Delta Networks for Optimized Recurrent Network Computation
Daniel Neil
Junhaeng Lee
T. Delbruck
Shih-Chii Liu
36
66
0
16 Dec 2016
FastText.zip: Compressing text classification models
FastText.zip: Compressing text classification models
Armand Joulin
Edouard Grave
Piotr Bojanowski
Matthijs Douze
Hervé Jégou
Tomas Mikolov
MQ
25
1,192
0
12 Dec 2016
Learning in the Machine: Random Backpropagation and the Deep Learning
  Channel
Learning in the Machine: Random Backpropagation and the Deep Learning Channel
Pierre Baldi
Peter Sadowski
Zhiqin Lu
AAML
18
16
0
08 Dec 2016
Filter sharing: Efficient learning of parameters for volumetric
  convolutions
Filter sharing: Efficient learning of parameters for volumetric convolutions
Rahul Venkataramani
S. Thiruvenkadam
Prasad Sudhakar
Hariharan Ravishankar
V. Vaidya
3DPC
MedIm
32
0
0
08 Dec 2016
Spatially Adaptive Computation Time for Residual Networks
Spatially Adaptive Computation Time for Residual Networks
Michael Figurnov
Maxwell D. Collins
Yukun Zhu
Li Zhang
Jonathan Huang
Dmitry Vetrov
Ruslan Salakhutdinov
23
346
0
07 Dec 2016
Towards the Limit of Network Quantization
Towards the Limit of Network Quantization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
22
191
0
05 Dec 2016
Trained Ternary Quantization
Trained Ternary Quantization
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
92
1,035
0
04 Dec 2016
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
Yaman Umuroglu
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
53
983
0
01 Dec 2016
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Song Han
Junlong Kang
Huizi Mao
Yiming Hu
Xin Li
...
Hong Luo
Song Yao
Yu Wang
Huazhong Yang
W. Dally
34
627
0
01 Dec 2016
Effective Quantization Methods for Recurrent Neural Networks
Effective Quantization Methods for Recurrent Neural Networks
Qinyao He
He Wen
Shuchang Zhou
Yuxin Wu
Cong Yao
Xinyu Zhou
Yuheng Zou
MQ
32
75
0
30 Nov 2016
Deep Cuboid Detection: Beyond 2D Bounding Boxes
Deep Cuboid Detection: Beyond 2D Bounding Boxes
Debidatta Dwibedi
Tomasz Malisiewicz
Vijay Badrinarayanan
Andrew Rabinovich
32
18
0
30 Nov 2016
Capacity and Trainability in Recurrent Neural Networks
Capacity and Trainability in Recurrent Neural Networks
Jasmine Collins
Jascha Narain Sohl-Dickstein
David Sussillo
35
203
0
29 Nov 2016
LCNN: Lookup-based Convolutional Neural Network
LCNN: Lookup-based Convolutional Neural Network
Hessam Bagherinezhad
Mohammad Rastegari
Ali Farhadi
13
89
0
20 Nov 2016
Fast Video Classification via Adaptive Cascading of Deep Models
Fast Video Classification via Adaptive Cascading of Deep Models
Haichen Shen
Seungyeop Han
Matthai Philipose
Arvind Krishnamurthy
34
78
0
20 Nov 2016
Quantized neural network design under weight capacity constraint
Quantized neural network design under weight capacity constraint
Sungho Shin
Kyuyeon Hwang
Wonyong Sung
MQ
29
2
0
19 Nov 2016
ModelHub: Towards Unified Data and Lifecycle Management for Deep
  Learning
ModelHub: Towards Unified Data and Lifecycle Management for Deep Learning
Hui Miao
Ang Li
L. Davis
Amol Deshpande
VLM
MU
29
128
0
18 Nov 2016
GaDei: On Scale-up Training As A Service For Deep Learning
GaDei: On Scale-up Training As A Service For Deep Learning
Wei Zhang
Minwei Feng
Yunhui Zheng
Yufei Ren
Yandong Wang
...
Peng Liu
Bing Xiang
Li Zhang
Bowen Zhou
Fei-Yue Wang
ALM
32
10
0
18 Nov 2016
The ZipML Framework for Training Models with End-to-End Low Precision:
  The Cans, the Cannots, and a Little Bit of Deep Learning
The ZipML Framework for Training Models with End-to-End Low Precision: The Cans, the Cannots, and a Little Bit of Deep Learning
Hantian Zhang
Jerry Li
Kaan Kara
Dan Alistarh
Ji Liu
Ce Zhang
MQ
23
20
0
16 Nov 2016
Fully-adaptive Feature Sharing in Multi-Task Networks with Applications
  in Person Attribute Classification
Fully-adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification
Y. Lu
Abhishek Kumar
Shuangfei Zhai
Yu Cheng
T. Javidi
Rogerio Feris
3DH
21
384
0
16 Nov 2016
Designing Energy-Efficient Convolutional Neural Networks using
  Energy-Aware Pruning
Designing Energy-Efficient Convolutional Neural Networks using Energy-Aware Pruning
Tien-Ju Yang
Yu-hsin Chen
Vivienne Sze
3DV
36
738
0
16 Nov 2016
Ultimate tensorization: compressing convolutional and FC layers alike
Ultimate tensorization: compressing convolutional and FC layers alike
T. Garipov
D. Podoprikhin
Alexander Novikov
Dmitry Vetrov
39
190
0
10 Nov 2016
Hierarchical compositional feature learning
Hierarchical compositional feature learning
Miguel Lazaro-Gredilla
Yi Liu
D. Phoenix
Dileep George
BDL
OCL
29
12
0
07 Nov 2016
Loss-aware Binarization of Deep Networks
Loss-aware Binarization of Deep Networks
Lu Hou
Quanming Yao
James T. Kwok
MQ
32
220
0
05 Nov 2016
Alternating Direction Method of Multipliers for Sparse Convolutional
  Neural Networks
Alternating Direction Method of Multipliers for Sparse Convolutional Neural Networks
Farkhondeh Kiaee
Christian Gagné
Mahdieh Abbasi
13
23
0
05 Nov 2016
Sparsely-Connected Neural Networks: Towards Efficient VLSI
  Implementation of Deep Neural Networks
Sparsely-Connected Neural Networks: Towards Efficient VLSI Implementation of Deep Neural Networks
A. Ardakani
C. Condo
W. Gross
33
40
0
04 Nov 2016
Deep Model Compression: Distilling Knowledge from Noisy Teachers
Deep Model Compression: Distilling Knowledge from Noisy Teachers
Bharat Bhusan Sau
V. Balasubramanian
29
181
0
30 Oct 2016
Compact Deep Convolutional Neural Networks With Coarse Pruning
Compact Deep Convolutional Neural Networks With Coarse Pruning
S. Anwar
Wonyong Sung
3DPC
23
55
0
30 Oct 2016
Generalized Haar Filter based Deep Networks for Real-Time Object
  Detection in Traffic Scene
Generalized Haar Filter based Deep Networks for Real-Time Object Detection in Traffic Scene
Keyu Lu
Jian Li
X. An
Hangen He
14
1
0
30 Oct 2016
Bit-pragmatic Deep Neural Network Computing
Bit-pragmatic Deep Neural Network Computing
Jorge Albericio
Patrick Judd
A. Delmas
Sayeh Sharify
Andreas Moshovos
MQ
34
239
0
20 Oct 2016
QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding
QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding
Dan Alistarh
Demjan Grubic
Jerry Li
Ryota Tomioka
Milan Vojnović
MQ
42
426
0
07 Oct 2016
Scalable Machine Translation in Memory Constrained Environments
Scalable Machine Translation in Memory Constrained Environments
Paul Baltescu
19
0
0
06 Oct 2016
Accelerating Deep Convolutional Networks using low-precision and
  sparsity
Accelerating Deep Convolutional Networks using low-precision and sparsity
Ganesh Venkatesh
Eriko Nurvitadhi
Debbie Marr
36
135
0
02 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhehuai Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,750
0
26 Sep 2016
Quantized Neural Networks: Training Neural Networks with Low Precision
  Weights and Activations
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
Itay Hubara
Matthieu Courbariaux
Daniel Soudry
Ran El-Yaniv
Yoshua Bengio
MQ
54
1,846
0
22 Sep 2016
A scalable convolutional neural network for task-specified scenarios via
  knowledge distillation
A scalable convolutional neural network for task-specified scenarios via knowledge distillation
Mengnan Shi
F. Qin
QiXiang Ye
Zhenjun Han
Jianbin Jiao
21
5
0
19 Sep 2016
Reduced Memory Region Based Deep Convolutional Neural Network Detection
Reduced Memory Region Based Deep Convolutional Neural Network Detection
Denis Tomè
Luca Bondi
Emanuele Plebani
L. Baroffio
D. Pau
Stefano Tubaro
30
11
0
08 Sep 2016
Evolutionary Synthesis of Deep Neural Networks via Synaptic
  Cluster-driven Genetic Encoding
Evolutionary Synthesis of Deep Neural Networks via Synaptic Cluster-driven Genetic Encoding
M. Shafiee
A. Wong
26
23
0
06 Sep 2016
Ternary Neural Networks for Resource-Efficient AI Applications
Ternary Neural Networks for Resource-Efficient AI Applications
Hande Alemdar
V. Leroy
Adrien Prost-Boucle
F. Pétrot
24
204
0
01 Sep 2016
Pruning Filters for Efficient ConvNets
Pruning Filters for Efficient ConvNets
Hao Li
Asim Kadav
Igor Durdanovic
H. Samet
H. Graf
3DPC
105
3,660
0
31 Aug 2016
Low Complexity Multiply Accumulate Unit for Weight-Sharing Convolutional
  Neural Networks
Low Complexity Multiply Accumulate Unit for Weight-Sharing Convolutional Neural Networks
James Garland
David Gregg
MQ
11
28
0
30 Aug 2016
Scalable Compression of Deep Neural Networks
Scalable Compression of Deep Neural Networks
Xing Wang
Jie Liang
21
4
0
26 Aug 2016
Local Binary Convolutional Neural Networks
Local Binary Convolutional Neural Networks
Felix Juefei Xu
Vishnu Boddeti
Marios Savvides
MQ
32
251
0
22 Aug 2016
Lets keep it simple, Using simple architectures to outperform deeper and
  more complex architectures
Lets keep it simple, Using simple architectures to outperform deeper and more complex architectures
S. H. HasanPour
Mohammad Rouhani
Mohsen Fayyaz
Mohammad Sabokrou
23
119
0
22 Aug 2016
Dynamic Network Surgery for Efficient DNNs
Dynamic Network Surgery for Efficient DNNs
Yiwen Guo
Anbang Yao
Yurong Chen
18
1,053
0
16 Aug 2016
Design of Efficient Convolutional Layers using Single Intra-channel
  Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Min Wang
Baoyuan Liu
H. Foroosh
27
51
0
15 Aug 2016
SGDR: Stochastic Gradient Descent with Warm Restarts
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
107
7,965
0
13 Aug 2016
Previous
123...676869
Next