ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
QUENN: QUantization Engine for low-power Neural Networks
QUENN: QUantization Engine for low-power Neural Networks
Miguel de Prado
Maurizio Denna
Luca Benini
Nuria Pazos
MQ
84
14
0
14 Nov 2018
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU
  Computers
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers
J. Pedoeem
Rachel Huang
ObjD
72
507
0
14 Nov 2018
Iteratively Training Look-Up Tables for Network Quantization
Iteratively Training Look-Up Tables for Network Quantization
Fabien Cardinaux
Stefan Uhlich
K. Yoshiyama
J. A. García
Stephen Tiedemann
Thomas Kemp
Akira Nakamura
MQ
56
1
0
13 Nov 2018
Intelligent Drone Swarm for Search and Rescue Operations at Sea
Intelligent Drone Swarm for Search and Rescue Operations at Sea
Vincenzo Lomonaco
A. Trotta
M. Ziosi
Juan de Dios Yáñez Ávila
Natalia Díaz Rodríguez
38
25
0
13 Nov 2018
Private Model Compression via Knowledge Distillation
Private Model Compression via Knowledge Distillation
Ji Wang
Weidong Bao
Lichao Sun
Xiaomin Zhu
Bokai Cao
Philip S. Yu
FedML
83
120
0
13 Nov 2018
Generalized Ternary Connect: End-to-End Learning and Compression of
  Multiplication-Free Deep Neural Networks
Generalized Ternary Connect: End-to-End Learning and Compression of Multiplication-Free Deep Neural Networks
Samyak Parajuli
Aswin Raghavan
S. Chai
52
7
0
12 Nov 2018
Sequence-Level Knowledge Distillation for Model Compression of
  Attention-based Sequence-to-Sequence Speech Recognition
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Muáz Muním
Nakamasa Inoue
Koichi Shinoda
64
26
0
12 Nov 2018
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural
  Networks
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks
Amir H. Ashouri
T. Abdelrahman
Alwyn Dos Remedios
MQ
104
12
0
10 Nov 2018
A First Look at Deep Learning Apps on Smartphones
A First Look at Deep Learning Apps on Smartphones
Mengwei Xu
Jiawei Liu
Yuanqiang Liu
F. Lin
Yunxin Liu
Xuanzhe Liu
HAI
91
183
0
08 Nov 2018
Packing Sparse Convolutional Neural Networks for Efficient Systolic
  Array Implementations: Column Combining Under Joint Optimization
Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint Optimization
H. T. Kung
Bradley McDanel
Shanghang Zhang
98
135
0
07 Nov 2018
Training Domain Specific Models for Energy-Efficient Object Detection
Training Domain Specific Models for Energy-Efficient Object Detection
Kentaro Yoshioka
Edward Lee
Roummel F. Marcia
ObjD
10
0
0
06 Nov 2018
Revealing Fine Structures of the Retinal Receptive Field by Deep
  Learning Networks
Revealing Fine Structures of the Retinal Receptive Field by Deep Learning Networks
Hui Li
Yajing Zheng
Shanshan Jia
Yichen Zhang
Zhaofei Yu
Feng Chen
Yonghong Tian
Tiejun Huang
Jian K. Liu
98
23
0
06 Nov 2018
A Unified Framework of DNN Weight Pruning and Weight
  Clustering/Quantization Using ADMM
A Unified Framework of DNN Weight Pruning and Weight Clustering/Quantization Using ADMM
David Cortes
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Jiaming Xie
Yun Liang
Sijia Liu
Xinyu Lin
Yanzhi Wang
MQ
58
45
0
05 Nov 2018
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural
  Networks
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
146
68
0
05 Nov 2018
Dynamic Representations Toward Efficient Inference on Deep Neural
  Networks by Decision Gates
Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
Mohammad Saeed Shafiee
M. Shafiee
A. Wong
AI4CE
18
4
0
05 Nov 2018
Bi-Real Net: Binarizing Deep Network Towards Real-Network Performance
Bi-Real Net: Binarizing Deep Network Towards Real-Network Performance
Zechun Liu
Wenhan Luo
Baoyuan Wu
Xin Yang
Wen Liu
K. Cheng
MQ
80
96
0
04 Nov 2018
A Batched Scalable Multi-Objective Bayesian Optimization Algorithm
A Batched Scalable Multi-Objective Bayesian Optimization Algorithm
Xi Lin
Hui-Ling Zhen
Zhenhua Li
Qingfu Zhang
Sam Kwong
51
11
0
04 Nov 2018
ReXCam: Resource-Efficient, Cross-Camera Video Analytics at Scale
ReXCam: Resource-Efficient, Cross-Camera Video Analytics at Scale
Samvit Jain
Xun Zhang
Yuhao Zhou
Ganesh Ananthanarayanan
Junchen Jiang
Yuanchao Shu
Joseph E. Gonzalez
HAI
119
9
0
03 Nov 2018
Online Embedding Compression for Text Classification using Low Rank
  Matrix Factorization
Online Embedding Compression for Text Classification using Low Rank Matrix Factorization
Anish Acharya
Rahul Goel
A. Metallinou
Inderjit Dhillon
102
62
0
01 Nov 2018
Filter Pruning via Geometric Median for Deep Convolutional Neural
  Networks Acceleration
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
Yang He
Ping Liu
Ziwei Wang
Zhilan Hu
Yi Yang
AAML3DPC
122
1,052
0
01 Nov 2018
Balanced Sparsity for Efficient DNN Inference on GPU
Balanced Sparsity for Efficient DNN Inference on GPU
Zhuliang Yao
Shijie Cao
Wencong Xiao
Chen Zhang
Lanshun Nie
85
93
0
01 Nov 2018
Low-Precision Random Fourier Features for Memory-Constrained Kernel
  Approximation
Low-Precision Random Fourier Features for Memory-Constrained Kernel Approximation
Jian Zhang
Avner May
Tri Dao
Christopher Ré
80
29
0
31 Oct 2018
Convolutional Neural Network Quantization using Generalized Gamma
  Distribution
Convolutional Neural Network Quantization using Generalized Gamma Distribution
Doyun Kim
H. Yim
Sanghyuck Ha
Changgwun Lee
Inyup Kang
MQ
40
4
0
31 Oct 2018
SplineNets: Continuous Neural Decision Graphs
SplineNets: Continuous Neural Decision Graphs
Cem Keskin
Shahram Izadi
51
11
0
31 Oct 2018
Low-Rank Embedding of Kernels in Convolutional Neural Networks under
  Random Shuffling
Low-Rank Embedding of Kernels in Convolutional Neural Networks under Random Shuffling
Chong Li
Zhun Sun
Jinshi Yu
Ming Hou
Qibin Zhao
32
5
0
31 Oct 2018
JavaScript Convolutional Neural Networks for Keyword Spotting in the
  Browser: An Experimental Analysis
JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis
Jaejun Lee
Raphael Tang
Jimmy J. Lin
21
2
0
30 Oct 2018
DeepTwist: Learning Model Compression via Occasional Weight Distortion
DeepTwist: Learning Model Compression via Occasional Weight Distortion
Dongsoo Lee
Parichay Kapoor
Byeongwook Kim
67
19
0
30 Oct 2018
Demystifying Neural Network Filter Pruning
Demystifying Neural Network Filter Pruning
Zhuwei Qin
Fuxun Yu
Adam Lesnikowski
Xiang Chen
67
5
0
29 Oct 2018
Low-complexity Recurrent Neural Network-based Polar Decoder with Weight
  Quantization Mechanism
Low-complexity Recurrent Neural Network-based Polar Decoder with Weight Quantization Mechanism
Chieh-Fang Teng
Chengyang Wu
A. K. Ho
A. Wu
55
58
0
29 Oct 2018
Discrimination-aware Channel Pruning for Deep Neural Networks
Discrimination-aware Channel Pruning for Deep Neural Networks
Zhuangwei Zhuang
Mingkui Tan
Bohan Zhuang
Jing Liu
Yong Guo
Qingyao Wu
Junzhou Huang
Jin-Hui Zhu
166
601
0
28 Oct 2018
Learning Sparse Neural Networks via Sensitivity-Driven Regularization
Learning Sparse Neural Networks via Sensitivity-Driven Regularization
Enzo Tartaglione
S. Lepsøy
Attilio Fiandrotti
Gianluca Francini
64
71
0
28 Oct 2018
A Miniaturized Semantic Segmentation Method for Remote Sensing Image
A Miniaturized Semantic Segmentation Method for Remote Sensing Image
Shou-Yu Chen
Guang-Sheng Chen
Wei-Peng Jing
29
1
0
27 Oct 2018
Distilling with Performance Enhanced Students
Distilling with Performance Enhanced Students
Jack Turner
Elliot J. Crowley
Valentin Radu
José Cano
Amos Storkey
Michael F. P. O'Boyle
38
3
0
24 Oct 2018
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for
  Continuous Mobile Vision
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
Biyi Fang
Xiao Zeng
Mi Zhang
3DH
96
270
0
23 Oct 2018
Deep Neural Network inference with reduced word length
Deep Neural Network inference with reduced word length
Lukas Mauch
Binh Yang
MQ
26
0
0
23 Oct 2018
Convolutional Neural Network Pruning to Accelerate Membrane Segmentation
  in Electron Microscopy
Convolutional Neural Network Pruning to Accelerate Membrane Segmentation in Electron Microscopy
J. Roels
Jonas De Vylder
J. Aelterman
Yvan Saeys
Wilfried Philips
18
6
0
23 Oct 2018
Learning sparse transformations through backpropagation
Learning sparse transformations through backpropagation
Peter Bloem
43
0
0
22 Oct 2018
To Compress, or Not to Compress: Characterizing Deep Learning Model
  Compression for Embedded Inference
To Compress, or Not to Compress: Characterizing Deep Learning Model Compression for Embedded Inference
Qing Qin
Jie Ren
Jia-Le Yu
Ling Gao
Hai Wang
Jie Zheng
Yansong Feng
Jianbin Fang
Zheng Wang
47
24
0
21 Oct 2018
CNN inference acceleration using dictionary of centroids
CNN inference acceleration using dictionary of centroids
D.Babin
I.Mazurenko
D.Parkhomenko
A.Voloshko
MQ
24
0
0
19 Oct 2018
Real-time Neural-based Input Method
Real-time Neural-based Input Method
Jiali Yao
Raphael Shu
Xinjian Li
K. Ohtsuki
Hideki Nakayama
30
4
0
19 Oct 2018
KTAN: Knowledge Transfer Adversarial Network
KTAN: Knowledge Transfer Adversarial Network
Peiye Liu
Wu Liu
Huadong Ma
Tao Mei
Mingoo Seok
GAN
91
28
0
18 Oct 2018
Progressive Weight Pruning of Deep Neural Networks using ADMM
Progressive Weight Pruning of Deep Neural Networks using ADMM
Shaokai Ye
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Kaidi Xu
...
M. Fardad
Sijia Liu
Xiang Chen
Xinyu Lin
Yanzhi Wang
AI4CE
120
38
0
17 Oct 2018
Quantization for Rapid Deployment of Deep Neural Networks
Quantization for Rapid Deployment of Deep Neural Networks
J. Lee
Sangwon Ha
Saerom Choi
Won-Jo Lee
Seungwon Lee
MQ
80
49
0
12 Oct 2018
Rethinking the Value of Network Pruning
Rethinking the Value of Network Pruning
Zhuang Liu
Mingjie Sun
Tinghui Zhou
Gao Huang
Trevor Darrell
100
1,480
0
11 Oct 2018
A Closer Look at Structured Pruning for Neural Network Compression
A Closer Look at Structured Pruning for Neural Network Compression
Elliot J. Crowley
Jack Turner
Amos Storkey
Michael F. P. O'Boyle
3DPC
80
31
0
10 Oct 2018
Extreme Classification in Log Memory
Extreme Classification in Log Memory
Qixuan Huang
Yiqiu Wang
Tharun Medini
Anshumali Shrivastava
VLM
70
3
0
09 Oct 2018
Deep Neural Network Compression for Aircraft Collision Avoidance Systems
Deep Neural Network Compression for Aircraft Collision Avoidance Systems
Kyle D. Julian
Mykel J. Kochenderfer
Michael P. Owen
63
173
0
09 Oct 2018
Rate Distortion For Model Compression: From Theory To Practice
Rate Distortion For Model Compression: From Theory To Practice
Weihao Gao
Yu-Han Liu
Chong-Jun Wang
Sewoong Oh
100
31
0
09 Oct 2018
Light-Weight RefineNet for Real-Time Semantic Segmentation
Light-Weight RefineNet for Real-Time Semantic Segmentation
Vladimir Nekrasov
Chunhua Shen
Ian Reid
SSegVLM
93
148
0
08 Oct 2018
Sparse Winograd Convolutional neural networks on small-scale systolic
  arrays
Sparse Winograd Convolutional neural networks on small-scale systolic arrays
Feng Shi
Haochen Li
Yuhe Gao
Benjamin Kuschner
Song-Chun Zhu
55
15
0
03 Oct 2018
Previous
123...575859...686970
Next