ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.09282
  4. Cited By
A Survey of Model Compression and Acceleration for Deep Neural Networks

A Survey of Model Compression and Acceleration for Deep Neural Networks

23 October 2017
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
ArXivPDFHTML

Papers citing "A Survey of Model Compression and Acceleration for Deep Neural Networks"

50 / 130 papers shown
Title
CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded
  Systems
CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems
Priyank Kalgaonkar
M. El-Sharkawy
3DH
17
5
0
01 Dec 2021
Nonlinear Tensor Ring Network
Nonlinear Tensor Ring Network
Xiao Peng Li
Qi Liu
Hayden Kwok-Hay So
19
0
0
12 Nov 2021
Gabor filter incorporated CNN for compression
Gabor filter incorporated CNN for compression
Akihiro Imamura
N. Arizumi
CVBM
25
2
0
29 Oct 2021
Model based Multi-agent Reinforcement Learning with Tensor
  Decompositions
Model based Multi-agent Reinforcement Learning with Tensor Decompositions
Pascal R. van der Vaart
Anuj Mahajan
Shimon Whiteson
AI4CE
31
8
0
27 Oct 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained
  Optimization
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
42
61
0
13 Oct 2021
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting
  and Output Merging
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
25
15
0
30 Sep 2021
Beyond Distillation: Task-level Mixture-of-Experts for Efficient
  Inference
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Sneha Kudugunta
Yanping Huang
Ankur Bapna
M. Krikun
Dmitry Lepikhin
Minh-Thang Luong
Orhan Firat
MoE
119
106
0
24 Sep 2021
New Perspective on Progressive GANs Distillation for One-class Novelty Detection
Zhiwei Zhang
Yu Dong
Hanyu Peng
Shifeng Chen
29
0
0
15 Sep 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Amin Banitalebi-Dehkordi
Naveen Vedula
J. Pei
Fei Xia
Lanjun Wang
Yong Zhang
22
89
0
30 Aug 2021
Compact representations of convolutional neural networks via weight
  pruning and quantization
Compact representations of convolutional neural networks via weight pruning and quantization
Giosuè Cataldo Marinò
A. Petrini
D. Malchiodi
Marco Frasca
MQ
21
4
0
28 Aug 2021
FOVEA: Foveated Image Magnification for Autonomous Navigation
FOVEA: Foveated Image Magnification for Autonomous Navigation
Chittesh Thavamani
Mengtian Li
N. Cebron
Deva Ramanan
33
32
0
27 Aug 2021
Efficient training of lightweight neural networks using Online
  Self-Acquired Knowledge Distillation
Efficient training of lightweight neural networks using Online Self-Acquired Knowledge Distillation
Maria Tzelepi
Anastasios Tefas
11
6
0
26 Aug 2021
A Survey on GAN Acceleration Using Memory Compression Technique
A Survey on GAN Acceleration Using Memory Compression Technique
Dina Tantawy
Mohamed Zahran
A. Wassal
36
8
0
14 Aug 2021
Developing a Compressed Object Detection Model based on YOLOv4 for
  Deployment on Embedded GPU Platform of Autonomous System
Developing a Compressed Object Detection Model based on YOLOv4 for Deployment on Embedded GPU Platform of Autonomous System
Issac Sim
Junho Lim
Young-Wan Jang
Jihwan You
Seontaek Oh
Young-Keun Kim
21
7
0
01 Aug 2021
Data-Driven Low-Rank Neural Network Compression
Data-Driven Low-Rank Neural Network Compression
D. Papadimitriou
Swayambhoo Jain
BDL
19
3
0
13 Jul 2021
Modality specific U-Net variants for biomedical image segmentation: A
  survey
Modality specific U-Net variants for biomedical image segmentation: A survey
Narinder Singh Punn
Sonali Agarwal
SSeg
29
144
0
09 Jul 2021
JIZHI: A Fast and Cost-Effective Model-As-A-Service System for Web-Scale
  Online Inference at Baidu
JIZHI: A Fast and Cost-Effective Model-As-A-Service System for Web-Scale Online Inference at Baidu
Hao Liu
Qian Gao
Jiang Li
X. Liao
Hao Xiong
...
Guobao Yang
Zhiwei Zha
Daxiang Dong
Dejing Dou
Haoyi Xiong
VLM
22
22
0
03 Jun 2021
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Anuj Mahajan
Mikayel Samvelyan
Lei Mao
Viktor Makoviychuk
Animesh Garg
Jean Kossaifi
Shimon Whiteson
Yuke Zhu
Anima Anandkumar
26
32
0
31 May 2021
Dynamic-Deep: Tune ECG Task Performance and Optimize Compression in IoT
  Architectures
Dynamic-Deep: Tune ECG Task Performance and Optimize Compression in IoT Architectures
Eli Brosh
Elad Wasserstein
Anat Bremler Barr
14
0
0
30 May 2021
Deep Spiking Convolutional Neural Network for Single Object Localization
  Based On Deep Continuous Local Learning
Deep Spiking Convolutional Neural Network for Single Object Localization Based On Deep Continuous Local Learning
Sami Barchid
José Mennesson
Chaabane Djéraba
31
9
0
12 May 2021
"BNN - BN = ?": Training Binary Neural Networks without Batch
  Normalization
"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization
Tianlong Chen
Zhenyu (Allen) Zhang
Xu Ouyang
Zechun Liu
Zhiqiang Shen
Zhangyang Wang
MQ
37
36
0
16 Apr 2021
Efficient Video Compression via Content-Adaptive Super-Resolution
Efficient Video Compression via Content-Adaptive Super-Resolution
Mehrdad Khani Shirkoohi
Vibhaalakshmi Sivaraman
Mohammad Alizadeh
SupR
28
49
0
06 Apr 2021
hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power
  Machine Learning Devices
hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices
F. Fahim
B. Hawks
C. Herwig
J. Hirschauer
S. Jindariani
...
J. Ngadiuba
Miaoyuan Liu
Duc Hoang
E. Kreinar
Zhenbin Wu
24
129
0
09 Mar 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
135
674
0
24 Jan 2021
A Comprehensive Survey on Hardware-Aware Neural Architecture Search
A Comprehensive Survey on Hardware-Aware Neural Architecture Search
Hadjer Benmeziane
Kaoutar El Maghraoui
Hamza Ouarnoughi
Smail Niar
Martin Wistuba
Naigang Wang
34
96
0
22 Jan 2021
Activation Density based Mixed-Precision Quantization for Energy
  Efficient Neural Networks
Activation Density based Mixed-Precision Quantization for Energy Efficient Neural Networks
Karina Vasquez
Yeshwanth Venkatesha
Abhiroop Bhattacharjee
Abhishek Moitra
Priyadarshini Panda
MQ
37
15
0
12 Jan 2021
Shallow-UWnet : Compressed Model for Underwater Image Enhancement
Shallow-UWnet : Compressed Model for Underwater Image Enhancement
Ankita Rajaram Naik
Apurva Swarnakar
Kartik Mittal
44
155
0
06 Jan 2021
Are We Ready For Learned Cardinality Estimation?
Are We Ready For Learned Cardinality Estimation?
Xiaoying Wang
Changbo Qu
Weiyuan Wu
Jiannan Wang
Qingqing Zhou
37
113
0
12 Dec 2020
Machine Learning for Cataract Classification and Grading on Ophthalmic
  Imaging Modalities: A Survey
Machine Learning for Cataract Classification and Grading on Ophthalmic Imaging Modalities: A Survey
Xiaoqin Zhang
Yan Hu
Zunjie Xiao
Jiansheng Fang
Risa Higashita
Jiang-Dong Liu
48
41
0
09 Dec 2020
DiffPrune: Neural Network Pruning with Deterministic Approximate Binary
  Gates and $L_0$ Regularization
DiffPrune: Neural Network Pruning with Deterministic Approximate Binary Gates and L0L_0L0​ Regularization
Yaniv Shulman
46
3
0
07 Dec 2020
Benchmarking Inference Performance of Deep Learning Models on Analog
  Devices
Benchmarking Inference Performance of Deep Learning Models on Analog Devices
Omobayode Fagbohungbe
Lijun Qian
19
7
0
24 Nov 2020
Exploring Energy-Accuracy Tradeoffs in AI Hardware
Exploring Energy-Accuracy Tradeoffs in AI Hardware
Cory E. Merkel
24
1
0
17 Nov 2020
Unsupervised Intrusion Detection System for Unmanned Aerial Vehicle with
  Less Labeling Effort
Unsupervised Intrusion Detection System for Unmanned Aerial Vehicle with Less Labeling Effort
Kyung Ho Park
Eunji Park
H. Kim
13
13
0
01 Nov 2020
$μ$NAS: Constrained Neural Architecture Search for Microcontrollers
μμμNAS: Constrained Neural Architecture Search for Microcontrollers
Edgar Liberis
L. Dudziak
Nicholas D. Lane
BDL
15
103
0
27 Oct 2020
Block-term Tensor Neural Networks
Block-term Tensor Neural Networks
Jinmian Ye
Guangxi Li
Di Chen
Haiqin Yang
Shandian Zhe
Zenglin Xu
24
30
0
10 Oct 2020
A Survey on Large-scale Machine Learning
A Survey on Large-scale Machine Learning
Meng Wang
Weijie Fu
Xiangnan He
Shijie Hao
Xindong Wu
14
109
0
10 Aug 2020
A Unified Framework for Shot Type Classification Based on Subject
  Centric Lens
A Unified Framework for Shot Type Classification Based on Subject Centric Lens
Anyi Rao
Jiaze Wang
Linning Xu
Xuekun Jiang
Qingqiu Huang
Bolei Zhou
Dahua Lin
18
60
0
08 Aug 2020
Vehicle Attribute Recognition by Appearance: Computer Vision Methods for
  Vehicle Type, Make and Model Classification
Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification
Xingyang Ni
H. Huttunen
CVBM
16
20
0
29 Jun 2020
DeepAbstract: Neural Network Abstraction for Accelerating Verification
DeepAbstract: Neural Network Abstraction for Accelerating Verification
P. Ashok
Vahid Hashemi
Jan Křetínský
S. Mohr
17
49
0
24 Jun 2020
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization
  is Sufficient
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization is Sufficient
Ankit Pensia
Shashank Rajput
Alliot Nagle
Harit Vishwakarma
Dimitris Papailiopoulos
19
102
0
14 Jun 2020
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming
Mehrdad Khani Shirkoohi
Pouya Hamadanian
Arash Nasr-Esfahany
Mohammad Alizadeh
26
44
0
11 Jun 2020
Autonomous Driving with Deep Learning: A Survey of State-of-Art
  Technologies
Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies
Yu Huang
Yue Chen
3DPC
49
83
0
10 Jun 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based
  Quantized DNNs
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
25
30
0
20 May 2020
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy
  Efficient Inference
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi Zadeh
Isak Edo
Omar Mohamed Awad
Andreas Moshovos
MQ
22
183
0
08 May 2020
PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal
  Matrices
PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices
Chunhua Deng
Siyu Liao
Yi Xie
Keshab K. Parhi
Xuehai Qian
Bo Yuan
30
93
0
23 Apr 2020
Binary Neural Networks: A Survey
Binary Neural Networks: A Survey
Haotong Qin
Ruihao Gong
Xianglong Liu
Xiao Bai
Jingkuan Song
N. Sebe
MQ
50
457
0
31 Mar 2020
Squeezed Deep 6DoF Object Detection Using Knowledge Distillation
Squeezed Deep 6DoF Object Detection Using Knowledge Distillation
H. Felix
Walber M. Rodrigues
David Macêdo
Francisco Simões
Adriano Oliveira
Veronica Teichrieb
Cleber Zanchettin
3DPC
14
9
0
30 Mar 2020
Progressive Graph Convolutional Networks for Semi-Supervised Node
  Classification
Progressive Graph Convolutional Networks for Semi-Supervised Node Classification
Negar Heidari
Alexandros Iosifidis
GNN
16
14
0
27 Mar 2020
Pre-trained Models for Natural Language Processing: A Survey
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,450
0
18 Mar 2020
A Survey on Contextual Embeddings
A Survey on Contextual Embeddings
Qi Liu
Matt J. Kusner
Phil Blunsom
225
146
0
16 Mar 2020
Previous
123
Next