Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.06038
Cited By
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
9 May 2024
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
Chao Jin
Manas Gupta
Xulei Yang
Zhenghua Chen
M. Aly
Jie Lin
Min-man Wu
Xiaoli Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks"
50 / 103 papers shown
Title
Importance Estimation for Neural Network Pruning
Pavlo Molchanov
Arun Mallya
Stephen Tyree
I. Frosio
Jan Kautz
3DPC
78
879
0
25 Jun 2019
Data-Free Quantization Through Weight Equalization and Bias Correction
Markus Nagel
M. V. Baalen
Tijmen Blankevoort
Max Welling
MQ
55
509
0
11 Jun 2019
Distilling Object Detectors with Fine-grained Feature Imitation
Tao Wang
Li-xin Yuan
Xiaopeng Zhang
Jiashi Feng
ObjD
51
381
0
09 Jun 2019
Discovering Neural Wirings
Mitchell Wortsman
Ali Farhadi
Mohammad Rastegari
AI4CE
96
121
0
03 Jun 2019
Dimensionality compression and expansion in Deep Neural Networks
Stefano Recanatesi
M. Farrell
Madhu S. Advani
Timothy Moore
Guillaume Lajoie
E. Shea-Brown
49
73
0
02 Jun 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
131
18,058
0
28 May 2019
Zero-Shot Knowledge Distillation in Deep Networks
Gaurav Kumar Nayak
Konda Reddy Mopuri
Vaisakh Shaj
R. Venkatesh Babu
Anirban Chakraborty
73
245
0
20 May 2019
DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
MQ
46
21
0
15 May 2019
Encrypted Speech Recognition using Deep Polynomial Networks
Shi-Xiong Zhang
Jiawei Liu
Dong Yu
52
26
0
11 May 2019
Relational Knowledge Distillation
Wonpyo Park
Dongju Kim
Yan Lu
Minsu Cho
63
1,405
0
10 Apr 2019
Adaptive NMS: Refining Pedestrian Detection in a Crowd
Songtao Liu
Di Huang
Yunhong Wang
46
284
0
07 Apr 2019
Correlation Congruence for Knowledge Distillation
Baoyun Peng
Xiao Jin
Jiaheng Liu
Shunfeng Zhou
Yichao Wu
Yu Liu
Dongsheng Li
Zhaoning Zhang
86
510
0
03 Apr 2019
Towards Optimal Structured CNN Pruning via Generative Adversarial Learning
Shaohui Lin
Rongrong Ji
Chenqian Yan
Baochang Zhang
Liujuan Cao
QiXiang Ye
Feiyue Huang
David Doermann
CVBM
53
507
0
22 Mar 2019
Learned Step Size Quantization
S. K. Esser
J. McKinstry
Deepika Bablani
R. Appuswamy
D. Modha
MQ
71
798
0
21 Feb 2019
Slimmable Neural Networks
Jiahui Yu
L. Yang
N. Xu
Jianchao Yang
Thomas Huang
71
552
0
21 Dec 2018
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
Han Cai
Ligeng Zhu
Song Han
93
1,867
0
02 Dec 2018
Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search
Bichen Wu
Yanghan Wang
Peizhao Zhang
Yuandong Tian
Peter Vajda
Kurt Keutzer
MQ
64
273
0
30 Nov 2018
Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference
Edward Chou
Josh Beal
Daniel Levy
Serena Yeung
Albert Haque
Li Fei-Fei
45
198
0
25 Nov 2018
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
115
880
0
21 Nov 2018
Learning to Screen for Fast Softmax Inference on Large Vocabulary Neural Networks
Patrick H. Chen
Si Si
Sanjiv Kumar
Yang Li
Cho-Jui Hsieh
42
21
0
29 Oct 2018
SNIP: Single-shot Network Pruning based on Connection Sensitivity
Namhoon Lee
Thalaiyasingam Ajanthan
Philip Torr
VLM
243
1,196
0
04 Oct 2018
Bounding Box Regression with Uncertainty for Accurate Object Detection
Yihui He
Chenchen Zhu
Jianren Wang
Marios Savvides
Xinming Zhang
ObjD
74
468
0
23 Sep 2018
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural Network in Embedded FPGA
Junsong Wang
Qiuwen Lou
Xiaofan Zhang
Chao Zhu
Yonghua Lin
Deming Chen
MQ
55
93
0
31 Jul 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
114
3,004
0
31 Jul 2018
Acquisition of Localization Confidence for Accurate Object Detection
Borui Jiang
Ruixuan Luo
Jiayuan Mao
Tete Xiao
Yuning Jiang
ObjD
48
850
0
30 Jul 2018
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
G. Hua
MQ
59
703
0
26 Jul 2018
Coreset-Based Neural Network Compression
Abhimanyu Dubey
Moitreya Chatterjee
Narendra Ahuja
57
81
0
25 Jul 2018
DARTS: Differentiable Architecture Search
Hanxiao Liu
Karen Simonyan
Yiming Yang
185
4,345
0
24 Jun 2018
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
Jonathan Frankle
Michael Carbin
219
3,457
0
09 Mar 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
86
1,348
0
10 Feb 2018
Efficient Neural Architecture Search via Parameter Sharing
Hieu H. Pham
M. Guan
Barret Zoph
Quoc V. Le
J. Dean
107
2,761
0
09 Feb 2018
Relation Networks for Object Detection
Han Hu
Jiayuan Gu
Zheng Zhang
Jifeng Dai
Yichen Wei
ObjD
102
1,221
0
30 Nov 2017
Improving Object Localization with Fitness NMS and Bounded IoU Loss
Lachlan Tychsen-Smith
L. Petersson
58
176
0
01 Nov 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
185
1,275
0
05 Oct 2017
Deep Mutual Learning
Ying Zhang
Tao Xiang
Timothy M. Hospedales
Huchuan Lu
FedML
143
1,650
0
01 Jun 2017
Learning non-maximum suppression
J. Hosang
Rodrigo Benenson
Bernt Schiele
ObjD
54
519
0
08 May 2017
Soft-NMS -- Improving Object Detection With One Line of Code
Navaneeth Bodla
Bharat Singh
Rama Chellappa
L. Davis
ObjD
83
1,786
0
14 Apr 2017
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML
3DV
113
3,013
0
27 Mar 2017
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
382
1,050
0
10 Feb 2017
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko
N. Komodakis
113
2,569
0
12 Dec 2016
Challenges and Opportunities in Edge Computing
Blesson Varghese
Nan Wang
Sakil Barbhuiya
Peter Kilpatrick
Dimitrios S. Nikolopoulos
62
461
0
07 Sep 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
PINN
3DV
729
36,708
0
25 Aug 2016
Ternary Weight Networks
Fengfu Li
Bin Liu
Xiaoxing Wang
Bo Zhang
Junchi Yan
MQ
68
525
0
16 May 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
159
4,350
0
16 Mar 2016
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
F. Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
W. Dally
Kurt Keutzer
139
7,465
0
24 Feb 2016
EIE: Efficient Inference Engine on Compressed Deep Neural Network
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
118
2,455
0
04 Feb 2016
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
245
8,821
0
01 Oct 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
296
18,587
0
06 Feb 2015
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Nicolas Vasilache
Jeff Johnson
Michaël Mathieu
Soumith Chintala
Serkan Piantino
Yann LeCun
50
347
0
24 Dec 2014
FitNets: Hints for Thin Deep Nets
Adriana Romero
Nicolas Ballas
Samira Ebrahimi Kahou
Antoine Chassang
C. Gatta
Yoshua Bengio
FedML
288
3,870
0
19 Dec 2014
Previous
1
2
3
Next