Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.05128
Cited By
Designing Energy-Efficient Convolutional Neural Networks using Energy-Aware Pruning
16 November 2016
Tien-Ju Yang
Yu-hsin Chen
Vivienne Sze
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Designing Energy-Efficient Convolutional Neural Networks using Energy-Aware Pruning"
50 / 122 papers shown
Title
Activation Density based Mixed-Precision Quantization for Energy Efficient Neural Networks
Karina Vasquez
Yeshwanth Venkatesha
Abhiroop Bhattacharjee
Abhishek Moitra
Priyadarshini Panda
MQ
48
15
0
12 Jan 2021
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
107
345
0
05 Jan 2021
FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training
Y. Fu
Haoran You
Yang Zhao
Yue Wang
Chaojian Li
K. Gopalakrishnan
Zhangyang Wang
Yingyan Lin
MQ
38
32
0
24 Dec 2020
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Maurizio Capra
Beatrice Bussolino
Alberto Marchisio
Guido Masera
Maurizio Martina
Mohamed Bennai
BDL
64
140
0
21 Dec 2020
Enabling Retrain-free Deep Neural Network Pruning using Surrogate Lagrangian Relaxation
Deniz Gurevin
Shangli Zhou
Lynn Pepin
Bingbing Li
Mikhail A. Bragin
Caiwen Ding
Fei Miao
26
3
0
18 Dec 2020
Bringing AI To Edge: From Deep Learning's Perspective
Di Liu
Hao Kong
Xiangzhong Luo
Weichen Liu
Ravi Subramaniam
57
117
0
25 Nov 2020
Third ArchEdge Workshop: Exploring the Design Space of Efficient Deep Neural Networks
Fuxun Yu
Dimitrios Stamoulis
Di Wang
Dimitrios Lymberopoulos
Xiang Chen
3DV
27
1
0
22 Nov 2020
GECKO: Reconciling Privacy, Accuracy and Efficiency in Embedded Deep Learning
Vasisht Duddu
A. Boutet
Virat Shejwalkar
GNN
24
4
0
02 Oct 2020
Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training
Dingqing Yang
Amin Ghasemazar
X. Ren
Maximilian Golub
G. Lemieux
Mieszko Lis
22
48
0
23 Sep 2020
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity
Cong Guo
B. Hsueh
Jingwen Leng
Yuxian Qiu
Yue Guan
Zehuan Wang
Xiaoying Jia
Xipeng Li
Minyi Guo
Yuhao Zhu
35
83
0
29 Aug 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
64
82
0
02 Jul 2020
The Ramifications of Making Deep Neural Networks Compact
N. Jha
Sparsh Mittal
Govardhan Mattela
18
14
0
26 Jun 2020
Principal Component Networks: Parameter Reduction Early in Training
R. Waleffe
Theodoros Rekatsinas
3DPC
19
9
0
23 Jun 2020
AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for Enabling Ubiquitous Intelligent Mobiles
Sicong Liu
Junzhao Du
Kaiming Nan
Zimu Zhou
Zhangyang Wang
Yingyan Lin
32
30
0
08 Jun 2020
Effective and Efficient Computation with Multiple-timescale Spiking Recurrent Neural Networks
Bojian Yin
Federico Corradi
Sander M. Bohté
22
99
0
24 May 2020
AOWS: Adaptive and optimal network width search with latency constraints
Maxim Berman
Leonid Pischulin
N. Xu
Matthew B. Blaschko
Gérard Medioni
41
29
0
21 May 2020
Pruning Algorithms to Accelerate Convolutional Neural Networks for Edge Applications: A Survey
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
3DPC
MedIm
30
52
0
08 May 2020
TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators Towards Local and in Time Domain
Weitao Li
Pengfei Xu
Yang Zhao
Haitong Li
Yuan Xie
Yingyan Lin
17
69
0
03 May 2020
Deploying Image Deblurring across Mobile Devices: A Perspective of Quality and Latency
Cheng-Ming Chiang
Yu-Wen Tseng
Yu-Syuan Xu
Hsien-Kai Kuo
Yi-Min Tsai
...
Chia-Lin Yu
B. Shen
Kloze Kao
Chia-Ming Cheng
Hung-Jen Chen
35
22
0
27 Apr 2020
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
Alvin Wan
Xiaoliang Dai
Peizhao Zhang
Zijian He
Yuandong Tian
...
Matthew Yu
Tao Xu
Kan Chen
Peter Vajda
Joseph E. Gonzalez
24
289
0
12 Apr 2020
Energy Predictive Models for Convolutional Neural Networks on Mobile Platforms
Crefeda Faviola Rodrigues
Graham D. Riley
M. Luján
HAI
9
2
0
10 Apr 2020
Comparing Rewinding and Fine-tuning in Neural Network Pruning
Alex Renda
Jonathan Frankle
Michael Carbin
237
383
0
05 Mar 2020
Sparse Optimization for Green Edge AI Inference
Xiangyu Yang
Sheng Hua
Yuanming Shi
Hao Wang
Jun Zhang
Khaled B. Letaief
29
14
0
24 Feb 2020
Proving the Lottery Ticket Hypothesis: Pruning is All You Need
Eran Malach
Gilad Yehudai
Shai Shalev-Shwartz
Ohad Shamir
64
272
0
03 Feb 2020
PreVIous: A Methodology for Prediction of Visual Inference Performance on IoT Devices
Delia Velasco-Montero
Jorge Fernández-Berni
R. Carmona-Galán
Á. Rodríguez-Vázquez
30
21
0
13 Dec 2019
Optimizing the energy consumption of spiking neural networks for neuromorphic applications
M. Sorbaro
Li-Yu Daisy Liu
Massimo Bortone
Sadique Sheik
26
67
0
03 Dec 2019
On-Device Machine Learning: An Algorithms and Learning Theory Perspective
Sauptik Dhar
Junyao Guo
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
33
141
0
02 Nov 2019
Stochastic Channel-Based Federated Learning for Medical Data Privacy Preserving
Rulin Shao
Hongyu Hè
Hui Liu
Dianbo Liu
FedML
OOD
25
13
0
23 Oct 2019
Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks
Yihui He
Jianing Qian
Jianren Wang
Cindy X. Le
Congrui Hetang
Qi Lyu
Wenping Wang
Tianwei Yue
53
11
0
21 Oct 2019
SPEC2: SPECtral SParsE CNN Accelerator on FPGAs
Yue Niu
Hanqing Zeng
Ajitesh Srivastava
Kartik Lakhotia
Rajgopal Kannan
Yanzhi Wang
Viktor Prasanna
MQ
21
8
0
16 Oct 2019
EDEN: Enabling Energy-Efficient, High-Performance Deep Neural Network Inference Using Approximate DRAM
Skanda Koppula
Lois Orosa
A. G. Yaglikçi
Roknoddin Azizi
Taha Shahroodi
Konstantinos Kanellopoulos
O. Mutlu
32
105
0
12 Oct 2019
Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent
Dilin Wang
Meng Li
Lemeng Wu
Vikas Chandra
Qiang Liu
46
20
0
07 Oct 2019
How does topology influence gradient propagation and model performance of deep networks with DenseNet-type skip connections?
Kartikeya Bhardwaj
Guihong Li
R. Marculescu
38
1
0
02 Oct 2019
Global Sparse Momentum SGD for Pruning Very Deep Neural Networks
Xiaohan Ding
Guiguang Ding
Xiangxin Zhou
Yuchen Guo
Jungong Han
Ji Liu
20
162
0
27 Sep 2019
Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference
Kai Yang
Yuanming Shi
Wei Yu
Z. Ding
18
42
0
29 Jul 2019
Memory- and Communication-Aware Model Compression for Distributed Deep Learning Inference on IoT
Kartikeya Bhardwaj
Chingyi Lin
A. L. Sartor
R. Marculescu
GNN
29
51
0
26 Jul 2019
Similarity-Preserving Knowledge Distillation
Frederick Tung
Greg Mori
48
961
0
23 Jul 2019
SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers
Igor Fedorov
Ryan P. Adams
Matthew Mattina
P. Whatmough
23
166
0
28 May 2019
Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing
Zhi Zhou
Xu Chen
En Li
Liekang Zeng
Ke Luo
Junshan Zhang
44
1,421
0
24 May 2019
An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection
Youngwan Lee
Joong-won Hwang
Sangrok Lee
Yuseok Bae
Jongyoul Park
PINN
ObjD
24
358
0
22 Apr 2019
Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM
Shaokai Ye
Xiaoyu Feng
Tianyun Zhang
Xiaolong Ma
Sheng Lin
...
Jian Tang
M. Fardad
Xinyu Lin
Yongpan Liu
Yanzhi Wang
MQ
38
38
0
23 Mar 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM
Hongxu Yin
Guoyang Chen
Yingmin Li
Shuai Che
Weifeng Zhang
N. Jha
36
10
0
30 Jan 2019
Trading-off Accuracy and Energy of Deep Inference on Embedded Systems: A Co-Design Approach
Nitthilan Kanappan Jayakodi
Anwesha Chatterjee
Wonje Choi
J. Doppa
P. Pande
19
27
0
29 Jan 2019
Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning
Shaohui Lin
Rongrong Ji
Yuchao Li
Cheng Deng
Xuelong Li
41
70
0
23 Jan 2019
Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks
Xue Geng
Jie Fu
Bin Zhao
Jie Lin
M. Aly
C. Pal
V. Chandrasekhar
MQ
24
5
0
04 Jan 2019
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Ao Ren
Tianyun Zhang
Shaokai Ye
Jiayu Li
Wenyao Xu
Xuehai Qian
Xinyu Lin
Yanzhi Wang
MQ
40
161
0
31 Dec 2018
ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
Xiaoliang Dai
Peizhao Zhang
Bichen Wu
Hongxu Yin
Fei Sun
...
Yiming Wu
Yangqing Jia
Peter Vajda
M. Uyttendaele
N. Jha
33
272
0
21 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
27
8
0
20 Dec 2018
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Yuchao Li
Shaohui Lin
Baochang Zhang
Jianzhuang Liu
David Doermann
Yongjian Wu
Feiyue Huang
Rongrong Ji
43
130
0
11 Dec 2018
Wireless Network Intelligence at the Edge
Jihong Park
S. Samarakoon
M. Bennis
Mérouane Debbah
25
518
0
07 Dec 2018
Previous
1
2
3
Next