Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.04485
Cited By
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
23 May 2017
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks"
50 / 121 papers shown
Title
Mitigating Edge Machine Learning Inference Bottlenecks: An Empirical Study on Accelerating Google Edge Models
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
27
28
0
01 Mar 2021
A Survey of Machine Learning for Computer Architecture and Systems
Nan Wu
Yuan Xie
AI4TS
AI4CE
20
145
0
16 Feb 2021
Activation Density based Mixed-Precision Quantization for Energy Efficient Neural Networks
Karina Vasquez
Yeshwanth Venkatesha
Abhiroop Bhattacharjee
Abhishek Moitra
Priyadarshini Panda
MQ
43
15
0
12 Jan 2021
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Maurizio Capra
Beatrice Bussolino
Alberto Marchisio
Guido Masera
Maurizio Martina
Muhammad Shafique
BDL
59
140
0
21 Dec 2020
FantastIC4: A Hardware-Software Co-Design Approach for Efficiently Running 4bit-Compact Multilayer Perceptrons
Simon Wiedemann
Suhas Shivapakash
P. Wiedemann
Daniel Becking
Wojciech Samek
F. Gerfers
Thomas Wiegand
MQ
23
7
0
17 Dec 2020
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Hanrui Wang
Zhekai Zhang
Song Han
43
380
0
17 Dec 2020
Robustness and Transferability of Universal Attacks on Compressed Models
Alberto G. Matachana
Kenneth T. Co
Luis Muñoz-González
David Martínez
Emil C. Lupu
AAML
29
10
0
10 Dec 2020
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression
Cody Blakeney
Xiaomin Li
Yan Yan
Ziliang Zong
48
39
0
05 Dec 2020
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Yujeong Choi
Yunseong Kim
Minsoo Rhu
24
66
0
25 Oct 2020
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
27
40
0
25 Oct 2020
FPRaker: A Processing Element For Accelerating Neural Network Training
Omar Mohamed Awad
Mostafa Mahmoud
Isak Edo Vivancos
Ali Hadi Zadeh
Ciaran Bannon
Anand Jayarajan
Gennady Pekhimenko
Andreas Moshovos
25
15
0
15 Oct 2020
Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training
Dingqing Yang
Amin Ghasemazar
X. Ren
Maximilian Golub
G. Lemieux
Mieszko Lis
22
48
0
23 Sep 2020
Layer-specific Optimization for Mixed Data Flow with Mixed Precision in FPGA Design for CNN-based Object Detectors
Duy-Thanh Nguyen
Hyun Kim
Hyuk-Jae Lee
MQ
25
59
0
03 Sep 2020
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity
Cong Guo
B. Hsueh
Jingwen Leng
Yuxian Qiu
Yue Guan
Zehuan Wang
Xiaoying Jia
Xipeng Li
M. Guo
Yuhao Zhu
35
83
0
29 Aug 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices
Parth Mannan
A. Samajdar
T. Krishna
31
2
0
27 Aug 2020
Self-Supervised GAN Compression
Chong Yu
Jeff Pool
9
9
0
03 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
59
82
0
02 Jul 2020
Generative Sparse Detection Networks for 3D Single-shot Object Detection
JunYoung Gwak
Chris Choy
Silvio Savarese
3DPC
30
101
0
22 Jun 2020
Sponge Examples: Energy-Latency Attacks on Neural Networks
Ilia Shumailov
Yiren Zhao
Daniel Bates
Nicolas Papernot
Robert D. Mullins
Ross J. Anderson
SILM
19
127
0
05 Jun 2020
SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation
Yang Katie Zhao
Xiaohan Chen
Yue Wang
Chaojian Li
Haoran You
Y. Fu
Yuan Xie
Zhangyang Wang
Yingyan Lin
MQ
40
43
0
07 May 2020
Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun
Minghai Qin
Tianyun Zhang
Liu Liu
Yen-kuang Chen
Yuan Xie
34
7
0
24 Apr 2020
PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices
Chunhua Deng
Siyu Liao
Yi Xie
Keshab K. Parhi
Xuehai Qian
Bo Yuan
38
93
0
23 Apr 2020
A Survey on Impact of Transient Faults on BNN Inference Accelerators
N. Khoshavi
Connor Broyles
Yu Bi
27
8
0
10 Apr 2020
Dithered backprop: A sparse and quantized backpropagation algorithm for more efficient deep neural network training
Simon Wiedemann
Temesgen Mehari
Kevin Kepp
Wojciech Samek
27
18
0
09 Apr 2020
Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications
Chinthaka Gamanayake
Lahiru Jayasinghe
Benny Kai Kiat Ng
Chau Yuen
VLM
23
45
0
05 Mar 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
76
55
0
04 Mar 2020
A
3
^3
3
: Accelerating Attention Mechanisms in Neural Networks with Approximation
Tae Jun Ham
Sungjun Jung
Seonghak Kim
Young H. Oh
Yeonhong Park
...
Jung-Hun Park
Sanghee Lee
Kyoung Park
Jae W. Lee
D. Jeong
24
213
0
22 Feb 2020
Co-Exploration of Neural Architectures and Heterogeneous ASIC Accelerator Designs Targeting Multiple Tasks
Lei Yang
Zheyu Yan
Meng Li
Hyoukjun Kwon
Liangzhen Lai
T. Krishna
Vikas Chandra
Weiwen Jiang
Yiyu Shi
32
114
0
10 Feb 2020
Activation Density driven Energy-Efficient Pruning in Training
Timothy Foldy-Porto
Yeshwanth Venkatesha
Priyadarshini Panda
8
4
0
07 Feb 2020
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
Xiaolong Ma
Wei Niu
Tianyun Zhang
Sijia Liu
Sheng Lin
...
Xiang Chen
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
35
27
0
20 Jan 2020
DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference
Udit Gupta
Samuel Hsia
V. Saraph
Xiaodong Wang
Brandon Reagen
Gu-Yeon Wei
Hsien-Hsin S. Lee
David Brooks
Carole-Jean Wu
GNN
36
188
0
08 Jan 2020
Sparse Weight Activation Training
Md Aamir Raihan
Tor M. Aamodt
34
73
0
07 Jan 2020
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Wei Niu
Xiaolong Ma
Sheng Lin
Shihao Wang
Xuehai Qian
X. Lin
Yanzhi Wang
Bin Ren
MQ
35
227
0
01 Jan 2020
TOCO: A Framework for Compressing Neural Network Models Based on Tolerance Analysis
Soroosh Khoram
J. Li
16
1
0
18 Dec 2019
SPEC2: SPECtral SParsE CNN Accelerator on FPGAs
Yue Niu
Hanqing Zeng
Ajitesh Srivastava
Kartik Lakhotia
Rajgopal Kannan
Yanzhi Wang
Viktor Prasanna
MQ
16
8
0
16 Oct 2019
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices
Xiaolong Ma
Fu-Ming Guo
Wei Niu
Xue Lin
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
CVBM
27
173
0
06 Sep 2019
Machine Learning at the Network Edge: A Survey
M. G. Sarwar Murshed
Chris Murphy
Daqing Hou
Nazar Khan
Ganesh Ananthanarayanan
Faraz Hussain
38
378
0
31 Jul 2019
AI Enabling Technologies: A Survey
V. Gadepally
Justin A. Goodwin
J. Kepner
Albert Reuther
Hayley Reynolds
S. Samsi
Jonathan Su
David Martinez
27
24
0
08 May 2019
Octree guided CNN with Spherical Kernels for 3D Point Clouds
Huan Lei
Naveed Akhtar
Ajmal Mian
3DPC
19
141
0
28 Feb 2019
FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning
P. Whatmough
Chuteng Zhou
Patrick Hansen
S. Venkataramanaiah
Jae-sun Seo
Matthew Mattina
15
57
0
27 Feb 2019
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Ao Ren
Tianyun Zhang
Shaokai Ye
Jiayu Li
Wenyao Xu
Xuehai Qian
X. Lin
Yanzhi Wang
MQ
34
161
0
31 Dec 2018
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Jongsoo Park
Maxim Naumov
Protonu Basu
Summer Deng
Aravind Kalaiah
...
Lin Qiao
Vijay Rao
Nadav Rotem
S. Yoo
M. Smelyanskiy
FedML
GNN
BDL
20
186
0
24 Nov 2018
Analyzing Machine Learning Workloads Using a Detailed GPU Simulator
Jonathan Lew
Deval Shah
Suchita Pati
Shaylin Cattell
Mengchi Zhang
...
Christopher Ng
Negar Goli
Matthew D. Sinclair
Timothy G. Rogers
Tor M. Aamodt
29
65
0
18 Nov 2018
Tetris: Re-architecting Convolutional Neural Network Computation for Machine Learning Accelerators
Hang Lu
Xin Wei
Ning Lin
Guihai Yan
Xiaowei Li
MQ
20
31
0
14 Nov 2018
SCALE-Sim: Systolic CNN Accelerator Simulator
A. Samajdar
Yuhao Zhu
P. Whatmough
Matthew Mattina
Tushar Krishna
30
137
0
16 Oct 2018
Morph: Flexible Acceleration for 3D CNN-based Video Understanding
Kartik Hegde
R. Agrawal
Yulun Yao
Christopher W. Fletcher
30
71
0
16 Oct 2018
Dynamic Channel Pruning: Feature Boosting and Suppression
Xitong Gao
Yiren Zhao
L. Dudziak
Robert D. Mullins
Chengzhong Xu
42
311
0
12 Oct 2018
Mini-batch Serialization: CNN Training with Inter-layer Data Reuse
Sangkug Lym
Armand Behroozi
W. Wen
Ge Li
Yongkee Kwon
M. Erez
12
25
0
30 Sep 2018
To compress or not to compress: Understanding the Interactions between Adversarial Attacks and Neural Network Compression
Yiren Zhao
Ilia Shumailov
Robert D. Mullins
Ross J. Anderson
AAML
11
43
0
29 Sep 2018
Interstellar: Using Halide's Scheduling Language to Analyze DNN Accelerators
Xuan S. Yang
Mingyu Gao
Qiaoyi Liu
Jeff Setter
Jing Pu
...
Kaidi Cao
Heonjae Ha
Priyanka Raina
Christos Kozyrakis
M. Horowitz
24
226
0
10 Sep 2018
Previous
1
2
3
Next