Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1610.00324
Cited By
Accelerating Deep Convolutional Networks using low-precision and sparsity
2 October 2016
Ganesh Venkatesh
Eriko Nurvitadhi
Debbie Marr
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Accelerating Deep Convolutional Networks using low-precision and sparsity"
18 / 18 papers shown
Title
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption
Sam Leroux
Pieter Simoens
Meelis Lootus
Kartik Thakore
Akshay Sharma
37
16
0
21 Mar 2022
Energy awareness in low precision neural networks
Nurit Spingarn-Eliezer
Ron Banner
Elad Hoffer
Hilla Ben-Yaacov
T. Michaeli
41
0
0
06 Feb 2022
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
676
0
24 Jan 2021
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Yujeong Choi
Yunseong Kim
Minsoo Rhu
24
66
0
25 Oct 2020
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda Khadka
Estelle Aflalo
Mattias Marder
Avrech Ben-David
Santiago Miret
Shie Mannor
Tamir Hazan
Hanlin Tang
Somdeb Majumdar
GNN
29
11
0
14 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
64
82
0
02 Jul 2020
Unrolling Ternary Neural Networks
Stephen Tridgell
M. Kumm
M. Hardieck
David Boland
Duncan J. M. Moss
P. Zipf
Philip H. W. Leong
27
26
0
09 Sep 2019
Rethinking Arithmetic for Deep Neural Networks
George A. Constantinides
34
4
0
07 May 2019
Evolutionary Cell Aided Design for Neural Network Architectures
Philip Colangelo
Oren Segal
Alexander Speicher
M. Margala
11
3
0
06 Mar 2019
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
33
133
0
01 Jul 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
MQ
24
19
0
05 Mar 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
Hyeontaek Lim
D. Andersen
M. Kaminsky
21
70
0
21 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks
R. Cai
Ao Ren
Ning Liu
Caiwen Ding
Luhao Wang
Xuehai Qian
Massoud Pedram
Yanzhi Wang
BDL
46
87
0
02 Feb 2018
WRPN: Wide Reduced-Precision Networks
Asit K. Mishra
Eriko Nurvitadhi
Jeffrey J. Cook
Debbie Marr
MQ
39
266
0
04 Sep 2017
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
38
10
0
16 Aug 2017
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
23
479
0
24 May 2017
Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations
Liangzhen Lai
Naveen Suda
Vikas Chandra
MQ
33
85
0
08 Mar 2017
Mixed Low-precision Deep Learning Inference using Dynamic Fixed Point
Naveen Mellempudi
Abhisek Kundu
Dipankar Das
Dheevatsa Mudigere
Bharat Kaul
MQ
35
30
0
31 Jan 2017
1