Accelerating Deep Convolutional Networks using low-precision and sparsity

2 October 2016

Papers citing "Accelerating Deep Convolutional Networks using low-precision and sparsity"

18 / 18 papers shown

Title
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption Sam Leroux Pieter Simoens Meelis Lootus Kartik Thakore Akshay Sharma 37 16 0 21 Mar 2022
Energy awareness in low precision neural networks Nurit Spingarn-Eliezer Ron Banner Elad Hoffer Hilla Ben-Yaacov T. Michaeli 41 0 0 06 Feb 2022
Pruning and Quantization for Deep Neural Network Acceleration: A Survey Tailin Liang C. Glossner Lei Wang Shaobo Shi Xiaotong Zhang MQ 150 676 0 24 Jan 2021
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference Yujeong Choi Yunseong Kim Minsoo Rhu 24 66 0 25 Oct 2020
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning Shauharda Khadka Estelle Aflalo Mattias Marder Avrech Ben-David Santiago Miret Shie Mannor Tamir Hazan Hanlin Tang Somdeb Majumdar GNN 29 11 0 14 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights Shail Dave Riyadh Baghdadi Tony Nowatzki Sasikanth Avancha Aviral Shrivastava Baoxin Li 64 82 0 02 Jul 2020
Unrolling Ternary Neural Networks Stephen Tridgell M. Kumm M. Hardieck David Boland Duncan J. M. Moss P. Zipf Philip H. W. Leong 27 26 0 09 Sep 2019
Rethinking Arithmetic for Deep Neural Networks George A. Constantinides 34 4 0 07 May 2019
Evolutionary Cell Aided Design for Neural Network Architectures Philip Colangelo Oren Segal Alexander Speicher M. Margala 11 3 0 06 Mar 2019
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks Julian Faraone Nicholas J. Fraser Michaela Blott Philip H. W. Leong MQ 33 133 0 01 Jul 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine Renzo Andri Lukas Cavigelli D. Rossi Luca Benini MQ 24 19 0 05 Mar 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning Hyeontaek Lim D. Andersen M. Kaminsky 21 70 0 21 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks R. Cai Ao Ren Ning Liu Caiwen Ding Luhao Wang Xuehai Qian Massoud Pedram Yanzhi Wang BDL 46 87 0 02 Feb 2018
WRPN: Wide Reduced-Precision Networks Asit K. Mishra Eriko Nurvitadhi Jeffrey J. Cook Debbie Marr MQ 39 266 0 04 Sep 2017
BitNet: Bit-Regularized Deep Neural Networks Aswin Raghavan Mohamed R. Amer S. Chai Graham Taylor MQ 38 10 0 16 Aug 2017
Bayesian Compression for Deep Learning Christos Louizos Karen Ullrich Max Welling UQCV BDL 23 479 0 24 May 2017
Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations Liangzhen Lai Naveen Suda Vikas Chandra MQ 33 85 0 08 Mar 2017
Mixed Low-precision Deep Learning Inference using Dynamic Fixed Point Naveen Mellempudi Abhisek Kundu Dipankar Das Dheevatsa Mudigere Bharat Kaul MQ 35 30 0 31 Jan 2017