Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

15 December 2017

Papers citing "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"

50 / 1,298 papers shown

Title
Anchor-based Plain Net for Mobile Image Super-Resolution Zongcai Du Jie Liu Jie Tang Gangshan Wu SupR MQ 61 52 0 20 May 2021
BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer Haoping Bai Mengsi Cao Ping Huang Jiulong Shan MQ 83 34 0 19 May 2021
Fast and Accurate Camera Scene Detection on Smartphones Angeline Pouget Sidharth Ramesh Maximilian Giang Ramithan Chandrapalan Toni Tanner Moritz Prussing Radu Timofte Andrey D. Ignatov 3DH 57 5 0 17 May 2021
Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report Andrey D. Ignatov Grigory Malivenko Radu Timofte Sheng Chen Xin Xia ... K. Lyda L. Khojoyan Abhishek Thanki Sayak Paul Shahid Siddiqui MQ 90 20 0 17 May 2021
Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report Andrey D. Ignatov Grigory Malivenko D. Plowman Samarth Shukla Radu Timofte ... Tianpeng Feng Yang Liu Chuannan Sheng Jian Yin Fausto T. Benavide MDE 74 36 0 17 May 2021
Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report Andrey D. Ignatov Andrés Romero Heewon Kim Radu Timofte C. Ho ... Xiumei Wang Jiaming Guo Xueyi Zhou Hao Jia Youliang Yan SupR 73 54 0 17 May 2021
Real-Time Quantized Image Super-Resolution on Mobile NPUs, Mobile AI 2021 Challenge: Report Andrey D. Ignatov Radu Timofte Maurizio Denna Abdelrazak Younes A. Lek ... Kun Zeng Peirong Li Zhi-Hao Liu Shiqi Xue Shengpeng Wang SupR MQ 64 60 0 17 May 2021
Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report Andrey D. Ignatov Kim Byeoung-su Radu Timofte Angeline Pouget Fenglong Song ... Lei Lei Chaoyu Feng L. Huang Z. Lei Feifei Chen 68 30 0 17 May 2021
Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report Andrey D. Ignatov Cheng-Ming Chiang Hsien-Kai Kuo Anastasia Sycheva Radu Timofte ... K. Upla Kiran Raja Raghavendra Ramachandra Christoph Busch Etienne de Stoutz 86 48 0 17 May 2021
Texture Generation with Neural Cellular Automata A. Mordvintsev Eyvind Niklasson E. Randazzo 47 9 0 15 May 2021
Lightweight Compression of Intermediate Neural Network Features for Collaborative Intelligence R. Cohen Hyomin Choi Ivan V. Bajić 54 24 0 15 May 2021
High-Performance FPGA-based Accelerator for Bayesian Neural Networks Hongxiang Fan Martin Ferianc Miguel R. D. Rodrigues Hongyu Zhou Xinyu Niu Wayne Luk BDL 58 23 0 12 May 2021
Agatha: Smart Contract for DNN Computation Zihan Zheng Peichen Xie Xian Zhang Shuo Chen Yang Chen Xiaobing Guo Guangzhong Sun Guangyu Sun Lidong Zhou GNN 56 12 0 11 May 2021
In-Hindsight Quantization Range Estimation for Quantized Training Marios Fournarakis Markus Nagel MQ 49 10 0 10 May 2021
KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation Mengqi Xue Mingli Song Xinchao Wang Ying Chen Xingen Wang Xiuming Zhang 55 10 0 10 May 2021
Pareto-Optimal Quantized ResNet Is Mostly 4-bit AmirAli Abdolrashidi Lisa Wang Shivani Agrawal J. Malmaud Oleg Rybakov Chas Leichner Lukasz Lew MQ 71 36 0 07 May 2021
Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression Baeseong Park S. Kwon Daehwan Oh Byeongwook Kim Dongsoo Lee 63 4 0 05 May 2021
Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization Byeongwook Kim Dongsoo Lee Yeonju Ro Yongkweon Jeon S. Kwon Baeseong Park Daehwan Oh MQ 53 1 0 05 May 2021
Stealthy Backdoors as Compression Artifacts Yulong Tian Fnu Suya Fengyuan Xu David Evans 94 22 0 30 Apr 2021
AttendSeg: A Tiny Attention Condenser Neural Network for Semantic Segmentation on the Edge Xiaoyue Wen M. Famouri Andrew Hryniowski Alexander Wong SSeg 60 7 0 29 Apr 2021
Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety Sebastian Houben Stephanie Abrecht Maram Akila Andreas Bär Felix Brockherde ... Serin Varghese Michael Weber Sebastian J. Wirkert Tim Wirtz Matthias Woehrle AAML 130 58 0 29 Apr 2021
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training Jianfei Chen Lianmin Zheng Z. Yao Dequan Wang Ion Stoica Michael W. Mahoney Joseph E. Gonzalez MQ 77 75 0 29 Apr 2021
An optical neural network using less than 1 photon per multiplication Tianyu Wang Shifan Ma Logan G. Wright Tatsuhiro Onodera Brian C. Richard Peter L. McMahon 105 185 0 27 Apr 2021
HAO: Hardware-aware neural Architecture Optimization for Efficient Inference Zhen Dong Yizhao Gao Qijing Huang J. Wawrzynek Hayden Kwok-Hay So Kurt Keutzer 79 37 0 26 Apr 2021
Quantization of Deep Neural Networks for Accurate Edge Computing Wentao Chen Hailong Qiu Zhuang Jian Chutong Zhang Yu Hu Qing Lu Tianchen Wang Yiyu Shi Meiping Huang Xiaowe Xu 96 24 0 25 Apr 2021
Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation Mengyao Zhai Lei Chen Jiawei He Megha Nawhal Frederick Tung Greg Mori CLL 67 29 0 24 Apr 2021
Do All MobileNets Quantize Poorly? Gaining Insights into the Effect of Quantization on Depthwise Separable Convolutional Networks Through the Eyes of Multi-scale Distributional Dynamics S. Yun Alexander Wong MQ 84 27 0 24 Apr 2021
Measuring what Really Matters: Optimizing Neural Networks for TinyML Lennart Heim Andreas Biri Zhongnan Qu Lothar Thiele 84 30 0 21 Apr 2021
DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device Mario Almeida Stefanos Laskaridis Stylianos I. Venieris Ilias Leontiadis Nicholas D. Lane 75 37 0 20 Apr 2021
Distilling Knowledge via Knowledge Review Pengguang Chen Shu Liu Hengshuang Zhao Jiaya Jia 220 450 0 19 Apr 2021
Filtering Empty Camera Trap Images in Embedded Systems Fagner Cunha E. M. Santos R. Barreto J. Colonna 73 14 0 18 Apr 2021
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators David Stutz Nandhini Chandramoorthy Matthias Hein Bernt Schiele AAML MQ 70 18 0 16 Apr 2021
All-You-Can-Fit 8-Bit Flexible Floating-Point Format for Accurate and Memory-Efficient Inference of Deep Neural Networks Cheng-Wei Huang Tim-Wei Chen Juinn-Dar Huang MQ 36 6 0 15 Apr 2021
Annealing Knowledge Distillation A. Jafari Mehdi Rezagholizadeh Pranav Sharma A. Ghodsi 98 79 0 14 Apr 2021
Combined Depth Space based Architecture Search For Person Re-identification Hanjun Li Gaojie Wu Weishi Zheng 3DPC 83 107 0 09 Apr 2021
Content-Aware GAN Compression Yuchen Liu Zhixin Shu Yijun Li Zhe Lin Federico Perazzi S. Kung GAN 73 59 0 06 Apr 2021
TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT H. F. Langroudi Vedant Karia Tej Pandit Dhireesha Kudithipudi MQ 57 10 0 06 Apr 2021
Faster Convolution Inference Through Using Pre-Calculated Lookup Tables Grigor Gatchev V. Mollov VLM 39 0 0 04 Apr 2021
Inference of Recyclable Objects with Convolutional Neural Networks Jaime Caballero Francisco Vergara Randal Miranda José Serracín HAI 18 3 0 02 Apr 2021
Anytime Dense Prediction with Confidence Adaptivity Zhuang Liu Zhiqiu Xu H. Wang Trevor Darrell Evan Shelhamer 76 20 0 01 Apr 2021
Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer Phuoc Pham J. Abraham Jaeyong Chung MQ 81 13 0 01 Apr 2021
Bit-Mixer: Mixed-precision networks with runtime bit-width selection Adrian Bulat Georgios Tzimiropoulos MQ 77 27 0 31 Mar 2021
Integer-only Zero-shot Quantization for Efficient Speech Recognition Sehoon Kim A. Gholami Z. Yao Nicholas Lee Patrick Wang Aniruddha Nrusimha Bohan Zhai Tianren Gao Michael W. Mahoney Kurt Keutzer MQ 101 25 0 31 Mar 2021
Slimmable Compressive Autoencoders for Practical Neural Image Compression Feiyu Yang Luis Herranz Yongmei Cheng M. Mozerov 64 66 0 29 Mar 2021
Zero-shot Adversarial Quantization Yuang Liu Wei Zhang Jun Wang MQ 114 79 0 29 Mar 2021
Automated Backend-Aware Post-Training Quantization Ziheng Jiang Animesh Jain An Liu Josh Fromm Chengqian Ma Tianqi Chen Luis Ceze MQ 79 2 0 27 Mar 2021
A Practical Survey on Faster and Lighter Transformers Quentin Fournier G. Caron Daniel Aloise 137 105 0 26 Mar 2021
RCT: Resource Constrained Training for Edge AI Tian Huang Yaoyu Zhang Ming Yan Qiufeng Wang Rick Siow Mong Goh 82 8 0 26 Mar 2021
Distilling a Powerful Student Model via Online Knowledge Distillation Shaojie Li Mingbao Lin Yan Wang Yongjian Wu Yonghong Tian Ling Shao Rongrong Ji FedML 117 49 0 26 Mar 2021
Dynamic Domain Adaptation for Efficient Inference Shuang Li Jinming Zhang Wen-hui Ma Chi Harold Liu Wei Li 66 13 0 26 Mar 2021