ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.05877
  4. Cited By
Quantization and Training of Neural Networks for Efficient
  Integer-Arithmetic-Only Inference

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
    MQ
ArXiv (abs)PDFHTML

Papers citing "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"

50 / 1,298 papers shown
Title
Integer-arithmetic-only Certified Robustness for Quantized Neural
  Networks
Integer-arithmetic-only Certified Robustness for Quantized Neural Networks
Haowen Lin
Jian Lou
Li Xiong
Cyrus Shahabi
MQAAML
54
13
0
21 Aug 2021
Quantization Backdoors to Deep Learning Commercial Frameworks
Quantization Backdoors to Deep Learning Commercial Frameworks
Hua Ma
Huming Qiu
Yansong Gao
Zhi-Li Zhang
A. Abuadbba
Minhui Xue
Anmin Fu
Jiliang Zhang
S. Al-Sarawi
Derek Abbott
MQ
124
21
0
20 Aug 2021
An Information Theory-inspired Strategy for Automatic Network Pruning
An Information Theory-inspired Strategy for Automatic Network Pruning
Xiawu Zheng
Yuexiao Ma
Teng Xi
Gang Zhang
Errui Ding
Yuchao Li
Jie Chen
Yonghong Tian
Rongrong Ji
207
13
0
19 Aug 2021
A Survey on GAN Acceleration Using Memory Compression Technique
A Survey on GAN Acceleration Using Memory Compression Technique
Dina Tantawy
Mohamed Zahran
A. Wassal
86
8
0
14 Aug 2021
FOX-NAS: Fast, On-device and Explainable Neural Architecture Search
FOX-NAS: Fast, On-device and Explainable Neural Architecture Search
Chia-Hsiang Liu
Yu-Shin Han
Yuan-Yao Sung
Yi Lee
Hung-Yueh Chiang
Kai-Chiang Wu
61
12
0
14 Aug 2021
Pruning vs XNOR-Net: A Comprehensive Study of Deep Learning for Audio
  Classification on Edge-devices
Pruning vs XNOR-Net: A Comprehensive Study of Deep Learning for Audio Classification on Edge-devices
Md Mohaimenuzzaman
Christoph Bergmeir
B. Meyer
61
21
0
13 Aug 2021
Audio Spectral Enhancement: Leveraging Autoencoders for Low Latency
  Reconstruction of Long, Lossy Audio Sequences
Audio Spectral Enhancement: Leveraging Autoencoders for Low Latency Reconstruction of Long, Lossy Audio Sequences
Darshan Deshpande
H. Abichandani
23
0
0
08 Aug 2021
Tiny Neural Models for Seq2Seq
Tiny Neural Models for Seq2Seq
A. Kandoor
38
0
0
07 Aug 2021
Developing a Compressed Object Detection Model based on YOLOv4 for
  Deployment on Embedded GPU Platform of Autonomous System
Developing a Compressed Object Detection Model based on YOLOv4 for Deployment on Embedded GPU Platform of Autonomous System
Issac Sim
Junho Lim
Young-Wan Jang
Jihwan You
Seontaek Oh
Young-Keun Kim
53
7
0
01 Aug 2021
Adaptive Precision Training (AdaPT): A dynamic fixed point quantized
  training approach for DNNs
Adaptive Precision Training (AdaPT): A dynamic fixed point quantized training approach for DNNs
Lorenz Kummer
Kevin Sidak
Tabea Reichmann
Wilfried Gansterer
MQ
70
6
0
28 Jul 2021
Dynamic Neural Network Architectural and Topological Adaptation and
  Related Methods -- A Survey
Dynamic Neural Network Architectural and Topological Adaptation and Related Methods -- A Survey
Lorenz Kummer
AI4CE
73
0
0
28 Jul 2021
Improving Variational Autoencoder based Out-of-Distribution Detection
  for Embedded Real-time Applications
Improving Variational Autoencoder based Out-of-Distribution Detection for Embedded Real-time Applications
Yeli Feng
Daniel Jun Xian Ng
Arvind Easwaran
OODD
84
18
0
25 Jul 2021
Bias Loss for Mobile Neural Networks
Bias Loss for Mobile Neural Networks
L. Abrahamyan
Valentin Ziatchin
Yiming Chen
Nikos Deligiannis
45
14
0
23 Jul 2021
Positive/Negative Approximate Multipliers for DNN Accelerators
Positive/Negative Approximate Multipliers for DNN Accelerators
Ourania Spantidi
Georgios Zervakis
Iraklis Anagnostopoulos
H. Amrouch
J. Henkel
45
19
0
20 Jul 2021
Double Similarity Distillation for Semantic Image Segmentation
Double Similarity Distillation for Semantic Image Segmentation
Yingchao Feng
Xian Sun
Wenhui Diao
Jihao Li
Xin Gao
48
63
0
19 Jul 2021
A High-Performance Adaptive Quantization Approach for Edge CNN
  Applications
A High-Performance Adaptive Quantization Approach for Edge CNN Applications
Hsu-Hsun Chin
R. Tsay
Hsin-I Wu
MQ
52
5
0
18 Jul 2021
Dynamic Transformer for Efficient Machine Translation on Embedded
  Devices
Dynamic Transformer for Efficient Machine Translation on Embedded Devices
Hishan Parry
Lei Xun
Amin Sabet
Jia Bi
Jonathon S. Hare
G. Merrett
57
7
0
17 Jul 2021
Training for temporal sparsity in deep neural networks, application in
  video processing
Training for temporal sparsity in deep neural networks, application in video processing
Amirreza Yousefzadeh
Manolis Sifalakis
67
3
0
15 Jul 2021
Joint Matrix Decomposition for Deep Convolutional Neural Networks
  Compression
Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression
Shaowu Chen
Jihao Zhou
Weize Sun
Lei Huang
45
21
0
09 Jul 2021
Weight Reparametrization for Budget-Aware Network Pruning
Weight Reparametrization for Budget-Aware Network Pruning
Robin Dupont
H. Sahbi
Guillaume Michel
43
1
0
08 Jul 2021
Differentiable Architecture Pruning for Transfer Learning
Differentiable Architecture Pruning for Transfer Learning
Nicolo Colombo
Yang Gao
62
2
0
07 Jul 2021
Learning Efficient Vision Transformers via Fine-Grained Manifold
  Distillation
Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation
Zhiwei Hao
Jianyuan Guo
Ding Jia
Kai Han
Yehui Tang
Chao Zhang
Dacheng Tao
Yunhe Wang
ViT
140
73
0
03 Jul 2021
Pool of Experts: Realtime Querying Specialized Knowledge in Massive
  Neural Networks
Pool of Experts: Realtime Querying Specialized Knowledge in Massive Neural Networks
Hakbin Kim
Dong-Wan Choi
46
2
0
03 Jul 2021
A Review on Edge Analytics: Issues, Challenges, Opportunities, Promises,
  Future Directions, and Applications
A Review on Edge Analytics: Issues, Challenges, Opportunities, Promises, Future Directions, and Applications
Sabuzima Nayak
Ripon Patgiri
Lilapati Waikhom
Arif Ahmed
57
44
0
01 Jul 2021
Progressive Joint Low-light Enhancement and Noise Removal for Raw Images
Progressive Joint Low-light Enhancement and Noise Removal for Raw Images
Yucheng Lu
Seung‐Won Jung
96
32
0
28 Jun 2021
Real-Time Multi-View 3D Human Pose Estimation using Semantic Feedback to
  Smart Edge Sensors
Real-Time Multi-View 3D Human Pose Estimation using Semantic Feedback to Smart Edge Sensors
S. Bultmann
Sven Behnke
3DH
64
30
0
28 Jun 2021
Dataset and Benchmarking of Real-Time Embedded Object Detection for
  RoboCup SSL
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL
Roberto Fernandes
Walber M. Rodrigues
Edna N. S. Barros
ObjD
45
7
0
28 Jun 2021
LNS-Madam: Low-Precision Training in Logarithmic Number System using
  Multiplicative Weight Update
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Jiawei Zhao
Steve Dai
Rangharajan Venkatesan
Brian Zimmer
Mustafa Ali
Xuan Li
Brucek Khailany
B. Dally
Anima Anandkumar
MQ
85
14
0
26 Jun 2021
Quantization Aware Training, ERNIE and Kurtosis Regularizer: a short
  empirical study
Quantization Aware Training, ERNIE and Kurtosis Regularizer: a short empirical study
A. Zanetti
MQ
10
0
0
24 Jun 2021
Self-Supervised Monocular Depth Estimation of Untextured Indoor Rotated
  Scenes
Self-Supervised Monocular Depth Estimation of Untextured Indoor Rotated Scenes
Benjamin Keltjens
T. V. Dijk
Guido de Croon
MDE
63
3
0
24 Jun 2021
Information Bottleneck: Exact Analysis of (Quantized) Neural Networks
Information Bottleneck: Exact Analysis of (Quantized) Neural Networks
S. Lorenzen
Christian Igel
M. Nielsen
MQ
64
18
0
24 Jun 2021
Boggart: Towards General-Purpose Acceleration of Retrospective Video
  Analytics
Boggart: Towards General-Purpose Acceleration of Retrospective Video Analytics
Neil Agarwal
Ravi Netravali
95
15
0
21 Jun 2021
How to Reach Real-Time AI on Consumer Devices? Solutions for
  Programmable and Custom Architectures
How to Reach Real-Time AI on Consumer Devices? Solutions for Programmable and Custom Architectures
Stylianos I. Venieris
Ioannis Panopoulos
Ilias Leontiadis
I. Venieris
84
6
0
21 Jun 2021
CompConv: A Compact Convolution Module for Efficient Feature Learning
CompConv: A Compact Convolution Module for Efficient Feature Learning
Chen Zhang
Yinghao Xu
Yujun Shen
VLMSSL
51
10
0
19 Jun 2021
Quantized Neural Networks via {-1, +1} Encoding Decomposition and
  Acceleration
Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration
Qigong Sun
Xiufang Li
Fanhua Shang
Hongying Liu
Kan Yang
L. Jiao
Zhouchen Lin
MQ
54
0
0
18 Jun 2021
Efficient Deep Learning: A Survey on Making Deep Learning Models
  Smaller, Faster, and Better
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Gaurav Menghani
VLMMedIm
110
391
0
16 Jun 2021
Development of Quantized DNN Library for Exact Hardware Emulation
Development of Quantized DNN Library for Exact Hardware Emulation
M. Kiyama
Motoki Amagasaki
M. Iida
MQ
32
0
0
15 Jun 2021
A White Paper on Neural Network Quantization
A White Paper on Neural Network Quantization
Markus Nagel
Marios Fournarakis
Rana Ali Amjad
Yelysei Bondarenko
M. V. Baalen
Tijmen Blankevoort
MQ
149
553
0
15 Jun 2021
FastICARL: Fast Incremental Classifier and Representation Learning with
  Efficient Budget Allocation in Audio Sensing Applications
FastICARL: Fast Incremental Classifier and Representation Learning with Efficient Budget Allocation in Audio Sensing Applications
Young D. Kwon
Jagmohan Chauhan
Cecilia Mascolo
HAI
66
16
0
14 Jun 2021
Sparse PointPillars: Maintaining and Exploiting Input Sparsity to
  Improve Runtime on Embedded Systems
Sparse PointPillars: Maintaining and Exploiting Input Sparsity to Improve Runtime on Embedded Systems
Kyle Vedder
Eric Eaton
3DPC
50
13
0
12 Jun 2021
DECORE: Deep Compression with Reinforcement Learning
DECORE: Deep Compression with Reinforcement Learning
Manoj Alwani
Yang Wang
Vashisht Madhavan
AI4CE
75
44
0
11 Jun 2021
Knowledge distillation: A good teacher is patient and consistent
Knowledge distillation: A good teacher is patient and consistent
Lucas Beyer
Xiaohua Zhai
Amelie Royer
L. Markeeva
Rohan Anil
Alexander Kolesnikov
VLM
112
302
0
09 Jun 2021
OODIn: An Optimised On-Device Inference Framework for Heterogeneous
  Mobile Devices
OODIn: An Optimised On-Device Inference Framework for Heterogeneous Mobile Devices
Stylianos I. Venieris
Ioannis Panopoulos
I. Venieris
94
14
0
08 Jun 2021
Dynamic Resolution Network
Dynamic Resolution Network
Mingjian Zhu
Kai Han
Enhua Wu
Qiulin Zhang
Ying Nie
Zhenzhong Lan
Yunhe Wang
OOD
108
52
0
05 Jun 2021
Advances in Classifying the Stages of Diabetic Retinopathy Using
  Convolutional Neural Networks in Low Memory Edge Devices
Advances in Classifying the Stages of Diabetic Retinopathy Using Convolutional Neural Networks in Low Memory Edge Devices
A. Paul
59
5
0
03 Jun 2021
Towards a Federated Learning Framework for Heterogeneous Devices of
  Internet of Things
Towards a Federated Learning Framework for Heterogeneous Devices of Internet of Things
Huanle Zhang
Jeonghoon Kim
FedML
16
1
0
31 May 2021
Integer-Only Neural Network Quantization Scheme Based on
  Shift-Batch-Normalization
Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-Normalization
Qingyu Guo
Yuan Wang
Xiaoxin Cui
MQ
25
2
0
28 May 2021
Quantization and Deployment of Deep Neural Networks on Microcontrollers
Quantization and Deployment of Deep Neural Networks on Microcontrollers
Pierre-Emmanuel Novac
G. B. Hacene
Alain Pegatoquet
Benoit Miramond
Vincent Gripon
MQ
68
124
0
27 May 2021
Low-Precision Hardware Architectures Meet Recommendation Model Inference
  at Scale
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Zhaoxia Deng
Deng
Jongsoo Park
P. T. P. Tang
Haixin Liu
...
S. Nadathur
Changkyu Kim
Maxim Naumov
S. Naghshineh
M. Smelyanskiy
59
11
0
26 May 2021
Towards Compact CNNs via Collaborative Compression
Towards Compact CNNs via Collaborative Compression
Yuchao Li
Shaohui Lin
Jianzhuang Liu
QiXiang Ye
Mengdi Wang
Yong Li
Fan Yang
Jincheng Ma
Qi Tian
Rongrong Ji
3DV
68
88
0
24 May 2021
Previous
123...171819...242526
Next