Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.05877
Cited By
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"
50 / 1,298 papers shown
Title
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Sai Qian Zhang
Thierry Tambe
Nestor Cuevas
Gu-Yeon Wei
David Brooks
56
4
0
04 May 2023
Stable and low-precision training for large-scale vision-language models
Mitchell Wortsman
Tim Dettmers
Luke Zettlemoyer
Ari S. Morcos
Ali Farhadi
Ludwig Schmidt
MQ
MLLM
VLM
144
44
0
25 Apr 2023
Improving Post-Training Quantization on Object Detection with Task Loss-Guided Lp Metric
Lin Niu
Jia-Wen Liu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
61
2
0
19 Apr 2023
Big-Little Adaptive Neural Networks on Low-Power Near-Subthreshold Processors
Zichao Shen
Neil Howard
J. Núñez-Yáñez
85
2
0
19 Apr 2023
Neural Network Quantisation for Faster Homomorphic Encryption
Wouter Legiest
Jan-Pieter DÁnvers
Furkan Turan
Michiel Van Beirendonck
Ingrid Verbauwhede
MQ
52
6
0
19 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Xiuying Wei
Yunchen Zhang
Yuhang Li
Xiangguo Zhang
Ruihao Gong
Jian Ren
Zhengang Li
MQ
78
36
0
18 Apr 2023
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Alexander Warnecke
Julian Speith
Janka Möller
Konrad Rieck
C. Paar
AAML
211
3
0
17 Apr 2023
Learning-based Spatial and Angular Information Separation for Light Field Compression
Jinglei Shi
Yihong Xu
C. Guillemot
55
0
0
13 Apr 2023
Efficient Deep Learning Models for Privacy-preserving People Counting on Low-resolution Infrared Arrays
Chen Xie
Francesco Daghero
Yukai Chen
Marco Castellano
Luca Gandolfi
A. Calimera
Enrico Macii
Massimo Poncino
Daniele Jahier Pagliari
69
6
0
12 Apr 2023
Scale-Space Hypernetworks for Efficient Biomedical Imaging
Jose Javier Gonzalez Ortiz
John Guttag
Adrian Dalca
57
0
0
11 Apr 2023
Learning to Detect Touches on Cluttered Tables
Norberto A. Goussies
Kenji Hata
Shruthi Prabhakara
Abhishek Amit
Tony Aubé
...
Brett Rampata
Carlos Sobrinho
George Sung
Natalie Zauhar
Palash Nandy
32
2
0
10 Apr 2023
Are Visual Recognition Models Robust to Image Compression?
Joao Maria Janeiro
Stanislav Frolov
Alaaeldin El-Nouby
Jakob Verbeek
VLM
55
4
0
10 Apr 2023
EnforceSNN: Enabling Resilient and Energy-Efficient Spiking Neural Network Inference considering Approximate DRAMs for Embedded Systems
Rachmad Vidya Wicaksana Putra
Muhammad Abdullah Hanif
Mohamed Bennai
62
11
0
08 Apr 2023
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
57
0
0
07 Apr 2023
HNeRV: A Hybrid Neural Representation for Videos
Hao Chen
M. Gwilliam
Ser-Nam Lim
Abhinav Shrivastava
73
77
1
05 Apr 2023
MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations
M. Wong
M. Ramanujam
Guha Balakrishnan
Ravi Netravali
103
5
0
04 Apr 2023
Optimizing data-flow in Binary Neural Networks
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
73
6
0
03 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
214
48
0
30 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
157
98
0
27 Mar 2023
Mathematical Challenges in Deep Learning
V. Nia
Guojun Zhang
I. Kobyzev
Michael R. Metel
Xinlin Li
...
S. Hemati
M. Asgharian
Linglong Kong
Wulong Liu
Boxing Chen
AI4CE
VLM
72
1
0
24 Mar 2023
PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration
Richard Petri
Grace Li Zhang
Yiran Chen
Ulf Schlichtmann
Bing Li
29
6
0
24 Mar 2023
Hard Sample Matters a Lot in Zero-Shot Quantization
Huantong Li
Xiangmiao Wu
Fanbing Lv
Daihai Liao
Thomas H. Li
Yonggang Zhang
Bo Han
Mingkui Tan
MQ
80
21
0
24 Mar 2023
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems
Jemin Lee
Yongin Kwon
Sihyeong Park
Misun Yu
Jeman Park
Hwanjun Song
ViT
MQ
83
6
0
22 Mar 2023
Fighting over-fitting with quantization for learning deep neural networks on noisy labels
Gauthier Tallec
Edouard Yvinec
Arnaud Dapogny
Kévin Bailly
NoLa
31
1
0
21 Mar 2023
Unit Scaling: Out-of-the-Box Low-Precision Training
Charlie Blake
Douglas Orr
Carlo Luschi
MQ
69
7
0
20 Mar 2023
Evaluation of Convolution Primitives for Embedded Neural Networks on 32-bit Microcontrollers
Baptiste Nguyen
Pierre-Alain Moëllic
Sylvain Blayac
20
2
0
19 Mar 2023
SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference
Boren Hu
Yun Zhu
Jiacheng Li
Siliang Tang
58
9
0
16 Mar 2023
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
Li Zhang
Xudong Wang
Jiahang Xu
Quanlu Zhang
Yujing Wang
Yuqing Yang
Ningxin Zheng
Ting Cao
Mao Yang
MQ
57
3
0
15 Mar 2023
Bag of Tricks with Quantized Convolutional Neural Networks for image classification
Jie Hu
Mengze Zeng
Enhua Wu
MQ
57
2
0
13 Mar 2023
Adaptive Data-Free Quantization
Biao Qian
Yang Wang
Richang Hong
Meng Wang
MQ
105
38
0
13 Mar 2023
Scalable Object Detection on Embedded Devices Using Weight Pruning and Singular Value Decomposition
D. Ham
Jaeyeop Jeong
June-Kyoo Park
Raehyeon Jeong
S. Jeon
Hyeongjun Jeon
Ye-Eun Lim
29
0
0
05 Mar 2023
Fixed-point quantization aware training for on-device keyword-spotting
Sashank Macha
Om Oza
Alex Escott
Francesco Calivá
Robert M. Armitano
S. Cheekatmalla
S. Parthasarathi
Yuzong Liu
MQ
47
4
0
04 Mar 2023
Adversarial Attacks on Machine Learning in Embedded and IoT Platforms
Christian Westbrook
S. Pasricha
AAML
71
3
0
03 Mar 2023
Boosting Distributed Full-graph GNN Training with Asynchronous One-bit Communication
Mengdie Zhang
Qi Hu
Peng Sun
Yonggang Wen
Tianwei Zhang
GNN
69
6
0
02 Mar 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
163
106
0
27 Feb 2023
DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference
Jiajun Zhou
Jiajun Wu
Yizhao Gao
Yuhao Ding
Chaofan Tao
Yue Liu
Fengbin Tu
Kwang-Ting Cheng
Hayden Kwok-Hay So
Ngai Wong
MQ
71
7
0
24 Feb 2023
Quantized Low-Rank Multivariate Regression with Random Dithering
Junren Chen
Yueqi Wang
Michael Kwok-Po Ng
86
6
0
22 Feb 2023
Optical Transformers
Maxwell G. Anderson
Shifan Ma
Tianyu Wang
Logan G. Wright
Peter L. McMahon
47
23
0
20 Feb 2023
Rethinking Data-Free Quantization as a Zero-Sum Game
Biao Qian
Yang Wang
Richang Hong
Meng Wang
MQ
70
18
0
19 Feb 2023
Moby: Empowering 2D Models for Efficient Point Cloud Analytics on the Edge
Jingzong Li
Yik Hong Cai
Libin Liu
Yushun Mao
Chun Jason Xue
Hongchang Xu
56
4
0
18 Feb 2023
A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and Techniques
Wenbin Li
Hakim Hacid
Ebtesam Almazrouei
Merouane Debbah
91
13
0
16 Feb 2023
Towards Optimal Compression: Joint Pruning and Quantization
Ben Zandonati
Glenn Bucagu
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
90
3
0
15 Feb 2023
Privacy-Preserving Tree-Based Inference with TFHE
Jordan Fréry
Andrei Stoian
Roman Bredehoft
Luis Montero
Celia Kherfallah
Benoît Chevallier-Mames
Arthur Meyre
41
5
0
13 Feb 2023
Deep Neural Networks for Encrypted Inference with TFHE
Andrei Stoian
Jordan Fréry
Roman Bredehoft
Luis Montero
Celia Kherfallah
Benoît Chevallier-Mames
FedML
77
23
0
13 Feb 2023
A Practical Mixed Precision Algorithm for Post-Training Quantization
N. Pandey
Markus Nagel
M. V. Baalen
Yin-Ruey Huang
Chirag I. Patel
Tijmen Blankevoort
MQ
64
22
0
10 Feb 2023
Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao
Ji Lin
Song Han
90
76
0
09 Feb 2023
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement Learning
Yingchun Wang
Jingcai Guo
Song Guo
Weizhan Zhang
MQ
75
21
0
09 Feb 2023
ZipLM: Inference-Aware Structured Pruning of Language Models
Eldar Kurtic
Elias Frantar
Dan Alistarh
MQ
101
26
0
07 Feb 2023
LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup
Xiaohu Tang
Yang Wang
Ting Cao
Li Zhang
Qi Chen
Deng Cai
Yunxin Liu
Mao Yang
89
18
0
07 Feb 2023
Training with Mixed-Precision Floating-Point Assignments
Wonyeol Lee
Rahul Sharma
A. Aiken
MQ
45
3
0
31 Jan 2023
Previous
1
2
3
...
10
11
12
...
24
25
26
Next