Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.03044
Cited By
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
10 February 2017
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights"
50 / 464 papers shown
Title
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Yang He
Guoliang Kang
Xuanyi Dong
Yanwei Fu
Yi Yang
AAML
VLM
25
954
0
21 Aug 2018
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y. Hsu
Yu-Chen Lin
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
MQ
17
15
0
17 Aug 2018
CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams
Lukas Cavigelli
Luca Benini
19
26
0
15 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
27
230
0
13 Aug 2018
Training Compact Neural Networks with Binary Weights and Low Precision Activations
Bohan Zhuang
Chunhua Shen
Ian Reid
MQ
13
14
0
08 Aug 2018
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Soojeong Kim
Gyeong-In Yu
Hojin Park
Sungwoo Cho
Eunji Jeong
Hyeonmin Ha
Sanha Lee
Joo Seong Jeong
Byung-Gon Chun
15
73
0
08 Aug 2018
Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved Representational Capability and Advanced Training Algorithm
Zechun Liu
Baoyuan Wu
Wenhan Luo
Xin Yang
Wei Liu
K. Cheng
MQ
30
550
0
01 Aug 2018
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition
Zhan Yang
Osolo Ian Raymond
Chengyuan Zhang
Ying Wan
J. Long
CVBM
34
36
0
31 Jul 2018
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
G. Hua
MQ
9
696
0
26 Jul 2018
XNOR Neural Engine: a Hardware Accelerator IP for 21.6 fJ/op Binary Neural Network Inference
Francesco Conti
Pasquale Davide Schiavone
Luca Benini
24
108
0
09 Jul 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
22
133
0
01 Jul 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
22
135
0
20 Jun 2018
Deep Learning Approximation: Zero-Shot Neural Network Speedup
Michele Pratusevich
14
0
0
15 Jun 2018
Early Seizure Detection with an Energy-Efficient Convolutional Neural Network on an Implantable Microcontroller
Maria Hügle
S. Heller
Manuel Watter
Manuel Blum
F. Manzouri
M. Dümpelmann
A. Schulze-Bonhage
P. Woias
Joschka Boedecker
11
41
0
12 Jun 2018
Full deep neural network training on a pruned weight budget
Maximilian Golub
G. Lemieux
Mieszko Lis
28
28
0
11 Jun 2018
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks
Ke Sun
Mingjie Li
Dong Liu
Jingdong Wang
37
126
0
01 Jun 2018
Approximate Random Dropout
Zhuoran Song
Ru Wang
Dongyu Ru
Hongru Huang
Zhenghao Peng
Hai Zhao
Xiaoyao Liang
Li Jiang
BDL
20
9
0
23 May 2018
CascadeCNN: Pushing the performance limits of quantisation
Alexandros Kouris
Stylianos I. Venieris
C. Bouganis
MQ
22
24
0
22 May 2018
Quantization Mimic: Towards Very Tiny CNN for Object Detection
Yi Wei
Xinyu Pan
Hongwei Qin
Wanli Ouyang
Junjie Yan
ObjD
22
88
0
06 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
14
45
0
29 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition
Kartik Hegde
Jiyong Yu
R. Agrawal
Mengjia Yan
Michael Pellauer
Christopher W. Fletcher
14
165
0
18 Apr 2018
Training a Binary Weight Object Detector by Knowledge Transfer for Autonomous Driving
Jiaolong Xu
Peng Wang
Hengzhang Yang
Antonio M. López
MQ
27
23
0
17 Apr 2018
IGCV
2
2
2
: Interleaved Structured Sparse Convolutional Neural Networks
Guotian Xie
Jingdong Wang
Ting Zhang
Jianhuang Lai
Richang Hong
Guo-Jun Qi
16
104
0
17 Apr 2018
Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds
Cenk Baykal
Lucas Liebenwein
Igor Gilitschenski
Dan Feldman
Daniela Rus
17
79
0
15 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Ameya Prabhu
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
25
18
0
11 Apr 2018
Distribution-Aware Binarization of Neural Networks for Sketch Recognition
Ameya Prabhu
Vishal Batchu
Sri Aurobindo Munagala
Rohit Gajawada
A. Namboodiri
MQ
16
5
0
09 Apr 2018
Adversarial Network Compression
Vasileios Belagiannis
Azade Farshad
Fabio Galasso
GAN
AAML
14
58
0
28 Mar 2018
Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions
Zheng Qin
Zhaoning Zhang
Dongsheng Li
Yiming Zhang
Yuxing Peng
17
28
0
27 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
13
184
0
15 Mar 2018
Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation
Xiaowei Xu
Q. Lu
Yu Hu
Lin Yang
X. S. Hu
Danny Chen
Yiyu Shi
MedIm
21
85
0
13 Mar 2018
Deep Neural Network Compression with Single and Multiple Level Quantization
Yuhui Xu
Yongzhuang Wang
Aojun Zhou
Weiyao Lin
H. Xiong
MQ
20
114
0
06 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
MQ
19
19
0
05 Mar 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence
Jinglan Liu
Jiaxin Zhang
Yukun Ding
Xiaowei Xu
Meng Jiang
Yiyu Shi
33
4
0
26 Feb 2018
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
27
71
0
23 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
24
389
0
13 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
Yukun Ding
Jinglan Liu
Jinjun Xiong
Yiyu Shi
MQ
29
21
0
10 Feb 2018
From Hashing to CNNs: Training BinaryWeight Networks via Hashing
Qinghao Hu
Peisong Wang
Jian Cheng
MQ
24
98
0
08 Feb 2018
Universal Deep Neural Network Compression
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
83
85
0
07 Feb 2018
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks
Jian Cheng
Peisong Wang
Gang Li
Qinghao Hu
Hanqing Lu
27
3
0
03 Feb 2018
Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks
Deepak Mittal
S. Bhardwaj
Mitesh M. Khapra
Balaraman Ravindran
VLM
28
65
0
31 Jan 2018
BinaryRelax: A Relaxation Approach For Training Deep Neural Networks With Quantized Weights
Penghang Yin
Shuai Zhang
J. Lyu
Stanley Osher
Y. Qi
Jack Xin
MQ
22
78
0
19 Jan 2018
SBNet: Sparse Blocks Network for Fast Inference
Mengye Ren
A. Pokrovsky
Binh Yang
R. Urtasun
25
179
0
07 Jan 2018
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
54
3,047
0
15 Dec 2017
Deep Learning for Real-Time Crime Forecasting and its Ternarization
Bao Wang
Penghang Yin
Andrea L. Bertozzi
P. Brantingham
Stanley J. Osher
Jack Xin
AI4TS
38
82
0
23 Nov 2017
Mobile Video Object Detection with Temporally-Aware Feature Maps
Mason Liu
Menglong Zhu
ObjD
18
196
0
17 Nov 2017
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy
Asit K. Mishra
Debbie Marr
FedML
27
330
0
15 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning
Raphael Shu
Hideki Nakayama
29
129
0
03 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
31
231
0
01 Nov 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
25
23
0
13 Oct 2017
Flexible Network Binarization with Layer-wise Priority
He Wang
Yi Tian Xu
Bingbing Ni
Hongteng Xu
MQ
23
10
0
13 Sep 2017
Previous
1
2
3
...
10
8
9
Next