Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.06393
Cited By
Fixed Point Quantization of Deep Convolutional Networks
19 November 2015
D. Lin
S. Talathi
V. Annapureddy
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fixed Point Quantization of Deep Convolutional Networks"
50 / 124 papers shown
Title
Reduced-Order Neural Network Synthesis with Robustness Guarantees
R. Drummond
M. Turner
S. Duncan
19
9
0
18 Feb 2021
Dynamic Precision Analog Computing for Neural Networks
Sahaj Garg
Joe Lou
Anirudh Jain
Mitchell Nahmias
45
33
0
12 Feb 2021
Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms
Rishabh Goyal
Joaquin Vanschoren
V. V. Acht
S. Nijssen
MQ
30
23
0
03 Feb 2021
Sound Event Detection with Binary Neural Networks on Tightly Power-Constrained IoT Devices
G. Cerutti
Renzo Andri
Lukas Cavigelli
Michele Magno
Elisabetta Farella
Luca Benini
MQ
21
37
0
12 Jan 2021
Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks
Ziquan Liu
Wuguannan Yao
Qiao Li
Antoni B. Chan
MQ
27
9
0
26 Dec 2020
DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image Super-Resolution Networks
Chee Hong
Heewon Kim
Sungyong Baik
Junghun Oh
Kyoung Mu Lee
OOD
SupR
MQ
24
41
0
21 Dec 2020
Design and Analysis of Uplink and Downlink Communications for Federated Learning
Sihui Zheng
Cong Shen
Xiang Chen
39
140
0
07 Dec 2020
An SMT-Based Approach for Verifying Binarized Neural Networks
Guy Amir
Haoze Wu
Clark W. Barrett
Guy Katz
19
58
0
05 Nov 2020
Rotated Binary Neural Network
Mingbao Lin
Rongrong Ji
Zi-Han Xu
Baochang Zhang
Yan Wang
Yongjian Wu
Feiyue Huang
Chia-Wen Lin
MQ
19
129
0
28 Sep 2020
Softmax Tempering for Training Neural Machine Translation Models
Raj Dabre
Atsushi Fujita
28
11
0
20 Sep 2020
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and Cheaper Reasoning
Yushan Zhu
Wen Zhang
Mingyang Chen
Hui Chen
Xu-Xin Cheng
Wei Zhang
Huajun Chen Zhejiang University
22
15
0
13 Sep 2020
Layer-specific Optimization for Mixed Data Flow with Mixed Precision in FPGA Design for CNN-based Object Detectors
Duy-Thanh Nguyen
Hyun Kim
Hyuk-Jae Lee
MQ
25
59
0
03 Sep 2020
A transprecision floating-point cluster for efficient near-sensor data analytics
Fabio Montagna
Stefan Mach
Simone Benatti
Angelo Garofalo
G. Ottavi
Luca Benini
D. Rossi
Giuseppe Tagliavini
6
12
0
27 Aug 2020
Term Revealing: Furthering Quantization at Run Time on Quantized DNNs
H. T. Kung
Bradley McDanel
S. Zhang
MQ
21
9
0
13 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
59
82
0
02 Jul 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
35
122
0
14 Jun 2020
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming
Mehrdad Khani Shirkoohi
Pouya Hamadanian
Arash Nasr-Esfahany
Mohammad Alizadeh
26
44
0
11 Jun 2020
An Overview of Neural Network Compression
James OÑeill
AI4CE
45
98
0
05 Jun 2020
Quantized Neural Networks: Characterization and Holistic Optimization
Yoonho Boo
Sungho Shin
Wonyong Sung
MQ
48
8
0
31 May 2020
PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices
Chunhua Deng
Siyu Liao
Yi Xie
Keshab K. Parhi
Xuehai Qian
Bo Yuan
38
93
0
23 Apr 2020
Learned Threshold Pruning
K. Azarian
Yash Bhalgat
Jinwon Lee
Tijmen Blankevoort
MQ
28
38
0
28 Feb 2020
BinaryDuo: Reducing Gradient Mismatch in Binary Activation Network by Coupling Binary Activations
Hyungjun Kim
Kyungsu Kim
Jinseok Kim
Jae-Joon Kim
MQ
27
47
0
16 Feb 2020
Taurus: A Data Plane Architecture for Per-Packet ML
Tushar Swamy
Alexander Rucker
M. Shahbaz
Ishan Gaur
K. Olukotun
21
82
0
12 Feb 2020
Compact recurrent neural networks for acoustic event detection on low-energy low-complexity platforms
G. Cerutti
Rahul Prasad
Alessio Brutti
Elisabetta Farella
21
47
0
29 Jan 2020
Convolutional-Recurrent Neural Networks on Low-Power Wearable Platforms for Cardiac Arrhythmia Detection
Antonino Faraone
R. Delgado-Gonzalo
19
24
0
08 Jan 2020
Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth
Günther Schindler
Lukas Pfeifenberger
Robert Peharz
Sebastian Tschiatschek
Holger Fröning
Franz Pernkopf
Zoubin Ghahramani
34
47
0
07 Jan 2020
Sparse Weight Activation Training
Md Aamir Raihan
Tor M. Aamodt
34
73
0
07 Jan 2020
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Wei Niu
Xiaolong Ma
Sheng Lin
Shihao Wang
Xuehai Qian
X. Lin
Yanzhi Wang
Bin Ren
MQ
35
226
0
01 Jan 2020
Adaptive Loss-aware Quantization for Multi-bit Networks
Zhongnan Qu
Zimu Zhou
Yun Cheng
Lothar Thiele
MQ
36
53
0
18 Dec 2019
QKD: Quantization-aware Knowledge Distillation
Jangho Kim
Yash Bhalgat
Jinwon Lee
Chirag I. Patel
Nojun Kwak
MQ
21
63
0
28 Nov 2019
CoopNet: Cooperative Convolutional Neural Network for Low-Power MCUs
Luca Mocerino
A. Calimera
MQ
16
9
0
19 Nov 2019
Loss Aware Post-training Quantization
Yury Nahshan
Brian Chmiel
Chaim Baskin
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
37
163
0
17 Nov 2019
S2DNAS:Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search
Zhihang Yuan
Bingzhe Wu
Zheng Liang
Shiwan Zhao
Weichen Bi
Guangyu Sun
27
30
0
16 Nov 2019
XNOR-Net++: Improved Binary Neural Networks
Adrian Bulat
Georgios Tzimiropoulos
MQ
39
200
0
30 Sep 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
Caiwen Ding
Shuo Wang
Ning Liu
Kaidi Xu
Yanzhi Wang
Yun Liang
MQ
18
89
0
29 Sep 2019
Point-Voxel CNN for Efficient 3D Deep Learning
Zhijian Liu
Haotian Tang
Yujun Lin
Song Han
3DPC
67
660
0
08 Jul 2019
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network Inference On Microcontrollers
Manuele Rusci
Alessandro Capotondi
Luca Benini
MQ
21
74
0
30 May 2019
Encrypted Speech Recognition using Deep Polynomial Networks
Shi-Xiong Zhang
Jiawei Liu
Dong Yu
24
25
0
11 May 2019
Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM
Sheng Lin
Xiaolong Ma
Shaokai Ye
Geng Yuan
Kaisheng Ma
Yanzhi Wang
MQ
25
10
0
02 May 2019
Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization
Yangyang Shi
M. Hwang
X. Lei
Haoyu Sheng
26
25
0
08 Apr 2019
Progressive Stochastic Binarization of Deep Networks
David Hartmann
Michael Wand
MQ
17
1
0
03 Apr 2019
Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM
Shaokai Ye
Xiaoyu Feng
Tianyun Zhang
Xiaolong Ma
Sheng Lin
...
Jian Tang
M. Fardad
X. Lin
Yongpan Liu
Yanzhi Wang
MQ
38
38
0
23 Mar 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
33
355
0
18 Feb 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
27
97
0
15 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
24
85
0
05 Feb 2019
Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Charbel Sakr
Naigang Wang
Chia-Yu Chen
Jungwook Choi
A. Agrawal
Naresh R Shanbhag
K. Gopalakrishnan
MQ
30
34
0
19 Jan 2019
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Ao Ren
Tianyun Zhang
Shaokai Ye
Jiayu Li
Wenyao Xu
Xuehai Qian
X. Lin
Yanzhi Wang
MQ
34
161
0
31 Dec 2018
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Zhe Li
Caiwen Ding
Siyue Wang
Wujie Wen
Youwei Zhuo
...
Qinru Qiu
Wenyao Xu
X. Lin
Xuehai Qian
Yanzhi Wang
MQ
12
64
0
12 Dec 2018
QUENN: QUantization Engine for low-power Neural Networks
Miguel de Prado
Maurizio Denna
Luca Benini
Nuria Pazos
MQ
35
14
0
14 Nov 2018
Convolutional Neural Network Quantization using Generalized Gamma Distribution
Doyun Kim
H. Yim
Sanghyuck Ha
Changgwun Lee
Inyup Kang
MQ
24
4
0
31 Oct 2018
Previous
1
2
3
Next