Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.08886
Cited By
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
21 November 2018
Kuan-Chieh Jackson Wang
Zhijian Liu
Yujun Lin
Ji Lin
Song Han
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HAQ: Hardware-Aware Automated Quantization with Mixed Precision"
35 / 435 papers shown
Title
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
19
42
0
21 Dec 2019
AdaBits: Neural Network Quantization with Adaptive Bit-Widths
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
16
123
0
20 Dec 2019
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
Hongxu Yin
Pavlo Molchanov
Zhizhong Li
J. Álvarez
Arun Mallya
Derek Hoiem
N. Jha
Jan Kautz
28
552
0
18 Dec 2019
Dynamic Convolution: Attention over Convolution Kernels
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Dongdong Chen
Lu Yuan
Zicheng Liu
33
869
0
07 Dec 2019
Deep Model Compression Via Two-Stage Deep Reinforcement Learning
Huixin Zhan
Wei-Ming Lin
Yongcan Cao
10
12
0
04 Dec 2019
Semi-Relaxed Quantization with DropBits: Training Low-Bit Neural Networks via Bit-wise Regularization
J. H. Lee
Jihun Yun
Sung Ju Hwang
Eunho Yang
MQ
15
0
0
29 Nov 2019
QKD: Quantization-aware Knowledge Distillation
Jangho Kim
Yash Bhalgat
Jinwon Lee
Chirag I. Patel
Nojun Kwak
MQ
21
63
0
28 Nov 2019
Domain-Aware Dynamic Networks
Tianyuan Zhang
Bichen Wu
Xin Wang
Joseph E. Gonzalez
Kurt Keutzer
30
6
0
26 Nov 2019
Any-Precision Deep Neural Networks
Haichao Yu
Haoxiang Li
Humphrey Shi
Thomas S. Huang
G. Hua
MQ
23
63
0
17 Nov 2019
Ternary MobileNets via Per-Layer Hybrid Filter Banks
Dibakar Gope
Jesse G. Beu
Urmish Thakker
Matthew Mattina
MQ
32
15
0
04 Nov 2019
Comprehensive SNN Compression Using ADMM Optimization and Activity Regularization
Lei Deng
Yujie Wu
Yifan Hu
Ling Liang
Guoqi Li
Xing Hu
Yufei Ding
Peng Li
Yuan Xie
30
80
0
03 Nov 2019
Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers
Xishan Zhang
Shaoli Liu
Rui Zhang
Chang-Shu Liu
Di Huang
...
Jiaming Guo
Yu Kang
Qi Guo
Zidong Du
Yunji Chen
MQ
21
6
0
01 Nov 2019
Training DNN IoT Applications for Deployment On Analog NVM Crossbars
F. García-Redondo
Shidhartha Das
G. Rosendale
19
5
0
30 Oct 2019
Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks
Yihui He
Jianing Qian
Jianren Wang
Cindy X. Le
Congrui Hetang
Qi Lyu
Wenping Wang
Tianwei Yue
48
11
0
21 Oct 2019
Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach
Haichuan Yang
Shupeng Gui
Yuhao Zhu
Ji Liu
MQ
20
5
0
14 Oct 2019
Forward and Backward Information Retention for Accurate Binary Neural Networks
Haotong Qin
Ruihao Gong
Xianglong Liu
Mingzhu Shen
Ziran Wei
F. Yu
Jingkuan Song
MQ
133
324
0
24 Sep 2019
Structured Binary Neural Networks for Image Recognition
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Peng Chen
Lingqiao Liu
Ian Reid
MQ
22
17
0
22 Sep 2019
PULP-NN: Accelerating Quantized Neural Networks on Parallel Ultra-Low-Power RISC-V Processors
Angelo Garofalo
Manuele Rusci
Francesco Conti
D. Rossi
Luca Benini
MQ
11
134
0
29 Aug 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
32
446
0
14 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
33
3
0
05 Aug 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Hervé Jégou
MQ
37
149
0
12 Jul 2019
Point-Voxel CNN for Efficient 3D Deep Learning
Zhijian Liu
Haotian Tang
Yujun Lin
Song Han
3DPC
67
660
0
08 Jul 2019
Hardware/Software Co-Exploration of Neural Architectures
Weiwen Jiang
Lei Yang
E. Sha
Qingfeng Zhuge
Shouzhen Gu
Sakyasingha Dasgupta
Yiyu Shi
Jiaxi Hu
18
129
0
06 Jul 2019
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network Inference On Microcontrollers
Manuele Rusci
Alessandro Capotondi
Luca Benini
MQ
21
74
0
30 May 2019
Instant Quantization of Neural Networks using Monte Carlo Methods
Gonçalo Mordido
Matthijs Van Keirsbilck
A. Keller
MQ
27
9
0
29 May 2019
Approximate LSTMs for Time-Constrained Inference: Enabling Fast Reaction in Self-Driving Cars
Alexandros Kouris
Stylianos I. Venieris
Michail Rizakis
C. Bouganis
AI4TS
14
12
0
02 May 2019
Low-Memory Neural Network Training: A Technical Report
N. Sohoni
Christopher R. Aberger
Megan Leszczynski
Jian Zhang
Christopher Ré
17
99
0
24 Apr 2019
Design Automation for Efficient Deep Learning Computing
Song Han
Han Cai
Ligeng Zhu
Ji Lin
Kuan-Chieh Jackson Wang
Zhijian Liu
Yujun Lin
15
20
0
24 Apr 2019
Resource Constrained Neural Network Architecture Search: Will a Submodularity Assumption Help?
Yunyang Xiong
Ronak R. Mehta
Vikas Singh
41
33
0
08 Apr 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
24
97
0
15 Feb 2019
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
47
1,671
0
20 Nov 2018
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
55
68
0
05 Nov 2018
NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications
Tien-Ju Yang
Andrew G. Howard
Bo Chen
Xiao Zhang
Alec Go
Mark Sandler
Vivienne Sze
Hartwig Adam
90
515
0
09 Apr 2018
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
40
1,087
0
23 Oct 2017
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
271
5,327
0
05 Nov 2016
Previous
1
2
3
4
5
6
7
8
9