Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.08886
Cited By
v1
v2
v3 (latest)
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
21 November 2018
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HAQ: Hardware-Aware Automated Quantization with Mixed Precision"
50 / 436 papers shown
Title
Standing on the Shoulders of Giants: Hardware and Neural Architecture Co-Search with Hot Start
Weiwen Jiang
Lei Yang
Sakyasingha Dasgupta
Jiaxi Hu
Yiyu Shi
68
59
0
17 Jul 2020
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda Khadka
Estelle Aflalo
Mattias Marder
Avrech Ben-David
Santiago Miret
Shie Mannor
Tamir Hazan
Hanlin Tang
Somdeb Majumdar
GNN
71
11
0
14 Jul 2020
AUSN: Approximately Uniform Quantization by Adaptively Superimposing Non-uniform Distribution for Deep Neural Networks
Fangxin Liu
Wenbo Zhao
Yanzhi Wang
Changzhi Dai
Li Jiang
MQ
58
3
0
08 Jul 2020
FracBits: Mixed Precision Quantization via Fractional Bit-Widths
Linjie Yang
Qing Jin
MQ
101
74
0
04 Jul 2020
Bit Error Robustness for Energy-Efficient DNN Accelerators
David Stutz
Nandhini Chandramoorthy
Matthias Hein
Bernt Schiele
MQ
52
1
0
24 Jun 2020
Automatic heterogeneous quantization of deep neural networks for low-latency inference on the edge for particle detectors
C. Coelho
Aki Kuusela
Shane Li
Zhuang Hao
T. Aarrestad
Vladimir Loncar
J. Ngadiuba
M. Pierini
Adrian Alan Pol
S. Summers
MQ
102
179
0
15 Jun 2020
Automated Design Space Exploration for optimised Deployment of DNN on Arm Cortex-A CPUs
Miguel de Prado
Andrew Mundy
Rabia Saeed
Maurizo Denna
Nuria Pazos
Luca Benini
69
11
0
09 Jun 2020
EDCompress: Energy-Aware Model Compression for Dataflows
Zhehui Wang
Yaoyu Zhang
Qiufeng Wang
Rick Siow Mong Goh
52
2
0
08 Jun 2020
Novel Adaptive Binary Search Strategy-First Hybrid Pyramid- and Clustering-Based CNN Filter Pruning Method without Parameters Setting
K. Chung
Yu-Lun Chang
Bo-Wei Tsai
27
0
0
08 Jun 2020
Conditional Neural Architecture Search
Sheng-Chun Kao
Arun Ramamurthy
Reed Williams
T. Krishna
28
0
0
06 Jun 2020
Generative Design of Hardware-aware DNNs
Sheng-Chun Kao
Arun Ramamurthy
T. Krishna
MQ
37
2
0
06 Jun 2020
Machine Learning Systems for Intelligent Services in the IoT: A Survey
Wiebke Toussaint
Aaron Yi Ding
LRM
77
0
0
29 May 2020
A Feature-map Discriminant Perspective for Pruning Deep Neural Networks
Zejiang Hou
S. Kung
35
5
0
28 May 2020
Accelerating Neural Network Inference by Overflow Aware Quantization
Hongwei Xie
Shuo Zhang
Huanghao Ding
Yafei Song
Baitao Shao
Conggang Hu
Lingyi Cai
Mingyang Li
MQ
18
0
0
27 May 2020
VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization
Cheng Gong
Yao Chen
Ye Lu
Tao Li
Cong Hao
Deming Chen
MQ
51
45
0
18 May 2020
Bayesian Bits: Unifying Quantization and Pruning
M. V. Baalen
Christos Louizos
Markus Nagel
Rana Ali Amjad
Ying Wang
Tijmen Blankevoort
Max Welling
MQ
95
116
0
14 May 2020
Data-Free Network Quantization With Adversarial Knowledge Distillation
Yoojin Choi
Jihwan P. Choi
Mostafa El-Khamy
Jungwon Lee
MQ
76
121
0
08 May 2020
Lite Transformer with Long-Short Range Attention
Zhanghao Wu
Zhijian Liu
Ji Lin
Chengyue Wu
Song Han
62
323
0
24 Apr 2020
Automatic low-bit hybrid quantization of neural networks through meta learning
Tao Wang
Junsong Wang
Chang Xu
Chao Xue
MQ
23
2
0
24 Apr 2020
Intermittent Inference with Nonuniformly Compressed Multi-Exit Neural Network for Energy Harvesting Powered Devices
Yawen Wu
Zhepeng Wang
Zhenge Jia
Yiyu Shi
Jiaxi Hu
86
54
0
23 Apr 2020
LSQ+: Improving low-bit quantization through learnable offsets and better initialization
Yash Bhalgat
Jinwon Lee
Markus Nagel
Tijmen Blankevoort
Nojun Kwak
MQ
67
223
0
20 Apr 2020
Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Sparsification
Huanrui Yang
Minxue Tang
W. Wen
Feng Yan
Daniel Hu
Ang Li
H. Li
Yiran Chen
66
65
0
20 Apr 2020
Efficient Synthesis of Compact Deep Neural Networks
Wenhan Xia
Hongxu Yin
N. Jha
57
3
0
18 Apr 2020
Rethinking Differentiable Search for Mixed-Precision Neural Networks
Zhaowei Cai
Nuno Vasconcelos
MQ
49
126
0
13 Apr 2020
DarkneTZ: Towards Model Privacy at the Edge using Trusted Execution Environments
Fan Mo
Ali Shahin Shamsabadi
Kleomenis Katevas
Soteris Demetriou
Ilias Leontiadis
Andrea Cavallaro
Hamed Haddadi
FedML
68
183
0
12 Apr 2020
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
Alvin Wan
Xiaoliang Dai
Peizhao Zhang
Zijian He
Yuandong Tian
...
Matthew Yu
Tao Xu
Kan Chen
Peter Vajda
Joseph E. Gonzalez
84
294
0
12 Apr 2020
GeneCAI: Genetic Evolution for Acquiring Compact AI
Mojan Javaheripi
Mohammad Samragh
T. Javidi
F. Koushanfar
74
9
0
08 Apr 2020
CNN2Gate: Toward Designing a General Framework for Implementation of Convolutional Neural Networks on FPGA
Alireza Ghaffari
Yvon Savaria
31
9
0
06 Apr 2020
GAN Compression: Efficient Architectures for Interactive Conditional GANs
Zhekai Zhang
Ji Lin
Yaoyao Ding
Zhijian Liu
Jun-Yan Zhu
Song Han
GAN
84
2
0
19 Mar 2020
Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network Compression
Yawei Li
Shuhang Gu
Christoph Mayer
Luc Van Gool
Radu Timofte
225
192
0
19 Mar 2020
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Yuhang Li
Wei Wang
Haoli Bai
Ruihao Gong
Xin Dong
F. Yu
MQ
54
21
0
17 Mar 2020
Benchmarking TinyML Systems: Challenges and Direction
Colby R. Banbury
Vijay Janapa Reddi
Max Lam
William Fu
A. Fazel
...
Jae-sun Seo
Jeff Sieracki
Urmish Thakker
Marian Verhelst
Poonam Yadav
174
238
0
10 Mar 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
115
55
0
04 Mar 2020
WaveQ: Gradient-Based Deep Quantization of Neural Networks through Sinusoidal Adaptive Regularization
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
T. Elgindi
Charles-Alban Deledalle
H. Esmaeilzadeh
MQ
60
10
0
29 Feb 2020
Learning in the Frequency Domain
Kai Xu
Minghai Qin
Fei Sun
Yuhao Wang
Yen-kuang Chen
Fengbo Ren
130
409
0
27 Feb 2020
RNNPool: Efficient Non-linear Pooling for RAM Constrained Inference
Oindrila Saha
Aditya Kusupati
H. Simhadri
Manik Varma
Prateek Jain
85
56
0
27 Feb 2020
Searching for Winograd-aware Quantized Networks
Javier Fernandez-Marques
P. Whatmough
Andrew Mundy
Matthew Mattina
MQ
76
40
0
25 Feb 2020
Exploring the Connection Between Binary and Spiking Neural Networks
Sen Lu
Abhronil Sengupta
MQ
80
103
0
24 Feb 2020
Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision
Xingchao Liu
Mao Ye
Dengyong Zhou
Qiang Liu
MQ
92
42
0
20 Feb 2020
Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations
Yichi Zhang
Ritchie Zhao
Weizhe Hua
N. Xu
G. E. Suh
Zhiru Zhang
MQ
145
27
0
17 Feb 2020
Learning Architectures for Binary Networks
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
88
44
0
17 Feb 2020
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization
Milovs Nikolić
G. B. Hacene
Ciaran Bannon
Alberto Delmas Lascorz
Matthieu Courbariaux
Yoshua Bengio
Vincent Gripon
Andreas Moshovos
MQ
66
25
0
08 Feb 2020
Switchable Precision Neural Networks
Luis Guerra
Bohan Zhuang
Ian Reid
Tom Drummond
MQ
57
20
0
07 Feb 2020
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
Byung Hoon Ahn
Prannoy Pilligundla
Amir Yazdanbakhsh
H. Esmaeilzadeh
ODL
113
82
0
23 Jan 2020
Channel Pruning via Automatic Structure Search
Mingbao Lin
Rongrong Ji
Yuxin Zhang
Baochang Zhang
Yongjian Wu
Yonghong Tian
130
247
0
23 Jan 2020
Filter Sketch for Network Pruning
Mingbao Lin
Liujuan Cao
Shaojie Li
QiXiang Ye
Yonghong Tian
Jianzhuang Liu
Q. Tian
Rongrong Ji
CLIP
3DPC
171
82
0
23 Jan 2020
Functional Error Correction for Robust Neural Networks
Kunping Huang
P. Siegel
Anxiao
Anxiao Jiang
31
25
0
12 Jan 2020
Least squares binary quantization of neural networks
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
82
32
0
09 Jan 2020
Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth
Günther Schindler
Lukas Pfeifenberger
Robert Peharz
Sebastian Tschiatschek
Holger Fröning
Franz Pernkopf
Zoubin Ghahramani
84
51
0
07 Jan 2020
RPR: Random Partition Relaxation for Training; Binary and Ternary Weight Neural Networks
Lukas Cavigelli
Luca Benini
MQ
58
9
0
04 Jan 2020
Previous
1
2
3
4
5
6
7
8
9
Next