Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.06473
Cited By
Quantized Convolutional Neural Networks for Mobile Devices
21 December 2015
Jiaxiang Wu
Cong Leng
Yuhang Wang
Qinghao Hu
Jian Cheng
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quantized Convolutional Neural Networks for Mobile Devices"
50 / 178 papers shown
Title
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
12
616
0
04 Oct 2019
On the Efficacy of Knowledge Distillation
Ligang He
Rui Mao
57
600
0
03 Oct 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
Caiwen Ding
Shuo Wang
Ning Liu
Kaidi Xu
Yanzhi Wang
Yun Liang
MQ
24
89
0
29 Sep 2019
Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation
Xiaolong Ma
Geng Yuan
Sheng Lin
Caiwen Ding
Fuxun Yu
Tao Liu
Wujie Wen
Xiang Chen
Yanzhi Wang
MQ
18
45
0
27 Aug 2019
Recent Advances in Deep Learning for Object Detection
Xiongwei Wu
Doyen Sahoo
Guosheng Lin
VLM
ObjD
35
800
0
10 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
33
3
0
05 Aug 2019
SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications
Pengyi Zhang
Yunxin Zhong
Xiaoqiong Li
26
198
0
25 Jul 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Hervé Jégou
MQ
37
149
0
12 Jul 2019
GAN-Knowledge Distillation for one-stage Object Detection
Wanwei Wang
Jin ke Yu Fan Zong
ObjD
22
28
0
20 Jun 2019
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
Zhuo Chen
Jiyuan Zhang
Ruizhou Ding
Diana Marculescu
13
12
0
19 Jun 2019
Deep Learning-Based Decoding of Constrained Sequence Codes
Congzhe Cao
Duanshun Li
Ivan J. Fair
3DV
AI4TS
13
19
0
13 Jun 2019
BasisConv: A method for compressed representation and learning in CNNs
M. Tayyab
Abhijit Mahalanobis
3DPC
SSL
24
6
0
11 Jun 2019
Distilling Object Detectors with Fine-grained Feature Imitation
Tao Wang
Li-xin Yuan
Xiaopeng Zhang
Jiashi Feng
ObjD
13
378
0
09 Jun 2019
DiCENet: Dimension-wise Convolutions for Efficient Networks
Sachin Mehta
Hannaneh Hajishirzi
Mohammad Rastegari
36
43
0
08 Jun 2019
Butterfly Transform: An Efficient FFT Based Neural Architecture Design
Keivan Alizadeh-Vahid
Anish K. Prabhu
Ali Farhadi
Mohammad Rastegari
32
50
0
05 Jun 2019
Efficient 8-Bit Quantization of Transformer Neural Machine Language Translation Model
Aishwarya Bhandare
Vamsi Sripathi
Deepthi Karkada
Vivek V. Menon
Sun Choi
Kushal Datta
V. Saletore
MQ
30
131
0
03 Jun 2019
MobiVSR: A Visual Speech Recognition Solution for Mobile Devices
Nilay Shrivastava
Astitwa Saxena
Yaman Kumar Singla
Preeti Kaur
Debanjan Mahata
R. Shah
27
3
0
10 May 2019
Searching for MobileNetV3
Andrew G. Howard
Mark Sandler
Grace Chu
Liang-Chieh Chen
Bo Chen
...
Yukun Zhu
Ruoming Pang
Vijay Vasudevan
Quoc V. Le
Hartwig Adam
97
6,623
0
06 May 2019
Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM
Sheng Lin
Xiaolong Ma
Shaokai Ye
Geng Yuan
Kaisheng Ma
Yanzhi Wang
MQ
30
10
0
02 May 2019
T-Net: Parametrizing Fully Convolutional Nets with a Single High-Order Tensor
Jean Kossaifi
Adrian Bulat
Georgios Tzimiropoulos
Maja Pantic
22
67
0
04 Apr 2019
Resource Efficient 3D Convolutional Neural Networks
Okan Kopuklu
Neslihan Köse
Ahmet Gunduz
Gerhard Rigoll
24
186
0
04 Apr 2019
Correlation Congruence for Knowledge Distillation
Baoyun Peng
Xiao Jin
Jiaheng Liu
Shunfeng Zhou
Yichao Wu
Yu Liu
Dongsheng Li
Zhaoning Zhang
63
507
0
03 Apr 2019
Looking Fast and Slow: Memory-Guided Mobile Video Object Detection
Mason Liu
Menglong Zhu
Marie White
Yinxiao Li
Dmitry Kalenichenko
23
83
0
25 Mar 2019
Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM
Shaokai Ye
Xiaoyu Feng
Tianyun Zhang
Xiaolong Ma
Sheng Lin
...
Jian Tang
M. Fardad
X. Lin
Yongpan Liu
Yanzhi Wang
MQ
38
38
0
23 Mar 2019
High-Throughput CNN Inference on Embedded ARM big.LITTLE Multi-Core Processors
Siqi Wang
Gayathri Ananthanarayanan
Yifan Zeng
Neeraj Goel
A. Pathania
T. Mitra
19
118
0
14 Mar 2019
Partial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Xuzhao Li
Yiming Zhou
Zheng Pan
Jiashi Feng
3DV
27
158
0
09 Mar 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
Justice Amoh
K. Odame
32
17
0
13 Feb 2019
Optimally Scheduling CNN Convolutions for Efficient Memory Access
Arthur Stoutchinin
Francesco Conti
Luca Benini
38
43
0
04 Feb 2019
Convolutional Neural Networks with Layer Reuse
Okan Kopuklu
M. Babaee
S. Hörmann
Gerhard Rigoll
21
17
0
28 Jan 2019
Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning
Shaohui Lin
Rongrong Ji
Yuchao Li
Cheng Deng
Xuelong Li
32
70
0
23 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Asifullah Khan
A. Sohail
Umme Zahoora
Aqsa Saeed Qureshi
OOD
67
2,273
0
17 Jan 2019
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Ao Ren
Tianyun Zhang
Shaokai Ye
Jiayu Li
Wenyao Xu
Xuehai Qian
X. Lin
Yanzhi Wang
MQ
40
161
0
31 Dec 2018
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Zhe Li
Caiwen Ding
Siyue Wang
Wujie Wen
Youwei Zhuo
...
Qinru Qiu
Wenyao Xu
X. Lin
Xuehai Qian
Yanzhi Wang
MQ
14
64
0
12 Dec 2018
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Yuchao Li
Shaohui Lin
Baochang Zhang
Jianzhuang Liu
David Doermann
Yongjian Wu
Feiyue Huang
Rongrong Ji
43
130
0
11 Dec 2018
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network
Sachin Mehta
Mohammad Rastegari
Linda G. Shapiro
Hannaneh Hajishirzi
VLM
29
393
0
28 Nov 2018
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep LearningInference in Function as a Service Environments
Abdul Dakkak
Cheng-rong Li
Simon Garcia De Gonzalo
Jinjun Xiong
Wen-mei W. Hwu
21
19
0
24 Nov 2018
Analyzing Machine Learning Workloads Using a Detailed GPU Simulator
Jonathan Lew
Deval Shah
Suchita Pati
Shaylin Cattell
Mengchi Zhang
...
Christopher Ng
Negar Goli
Matthew D. Sinclair
Timothy G. Rogers
Tor M. Aamodt
29
65
0
18 Nov 2018
ShuffleDet: Real-Time Vehicle Detection Network in On-board Embedded UAV Imagery
S. Azimi
19
35
0
15 Nov 2018
A First Look at Deep Learning Apps on Smartphones
Mengwei Xu
Jiawei Liu
Yuanqiang Liu
F. Lin
Yunxin Liu
Xuanzhe Liu
HAI
33
177
0
08 Nov 2018
Progressive Weight Pruning of Deep Neural Networks using ADMM
Shaokai Ye
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Kaidi Xu
...
M. Fardad
Sijia Liu
Xiang Chen
X. Lin
Yanzhi Wang
AI4CE
37
38
0
17 Oct 2018
Shift-based Primitives for Efficient Convolutional Neural Networks
Huasong Zhong
Xianggen Liu
Yihui He
Yuchun Ma
35
20
0
22 Sep 2018
Deep Learning Towards Mobile Applications
Ji Wang
Bokai Cao
Philip S. Yu
Lichao Sun
Weidong Bao
Xiaomin Zhu
HAI
32
98
0
10 Sep 2018
An Adaptive Locally Connected Neuron Model: Focusing Neuron
F. Boray Tek
27
5
0
31 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
34
232
0
13 Aug 2018
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
G. Hua
MQ
23
697
0
26 Jul 2018
Filter Distillation for Network Compression
Xavier Suau
Luca Zappella
N. Apostoloff
24
38
0
20 Jul 2018
Edge Intelligence: On-Demand Deep Learning Model Co-Inference with Device-Edge Synergy
En Li
Zhi Zhou
Xu Chen
24
325
0
20 Jun 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
30
135
0
20 Jun 2018
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking
Patrick H. Chen
Si Si
Yang Li
Ciprian Chelba
Cho-Jui Hsieh
24
67
0
18 Jun 2018
Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
Haichuan Yang
Yuhao Zhu
Ji Liu
CVBM
19
36
0
12 Jun 2018
Previous
1
2
3
4
Next