Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.07061
Cited By
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
22 September 2016
Itay Hubara
Matthieu Courbariaux
Daniel Soudry
Ran El-Yaniv
Yoshua Bengio
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations"
50 / 526 papers shown
Title
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking
Patrick H. Chen
Si Si
Yang Li
Ciprian Chelba
Cho-Jui Hsieh
67
70
0
18 Jun 2018
RAPIDNN: In-Memory Deep Neural Network Acceleration Framework
Mohsen Imani
Mohammad Samragh
Yeseong Kim
Saransh Gupta
F. Koushanfar
Tajana Simunic
68
51
0
15 Jun 2018
Resource-Efficient Neural Architect
Yanqi Zhou
S. Ebrahimi
Sercan O. Arik
Haonan Yu
Hairong Liu
G. Diamos
67
64
0
12 Jun 2018
Full deep neural network training on a pruned weight budget
Maximilian Golub
G. Lemieux
Mieszko Lis
83
28
0
11 Jun 2018
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
Amartya Sanyal
Matt J. Kusner
Adria Gascon
Varun Kanade
FedML
74
127
0
09 Jun 2018
MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural Network Compression
Lazar Supic
R. Naous
Ranko Sredojevic
Aleksandra Faust
Vladimir M. Stojanović
119
4
0
30 May 2018
Adding New Tasks to a Single Network with Weight Transformations using Binary Masks
Massimiliano Mancini
Elisa Ricci
Barbara Caputo
Samuel Rota Buló
91
52
0
28 May 2018
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
48
149
0
26 May 2018
Heterogeneous Bitwidth Binarization in Convolutional Neural Networks
Josh Fromm
Shwetak N. Patel
Matthai Philipose
MQ
79
27
0
25 May 2018
Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression
Jiahao Su
Jingling Li
Bobby Bhattacharjee
Furong Huang
55
20
0
25 May 2018
Scalable Methods for 8-bit Training of Neural Networks
Ron Banner
Itay Hubara
Elad Hoffer
Daniel Soudry
MQ
86
339
0
25 May 2018
Laplacian Networks: Bounding Indicator Function Smoothness for Neural Network Robustness
Carlos Lassance
Vincent Gripon
Antonio Ortega
AAML
80
16
0
24 May 2018
CascadeCNN: Pushing the performance limits of quantisation
Alexandros Kouris
Stylianos I. Venieris
C. Bouganis
MQ
51
24
0
22 May 2018
Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit
Seyed Hamed Fatemi Langroudi
Tej Pandit
Dhireesha Kudithipudi
MQ
57
41
0
22 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
81
27
0
21 May 2018
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices Compressed with Quantization and Tensorization
Yuan Cheng
Guangya Li
Hai-Bao Chen
S. Tan
Hao Yu
22
3
0
21 May 2018
PACT: Parameterized Clipping Activation for Quantized Neural Networks
Jungwook Choi
Zhuo Wang
Swagath Venkataramani
P. Chuang
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
80
957
0
16 May 2018
A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones
Daniele Palossi
Antonio Loquercio
Francesco Conti
Eric Flamand
Davide Scaramuzza
Luca Benini
251
159
0
04 May 2018
Dynamic Sentence Sampling for Efficient Training of Neural Machine Translation
Rui Wang
Masao Utiyama
Eiichiro Sumita
126
28
0
01 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
81
45
0
29 Apr 2018
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Liyuan Liu
Xiang Ren
Jingbo Shang
Jian-wei Peng
Jiawei Han
83
44
0
20 Apr 2018
Value-aware Quantization for Training and Inference of Neural Networks
Eunhyeok Park
S. Yoo
Peter Vajda
MQ
66
163
0
20 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Ameya Prabhu
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
54
18
0
11 Apr 2018
Distribution-Aware Binarization of Neural Networks for Sketch Recognition
Ameya Prabhu
Vishal Batchu
Sri Aurobindo Munagala
Rohit Gajawada
A. Namboodiri
MQ
97
5
0
09 Apr 2018
SqueezeNext: Hardware-Aware Neural Network Design
A. Gholami
K. Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Kurt Keutzer
65
299
0
23 Mar 2018
A neural network memory prefetcher using semantic locality
L. Peled
U. Weiser
Yoav Etsion
48
43
0
19 Mar 2018
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
Sachin Mehta
Mohammad Rastegari
A. Caspi
Linda G. Shapiro
Hannaneh Hajishirzi
SSeg
133
783
0
19 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
71
185
0
15 Mar 2018
Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation
Xiaowei Xu
Q. Lu
Yu Hu
Lin Yang
X. S. Hu
Danny Chen
Yiyu Shi
MedIm
84
85
0
13 Mar 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
75
709
0
26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence
Jinglan Liu
Jiaxin Zhang
Yukun Ding
Xiaowei Xu
Meng Jiang
Yiyu Shi
76
4
0
26 Feb 2018
The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks
Nicholas Carlini
Chang-rui Liu
Ulfar Erlingsson
Jernej Kos
Basel Alomair
177
1,150
0
22 Feb 2018
Model compression via distillation and quantization
A. Polino
Razvan Pascanu
Dan Alistarh
MQ
88
733
0
15 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
Yukun Ding
Jinglan Liu
Jinjun Xiong
Yiyu Shi
MQ
117
21
0
10 Feb 2018
Universal Deep Neural Network Compression
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
141
88
0
07 Feb 2018
Mixed Precision Training of Convolutional Neural Networks using Integer Operations
Dipankar Das
Naveen Mellempudi
Dheevatsa Mudigere
Dhiraj D. Kalamkar
Sasikanth Avancha
...
J. Corbal
N. Shustrov
R. Dubtsov
Evarist Fomenko
V. Pirogov
MQ
74
154
0
03 Feb 2018
Alternating Multi-bit Quantization for Recurrent Neural Networks
Chen Xu
Jianqiang Yao
Zhouchen Lin
Wenwu Ou
Yuanbin Cao
Zhirong Wang
H. Zha
MQ
86
116
0
01 Feb 2018
Toward Scalable Verification for Safety-Critical Deep Networks
L. Kuper
Guy Katz
Justin Emile Gottschlich
Kyle D. Julian
Clark W. Barrett
Mykel Kochenderfer
109
40
0
18 Jan 2018
Fix your classifier: the marginal value of training the last weight layer
Elad Hoffer
Itay Hubara
Daniel Soudry
156
102
0
14 Jan 2018
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
170
3,151
0
15 Dec 2017
StrassenNets: Deep Learning with a Multiplication Budget
Michael Tschannen
Aran Khanna
Anima Anandkumar
52
30
0
11 Dec 2017
In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
Samuel Rota Buló
Lorenzo Porzi
Peter Kontschieder
117
357
0
07 Dec 2017
Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
Hardik Sharma
Jongse Park
Naveen Suda
Liangzhen Lai
Benson Chau
Joo-Young Kim
Vikas Chandra
H. Esmaeilzadeh
MQ
64
493
0
05 Dec 2017
Towards Accurate Binary Convolutional Neural Network
Xiaofan Lin
Cong Zhao
Wei Pan
MQ
100
649
0
30 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
73
10
0
13 Nov 2017
Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
Urs Koster
T. Webb
Xin Eric Wang
Marcel Nassar
Arjun K. Bansal
...
Luke Hornof
A. Khosrowshahi
Carey Kloss
Ruby J. Pai
N. Rao
MQ
57
262
0
06 Nov 2017
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
40
6
0
01 Nov 2017
Minimum Energy Quantized Neural Networks
Bert Moons
Koen Goetschalckx
Nick Van Berckelaer
Marian Verhelst
MQ
82
123
0
01 Nov 2017
The Implicit Bias of Gradient Descent on Separable Data
Daniel Soudry
Elad Hoffer
Mor Shpigel Nacson
Suriya Gunasekar
Nathan Srebro
208
924
0
27 Oct 2017
Mixed Precision Training
Paulius Micikevicius
Sharan Narang
Jonah Alben
G. Diamos
Erich Elsen
...
Boris Ginsburg
Michael Houston
Oleksii Kuchaiev
Ganesh Venkatesh
Hao Wu
183
1,809
0
10 Oct 2017
Previous
1
2
3
...
10
11
8
9
Next