Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.01064
Cited By
Trained Ternary Quantization
4 December 2016
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trained Ternary Quantization"
50 / 509 papers shown
Title
MLPerf Training Benchmark
Arya D. McCarthy
Christine Cheng
Cody Coleman
Greg Diamos
Paulius Micikevicius
...
Carole-Jean Wu
Lingjie Xu
Masafumi Yamazaki
C. Young
Matei A. Zaharia
47
307
0
02 Oct 2019
AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference
Thierry Tambe
En-Yu Yang
Zishen Wan
Yuntian Deng
Vijay Janapa Reddi
Alexander M. Rush
David Brooks
Gu-Yeon Wei
MQ
19
21
0
29 Sep 2019
Additive Powers-of-Two Quantization: An Efficient Non-uniform Discretization for Neural Networks
Yuhang Li
Xin Dong
Wei Wang
MQ
31
255
0
28 Sep 2019
Impact of Low-bitwidth Quantization on the Adversarial Robustness for Embedded Neural Networks
Rémi Bernhard
Pierre-Alain Moëllic
J. Dutertre
AAML
MQ
29
18
0
27 Sep 2019
Adaptive Binary-Ternary Quantization
Ryan Razani
Grégoire Morin
V. Nia
Eyyub Sari
MQ
16
13
0
26 Sep 2019
Accurate and Compact Convolutional Neural Networks with Trained Binarization
Zhe Xu
R. Cheung
MQ
27
54
0
25 Sep 2019
FALCON: Lightweight and Accurate Convolution
Jun-Gi Jang
Chun Quan
Hyun Dong Lee
U. Kang
6
1
0
25 Sep 2019
Forward and Backward Information Retention for Accurate Binary Neural Networks
Haotong Qin
Ruihao Gong
Xianglong Liu
Mingzhu Shen
Ziran Wei
F. Yu
Jingkuan Song
MQ
133
324
0
24 Sep 2019
Structured Binary Neural Networks for Image Recognition
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Peng Chen
Lingqiao Liu
Ian Reid
MQ
27
17
0
22 Sep 2019
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks
Shubham Jain
S. Gupta
A. Raghunathan
MQ
35
37
0
15 Sep 2019
A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor Arrays
Laurie Bose
Jianing Chen
S. Carey
Piotr Dudek
W. Mayol-Cuevas
16
37
0
12 Sep 2019
Knowledge Distillation for End-to-End Person Search
Bharti Munjal
Fabio Galasso
S. Amin
FedML
48
15
0
03 Sep 2019
High Performance Scalable FPGA Accelerator for Deep Neural Networks
Sudarshan Srinivasan
Pradeep Janedula
Saurabh Dhoble
Sasikanth Avancha
Dipankar Das
Naveen Mellempudi
Bharat Daga
M. Langhammer
Gregg Baeckler
Bharat Kaul
13
3
0
29 Aug 2019
Accelerating Large-Scale Inference with Anisotropic Vector Quantization
Ruiqi Guo
Philip Sun
Xiang Wu
Quan Geng
David Simcha
Felix Chern
Sanjiv Kumar
MQ
21
7
0
27 Aug 2019
Once-for-All: Train One Network and Specialize it for Efficient Deployment
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
51
1,262
0
26 Aug 2019
Learning Filter Basis for Convolutional Neural Network Compression
Yawei Li
Shuhang Gu
Luc Van Gool
Radu Timofte
SupR
17
5
0
23 Aug 2019
Incremental Binarization On Recurrent Neural Networks For Single-Channel Source Separation
Sunwoo Kim
Mrinmoy Maity
Minje Kim
MQ
11
14
0
23 Aug 2019
RBCN: Rectified Binary Convolutional Networks for Enhancing the Performance of 1-bit DCNNs
Chunlei Liu
Wenrui Ding
Xin Xia
Yuan Hu
Baochang Zhang
Jianzhuang Liu
Bohan Zhuang
G. Guo
MQ
19
25
0
21 Aug 2019
Efficient Deep Neural Networks
Bichen Wu
28
12
0
20 Aug 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
32
447
0
14 Aug 2019
Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations
Bohan Zhuang
Jing Liu
Mingkui Tan
Lingqiao Liu
Ian Reid
Chunhua Shen
MQ
29
45
0
10 Aug 2019
Efficient Inference of CNNs via Channel Pruning
Boyu Zhang
A. Davoodi
Y. Hu
CVBM
19
6
0
08 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
33
3
0
05 Aug 2019
Tuning Algorithms and Generators for Efficient Edge Inference
R. Naous
Lazar Supic
Yoonhwan Kang
Ranko Seradejovic
Anish Singhani
Vladimir M. Stojanović
14
2
0
31 Jul 2019
MoBiNet: A Mobile Binary Network for Image Classification
Hai T. Phan
Dang T. Huynh
Yihui He
Marios Savvides
Zhiqiang Shen
MQ
36
49
0
29 Jul 2019
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
54
93
0
27 Jul 2019
Learning Multimodal Fixed-Point Weights using Gradient Descent
Lukas Enderich
Fabian Timm
Lars Rosenbaum
Wolfram Burgard
MQ
17
9
0
16 Jul 2019
Light Multi-segment Activation for Model Compression
Zhenhui Xu
Guolin Ke
Jia Zhang
Jiang Bian
Tie-Yan Liu
19
2
0
16 Jul 2019
Bringing Giant Neural Networks Down to Earth with Unlabeled Data
Yehui Tang
Shan You
Chang Xu
Boxin Shi
Chao Xu
24
11
0
13 Jul 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Hervé Jégou
MQ
42
149
0
12 Jul 2019
A Targeted Acceleration and Compression Framework for Low bit Neural Networks
Biao Qian
Yang Wang
MQ
35
0
0
09 Jul 2019
Weight Normalization based Quantization for Deep Neural Network Compression
Wenhong Cai
Wu-Jun Li
24
14
0
01 Jul 2019
Improving Branch Prediction By Modeling Global History with Convolutional Neural Networks
Stephen J. Tarsa
Chit-Kwan Lin
Gokce Keskin
G. Chinya
Hong Wang
21
26
0
20 Jun 2019
Back to Simplicity: How to Train Accurate BNNs from Scratch?
Joseph Bethge
Haojin Yang
Marvin Bornstein
Christoph Meinel
AAML
MQ
27
58
0
19 Jun 2019
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
Zhuo Chen
Jiyuan Zhang
Ruizhou Ding
Diana Marculescu
13
12
0
19 Jun 2019
Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Alex Cloninger
H. Esmaeilzadeh
MQ
26
8
0
14 Jun 2019
Visual Wake Words Dataset
Aakanksha Chowdhery
Pete Warden
Jonathon Shlens
Andrew G. Howard
Rocky Rhodes
VLM
18
99
0
12 Jun 2019
Run-Time Efficient RNN Compression for Inference on Edge Devices
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Ganesh S. Dasika
Matthew Mattina
19
18
0
12 Jun 2019
Table-Based Neural Units: Fully Quantizing Networks for Multiply-Free Inference
Michele Covell
David Marwood
S. Baluja
Nick Johnston
MQ
19
7
0
11 Jun 2019
Compressing RNNs for IoT devices by 15-38x using Kronecker Products
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Chu Zhou
Igor Fedorov
Ganesh S. Dasika
Matthew Mattina
27
36
0
07 Jun 2019
Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1
Qigong Sun
Fanhua Shang
Kan Yang
Xiufang Li
Yan Ren
L. Jiao
MQ
46
12
0
31 May 2019
DeepShift: Towards Multiplication-Less Neural Networks
Mostafa Elhoushi
Zihao Chen
F. Shafiq
Ye Tian
Joey Yiwei Li
MQ
38
97
0
30 May 2019
Instant Quantization of Neural Networks using Monte Carlo Methods
Gonçalo Mordido
Matthijs Van Keirsbilck
A. Keller
MQ
27
9
0
29 May 2019
Harnessing Slow Dynamics in Neuromorphic Computation
Tianlin Liu
24
0
0
28 May 2019
Incremental Learning Using a Grow-and-Prune Paradigm with Efficient Neural Networks
Xiaoliang Dai
Hongxu Yin
N. Jha
33
31
0
27 May 2019
HadaNets: Flexible Quantization Strategies for Neural Networks
Yash Akhauri
MQ
21
7
0
26 May 2019
Structured Compression by Weight Encryption for Unstructured Pruning and Quantization
S. Kwon
Dongsoo Lee
Byeongwook Kim
Parichay Kapoor
Baeseong Park
Gu-Yeon Wei
MQ
35
48
0
24 May 2019
EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
Chaoqi Wang
Roger C. Grosse
Sanja Fidler
Guodong Zhang
29
121
0
15 May 2019
SinReQ: Generalized Sinusoidal Regularization for Low-Bitwidth Deep Quantized Training
Ahmed T. Elthakeb
Prannoy Pilligundla
H. Esmaeilzadeh
MQ
31
9
0
04 May 2019
Full-stack Optimization for Accelerating CNNs with FPGA Validation
Bradley McDanel
Shanghang Zhang
H. T. Kung
Xin Dong
MQ
25
2
0
01 May 2019
Previous
1
2
3
...
10
11
6
7
8
9
Next