ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.01064
  4. Cited By
Trained Ternary Quantization
v1v2v3 (latest)

Trained Ternary Quantization

4 December 2016
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
    MQ
ArXiv (abs)PDFHTML

Papers citing "Trained Ternary Quantization"

50 / 508 papers shown
Title
AdaptivFloat: A Floating-point based Data Type for Resilient Deep
  Learning Inference
AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference
Thierry Tambe
En-Yu Yang
Zishen Wan
Yuntian Deng
Vijay Janapa Reddi
Alexander M. Rush
David Brooks
Gu-Yeon Wei
MQ
58
21
0
29 Sep 2019
Additive Powers-of-Two Quantization: An Efficient Non-uniform
  Discretization for Neural Networks
Additive Powers-of-Two Quantization: An Efficient Non-uniform Discretization for Neural Networks
Yuhang Li
Xin Dong
Wei Wang
MQ
73
259
0
28 Sep 2019
Impact of Low-bitwidth Quantization on the Adversarial Robustness for
  Embedded Neural Networks
Impact of Low-bitwidth Quantization on the Adversarial Robustness for Embedded Neural Networks
Rémi Bernhard
Pierre-Alain Moëllic
J. Dutertre
AAMLMQ
89
18
0
27 Sep 2019
Adaptive Binary-Ternary Quantization
Adaptive Binary-Ternary Quantization
Ryan Razani
Grégoire Morin
V. Nia
Eyyub Sari
MQ
52
14
0
26 Sep 2019
Accurate and Compact Convolutional Neural Networks with Trained
  Binarization
Accurate and Compact Convolutional Neural Networks with Trained Binarization
Zhe Xu
R. Cheung
MQ
58
54
0
25 Sep 2019
FALCON: Lightweight and Accurate Convolution
FALCON: Lightweight and Accurate Convolution
Jun-Gi Jang
Chun Quan
Hyun Dong Lee
U. Kang
13
1
0
25 Sep 2019
Forward and Backward Information Retention for Accurate Binary Neural
  Networks
Forward and Backward Information Retention for Accurate Binary Neural Networks
Haotong Qin
Ruihao Gong
Xianglong Liu
Mingzhu Shen
Ziran Wei
F. Yu
Jingkuan Song
MQ
220
332
0
24 Sep 2019
Structured Binary Neural Networks for Image Recognition
Structured Binary Neural Networks for Image Recognition
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Peng Chen
Lingqiao Liu
Ian Reid
MQ
135
19
0
22 Sep 2019
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks
Shubham Jain
S. Gupta
A. Raghunathan
MQ
58
38
0
15 Sep 2019
A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor
  Arrays
A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor Arrays
Laurie Bose
Jianing Chen
S. Carey
Piotr Dudek
W. Mayol-Cuevas
54
37
0
12 Sep 2019
Knowledge Distillation for End-to-End Person Search
Knowledge Distillation for End-to-End Person Search
Bharti Munjal
Fabio Galasso
S. Amin
FedML
119
17
0
03 Sep 2019
High Performance Scalable FPGA Accelerator for Deep Neural Networks
High Performance Scalable FPGA Accelerator for Deep Neural Networks
Sudarshan Srinivasan
Pradeep Janedula
Saurabh Dhoble
Sasikanth Avancha
Dipankar Das
Naveen Mellempudi
Bharat Daga
M. Langhammer
Gregg Baeckler
Bharat Kaul
18
3
0
29 Aug 2019
Accelerating Large-Scale Inference with Anisotropic Vector Quantization
Accelerating Large-Scale Inference with Anisotropic Vector Quantization
Ruiqi Guo
Philip Sun
Xiang Wu
Quan Geng
David Simcha
Felix Chern
Sanjiv Kumar
MQ
88
7
0
27 Aug 2019
Once-for-All: Train One Network and Specialize it for Efficient
  Deployment
Once-for-All: Train One Network and Specialize it for Efficient Deployment
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
169
1,287
0
26 Aug 2019
Learning Filter Basis for Convolutional Neural Network Compression
Learning Filter Basis for Convolutional Neural Network Compression
Yawei Li
Shuhang Gu
Luc Van Gool
Radu Timofte
SupR
72
99
0
23 Aug 2019
Incremental Binarization On Recurrent Neural Networks For Single-Channel
  Source Separation
Incremental Binarization On Recurrent Neural Networks For Single-Channel Source Separation
Sunwoo Kim
Mrinmoy Maity
Minje Kim
MQ
49
15
0
23 Aug 2019
RBCN: Rectified Binary Convolutional Networks for Enhancing the
  Performance of 1-bit DCNNs
RBCN: Rectified Binary Convolutional Networks for Enhancing the Performance of 1-bit DCNNs
Chunlei Liu
Wenrui Ding
Xin Xia
Yuan Hu
Baochang Zhang
Jianzhuang Liu
Bohan Zhuang
G. Guo
MQ
67
26
0
21 Aug 2019
Efficient Deep Neural Networks
Efficient Deep Neural Networks
Bichen Wu
61
12
0
20 Aug 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit
  Neural Networks
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
102
460
0
14 Aug 2019
Effective Training of Convolutional Neural Networks with Low-bitwidth
  Weights and Activations
Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations
Bohan Zhuang
Jing Liu
Mingkui Tan
Lingqiao Liu
Ian Reid
Chunhua Shen
MQ
99
46
0
10 Aug 2019
Efficient Inference of CNNs via Channel Pruning
Efficient Inference of CNNs via Channel Pruning
Boyu Zhang
A. Davoodi
Y. Hu
CVBM
23
6
0
08 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
57
3
0
05 Aug 2019
Tuning Algorithms and Generators for Efficient Edge Inference
Tuning Algorithms and Generators for Efficient Edge Inference
R. Naous
Lazar Supic
Yoonhwan Kang
Ranko Seradejovic
Anish Singhani
Vladimir M. Stojanović
16
2
0
31 Jul 2019
MoBiNet: A Mobile Binary Network for Image Classification
MoBiNet: A Mobile Binary Network for Image Classification
Hai T. Phan
Dang T. Huynh
Yihui He
Marios Savvides
Zhiqiang Shen
MQ
83
49
0
29 Jul 2019
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
111
97
0
27 Jul 2019
Learning Multimodal Fixed-Point Weights using Gradient Descent
Learning Multimodal Fixed-Point Weights using Gradient Descent
Lukas Enderich
Fabian Timm
Lars Rosenbaum
Wolfram Burgard
MQ
53
9
0
16 Jul 2019
Light Multi-segment Activation for Model Compression
Light Multi-segment Activation for Model Compression
Zhenhui Xu
Guolin Ke
Jia Zhang
Jiang Bian
Tie-Yan Liu
28
2
0
16 Jul 2019
Bringing Giant Neural Networks Down to Earth with Unlabeled Data
Bringing Giant Neural Networks Down to Earth with Unlabeled Data
Yehui Tang
Shan You
Chang Xu
Boxin Shi
Chao Xu
85
11
0
13 Jul 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Hervé Jégou
MQ
120
149
0
12 Jul 2019
A Targeted Acceleration and Compression Framework for Low bit Neural
  Networks
A Targeted Acceleration and Compression Framework for Low bit Neural Networks
Biao Qian
Yang Wang
MQ
66
0
0
09 Jul 2019
Weight Normalization based Quantization for Deep Neural Network
  Compression
Weight Normalization based Quantization for Deep Neural Network Compression
Wenhong Cai
Wu-Jun Li
48
14
0
01 Jul 2019
Improving Branch Prediction By Modeling Global History with
  Convolutional Neural Networks
Improving Branch Prediction By Modeling Global History with Convolutional Neural Networks
Stephen J. Tarsa
Chit-Kwan Lin
Gokce Keskin
G. Chinya
Hong Wang
48
27
0
20 Jun 2019
Back to Simplicity: How to Train Accurate BNNs from Scratch?
Back to Simplicity: How to Train Accurate BNNs from Scratch?
Joseph Bethge
Haojin Yang
Marvin Bornstein
Christoph Meinel
AAMLMQ
64
58
0
19 Jun 2019
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and
  Object Detection
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
Zhuo Chen
Jiyuan Zhang
Ruizhou Ding
Diana Marculescu
39
12
0
19 Jun 2019
Divide and Conquer: Leveraging Intermediate Feature Representations for
  Quantized Training of Neural Networks
Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Alex Cloninger
H. Esmaeilzadeh
MQ
53
8
0
14 Jun 2019
Visual Wake Words Dataset
Visual Wake Words Dataset
Aakanksha Chowdhery
Pete Warden
Jonathon Shlens
Andrew G. Howard
Rocky Rhodes
VLM
88
102
0
12 Jun 2019
Run-Time Efficient RNN Compression for Inference on Edge Devices
Run-Time Efficient RNN Compression for Inference on Edge Devices
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Ganesh S. Dasika
Matthew Mattina
72
19
0
12 Jun 2019
Table-Based Neural Units: Fully Quantizing Networks for Multiply-Free
  Inference
Table-Based Neural Units: Fully Quantizing Networks for Multiply-Free Inference
Michele Covell
David Marwood
S. Baluja
Nick Johnston
MQ
41
7
0
11 Jun 2019
Compressing RNNs for IoT devices by 15-38x using Kronecker Products
Compressing RNNs for IoT devices by 15-38x using Kronecker Products
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Chu Zhou
Igor Fedorov
Ganesh S. Dasika
Matthew Mattina
106
36
0
07 Jun 2019
Multi-Precision Quantized Neural Networks via Encoding Decomposition of
  -1 and +1
Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1
Qigong Sun
Fanhua Shang
Kan Yang
Xiufang Li
Yan Ren
L. Jiao
MQ
70
12
0
31 May 2019
DeepShift: Towards Multiplication-Less Neural Networks
DeepShift: Towards Multiplication-Less Neural Networks
Mostafa Elhoushi
Zihao Chen
F. Shafiq
Ye Tian
Joey Yiwei Li
MQ
131
102
0
30 May 2019
Instant Quantization of Neural Networks using Monte Carlo Methods
Instant Quantization of Neural Networks using Monte Carlo Methods
Gonçalo Mordido
Matthijs Van Keirsbilck
A. Keller
MQ
39
9
0
29 May 2019
Harnessing Slow Dynamics in Neuromorphic Computation
Harnessing Slow Dynamics in Neuromorphic Computation
Tianlin Liu
38
0
0
28 May 2019
Incremental Learning Using a Grow-and-Prune Paradigm with Efficient
  Neural Networks
Incremental Learning Using a Grow-and-Prune Paradigm with Efficient Neural Networks
Xiaoliang Dai
Hongxu Yin
N. Jha
88
32
0
27 May 2019
HadaNets: Flexible Quantization Strategies for Neural Networks
HadaNets: Flexible Quantization Strategies for Neural Networks
Yash Akhauri
MQ
38
7
0
26 May 2019
Structured Compression by Weight Encryption for Unstructured Pruning and
  Quantization
Structured Compression by Weight Encryption for Unstructured Pruning and Quantization
S. Kwon
Dongsoo Lee
Byeongwook Kim
Parichay Kapoor
Baeseong Park
Gu-Yeon Wei
MQ
74
51
0
24 May 2019
EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
Chaoqi Wang
Roger C. Grosse
Sanja Fidler
Guodong Zhang
80
124
0
15 May 2019
SinReQ: Generalized Sinusoidal Regularization for Low-Bitwidth Deep
  Quantized Training
SinReQ: Generalized Sinusoidal Regularization for Low-Bitwidth Deep Quantized Training
Ahmed T. Elthakeb
Prannoy Pilligundla
H. Esmaeilzadeh
MQ
68
9
0
04 May 2019
Full-stack Optimization for Accelerating CNNs with FPGA Validation
Full-stack Optimization for Accelerating CNNs with FPGA Validation
Bradley McDanel
Shanghang Zhang
H. T. Kung
Xin Dong
MQ
29
2
0
01 May 2019
HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Zhen Dong
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
97
529
0
29 Apr 2019
Previous
123...10116789
Next