ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.01064
  4. Cited By
Trained Ternary Quantization
v1v2v3 (latest)

Trained Ternary Quantization

4 December 2016
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
    MQ
ArXiv (abs)PDFHTML

Papers citing "Trained Ternary Quantization"

50 / 508 papers shown
Title
Discrimination-aware Channel Pruning for Deep Neural Networks
Discrimination-aware Channel Pruning for Deep Neural Networks
Zhuangwei Zhuang
Mingkui Tan
Bohan Zhuang
Jing Liu
Yong Guo
Qingyao Wu
Junzhou Huang
Jin-Hui Zhu
162
601
0
28 Oct 2018
Progressive Weight Pruning of Deep Neural Networks using ADMM
Progressive Weight Pruning of Deep Neural Networks using ADMM
Shaokai Ye
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Kaidi Xu
...
M. Fardad
Sijia Liu
Xiang Chen
Xinyu Lin
Yanzhi Wang
AI4CE
113
38
0
17 Oct 2018
Training Deep Neural Network in Limited Precision
Training Deep Neural Network in Limited Precision
Hyunsun Park
J. Lee
Youngmin Oh
Sangwon Ha
Seungwon Lee
26
9
0
12 Oct 2018
Towards Fast and Energy-Efficient Binarized Neural Network Inference on
  FPGA
Towards Fast and Energy-Efficient Binarized Neural Network Inference on FPGA
Cheng Fu
Shilin Zhu
Hao Su
Ching-En Lee
Jishen Zhao
MQ
72
32
0
04 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
103
132
0
03 Oct 2018
LIT: Block-wise Intermediate Representation Training for Model
  Compression
LIT: Block-wise Intermediate Representation Training for Model Compression
Animesh Koratana
Daniel Kang
Peter Bailis
Matei A. Zaharia
50
12
0
02 Oct 2018
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network
  using Truncated Gaussian Approximation
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network using Truncated Gaussian Approximation
Zhezhi He
Deliang Fan
MQ
73
67
0
02 Oct 2018
ProxQuant: Quantized Neural Networks via Proximal Operators
ProxQuant: Quantized Neural Networks via Proximal Operators
Yu Bai
Yu Wang
Edo Liberty
MQ
98
118
0
01 Oct 2018
NICE: Noise Injection and Clamping Estimation for Neural Network
  Quantization
NICE: Noise Injection and Clamping Estimation for Neural Network Quantization
Chaim Baskin
Natan Liss
Yoav Chai
Evgenii Zheltonozhskii
Eli Schwartz
Raja Giryes
A. Mendelson
A. Bronstein
MQ
99
62
0
29 Sep 2018
Learning Recurrent Binary/Ternary Weights
Learning Recurrent Binary/Ternary Weights
A. Ardakani
Zhengyun Ji
S. C. Smithson
B. Meyer
W. Gross
MQ
67
28
0
28 Sep 2018
Scalar Arithmetic Multiple Data: Customizable Precision for Deep Neural
  Networks
Scalar Arithmetic Multiple Data: Customizable Precision for Deep Neural Networks
Andrew Anderson
David Gregg
MQ
23
1
0
27 Sep 2018
Characterising Across-Stack Optimisations for Deep Convolutional Neural
  Networks
Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks
Jack Turner
José Cano
Valentin Radu
Elliot J. Crowley
Michael F. P. O'Boyle
Amos Storkey
43
40
0
19 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided
  Fuzzing
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing
Xiaofei Xie
Lei Ma
Felix Juefei Xu
Hongxu Chen
Minhui Xue
Yue Liu
Yang Liu
Jianjun Zhao
Jianxiong Yin
Simon See
116
41
0
04 Sep 2018
Learning Sparse Low-Precision Neural Networks With Learnable
  Regularization
Learning Sparse Low-Precision Neural Networks With Learnable Regularization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
66
31
0
01 Sep 2018
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Yang He
Xuanyi Dong
Guoliang Kang
Yanwei Fu
C. Yan
Yi Yang
118
135
0
22 Aug 2018
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Yang He
Guoliang Kang
Xuanyi Dong
Yanwei Fu
Yi Yang
AAMLVLM
101
968
0
21 Aug 2018
Learning to Quantize Deep Networks by Optimizing Quantization Intervals
  with Task Loss
Learning to Quantize Deep Networks by Optimizing Quantization Intervals with Task Loss
S. Jung
Changyong Son
Seohyung Lee
JinWoo Son
Youngjun Kwak
Jae-Joon Han
Sung Ju Hwang
Changkyu Choi
MQ
82
376
0
17 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
117
236
0
13 Aug 2018
Training Compact Neural Networks with Binary Weights and Low Precision
  Activations
Training Compact Neural Networks with Binary Weights and Low Precision Activations
Bohan Zhuang
Chunhua Shen
Ian Reid
MQ
53
14
0
08 Aug 2018
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural
  Network in Embedded FPGA
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural Network in Embedded FPGA
Junsong Wang
Qiuwen Lou
Xiaofan Zhang
Chao Zhu
Yonghua Lin
Deming Chen
MQ
94
93
0
31 Jul 2018
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep
  Neural Networks
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
G. Hua
MQ
77
704
0
26 Jul 2018
Optimize Deep Convolutional Neural Network with Ternarized Weights and
  High Accuracy
Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
Zhezhi He
Boqing Gong
Deliang Fan
36
22
0
20 Jul 2018
Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)
Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)
Jungwook Choi
P. Chuang
Zhuo Wang
Swagath Venkataramani
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
73
76
0
17 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable
  Precision LSTM Networks on FPGAs
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
122
72
0
11 Jul 2018
Auto Deep Compression by Reinforcement Learning Based Actor-Critic
  Structure
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure
Hamed Hakkak
OffRLAI4CE
93
1
0
08 Jul 2018
Stochastic Layer-Wise Precision in Deep Neural Networks
Stochastic Layer-Wise Precision in Deep Neural Networks
Griffin Lacey
Graham W. Taylor
S. Areibi
86
18
0
03 Jul 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
102
133
0
01 Jul 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks
  per Bit?
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
108
137
0
20 Jun 2018
Exploration of Low Numeric Precision Deep Learning Inference Using Intel
  FPGAs
Exploration of Low Numeric Precision Deep Learning Inference Using Intel FPGAs
Philip Colangelo
Nasibeh Nasiri
Asit K. Mishra
Eriko Nurvitadhi
M. Margala
Kevin Nealis
MQ
35
1
0
12 Jun 2018
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
Amartya Sanyal
Matt J. Kusner
Adria Gascon
Varun Kanade
FedML
74
127
0
09 Jun 2018
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural
  Networks
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks
Ke Sun
Mingjie Li
Dong Liu
Jingdong Wang
124
126
0
01 Jun 2018
MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural
  Network Compression
MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural Network Compression
Lazar Supic
R. Naous
Ranko Sredojevic
Aleksandra Faust
Vladimir M. Stojanović
124
4
0
30 May 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Dongsoo Lee
Byeongwook Kim
MQ
79
16
0
29 May 2018
Accelerating CNN inference on FPGAs: A Survey
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
48
149
0
26 May 2018
Tensorial Neural Networks: Generalization of Neural Networks and
  Application to Model Compression
Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression
Jiahao Su
Jingling Li
Bobby Bhattacharjee
Furong Huang
70
20
0
25 May 2018
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices
  Compressed with Quantization and Tensorization
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices Compressed with Quantization and Tensorization
Yuan Cheng
Guangya Li
Hai-Bao Chen
S. Tan
Hao Yu
22
3
0
21 May 2018
PACT: Parameterized Clipping Activation for Quantized Neural Networks
PACT: Parameterized Clipping Activation for Quantized Neural Networks
Jungwook Choi
Zhuo Wang
Swagath Venkataramani
P. Chuang
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
103
958
0
16 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural
  Networks
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
90
45
0
29 Apr 2018
Low-memory convolutional neural networks through incremental depth-first
  processing
Low-memory convolutional neural networks through incremental depth-first processing
Jonathan Binas
Yoshua Bengio
SupR
39
3
0
28 Apr 2018
Value-aware Quantization for Training and Inference of Neural Networks
Value-aware Quantization for Training and Inference of Neural Networks
Eunhyeok Park
S. Yoo
Peter Vajda
MQ
69
163
0
20 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight
  Repetition
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition
Kartik Hegde
Jiyong Yu
R. Agrawal
Mengjia Yan
Michael Pellauer
Christopher W. Fletcher
74
166
0
18 Apr 2018
IGCV$2$: Interleaved Structured Sparse Convolutional Neural Networks
IGCV222: Interleaved Structured Sparse Convolutional Neural Networks
Guotian Xie
Jingdong Wang
Ting Zhang
Jianhuang Lai
Richang Hong
Guo-Jun Qi
115
106
0
17 Apr 2018
Fast inference of deep neural networks in FPGAs for particle physics
Fast inference of deep neural networks in FPGAs for particle physics
Javier Mauricio Duarte
Song Han
Philip C. Harris
S. Jindariani
E. Kreinar
...
J. Ngadiuba
M. Pierini
R. Rivera
N. Tran
Zhenbin Wu
AI4CE
164
396
0
16 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Ameya Prabhu
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
55
18
0
11 Apr 2018
Distribution-Aware Binarization of Neural Networks for Sketch
  Recognition
Distribution-Aware Binarization of Neural Networks for Sketch Recognition
Ameya Prabhu
Vishal Batchu
Sri Aurobindo Munagala
Rohit Gajawada
A. Namboodiri
MQ
104
5
0
09 Apr 2018
Training DNNs with Hybrid Block Floating Point
Training DNNs with Hybrid Block Floating Point
M. Drumond
Tao R. Lin
Martin Jaggi
Babak Falsafi
74
98
0
04 Apr 2018
Adversarial Network Compression
Adversarial Network Compression
Vasileios Belagiannis
Azade Farshad
Fabio Galasso
GANAAML
63
58
0
28 Mar 2018
SqueezeNext: Hardware-Aware Neural Network Design
SqueezeNext: Hardware-Aware Neural Network Design
A. Gholami
K. Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Kurt Keutzer
67
300
0
23 Mar 2018
EVA$^2$: Exploiting Temporal Redundancy in Live Computer Vision
EVA2^22: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
126
79
0
16 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey
  and Future Directions
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
71
185
0
15 Mar 2018
Previous
123...101189
Next