Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1605.04711
Cited By
Ternary Weight Networks
16 May 2016
Fengfu Li
Bin Liu
Xiaoxing Wang
Bo-Wen Zhang
Junchi Yan
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Ternary Weight Networks"
50 / 207 papers shown
Title
The Ecological Footprint of Neural Machine Translation Systems
D. Shterionov
Eva Vanmassenhove
32
3
0
04 Feb 2022
Signing the Supermask: Keep, Hide, Invert
Nils Koster
O. Grothe
Achim Rettinger
28
10
0
31 Jan 2022
TerViT: An Efficient Ternary Vision Transformer
Sheng Xu
Yanjing Li
Teli Ma
Bo-Wen Zeng
Baochang Zhang
Peng Gao
Jinhu Lv
ViT
23
11
0
20 Jan 2022
Resource-Efficient Deep Learning: A Survey on Model-, Arithmetic-, and Implementation-Level Techniques
JunKyu Lee
L. Mukhanov
A. S. Molahosseini
U. Minhas
Yang Hua
Jesus Martinez del Rincon
K. Dichev
Cheol-Ho Hong
Hans Vandierendonck
38
29
0
30 Dec 2021
BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of DNNs from Scratch
Souvik Kundu
Shikai Wang
Qirui Sun
P. Beerel
Massoud Pedram
MQ
26
18
0
24 Dec 2021
Elastic-Link for Binarized Neural Network
Jie Hu
Ziheng Wu
Vince Tan
Zhilin Lu
Mengze Zeng
Enhua Wu
MQ
30
6
0
19 Dec 2021
Neural Network Quantization for Efficient Inference: A Survey
Olivia Weng
MQ
20
22
0
08 Dec 2021
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech Recognition
Junhao Xu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
MQ
22
13
0
29 Nov 2021
Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition
Junhao Xu
Jianwei Yu
Xunying Liu
Helen Meng
MQ
28
10
0
29 Nov 2021
Mixed Precision of Quantization of Transformer Language Models for Speech Recognition
Junhao Xu
Shoukang Hu
Jianwei Yu
Xunying Liu
Helen M. Meng
MQ
40
15
0
29 Nov 2021
Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression
Yuezhou Sun
Wenlong Zhao
Lijun Zhang
Xiao Liu
Hui Guan
Matei A. Zaharia
26
0
0
19 Nov 2021
Iterative Training: Finding Binary Weight Deep Neural Networks with Layer Binarization
Cheng-Chou Lan
MQ
19
0
0
13 Nov 2021
Variability-Aware Training and Self-Tuning of Highly Quantized DNNs for Analog PIM
Zihao Deng
Michael Orshansky
MQ
37
6
0
11 Nov 2021
ILMPQ : An Intra-Layer Multi-Precision Deep Neural Network Quantization framework for FPGA
Sung-En Chang
Yanyu Li
Mengshu Sun
Yanzhi Wang
Xue Lin
MQ
6
1
0
30 Oct 2021
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions
Sung-En Chang
Yanyu Li
Mengshu Sun
Weiwen Jiang
Sijia Liu
Yanzhi Wang
Xue Lin
MQ
8
10
0
30 Oct 2021
Demystifying and Generalizing BinaryConnect
Abhishek Sharma
Yaoliang Yu
Eyyub Sari
Mahdi Zolnouri
V. Nia
MQ
22
8
0
25 Oct 2021
BNAS v2: Learning Architectures for Binary Networks with Empirical Improvements
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
43
7
0
16 Oct 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
42
61
0
13 Oct 2021
CBP: Backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method
Guhyun Kim
D. Jeong
MQ
42
2
0
06 Oct 2021
Communication-Efficient Federated Learning with Binary Neural Networks
YuZhi Yang
Zhaoyang Zhang
Qianqian Yang
FedML
24
31
0
05 Oct 2021
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai
Lu Hou
Lifeng Shang
Xin Jiang
Irwin King
M. Lyu
MQ
76
47
0
30 Sep 2021
Prune Your Model Before Distill It
Jinhyuk Park
Albert No
VLM
43
27
0
30 Sep 2021
Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition
Marawan Gamal Abdel Hameed
Marzieh S. Tahaei
A. Mosleh
V. Nia
39
25
0
29 Sep 2021
Distribution-sensitive Information Retention for Accurate Binary Neural Network
Haotong Qin
Xiangguo Zhang
Ruihao Gong
Yifu Ding
Yi Xu
Xianglong Liu
MQ
17
84
0
25 Sep 2021
Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss
J. H. Lee
Jihun Yun
Sung Ju Hwang
Eunho Yang
MQ
20
14
0
05 Sep 2021
Distance-aware Quantization
Dohyung Kim
Junghyup Lee
Bumsub Ham
MQ
15
28
0
16 Aug 2021
Static analysis of ReLU neural networks with tropical polyhedra
Eric Goubault
Sébastien Palumby
S. Putot
Louis Rustenholz
S. Sankaranarayanan
15
7
0
30 Jul 2021
Adaptive Precision Training (AdaPT): A dynamic fixed point quantized training approach for DNNs
Lorenz Kummer
Kevin Sidak
Tabea Reichmann
Wilfried Gansterer
MQ
19
5
0
28 Jul 2021
Pruning Ternary Quantization
Danyang Liu
Xiangshan Chen
Jie Fu
Chen-li Ma
Xue Liu
MQ
39
0
0
23 Jul 2021
A High-Performance Adaptive Quantization Approach for Edge CNN Applications
Hsu-Hsun Chin
R. Tsay
Hsin-I Wu
MQ
14
5
0
18 Jul 2021
Model compression as constrained optimization, with application to neural nets. Part V: combining compressions
Miguel Á. Carreira-Perpiñán
Yerlan Idelbayev
25
6
0
09 Jul 2021
S
3
S^3
S
3
: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks
Xinlin Li
Bang Liu
Yaoliang Yu
Wulong Liu
Chunjing Xu
V. Nia
MQ
32
5
0
07 Jul 2021
Content-Aware Convolutional Neural Networks
Yong Guo
Yaofo Chen
Mingkui Tan
Kui Jia
Jian Chen
Jingdong Wang
36
8
0
30 Jun 2021
Reward-Based 1-bit Compressed Federated Distillation on Blockchain
Leon Witt
Usama Zafar
KuoYeh Shen
Felix Sattler
Dan Li
Wojciech Samek
FedML
26
4
0
27 Jun 2021
An Empirical Investigation into Deep and Shallow Rule Learning
Florian Beck
Johannes Furnkranz
NAI
18
7
0
18 Jun 2021
Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration
Qigong Sun
Xiufang Li
Fanhua Shang
Hongying Liu
Kan Yang
L. Jiao
Zhouchen Lin
MQ
31
0
0
18 Jun 2021
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Gaurav Menghani
VLM
MedIm
23
365
0
16 Jun 2021
Efficient Micro-Structured Weight Unification and Pruning for Neural Network Compression
Sheng Lin
Wei Jiang
Wei Wang
Kaidi Xu
Yanzhi Wang
Shan Liu
Songnan Li
13
1
0
15 Jun 2021
Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization
Haichao Yu
Linjie Yang
Humphrey Shi
OOD
MQ
24
5
0
16 May 2021
3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low BitwidthQuantization, and Ultra-Low Latency Acceleration
Yao Chen
Cole Hawkins
Kaiqi Zhang
Zheng-Wei Zhang
Cong Hao
18
8
0
11 May 2021
Binarized Weight Error Networks With a Transition Regularization Term
Savas Ozkan
G. Akar
MQ
26
1
0
09 May 2021
3D Scene Compression through Entropy Penalized Neural Representation Functions
Thomas Bird
Johannes Ballé
Saurabh Singh
P. Chou
35
30
0
26 Apr 2021
Quantization of Deep Neural Networks for Accurate Edge Computing
Wentao Chen
Hailong Qiu
Zhuang Jian
Chutong Zhang
Yu Hu
Qing Lu
Tianchen Wang
Yiyu Shi
Meiping Huang
Xiaowe Xu
44
21
0
25 Apr 2021
Differentiable Model Compression via Pseudo Quantization Noise
Alexandre Défossez
Yossi Adi
Gabriel Synnaeve
DiffM
MQ
15
47
0
20 Apr 2021
Towards End-to-End Neural Face Authentication in the Wild -- Quantifying and Compensating for Directional Lighting Effects
Viktor Varkarakis
Wang Yao
Peter Corcoran
CVBM
21
0
0
08 Apr 2021
NullaNet Tiny: Ultra-low-latency DNN Inference Through Fixed-function Combinational Logic
M. Nazemi
A. Fayyazi
Amirhossein Esmaili
Atharva Khare
Soheil Nazar Shahsavani
Massoud Pedram
22
13
0
07 Apr 2021
GPU Domain Specialization via Composable On-Package Architecture
Yaosheng Fu
Evgeny Bolotin
Niladrish Chatterjee
D. Nellans
S. Keckler
12
12
0
05 Apr 2021
Network Quantization with Element-wise Gradient Scaling
Junghyup Lee
Dohyung Kim
Bumsub Ham
MQ
10
115
0
02 Apr 2021
Charged particle tracking via edge-classifying interaction networks
G. Dezoort
S. Thais
Javier Mauricio Duarte
Vesal Razavimaleki
M. Atkinson
I. Ojalvo
Mark S. Neubauer
P. Elmer
25
46
0
30 Mar 2021
RCT: Resource Constrained Training for Edge AI
Tian Huang
Tao Luo
Ming Yan
Qiufeng Wang
Rick Siow Mong Goh
25
8
0
26 Mar 2021
Previous
1
2
3
4
5
Next