ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.05033
  4. Cited By
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit
  Neural Networks

Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks

14 August 2019
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
    MQ
ArXivPDFHTML

Papers citing "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"

49 / 99 papers shown
Title
Bimodal Distributed Binarized Neural Networks
Bimodal Distributed Binarized Neural Networks
T. Rozen
Moshe Kimhi
Brian Chmiel
A. Mendelson
Chaim Baskin
MQ
47
4
0
05 Apr 2022
Standard Deviation-Based Quantization for Deep Neural Networks
Standard Deviation-Based Quantization for Deep Neural Networks
Amir Ardakani
A. Ardakani
B. Meyer
J. Clark
W. Gross
MQ
52
1
0
24 Feb 2022
PokeBNN: A Binary Pursuit of Lightweight Accuracy
PokeBNN: A Binary Pursuit of Lightweight Accuracy
Yichi Zhang
Zhiru Zhang
Lukasz Lew
MQ
35
57
0
30 Nov 2021
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via
  Generalized Straight-Through Estimation
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation
Zechun Liu
Kwang-Ting Cheng
Dong Huang
Eric P. Xing
Zhiqiang Shen
MQ
25
103
0
29 Nov 2021
Sharpness-aware Quantization for Deep Neural Networks
Sharpness-aware Quantization for Deep Neural Networks
Jing Liu
Jianfei Cai
Bohan Zhuang
MQ
27
24
0
24 Nov 2021
IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for
  Zero-Shot Network Quantization
IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization
Mingliang Xu
Mingbao Lin
Gongrui Nan
Jianzhuang Liu
Baochang Zhang
Yonghong Tian
Rongrong Ji
MQ
46
71
0
17 Nov 2021
Arch-Net: Model Distillation for Architecture Agnostic Model Deployment
Arch-Net: Model Distillation for Architecture Agnostic Model Deployment
Weixin Xu
Zipeng Feng
Shuangkang Fang
Song Yuan
Yi Yang
Shuchang Zhou
MQ
27
1
0
01 Nov 2021
Federated Learning via Plurality Vote
Federated Learning via Plurality Vote
Kai Yue
Richeng Jin
Chau-Wai Wong
H. Dai
FedML
24
8
0
06 Oct 2021
8-bit Optimizers via Block-wise Quantization
8-bit Optimizers via Block-wise Quantization
Tim Dettmers
M. Lewis
Sam Shleifer
Luke Zettlemoyer
MQ
34
270
0
06 Oct 2021
CBP: Backpropagation with constraint on weight precision using a
  pseudo-Lagrange multiplier method
CBP: Backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method
Guhyun Kim
D. Jeong
MQ
47
2
0
06 Oct 2021
Elastic Significant Bit Quantization and Acceleration for Deep Neural
  Networks
Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks
Cheng Gong
Ye Lu
Kunpeng Xie
Zongming Jin
Tao Li
Yanzhi Wang
MQ
27
7
0
08 Sep 2021
AdvDrop: Adversarial Attack to DNNs by Dropping Information
AdvDrop: Adversarial Attack to DNNs by Dropping Information
Ranjie Duan
YueFeng Chen
Dantong Niu
Yun Yang
•. A. K. Qin
Yuan He
AAML
24
89
0
20 Aug 2021
QuPeD: Quantized Personalization via Distillation with Applications to
  Federated Learning
QuPeD: Quantized Personalization via Distillation with Applications to Federated Learning
Kaan Ozkara
Navjot Singh
Deepesh Data
Suhas Diggavi
FedML
MQ
24
56
0
29 Jul 2021
Differentiable Model Compression via Pseudo Quantization Noise
Differentiable Model Compression via Pseudo Quantization Noise
Alexandre Défossez
Yossi Adi
Gabriel Synnaeve
DiffM
MQ
18
47
0
20 Apr 2021
Training Multi-bit Quantized and Binarized Networks with A Learnable
  Symmetric Quantizer
Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer
Phuoc Pham
J. Abraham
Jaeyong Chung
MQ
37
11
0
01 Apr 2021
Invertible Image Signal Processing
Invertible Image Signal Processing
Yazhou Xing
Zian Qian
Qifeng Chen
26
112
0
28 Mar 2021
ReCU: Reviving the Dead Weights in Binary Neural Networks
ReCU: Reviving the Dead Weights in Binary Neural Networks
Zihan Xu
Mingbao Lin
Jianzhuang Liu
Jie Chen
Ling Shao
Yue Gao
Yonghong Tian
Rongrong Ji
MQ
24
81
0
23 Mar 2021
Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural
  Networks by Pruning A Randomly Weighted Network
Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network
James Diffenderfer
B. Kailkhura
MQ
35
75
0
17 Mar 2021
Learnable Companding Quantization for Accurate Low-bit Neural Networks
Learnable Companding Quantization for Accurate Low-bit Neural Networks
Kohei Yamamoto
MQ
36
63
0
12 Mar 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
674
0
24 Jan 2021
Direct Quantization for Training Highly Accurate Low Bit-width Deep
  Neural Networks
Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks
Ziquan Liu
Wuguannan Yao
Qiao Li
Antoni B. Chan
MQ
22
9
0
26 Dec 2020
FantastIC4: A Hardware-Software Co-Design Approach for Efficiently
  Running 4bit-Compact Multilayer Perceptrons
FantastIC4: A Hardware-Software Co-Design Approach for Efficiently Running 4bit-Compact Multilayer Perceptrons
Simon Wiedemann
Suhas Shivapakash
P. Wiedemann
Daniel Becking
Wojciech Samek
F. Gerfers
Thomas Wiegand
MQ
23
7
0
17 Dec 2020
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization
  Framework
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Sung-En Chang
Yanyu Li
Mengshu Sun
Runbin Shi
Hayden Kwok-Hay So
Xuehai Qian
Yanzhi Wang
Xue Lin
MQ
20
82
0
08 Dec 2020
A Tiny CNN Architecture for Medical Face Mask Detection for
  Resource-Constrained Endpoints
A Tiny CNN Architecture for Medical Face Mask Detection for Resource-Constrained Endpoints
P. Mohan
A. Paul
Abhay Chirania
CVBM
21
48
0
30 Nov 2020
Training Binary Neural Networks through Learning with Noisy Supervision
Training Binary Neural Networks through Learning with Noisy Supervision
Kai Han
Yunhe Wang
Yixing Xu
Chunjing Xu
Enhua Wu
Chang Xu
MQ
15
55
0
10 Oct 2020
Once Quantization-Aware Training: High Performance Extremely Low-bit
  Architecture Search
Once Quantization-Aware Training: High Performance Extremely Low-bit Architecture Search
Mingzhu Shen
Feng Liang
Ruihao Gong
Yuhang Li
Chuming Li
Chen Lin
F. Yu
Junjie Yan
Wanli Ouyang
MQ
28
36
0
09 Oct 2020
High-Capacity Expert Binary Networks
High-Capacity Expert Binary Networks
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
MQ
27
57
0
07 Oct 2020
Rotated Binary Neural Network
Rotated Binary Neural Network
Mingbao Lin
Rongrong Ji
Zi-Han Xu
Baochang Zhang
Yan Wang
Yongjian Wu
Feiyue Huang
Chia-Wen Lin
MQ
19
129
0
28 Sep 2020
MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network
  Quantization Framework
MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Sung-En Chang
Yanyu Li
Mengshu Sun
Weiwen Jiang
Runbin Shi
Xue Lin
Yanzhi Wang
MQ
27
7
0
16 Sep 2020
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and
  Cheaper Reasoning
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and Cheaper Reasoning
Yushan Zhu
Wen Zhang
Mingyang Chen
Hui Chen
Xu-Xin Cheng
Wei Zhang
Huajun Chen Zhejiang University
22
27
0
13 Sep 2020
Transform Quantization for CNN (Convolutional Neural Network)
  Compression
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
29
69
0
02 Sep 2020
High Throughput Matrix-Matrix Multiplication between Asymmetric
  Bit-Width Operands
High Throughput Matrix-Matrix Multiplication between Asymmetric Bit-Width Operands
Dibakar Gope
Jesse G. Beu
Matthew Mattina
20
4
0
03 Aug 2020
An Overview of Neural Network Compression
An Overview of Neural Network Compression
James OÑeill
AI4CE
45
98
0
05 Jun 2020
Quantized Neural Networks: Characterization and Holistic Optimization
Quantized Neural Networks: Characterization and Holistic Optimization
Yoonho Boo
Sungho Shin
Wonyong Sung
MQ
42
8
0
31 May 2020
Effective and Efficient Computation with Multiple-timescale Spiking
  Recurrent Neural Networks
Effective and Efficient Computation with Multiple-timescale Spiking Recurrent Neural Networks
Bojian Yin
Federico Corradi
Sander M. Bohté
20
99
0
24 May 2020
Hierarchical Feature Embedding for Attribute Recognition
Hierarchical Feature Embedding for Attribute Recognition
Jie Yang
Jiarou Fan
Yiru Wang
Yige Wang
Weihao Gan
Lin Liu
Wei Wu
CVBM
27
42
0
23 May 2020
SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost
  Computation
SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation
Yang Katie Zhao
Xiaohan Chen
Yue Wang
Chaojian Li
Haoran You
Y. Fu
Yuan Xie
Zhangyang Wang
Yingyan Lin
MQ
32
43
0
07 May 2020
Binary Neural Networks: A Survey
Binary Neural Networks: A Survey
Haotong Qin
Ruihao Gong
Xianglong Liu
Xiao Bai
Jingkuan Song
N. Sebe
MQ
50
457
0
31 Mar 2020
Training Binary Neural Networks with Real-to-Binary Convolutions
Training Binary Neural Networks with Real-to-Binary Convolutions
Brais Martínez
Jing Yang
Adrian Bulat
Georgios Tzimiropoulos
MQ
17
226
0
25 Mar 2020
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Yuhang Li
Wei Wang
Haoli Bai
Ruihao Gong
Xin Dong
F. Yu
MQ
15
20
0
17 Mar 2020
BinaryDuo: Reducing Gradient Mismatch in Binary Activation Network by
  Coupling Binary Activations
BinaryDuo: Reducing Gradient Mismatch in Binary Activation Network by Coupling Binary Activations
Hyungjun Kim
Kyungsu Kim
Jinseok Kim
Jae-Joon Kim
MQ
24
47
0
16 Feb 2020
Towards Unified INT8 Training for Convolutional Neural Network
Towards Unified INT8 Training for Convolutional Neural Network
Feng Zhu
Ruihao Gong
F. Yu
Xianglong Liu
Yanfei Wang
Zhelong Li
Xiuqi Yang
Junjie Yan
MQ
35
150
0
29 Dec 2019
Towards Efficient Training for Neural Network Quantization
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
13
42
0
21 Dec 2019
Adaptive Loss-aware Quantization for Multi-bit Networks
Adaptive Loss-aware Quantization for Multi-bit Networks
Zhongnan Qu
Zimu Zhou
Yun Cheng
Lothar Thiele
MQ
33
53
0
18 Dec 2019
QKD: Quantization-aware Knowledge Distillation
QKD: Quantization-aware Knowledge Distillation
Jangho Kim
Yash Bhalgat
Jinwon Lee
Chirag I. Patel
Nojun Kwak
MQ
18
63
0
28 Nov 2019
Loss Aware Post-training Quantization
Loss Aware Post-training Quantization
Yury Nahshan
Brian Chmiel
Chaim Baskin
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
28
163
0
17 Nov 2019
Divide and Conquer: Leveraging Intermediate Feature Representations for
  Quantized Training of Neural Networks
Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Alex Cloninger
H. Esmaeilzadeh
MQ
23
8
0
14 Jun 2019
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural
  Networks
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
55
68
0
05 Nov 2018
Incremental Network Quantization: Towards Lossless CNNs with
  Low-Precision Weights
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
337
1,049
0
10 Feb 2017
Previous
12