ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05668
  4. Cited By
Model compression via distillation and quantization

Model compression via distillation and quantization

15 February 2018
A. Polino
Razvan Pascanu
Dan Alistarh
    MQ
ArXivPDFHTML

Papers citing "Model compression via distillation and quantization"

21 / 171 papers shown
Title
Focused Quantization for Sparse CNNs
Focused Quantization for Sparse CNNs
Yiren Zhao
Xitong Gao
Daniel Bates
Robert D. Mullins
Chengzhong Xu
MQ
23
26
0
07 Mar 2019
Copying Machine Learning Classifiers
Copying Machine Learning Classifiers
Irene Unceta
Jordi Nin
O. Pujol
14
18
0
05 Mar 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
33
356
0
18 Feb 2019
Distillation Strategies for Proximal Policy Optimization
Distillation Strategies for Proximal Policy Optimization
Sam Green
C. Vineyard
Ç. Koç
27
8
0
23 Jan 2019
DSConv: Efficient Convolution Operator
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
32
62
0
07 Jan 2019
Efficient non-uniform quantizer for quantized neural network targeting
  reconfigurable hardware
Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss
Chaim Baskin
A. Mendelson
A. Bronstein
Raja Giryes
MQ
29
5
0
27 Nov 2018
Joint Neural Architecture Search and Quantization
Joint Neural Architecture Search and Quantization
Yukang Chen
Gaofeng Meng
Qian Zhang
Xinbang Zhang
Liangchen Song
Shiming Xiang
Chunhong Pan
MQ
30
29
0
23 Nov 2018
Structured Binary Neural Networks for Accurate Image Classification and
  Semantic Segmentation
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
27
152
0
22 Nov 2018
Online Embedding Compression for Text Classification using Low Rank
  Matrix Factorization
Online Embedding Compression for Text Classification using Low Rank Matrix Factorization
Anish Acharya
Rahul Goel
A. Metallinou
Inderjit Dhillon
25
58
0
01 Nov 2018
Relaxed Quantization for Discretized Neural Networks
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
41
132
0
03 Oct 2018
Learning to Quantize Deep Networks by Optimizing Quantization Intervals
  with Task Loss
Learning to Quantize Deep Networks by Optimizing Quantization Intervals with Task Loss
S. Jung
Changyong Son
Seohyung Lee
JinWoo Son
Youngjun Kwak
Jae-Joon Han
Sung Ju Hwang
Changkyu Choi
MQ
25
373
0
17 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
34
232
0
13 Aug 2018
Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)
Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)
Jungwook Choi
P. Chuang
Zhuo Wang
Swagath Venkataramani
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
19
75
0
17 Jul 2018
Quantizing deep convolutional networks for efficient inference: A
  whitepaper
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
50
997
0
21 Jun 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks
  per Bit?
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
32
135
0
20 Jun 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Dongsoo Lee
Byeongwook Kim
MQ
36
16
0
29 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural
  Networks
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
22
45
0
29 Apr 2018
Value-aware Quantization for Training and Inference of Neural Networks
Value-aware Quantization for Training and Inference of Neural Networks
Eunhyeok Park
S. Yoo
Peter Vajda
MQ
14
158
0
20 Apr 2018
OpenNMT: Open-Source Toolkit for Neural Machine Translation
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
273
1,896
0
10 Jan 2017
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
312
2,896
0
15 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
223
7,930
0
17 Aug 2015
Previous
1234