ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.10176
  4. Cited By
Effective Quantization Methods for Recurrent Neural Networks

Effective Quantization Methods for Recurrent Neural Networks

30 November 2016
Qinyao He
He Wen
Shuchang Zhou
Yuxin Wu
Cong Yao
Xinyu Zhou
Yuheng Zou
    MQ
ArXivPDFHTML

Papers citing "Effective Quantization Methods for Recurrent Neural Networks"

22 / 22 papers shown
Title
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
Armand Foucault
Franck Mamalet
François Malgouyres
MQ
87
0
0
28 Jan 2025
Histogram-Equalized Quantization for logic-gated Residual Neural Networks
Histogram-Equalized Quantization for logic-gated Residual Neural Networks
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
46
2
0
10 Jan 2025
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural
  Networks
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
32
0
0
07 Apr 2023
Training Integer-Only Deep Recurrent Neural Networks
Training Integer-Only Deep Recurrent Neural Networks
V. Nia
Eyyub Sari
Vanessa Courville
M. Asgharian
MQ
53
2
0
22 Dec 2022
Vau da muntanialas: Energy-efficient multi-die scalable acceleration of
  RNN inference
Vau da muntanialas: Energy-efficient multi-die scalable acceleration of RNN inference
G. Paulin
Francesco Conti
Lukas Cavigelli
Luca Benini
29
8
0
14 Feb 2022
iRNN: Integer-only Recurrent Neural Network
iRNN: Integer-only Recurrent Neural Network
Eyyub Sari
Vanessa Courville
V. Nia
MQ
56
4
0
20 Sep 2021
4-bit Quantization of LSTM-based Speech Recognition Models
4-bit Quantization of LSTM-based Speech Recognition Models
A. Fasoli
Chia-Yu Chen
Mauricio Serrano
Xiao Sun
Naigang Wang
...
Xiaodong Cui
Brian Kingsbury
Wei Zhang
Zoltán Tüske
K. Gopalakrishnan
MQ
26
21
0
27 Aug 2021
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network
  Quantization
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization
Huanrui Yang
Lin Duan
Yiran Chen
Hai Helen Li
MQ
21
64
0
20 Feb 2021
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization
  Framework
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Sung-En Chang
Yanyu Li
Mengshu Sun
Runbin Shi
Hayden Kwok-Hay So
Xuehai Qian
Yanzhi Wang
Xue Lin
MQ
26
83
0
08 Dec 2020
Compression of Deep Learning Models for Text: A Survey
Compression of Deep Learning Models for Text: A Survey
Manish Gupta
Puneet Agrawal
VLM
MedIm
AI4CE
22
115
0
12 Aug 2020
MuBiNN: Multi-Level Binarized Recurrent Neural Network for EEG signal
  Classification
MuBiNN: Multi-Level Binarized Recurrent Neural Network for EEG signal Classification
Seyed Ahmad Mirsalari
Sima Sinaei
M. Salehi
Masoud Daneshtalab
MQ
16
5
0
19 Apr 2020
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM
  Networks
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
Théodore Bluche
Maël Primet
Thibault Gisselbrecht
ObjD
MQ
28
24
0
25 Feb 2020
Post-Training 4-bit Quantization on Embedding Tables
Post-Training 4-bit Quantization on Embedding Tables
Hui Guan
Andrey Malevich
Jiyan Yang
Jongsoo Park
Hector Yuen
MQ
19
32
0
05 Nov 2019
Fully Quantized Transformer for Machine Translation
Fully Quantized Transformer for Machine Translation
Gabriele Prato
Ella Charlaix
Mehdi Rezagholizadeh
MQ
13
68
0
17 Oct 2019
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks
Shubham Jain
S. Gupta
A. Raghunathan
MQ
32
37
0
15 Sep 2019
Compressing RNNs for IoT devices by 15-38x using Kronecker Products
Compressing RNNs for IoT devices by 15-38x using Kronecker Products
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Chu Zhou
Igor Fedorov
Ganesh S. Dasika
Matthew Mattina
27
36
0
07 Jun 2019
Precision Highway for Ultra Low-Precision Quantization
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
21
12
0
24 Dec 2018
Joint Neural Architecture Search and Quantization
Joint Neural Architecture Search and Quantization
Yukang Chen
Gaofeng Meng
Qian Zhang
Xinbang Zhang
Liangchen Song
Shiming Xiang
Chunhong Pan
MQ
30
29
0
23 Nov 2018
A Survey on Methods and Theories of Quantized Neural Networks
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
34
232
0
13 Aug 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable
  Precision LSTM Networks on FPGAs
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
19
72
0
11 Jul 2018
Low Precision RNNs: Quantizing RNNs Without Losing Accuracy
Low Precision RNNs: Quantizing RNNs Without Losing Accuracy
Supriya Kapur
Asit K. Mishra
Debbie Marr
MQ
32
26
0
20 Oct 2017
Mixed Precision Training
Mixed Precision Training
Paulius Micikevicius
Sharan Narang
Jonah Alben
G. Diamos
Erich Elsen
...
Boris Ginsburg
Michael Houston
Oleksii Kuchaiev
Ganesh Venkatesh
Hao Wu
90
1,767
0
10 Oct 2017
1