ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.00694
  4. Cited By
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

1 December 2016
Song Han
Junlong Kang
Huizi Mao
Yiming Hu
Xin Li
Yubin Li
Dongliang Xie
Hong Luo
Song Yao
Yu Wang
Huazhong Yang
W. Dally
ArXivPDFHTML

Papers citing "ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA"

50 / 74 papers shown
Title
Event-Based Eye Tracking. 2025 Event-based Vision Workshop
Event-Based Eye Tracking. 2025 Event-based Vision Workshop
Qinyu Chen
Chang Gao
Min Liu
Daniele Perrone
Yan Ru Pei
...
Hoang M. Truong
Vinh-Thuan Ly
Huy G. Tran
Thuan-Phat Nguyen
Tram T. Doan
46
1
0
25 Apr 2025
Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey
Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey
Haoyang Wang
Ruishan Guo
Pengtao Ma
Ciyu Ruan
Xinyu Luo
Wenhua Ding
Tianyang Zhong
Jingao Xu
Yunhao Liu
Xinlei Chen
55
0
0
29 Mar 2025
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
45
0
0
01 Nov 2024
DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm$^2$ Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion
DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm2^22 Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion
Ang Li
Haolin Wu
Yizhuo Wu
Qinyu Chen
Leo C. N. de Vreede
Chang Gao
24
0
0
15 Oct 2024
Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams
Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams
Rana Shahout
Michael Mitzenmacher
25
2
0
24 Jun 2024
Graph Expansion in Pruned Recurrent Neural Network Layers Preserve
  Performance
Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance
Suryam Arnav Kalra
Arindam Biswas
Pabitra Mitra
Biswajit Basu
GNN
46
0
0
17 Mar 2024
STen: Productive and Efficient Sparsity in PyTorch
STen: Productive and Efficient Sparsity in PyTorch
Andrei Ivanov
Nikoli Dryden
Tal Ben-Nun
Saleh Ashkboos
Torsten Hoefler
36
4
0
15 Apr 2023
Workload-Balanced Pruning for Sparse Spiking Neural Networks
Workload-Balanced Pruning for Sparse Spiking Neural Networks
Ruokai Yin
Youngeun Kim
Yuhang Li
Abhishek Moitra
Nitin Satpute
Anna Hambitzer
Priyadarshini Panda
37
19
0
13 Feb 2023
Algorithm and Hardware Co-Design of Energy-Efficient LSTM Networks for
  Video Recognition with Hierarchical Tucker Tensor Decomposition
Algorithm and Hardware Co-Design of Energy-Efficient LSTM Networks for Video Recognition with Hierarchical Tucker Tensor Decomposition
Yu Gong
Miao Yin
Lingyi Huang
Chunhua Deng
Yang Sui
Bo Yuan
24
6
0
05 Dec 2022
Towards Stable Co-saliency Detection and Object Co-segmentation
Towards Stable Co-saliency Detection and Object Co-segmentation
Bo Li
Lv Tang
Senyun Kuang
Mofei Song
Shouhong Ding
36
14
0
25 Sep 2022
Accelerating Neural Network Inference with Processing-in-DRAM: From the
  Edge to the Cloud
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud
Geraldo F. Oliveira
Juan Gómez Luna
Saugata Ghose
Amirali Boroumand
O. Mutlu
29
24
0
19 Sep 2022
Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space
  Exploration Tool for FPGA High-Level Synthesis
Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level Synthesis
Mang Yu
Sitao Huang
Deming Chen
21
9
0
03 Jul 2022
Enabling All In-Edge Deep Learning: A Literature Review
Enabling All In-Edge Deep Learning: A Literature Review
Praveen Joshi
Mohammed Hasanuzzaman
Chandra Thapa
Haithem Afli
T. Scully
48
22
0
07 Apr 2022
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core
  Aware Weight Pruning
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning
Guyue Huang
Haoran Li
Minghai Qin
Fei Sun
Yufei Din
Yuan Xie
38
18
0
09 Mar 2022
Vau da muntanialas: Energy-efficient multi-die scalable acceleration of
  RNN inference
Vau da muntanialas: Energy-efficient multi-die scalable acceleration of RNN inference
G. Paulin
Francesco Conti
Lukas Cavigelli
Luca Benini
26
8
0
14 Feb 2022
Mixture-of-Rookies: Saving DNN Computations by Predicting ReLU Outputs
Mixture-of-Rookies: Saving DNN Computations by Predicting ReLU Outputs
D. Pinto
J. Arnau
Antonio González
39
1
0
10 Feb 2022
Google Neural Network Models for Edge Devices: Analyzing and Mitigating
  Machine Learning Inference Bottlenecks
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
25
82
0
29 Sep 2021
Efficient Non-linear Calculators
Efficient Non-linear Calculators
Adedamola Wuraola
N. Patel
14
0
0
26 Sep 2021
Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting
  Spatio-Temporal Sparsity
Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-Temporal Sparsity
Chang Gao
T. Delbruck
Shih-Chii Liu
21
44
0
04 Aug 2021
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN
  Acceleration
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN Acceleration
Zhi-Gang Liu
P. Whatmough
Yuhao Zhu
Matthew Mattina
MQ
19
75
0
16 Jul 2021
Trustworthy AI: A Computational Perspective
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
196
0
12 Jul 2021
Accelerating Recurrent Neural Networks for Gravitational Wave
  Experiments
Accelerating Recurrent Neural Networks for Gravitational Wave Experiments
Zhiqiang Que
Erwei Wang
Umar Marikar
Eric A. Moreno
J. Ngadiuba
...
Vladimir Loncar
S. Summers
M. Pierini
P. Cheung
Wayne Luk
13
25
0
26 Jun 2021
Dual-side Sparse Tensor Core
Dual-side Sparse Tensor Core
Yang-Feng Wang
Chen Zhang
Zhiqiang Xie
Cong Guo
Yunxin Liu
Jingwen Leng
25
75
0
20 May 2021
Enabling Design Methodologies and Future Trends for Edge AI:
  Specialization and Co-design
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design
Cong Hao
Jordan Dotzel
Jinjun Xiong
Luca Benini
Zhiru Zhang
Deming Chen
58
34
0
25 Mar 2021
Compacting Deep Neural Networks for Internet of Things: Methods and
  Applications
Compacting Deep Neural Networks for Internet of Things: Methods and Applications
Ke Zhang
Hanbo Ying
Hongning Dai
Lin Li
Yuangyuang Peng
Keyi Guo
Hongfang Yu
21
38
0
20 Mar 2021
BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio
  Sparsification
BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio Sparsification
Seyed Abolfazl Ghasemzadeh
E. Tavakoli
M. Kamal
A. Afzali-Kusha
Massoud Pedram
24
13
0
07 Jan 2021
Parallel Blockwise Knowledge Distillation for Deep Neural Network
  Compression
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression
Cody Blakeney
Xiaomin Li
Yan Yan
Ziliang Zong
53
40
0
05 Dec 2020
Bringing AI To Edge: From Deep Learning's Perspective
Bringing AI To Edge: From Deep Learning's Perspective
Di Liu
Hao Kong
Xiangzhong Luo
Weichen Liu
Ravi Subramaniam
52
116
0
25 Nov 2020
Auto Graph Encoder-Decoder for Neural Network Pruning
Auto Graph Encoder-Decoder for Neural Network Pruning
Sixing Yu
Arya Mazaheri
Ali Jannesari
GNN
27
38
0
25 Nov 2020
TUTOR: Training Neural Networks Using Decision Rules as Model Priors
TUTOR: Training Neural Networks Using Decision Rules as Model Priors
Shayan Hassantabar
Prerit Terway
N. Jha
36
10
0
12 Oct 2020
CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
  and Efficient Neural Networks
CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors and Efficient Neural Networks
Shayan Hassantabar
Novati Stefano
Vishweshwar Ghanakota
A. Ferrari
G. Nicola
R. Bruno
I. Marino
Kenza Hamidouche
N. Jha
18
69
0
20 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
64
82
0
02 Jul 2020
CSB-RNN: A Faster-than-Realtime RNN Acceleration Framework with
  Compressed Structured Blocks
CSB-RNN: A Faster-than-Realtime RNN Acceleration Framework with Compressed Structured Blocks
Runbin Shi
Peiyan Dong
Tong Geng
Yuhao Ding
Xiaolong Ma
Hayden Kwok-Hay So
Martin C. Herbordt
Ang Li
Yanzhi Wang
MQ
18
13
0
11 May 2020
FlexSA: Flexible Systolic Array Architecture for Efficient Pruned DNN
  Model Training
FlexSA: Flexible Systolic Array Architecture for Efficient Pruned DNN Model Training
Sangkug Lym
M. Erez
21
25
0
27 Apr 2020
Computation on Sparse Neural Networks: an Inspiration for Future
  Hardware
Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun
Minghai Qin
Tianyun Zhang
Liu Liu
Yen-kuang Chen
Yuan Xie
37
7
0
24 Apr 2020
CoCoPIE: Making Mobile AI Sweet As PIE --Compression-Compilation
  Co-Design Goes a Long Way
CoCoPIE: Making Mobile AI Sweet As PIE --Compression-Compilation Co-Design Goes a Long Way
Shaoshan Liu
Bin Ren
Xipeng Shen
Yanzhi Wang
17
18
0
14 Mar 2020
Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision
  Applications
Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications
Chinthaka Gamanayake
Lahiru Jayasinghe
Benny Kai Kiat Ng
Chau Yuen
VLM
23
45
0
05 Mar 2020
A$^3$: Accelerating Attention Mechanisms in Neural Networks with
  Approximation
A3^33: Accelerating Attention Mechanisms in Neural Networks with Approximation
Tae Jun Ham
Sungjun Jung
Seonghak Kim
Young H. Oh
Yeonhong Park
...
Jung-Hun Park
Sanghee Lee
Kyoung Park
Jae W. Lee
D. Jeong
24
214
0
22 Feb 2020
Taurus: A Data Plane Architecture for Per-Packet ML
Taurus: A Data Plane Architecture for Per-Packet ML
Tushar Swamy
Alexander Rucker
M. Shahbaz
Ishan Gaur
K. Olukotun
23
82
0
12 Feb 2020
Activation Density driven Energy-Efficient Pruning in Training
Activation Density driven Energy-Efficient Pruning in Training
Timothy Foldy-Porto
Yeshwanth Venkatesha
Priyadarshini Panda
10
4
0
07 Feb 2020
BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted
  Regularization Method
BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method
Xiaolong Ma
Zechao Li
Yifan Gong
Tianyun Zhang
Wei Niu
...
Pu Zhao
Jian Tang
X. Lin
Bin Ren
Yanzhi Wang
28
14
0
23 Jan 2020
NAIS: Neural Architecture and Implementation Search and its Applications
  in Autonomous Driving
NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving
Cong Hao
Yao Chen
Xinheng Liu
A. Sarwari
Daryl Sew
...
Dongdong Fu
Jinjun Xiong
Wen-mei W. Hwu
Junli Gu
Deming Chen
3DV
22
21
0
18 Nov 2019
ELSA: A Throughput-Optimized Design of an LSTM Accelerator for
  Energy-Constrained Devices
ELSA: A Throughput-Optimized Design of an LSTM Accelerator for Energy-Constrained Devices
E. Azari
S. Vrudhula
12
5
0
19 Oct 2019
DiabDeep: Pervasive Diabetes Diagnosis based on Wearable Medical Sensors
  and Efficient Neural Networks
DiabDeep: Pervasive Diabetes Diagnosis based on Wearable Medical Sensors and Efficient Neural Networks
Hongxu Yin
Bilal Mukadam
Xiaoliang Dai
N. Jha
34
47
0
11 Oct 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object
  Detection on FPGAs
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
Caiwen Ding
Shuo Wang
Ning Liu
Kaidi Xu
Yanzhi Wang
Yun Liang
MQ
24
89
0
29 Sep 2019
A Data-Center FPGA Acceleration Platform for Convolutional Neural
  Networks
A Data-Center FPGA Acceleration Platform for Convolutional Neural Networks
Xiaoyu Yu
Yuwei Wang
Jie Miao
Ephrem Wu
Heng Zhang
Yu Meng
Bo Zhang
Biao Min
Dewei Chen
Jianlin Gao
33
21
0
17 Sep 2019
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Xiaofei Wang
Yiwen Han
Victor C. M. Leung
Dusit Niyato
Xueqiang Yan
Xu Chen
17
977
0
19 Jul 2019
On improving deep learning generalization with adaptive sparse
  connectivity
On improving deep learning generalization with adaptive sparse connectivity
Shiwei Liu
Decebal Constantin Mocanu
Mykola Pechenizkiy
ODL
20
7
0
27 Jun 2019
OpenEI: An Open Framework for Edge Intelligence
OpenEI: An Open Framework for Edge Intelligence
Xingzhou Zhang
Yifan Wang
Sidi Lu
Liangkai Liu
Lanyu Xu
Weisong Shi
29
101
0
05 Jun 2019
Approximate LSTMs for Time-Constrained Inference: Enabling Fast Reaction
  in Self-Driving Cars
Approximate LSTMs for Time-Constrained Inference: Enabling Fast Reaction in Self-Driving Cars
Alexandros Kouris
Stylianos I. Venieris
Michail Rizakis
C. Bouganis
AI4TS
19
12
0
02 May 2019
12
Next