ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.08240
  4. Cited By
EESEN: End-to-End Speech Recognition using Deep RNN Models and
  WFST-based Decoding
v1v2v3 (latest)

EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding

29 July 2015
Yajie Miao
M. Gowayyed
Florian Metze
ArXiv (abs)PDFHTML

Papers citing "EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding"

50 / 264 papers shown
Title
Streaming parallel transducer beam search with fast-slow cascaded
  encoders
Streaming parallel transducer beam search with fast-slow cascaded encoders
Jay Mahadeokar
Yangyang Shi
Ke Li
Duc Le
Jiedan Zhu
Vikas Chandra
Ozlem Kalinli
M. Seltzer
71
16
0
29 Mar 2022
Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition
  With Emformer
Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer
J. Sun
Guiping Zhong
Dinghao Zhou
Baoxiang Li
108
0
0
29 Mar 2022
Locality Matters: A Locality-Biased Linear Attention for Automatic
  Speech Recognition
Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
J. Sun
Guiping Zhong
Dinghao Zhou
Baoxiang Li
Yiran Zhong
63
7
0
29 Mar 2022
Saving RNN Computations with a Neuron-Level Fuzzy Memoization Scheme
Saving RNN Computations with a Neuron-Level Fuzzy Memoization Scheme
Franyell Silfa
J. Arnau
Antonio González
34
1
0
14 Feb 2022
Improving Automatic Speech Recognition for Non-Native English with
  Transfer Learning and Language Model Decoding
Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding
Peter Sullivan
Toshiko Shibano
Muhammad Abdul-Mageed
78
11
0
10 Feb 2022
Star Temporal Classification: Sequence Classification with Partially
  Labeled Data
Star Temporal Classification: Sequence Classification with Partially Labeled Data
Vineel Pratap
Awni Y. Hannun
Gabriel Synnaeve
R. Collobert
56
8
0
28 Jan 2022
LiteLSTM Architecture for Deep Recurrent Neural Networks
LiteLSTM Architecture for Deep Recurrent Neural Networks
Nelly Elsayed
Zag ElSayed
Anthony Maida
72
5
0
27 Jan 2022
Large-Scale Inventory Optimization: A Recurrent-Neural-Networks-Inspired
  Simulation Approach
Large-Scale Inventory Optimization: A Recurrent-Neural-Networks-Inspired Simulation Approach
T. Wan
L. Hong
26
11
0
15 Jan 2022
A Survey on Adversarial Attacks for Malware Analysis
A Survey on Adversarial Attacks for Malware Analysis
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
AAML
106
53
0
16 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
170
379
0
02 Nov 2021
Speech Emotion Recognition Using Quaternion Convolutional Neural
  Networks
Speech Emotion Recognition Using Quaternion Convolutional Neural Networks
Aneesh Muppidi
Martin H. Radfar
62
47
0
31 Oct 2021
Combining Unsupervised and Text Augmented Semi-Supervised Learning for
  Low Resourced Autoregressive Speech Recognition
Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Chak-Fai Li
Francis Keith
William Hartmann
M. Snover
SSL
23
2
0
29 Oct 2021
A Unified Speaker Adaptation Approach for ASR
A Unified Speaker Adaptation Approach for ASR
Yingzhu Zhao
Chongjia Ni
C. Leung
Shafiq Joty
Chng Eng Siong
B. Ma
CLL
112
9
0
16 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech
  Recognition
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
136
231
0
07 Oct 2021
CTC Variations Through New WFST Topologies
CTC Variations Through New WFST Topologies
A. Laptev
Somshubra Majumdar
Boris Ginsburg
89
22
0
06 Oct 2021
Differentiable Allophone Graphs for Language-Universal Speech
  Recognition
Differentiable Allophone Graphs for Language-Universal Speech Recognition
Brian Yan
Siddharth Dalmia
David R. Mortensen
Florian Metze
Shinji Watanabe
63
11
0
24 Jul 2021
End-to-End Rich Transcription-Style Automatic Speech Recognition with
  Semi-Supervised Learning
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
Tomohiro Tanaka
Ryo Masumura
Mana Ihori
Akihiko Takashima
Shota Orihashi
Naoki Makishima
43
4
0
07 Jul 2021
Cross-Modal Transformer-Based Neural Correction Models for Automatic
  Speech Recognition
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Tomohiro Tanaka
Ryo Masumura
Mana Ihori
Akihiko Takashima
Takafumi Moriya
Takanori Ashihara
Shota Orihashi
Naoki Makishima
42
7
0
04 Jul 2021
What do End-to-End Speech Models Learn about Speaker, Language and
  Channel Information? A Layer-wise and Neuron-level Analysis
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis
Shammur A. Chowdhury
Nadir Durrani
Ahmed M. Ali
118
16
0
01 Jul 2021
Multi-mode Transformer Transducer with Stochastic Future Context
Multi-mode Transformer Transducer with Stochastic Future Context
Kwangyoun Kim
Felix Wu
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
71
10
0
17 Jun 2021
Why does CTC result in peaky behavior?
Why does CTC result in peaky behavior?
Albert Zeyer
Ralf Schluter
Hermann Ney
64
46
0
31 May 2021
On Addressing Practical Challenges for RNN-Transducer
On Addressing Practical Challenges for RNN-Transducer
Rui Zhao
Jian Xue
Jinyu Li
Wenning Wei
Lei He
Jiawei Liu
72
32
0
27 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech
  Recognition
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
66
18
0
08 Apr 2021
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy
  For Latency
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Yangyang Shi
Varun K. Nagaraja
Chunyang Wu
Jay Mahadeokar
Duc Le
...
Ching-Feng Yeh
Julian Chan
Christian Fuegen
Ozlem Kalinli
M. Seltzer
57
15
0
05 Apr 2021
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent
  Forecasting
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting
Ye Yuan
Xinshuo Weng
Yanglan Ou
Kris Kitani
AI4TS
114
461
0
25 Mar 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and
  Cross-Modal Knowledge Transferring from BERT
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
91
52
0
15 Feb 2021
Intermediate Loss Regularization for CTC-based Speech Recognition
Intermediate Loss Regularization for CTC-based Speech Recognition
Jaesong Lee
Shinji Watanabe
151
140
0
05 Feb 2021
Fine-tuning Handwriting Recognition systems with Temporal Dropout
Fine-tuning Handwriting Recognition systems with Temporal Dropout
Edgard Chammas
C. Mokbel
24
3
0
31 Jan 2021
Arabic aspect based sentiment analysis using bidirectional GRU based
  models
Arabic aspect based sentiment analysis using bidirectional GRU based models
Mohammed Mustafa
T. H. Soliman
A. Taloba
Mohammed Fawzi Seedik
53
82
0
23 Jan 2021
Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge
  Devices
Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices
Yuekai Zhang
Sining Sun
Long Ma
95
29
0
18 Jan 2021
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector
  Quantization
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Shaoshi Ling
Yuzong Liu
75
107
0
11 Dec 2020
End to End ASR System with Automatic Punctuation Insertion
End to End ASR System with Automatic Punctuation Insertion
Yushi Guan
3DV
34
5
0
03 Dec 2020
Disentangling Homophemes in Lip Reading using Perplexity Analysis
Disentangling Homophemes in Lip Reading using Perplexity Analysis
Souheil Fenghour
Daqing Chen
Kun Guo
Perry Xiao
41
3
0
28 Nov 2020
Streaming end-to-end multi-talker speech recognition
Streaming end-to-end multi-talker speech recognition
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
75
44
0
26 Nov 2020
STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation
  Learning
STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning
Prakamya Mishra
42
0
0
23 Nov 2020
WaDeNet: Wavelet Decomposition based CNN for Speech Processing
WaDeNet: Wavelet Decomposition based CNN for Speech Processing
P. Suresh
Abhijith Ragav
121
0
0
11 Nov 2020
Non-Autoregressive Predictive Coding for Learning Speech Representations
  from Local Dependencies
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies
Alexander H. Liu
Yu-An Chung
James R. Glass
SSL
92
88
0
01 Nov 2020
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
Anuj Diwan
Preethi Jyothi
39
5
0
19 Oct 2020
E-BATCH: Energy-Efficient and High-Throughput RNN Batching
E-BATCH: Energy-Efficient and High-Throughput RNN Batching
Franyell Silfa
J. Arnau
Antonio González
40
12
0
22 Sep 2020
Orientation-aware Vehicle Re-identification with Semantics-guided Part
  Attention Network
Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network
Tsai-Shien Chen
Chih-Ting Liu
Chih-Wei Wu
Shao-Yi Chien
3DPC
261
85
0
26 Aug 2020
Adaptation Algorithms for Neural Network-Based Speech Recognition: An
  Overview
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview
P. Bell
Joachim Fainberg
Ondˇrej Klejch
Jinyu Li
Steve Renals
P. Swietojanski
124
78
0
14 Aug 2020
Modular End-to-end Automatic Speech Recognition Framework for
  Acoustic-to-word Model
Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model
Qi Liu
Zhehuai Chen
Hao Li
Mingkun Huang
Yizhou Lu
Kai Yu
59
7
0
31 Jul 2020
Developing RNN-T Models Surpassing High-Performance Hybrid Models with
  Customization Capability
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Jinyu Li
Rui Zhao
Zhong Meng
Yanqing Liu
Wenning Wei
...
V. Mazalov
Zhenghao Wang
Lei He
Sheng Zhao
Jiawei Liu
90
109
0
30 Jul 2020
Fully Convolutional Networks for Continuous Sign Language Recognition
Fully Convolutional Networks for Continuous Sign Language Recognition
Ka Leong Cheng
Zhaoyang Yang
Qifeng Chen
Yu-Wing Tai
SLR
86
148
0
24 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
114
85
0
02 Jul 2020
Streaming Transformer ASR with Blockwise Synchronous Beam Search
Streaming Transformer ASR with Blockwise Synchronous Beam Search
E. Tsunoo
Yosuke Kashiwagi
Shinji Watanabe
148
11
0
25 Jun 2020
A Heuristically Self-Organised Linguistic Attribute Deep Learning in
  Edge Computing For IoT Intelligence
A Heuristically Self-Organised Linguistic Attribute Deep Learning in Edge Computing For IoT Intelligence
Hongmei He
Zhenhuan Zhu
16
1
0
08 Jun 2020
On the Comparison of Popular End-to-End Models for Large Scale Speech
  Recognition
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition
Jinyu Li
Yu-Huan Wu
Yashesh Gaur
Chengyi Wang
Rui Zhao
Shujie Liu
73
137
0
28 May 2020
CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end
  Approaches towards Data Efficiency and Low Latency
CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency
Keyu An
Hongyu Xiang
Zhijian Ou
66
20
0
27 May 2020
A systematic comparison of grapheme-based vs. phoneme-based label units
  for encoder-decoder-attention models
A systematic comparison of grapheme-based vs. phoneme-based label units for encoder-decoder-attention models
Mohammad Zeineldeen
Albert Zeyer
Wei Zhou
T. Ng
Ralf Schluter
Hermann Ney
69
2
0
19 May 2020
Previous
123456
Next