ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.08240
  4. Cited By
EESEN: End-to-End Speech Recognition using Deep RNN Models and
  WFST-based Decoding

EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding

29 July 2015
Yajie Miao
M. Gowayyed
Florian Metze
ArXivPDFHTML

Papers citing "EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding"

50 / 264 papers shown
Title
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory
  Prediction
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
26
459
0
18 May 2020
Text Recognition in the Wild: A Survey
Text Recognition in the Wild: A Survey
Xiaoxue Chen
Lianwen Jin
Yuanzhi Zhu
Canjie Luo
Tianwei Wang
3DV
27
102
0
07 May 2020
Comparing SNNs and RNNs on Neuromorphic Vision Datasets: Similarities
  and Differences
Comparing SNNs and RNNs on Neuromorphic Vision Datasets: Similarities and Differences
Weihua He
Yujie Wu
Lei Deng
Guoqi Li
Haoyu Wang
Yang Tian
Wei Ding
Wenhui Wang
Yuan Xie
88
125
0
02 May 2020
Exploring Pre-training with Alignments for RNN Transducer based
  End-to-End Speech Recognition
Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Hu Hu
Rui Zhao
Jinyu Li
Liang Lu
Jiawei Liu
19
27
0
01 May 2020
Homophone-based Label Smoothing in End-to-End Automatic Speech
  Recognition
Homophone-based Label Smoothing in End-to-End Automatic Speech Recognition
Y. Zheng
Xianjie Yang
Xuyong Dang
6
5
0
07 Apr 2020
High-Accuracy and Low-Latency Speech Recognition with Two-Head
  Contextual Layer Trajectory LSTM Model
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Jinyu Li
Rui Zhao
Eric Sun
J. H. M. Wong
Amit Das
Zhong Meng
Jiawei Liu
VLM
24
24
0
17 Mar 2020
Towards Real-time Mispronunciation Detection in Kids' Speech
Towards Real-time Mispronunciation Detection in Kids' Speech
Peter William VanHarn Plantinga
Eric Fosler-Lussier
13
9
0
03 Mar 2020
Towards Zero-shot Learning for Automatic Phonemic Transcription
Towards Zero-shot Learning for Automatic Phonemic Transcription
Xinjian Li
Siddharth Dalmia
David R. Mortensen
Juncheng Li
A. Black
Florian Metze
6
29
0
26 Feb 2020
Attentional Speech Recognition Models Misbehave on Out-of-domain
  Utterances
Attentional Speech Recognition Models Misbehave on Out-of-domain Utterances
Phillip Keung
Wei Niu
Y. Lu
Julian Salazar
Vikas Bhardwaj
22
9
0
12 Feb 2020
Gated Graph Recurrent Neural Networks
Gated Graph Recurrent Neural Networks
Luana Ruiz
Fernando Gama
Alejandro Ribeiro
GNN
26
139
0
03 Feb 2020
Unsupervised Pre-training of Bidirectional Speech Encoders via Masked
  Reconstruction
Unsupervised Pre-training of Bidirectional Speech Encoders via Masked Reconstruction
Weiran Wang
Qingming Tang
Karen Livescu
SSL
4
98
0
28 Jan 2020
Data Techniques For Online End-to-end Speech Recognition
Data Techniques For Online End-to-end Speech Recognition
Yang Chen
Weiran Wang
I-Fan Chen
Chao Wang
11
4
0
24 Jan 2020
Semi-supervised ASR by End-to-end Self-training
Semi-supervised ASR by End-to-end Self-training
Yang Chen
Weiran Wang
Chao Wang
14
53
0
24 Jan 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech
  Recognition
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Shaoshi Ling
Yuzong Liu
Julian Salazar
Katrin Kirchhoff
SSL
16
139
0
03 Dec 2019
CAT: CRF-based ASR Toolkit
CAT: CRF-based ASR Toolkit
Keyu An
Hongyu Xiang
Zhijian Ou
6
7
0
20 Nov 2019
Boosting LSTM Performance Through Dynamic Precision Selection
Boosting LSTM Performance Through Dynamic Precision Selection
Franyell Silfa
J. Arnau
Antonio González
MQ
13
5
0
07 Nov 2019
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural
  Network
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Network
R. Yazdani
Olatunji Ruwase
Minjia Zhang
Yuxiong He
J. Arnau
Antonio González
19
4
0
04 Nov 2019
Towards Online End-to-end Transformer Automatic Speech Recognition
Towards Online End-to-end Transformer Automatic Speech Recognition
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
22
32
0
25 Oct 2019
A practical two-stage training strategy for multi-stream end-to-end
  speech recognition
A practical two-stage training strategy for multi-stream end-to-end speech recognition
Ruizhi Li
Gregory Sell
Xiaofei Wang
Shinji Watanabe
H. Hermansky
14
7
0
23 Oct 2019
End-to-End Speech Recognition: A review for the French Language
End-to-End Speech Recognition: A review for the French Language
Florian Boyer
Jean-Luc Rouas
AI4TS
17
10
0
18 Oct 2019
Transformer ASR with Contextual Block Processing
Transformer ASR with Contextual Block Processing
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
59
64
0
16 Oct 2019
Convolutional Neural Networks for Speech Controlled Prosthetic Hands
Convolutional Neural Networks for Speech Controlled Prosthetic Hands
Yichao Fu
Runlong Su
BDL
16
5
0
03 Oct 2019
End-to-End Code-Switching ASR for Low-Resourced Language Pairs
End-to-End Code-Switching ASR for Low-Resourced Language Pairs
Xianghu Yue
Grandee Lee
Emre Yilmaz
Fang Deng
Haizhou Li
9
30
0
27 Sep 2019
Improving RNN Transducer Modeling for End-to-End Speech Recognition
Improving RNN Transducer Modeling for End-to-End Speech Recognition
Jinyu Li
Rui Zhao
Hu Hu
Jiawei Liu
16
170
0
26 Sep 2019
DARTS: Dialectal Arabic Transcription System
DARTS: Dialectal Arabic Transcription System
Sameer Khurana
Ahmed M. Ali
James R. Glass
14
11
0
26 Sep 2019
Improving OOV Detection and Resolution with External Language Models in
  Acoustic-to-Word ASR
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Hirofumi Inaguma
Masato Mimura
S. Sakai
Tatsuya Kawahara
23
5
0
22 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
19
73
0
18 Sep 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and
  Transfer Learning
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
12
27
0
13 Aug 2019
Exploiting semi-supervised training through a dropout regularization in
  end-to-end speech recognition
Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition
S. Dey
P. Motlícek
Trung H. Bui
Franck Dernoncourt
6
13
0
08 Aug 2019
SANTLR: Speech Annotation Toolkit for Low Resource Languages
SANTLR: Speech Annotation Toolkit for Low Resource Languages
Xinjian Li
Zhong Zhou
Siddharth Dalmia
A. Black
Florian Metze
13
5
0
02 Aug 2019
Multilingual Speech Recognition with Corpus Relatedness Sampling
Multilingual Speech Recognition with Corpus Relatedness Sampling
Xinjian Li
Siddharth Dalmia
A. Black
Florian Metze
14
17
0
02 Aug 2019
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE)
  for Speech Feature Enhancement
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement
Alzahra Badi
Sangwook Park
D. Han
Hanseok Ko
8
6
0
26 Jul 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic
  Speech Recognition
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Yonatan Belinkov
Ahmed M. Ali
James R. Glass
28
32
0
09 Jul 2019
Gated Embeddings in End-to-End Speech Recognition for
  Conversational-Context Fusion
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Suyoun Kim
Siddharth Dalmia
Florian Metze
10
23
0
27 Jun 2019
End-to-End ASR for Code-switched Hindi-English Speech
End-to-End ASR for Code-switched Hindi-English Speech
B. M. L. Srivastava
Basil Abraham
Sunayana Sitaram
Rupeshkumar Mehta
P. Jyothi
11
2
0
22 Jun 2019
Multimodal Abstractive Summarization for How2 Videos
Multimodal Abstractive Summarization for How2 Videos
Shruti Palaskar
Jindrich Libovický
Spandana Gella
Florian Metze
16
95
0
19 Jun 2019
Multi-Stream End-to-End Speech Recognition
Multi-Stream End-to-End Speech Recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Shinji Watanabe
Takaaki Hori
H. Hermansky
14
20
0
17 Jun 2019
Deep Learning-Based Automatic Downbeat Tracking: A Brief Review
Deep Learning-Based Automatic Downbeat Tracking: A Brief Review
Bijue Jia
Jiancheng Lv
Dayiheng Liu
27
28
0
10 Jun 2019
Reinforcement Learning and Adaptive Sampling for Optimized DNN
  Compilation
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation
Byung Hoon Ahn
Prannoy Pilligundla
H. Esmaeilzadeh
13
20
0
30 May 2019
Sampling from Stochastic Finite Automata with Applications to CTC
  Decoding
Sampling from Stochastic Finite Automata with Applications to CTC Decoding
Martin Jansche
Alexander Gutkin
6
2
0
21 May 2019
Acoustic-to-Word Models with Conversational Context Information
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
22
7
0
21 May 2019
End-to-end Adaptation with Backpropagation through WFST for On-device
  Speech Recognition System
End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
E. Tsunoo
Yosuke Kashiwagi
S. Asakawa
Toshiyuki Kumakura
16
4
0
17 May 2019
A Hardware-Oriented and Memory-Efficient Method for CTC Decoding
A Hardware-Oriented and Memory-Efficient Method for CTC Decoding
Siyuan Lu
Jinming Lu
Jun Lin
Zhongfeng Wang
11
5
0
08 May 2019
Reinterpreting CTC training as iterative fitting
Reinterpreting CTC training as iterative fitting
Hongzhu Li
Weiqiang Wang
14
1
0
24 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and
  Knowledge Distillation
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
16
46
0
17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint
  Embedding and Clustering
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
20
25
0
16 Apr 2019
Performance Monitoring for End-to-End Speech Recognition
Performance Monitoring for End-to-End Speech Recognition
Ruizhi Li
Gregory Sell
H. Hermansky
10
2
0
09 Apr 2019
Learning Shared Encoding Representation for End-to-End Speech
  Recognition Models
Learning Shared Encoding Representation for End-to-End Speech Recognition Models
T. Nguyen
Sebastian Stüker
A. Waibel
14
2
0
31 Mar 2019
Attention-Augmented End-to-End Multi-Task Learning for Emotion
  Prediction from Speech
Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech
Zixing Zhang
Bingwen Wu
Bjoern Schuller
11
83
0
29 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End
  Speech Recognition
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
14
15
0
27 Mar 2019
Previous
123456
Next