Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.08240
Cited By
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding
29 July 2015
Yajie Miao
M. Gowayyed
Florian Metze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding"
50 / 264 papers shown
Title
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
26
459
0
18 May 2020
Text Recognition in the Wild: A Survey
Xiaoxue Chen
Lianwen Jin
Yuanzhi Zhu
Canjie Luo
Tianwei Wang
3DV
27
102
0
07 May 2020
Comparing SNNs and RNNs on Neuromorphic Vision Datasets: Similarities and Differences
Weihua He
Yujie Wu
Lei Deng
Guoqi Li
Haoyu Wang
Yang Tian
Wei Ding
Wenhui Wang
Yuan Xie
88
125
0
02 May 2020
Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Hu Hu
Rui Zhao
Jinyu Li
Liang Lu
Jiawei Liu
19
27
0
01 May 2020
Homophone-based Label Smoothing in End-to-End Automatic Speech Recognition
Y. Zheng
Xianjie Yang
Xuyong Dang
6
5
0
07 Apr 2020
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Jinyu Li
Rui Zhao
Eric Sun
J. H. M. Wong
Amit Das
Zhong Meng
Jiawei Liu
VLM
24
24
0
17 Mar 2020
Towards Real-time Mispronunciation Detection in Kids' Speech
Peter William VanHarn Plantinga
Eric Fosler-Lussier
13
9
0
03 Mar 2020
Towards Zero-shot Learning for Automatic Phonemic Transcription
Xinjian Li
Siddharth Dalmia
David R. Mortensen
Juncheng Li
A. Black
Florian Metze
6
29
0
26 Feb 2020
Attentional Speech Recognition Models Misbehave on Out-of-domain Utterances
Phillip Keung
Wei Niu
Y. Lu
Julian Salazar
Vikas Bhardwaj
22
9
0
12 Feb 2020
Gated Graph Recurrent Neural Networks
Luana Ruiz
Fernando Gama
Alejandro Ribeiro
GNN
26
139
0
03 Feb 2020
Unsupervised Pre-training of Bidirectional Speech Encoders via Masked Reconstruction
Weiran Wang
Qingming Tang
Karen Livescu
SSL
4
98
0
28 Jan 2020
Data Techniques For Online End-to-end Speech Recognition
Yang Chen
Weiran Wang
I-Fan Chen
Chao Wang
11
4
0
24 Jan 2020
Semi-supervised ASR by End-to-end Self-training
Yang Chen
Weiran Wang
Chao Wang
14
53
0
24 Jan 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Shaoshi Ling
Yuzong Liu
Julian Salazar
Katrin Kirchhoff
SSL
16
139
0
03 Dec 2019
CAT: CRF-based ASR Toolkit
Keyu An
Hongyu Xiang
Zhijian Ou
6
7
0
20 Nov 2019
Boosting LSTM Performance Through Dynamic Precision Selection
Franyell Silfa
J. Arnau
Antonio González
MQ
13
5
0
07 Nov 2019
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Network
R. Yazdani
Olatunji Ruwase
Minjia Zhang
Yuxiong He
J. Arnau
Antonio González
19
4
0
04 Nov 2019
Towards Online End-to-end Transformer Automatic Speech Recognition
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
22
32
0
25 Oct 2019
A practical two-stage training strategy for multi-stream end-to-end speech recognition
Ruizhi Li
Gregory Sell
Xiaofei Wang
Shinji Watanabe
H. Hermansky
14
7
0
23 Oct 2019
End-to-End Speech Recognition: A review for the French Language
Florian Boyer
Jean-Luc Rouas
AI4TS
17
10
0
18 Oct 2019
Transformer ASR with Contextual Block Processing
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
59
64
0
16 Oct 2019
Convolutional Neural Networks for Speech Controlled Prosthetic Hands
Yichao Fu
Runlong Su
BDL
16
5
0
03 Oct 2019
End-to-End Code-Switching ASR for Low-Resourced Language Pairs
Xianghu Yue
Grandee Lee
Emre Yilmaz
Fang Deng
Haizhou Li
9
30
0
27 Sep 2019
Improving RNN Transducer Modeling for End-to-End Speech Recognition
Jinyu Li
Rui Zhao
Hu Hu
Jiawei Liu
16
170
0
26 Sep 2019
DARTS: Dialectal Arabic Transcription System
Sameer Khurana
Ahmed M. Ali
James R. Glass
14
11
0
26 Sep 2019
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Hirofumi Inaguma
Masato Mimura
S. Sakai
Tatsuya Kawahara
23
5
0
22 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
19
73
0
18 Sep 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
12
27
0
13 Aug 2019
Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition
S. Dey
P. Motlícek
Trung H. Bui
Franck Dernoncourt
6
13
0
08 Aug 2019
SANTLR: Speech Annotation Toolkit for Low Resource Languages
Xinjian Li
Zhong Zhou
Siddharth Dalmia
A. Black
Florian Metze
13
5
0
02 Aug 2019
Multilingual Speech Recognition with Corpus Relatedness Sampling
Xinjian Li
Siddharth Dalmia
A. Black
Florian Metze
14
17
0
02 Aug 2019
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement
Alzahra Badi
Sangwook Park
D. Han
Hanseok Ko
8
6
0
26 Jul 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Yonatan Belinkov
Ahmed M. Ali
James R. Glass
28
32
0
09 Jul 2019
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Suyoun Kim
Siddharth Dalmia
Florian Metze
10
23
0
27 Jun 2019
End-to-End ASR for Code-switched Hindi-English Speech
B. M. L. Srivastava
Basil Abraham
Sunayana Sitaram
Rupeshkumar Mehta
P. Jyothi
11
2
0
22 Jun 2019
Multimodal Abstractive Summarization for How2 Videos
Shruti Palaskar
Jindrich Libovický
Spandana Gella
Florian Metze
16
95
0
19 Jun 2019
Multi-Stream End-to-End Speech Recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Shinji Watanabe
Takaaki Hori
H. Hermansky
14
20
0
17 Jun 2019
Deep Learning-Based Automatic Downbeat Tracking: A Brief Review
Bijue Jia
Jiancheng Lv
Dayiheng Liu
27
28
0
10 Jun 2019
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation
Byung Hoon Ahn
Prannoy Pilligundla
H. Esmaeilzadeh
13
20
0
30 May 2019
Sampling from Stochastic Finite Automata with Applications to CTC Decoding
Martin Jansche
Alexander Gutkin
6
2
0
21 May 2019
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
22
7
0
21 May 2019
End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
E. Tsunoo
Yosuke Kashiwagi
S. Asakawa
Toshiyuki Kumakura
16
4
0
17 May 2019
A Hardware-Oriented and Memory-Efficient Method for CTC Decoding
Siyuan Lu
Jinming Lu
Jun Lin
Zhongfeng Wang
11
5
0
08 May 2019
Reinterpreting CTC training as iterative fitting
Hongzhu Li
Weiqiang Wang
14
1
0
24 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
16
46
0
17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
20
25
0
16 Apr 2019
Performance Monitoring for End-to-End Speech Recognition
Ruizhi Li
Gregory Sell
H. Hermansky
10
2
0
09 Apr 2019
Learning Shared Encoding Representation for End-to-End Speech Recognition Models
T. Nguyen
Sebastian Stüker
A. Waibel
14
2
0
31 Mar 2019
Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech
Zixing Zhang
Bingwen Wu
Bjoern Schuller
11
83
0
29 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
14
15
0
27 Mar 2019
Previous
1
2
3
4
5
6
Next