v1v2v3 (latest)

EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding

29 July 2015

Papers citing "EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding"

50 / 264 papers shown

Title
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction Cunjun Yu Xiao Ma Jiawei Ren Haiyu Zhao Shuai Yi 90 475 0 18 May 2020
Text Recognition in the Wild: A Survey Xiaoxue Chen Lianwen Jin Yuanzhi Zhu Canjie Luo Tianwei Wang 3DV 128 105 0 07 May 2020
Comparing SNNs and RNNs on Neuromorphic Vision Datasets: Similarities and Differences Weihua He Yujie Wu Lei Deng Guoqi Li Haoyu Wang Yang Tian Wei Ding Wenhui Wang Yuan Xie 135 130 0 02 May 2020
Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition Hu Hu Rui Zhao Jinyu Li Liang Lu Jiawei Liu 65 27 0 01 May 2020
Homophone-based Label Smoothing in End-to-End Automatic Speech Recognition Y. Zheng Xianjie Yang Xuyong Dang 20 5 0 07 Apr 2020
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model Jinyu Li Rui Zhao Eric Sun J. H. M. Wong Amit Das Zhong Meng Jiawei Liu VLM 70 25 0 17 Mar 2020
Towards Real-time Mispronunciation Detection in Kids' Speech Peter William VanHarn Plantinga Eric Fosler-Lussier 42 9 0 03 Mar 2020
Towards Zero-shot Learning for Automatic Phonemic Transcription Xinjian Li Siddharth Dalmia David R. Mortensen Juncheng Li A. Black Florian Metze 63 30 0 26 Feb 2020
Attentional Speech Recognition Models Misbehave on Out-of-domain Utterances Phillip Keung Wei Niu Y. Lu Julian Salazar Vikas Bhardwaj 72 9 0 12 Feb 2020
Gated Graph Recurrent Neural Networks Luana Ruiz Fernando Gama Alejandro Ribeiro GNN 118 146 0 03 Feb 2020
Unsupervised Pre-training of Bidirectional Speech Encoders via Masked Reconstruction Weiran Wang Qingming Tang Karen Livescu SSL 84 98 0 28 Jan 2020
Data Techniques For Online End-to-end Speech Recognition Yang Chen Weiran Wang I-Fan Chen Chao Wang 35 4 0 24 Jan 2020
Semi-supervised ASR by End-to-end Self-training Yang Chen Weiran Wang Chao Wang 72 53 0 24 Jan 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition Shaoshi Ling Yuzong Liu Julian Salazar Katrin Kirchhoff SSL 86 139 0 03 Dec 2019
CAT: CRF-based ASR Toolkit Keyu An Hongyu Xiang Zhijian Ou 34 7 0 20 Nov 2019
Boosting LSTM Performance Through Dynamic Precision Selection Franyell Silfa J. Arnau Antonio González MQ 28 5 0 07 Nov 2019
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Network R. Yazdani Olatunji Ruwase Minjia Zhang Yuxiong He J. Arnau Antonio González 55 5 0 04 Nov 2019
Towards Online End-to-end Transformer Automatic Speech Recognition E. Tsunoo Yosuke Kashiwagi Toshiyuki Kumakura Shinji Watanabe 84 32 0 25 Oct 2019
A practical two-stage training strategy for multi-stream end-to-end speech recognition Ruizhi Li Gregory Sell Xiaofei Wang Shinji Watanabe H. Hermansky 45 7 0 23 Oct 2019
End-to-End Speech Recognition: A review for the French Language Florian Boyer Jean-Luc Rouas AI4TS 66 10 0 18 Oct 2019
Transformer ASR with Contextual Block Processing E. Tsunoo Yosuke Kashiwagi Toshiyuki Kumakura Shinji Watanabe 113 64 0 16 Oct 2019
Convolutional Neural Networks for Speech Controlled Prosthetic Hands Yichao Fu Runlong Su BDL 57 5 0 03 Oct 2019
End-to-End Code-Switching ASR for Low-Resourced Language Pairs Xianghu Yue Grandee Lee Emre Yilmaz Fang Deng Haizhou Li 65 31 0 27 Sep 2019
Improving RNN Transducer Modeling for End-to-End Speech Recognition Jinyu Li Rui Zhao Hu Hu Jiawei Liu 79 170 0 26 Sep 2019
DARTS: Dialectal Arabic Transcription System Sameer Khurana Ahmed M. Ali James R. Glass 56 11 0 26 Sep 2019
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR Hirofumi Inaguma Masato Mimura S. Sakai Tatsuya Kawahara 45 5 0 22 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Yiming Wang Tongfei Chen Hainan Xu Shuoyang Ding Hang Lv Yiwen Shao Nanyun Peng Lei Xie Shinji Watanabe Sanjeev Khudanpur VLM 96 73 0 18 Sep 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning Pavel Denisov Ngoc Thang Vu 57 27 0 13 Aug 2019
Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition S. Dey P. Motlícek Trung H. Bui Franck Dernoncourt 43 13 0 08 Aug 2019
SANTLR: Speech Annotation Toolkit for Low Resource Languages Xinjian Li Zhong Zhou Siddharth Dalmia A. Black Florian Metze 49 5 0 02 Aug 2019
Multilingual Speech Recognition with Corpus Relatedness Sampling Xinjian Li Siddharth Dalmia A. Black Florian Metze 43 17 0 02 Aug 2019
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement Alzahra Badi Sangwook Park D. Han Hanseok Ko 28 7 0 26 Jul 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition Yonatan Belinkov Ahmed M. Ali James R. Glass 93 33 0 09 Jul 2019
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion Suyoun Kim Siddharth Dalmia Florian Metze 94 23 0 27 Jun 2019
End-to-End ASR for Code-switched Hindi-English Speech B. M. L. Srivastava Basil Abraham Sunayana Sitaram Rupeshkumar Mehta Preethi Jyothi 28 2 0 22 Jun 2019
Multimodal Abstractive Summarization for How2 Videos Shruti Palaskar Jindrich Libovický Spandana Gella Florian Metze 72 96 0 19 Jun 2019
Multi-Stream End-to-End Speech Recognition Ruizhi Li Xiaofei Wang Sri Harish Reddy Mallidi Shinji Watanabe Takaaki Hori H. Hermansky 55 21 0 17 Jun 2019
Deep Learning-Based Automatic Downbeat Tracking: A Brief Review Bijue Jia Jiancheng Lv Dayiheng Liu 47 28 0 10 Jun 2019
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation Byung Hoon Ahn Prannoy Pilligundla H. Esmaeilzadeh 71 20 0 30 May 2019
Sampling from Stochastic Finite Automata with Applications to CTC Decoding Martin Jansche Alexander Gutkin 33 2 0 21 May 2019
Acoustic-to-Word Models with Conversational Context Information Suyoun Kim Florian Metze 44 7 0 21 May 2019
End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System E. Tsunoo Yosuke Kashiwagi S. Asakawa Toshiyuki Kumakura 35 4 0 17 May 2019
A Hardware-Oriented and Memory-Efficient Method for CTC Decoding Siyuan Lu Jinming Lu Jun Lin Zhongfeng Wang 28 5 0 08 May 2019
Reinterpreting CTC training as iterative fitting Hongzhu Li Weiqiang Wang 21 1 0 24 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation Gakuto Kurata Kartik Audhkhasi 68 48 0 17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Gene-Ping Yang Chao-I Tuan Hung-yi Lee Lin-Shan Lee 61 25 0 16 Apr 2019
Performance Monitoring for End-to-End Speech Recognition Ruizhi Li Gregory Sell H. Hermansky 17 2 0 09 Apr 2019
Learning Shared Encoding Representation for End-to-End Speech Recognition Models T. Nguyen Sebastian Stüker A. Waibel 45 2 0 31 Mar 2019
Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech Zixing Zhang Bingwen Wu Bjoern Schuller 70 84 0 29 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition Shiliang Zhang Ming Lei Zhijie Yan 36 16 0 27 Mar 2019