Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units

31 December 2018

Papers citing "Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units"

32 / 32 papers shown

Title
On using 2D sequence-to-sequence models for speech recognition Parnia Bahar Albert Zeyer Ralf Schluter Hermann Ney VLM 3DV 25 10 0 20 Nov 2019
Recent Progresses in Deep Learning based Acoustic Models (Updated) Dong Yu Jinyu Li VLM 49 160 0 25 Apr 2018
Developing Far-Field Speaker System Via Teacher-Student Learning Jinyu Li Rui Zhao Zhuo Chen Changliang Liu Xiong Xiao Guoli Ye Jiawei Liu 30 56 0 14 Apr 2018
Advancing Acoustic-to-Word CTC Model Jinyu Li Guoli Ye Amit Das Rui Zhao Jiawei Liu 45 97 0 15 Mar 2018
Advancing Connectionist Temporal Classification With Attention Modeling Amit Das Jinyu Li Rui Zhao Jiawei Liu 41 51 0 15 Mar 2018
On Modular Training of Neural Acoustics-to-Word Model for LVCSR Zhehuai Chen Qi Liu Hao Li Kai Yu 39 29 0 03 Mar 2018
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer Kanishka Rao Hasim Sak Rohit Prabhavalkar AI4TS 59 346 0 02 Jan 2018
Building competitive direct acoustics-to-word models for English conversational speech recognition Kartik Audhkhasi Brian Kingsbury Bhuvana Ramabhadran G. Saon M. Picheny 51 151 0 08 Dec 2017
Improving the Performance of Online Neural Transducer Models Tara N. Sainath Chung-Cheng Chiu Rohit Prabhavalkar Anjuli Kannan Yonghui Wu Patrick Nguyen Zhiwen Chen AI4TS 60 49 0 05 Dec 2017
State-of-the-art Speech Recognition With Sequence-to-Sequence Models Chung-Cheng Chiu Tara N. Sainath Yonghui Wu Rohit Prabhavalkar Patrick Nguyen ... Katya Gonina Navdeep Jaitly Yue Liu J. Chorowski M. Bacchiani AI4TS 81 1,150 0 05 Dec 2017
Acoustic-To-Word Model Without OOV Jinyu Li Guoli Ye Rui Zhao J. Droppo Jiawei Liu 49 38 0 28 Nov 2017
Exploring Neural Transducers for End-to-End Speech Recognition Eric Battenberg Jitong Chen R. Child Adam Coates Yashesh Gaur Yi Li ... Hairong Liu S. Satheesh David Seetapun Anuroop Sriram Zhenyao Zhu AI4TS 61 230 0 24 Jul 2017
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 484 129,831 0 12 Jun 2017
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM Takaaki Hori Shinji Watanabe Yu Zhang William Chan 58 292 0 08 Jun 2017
Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition Shubham Toshniwal Hao Tang Liang Lu Karen Livescu 45 116 0 05 Apr 2017
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling Hairong Liu Zhenyao Zhu Xiangang Li S. Satheesh VLM 57 56 0 01 Mar 2017
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition H. Soltau H. Liao Hasim Sak 62 310 0 31 Oct 2016
Latent Sequence Decompositions William Chan Yu Zhang Quoc V. Le Navdeep Jaitly 33 62 0 10 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation Yonghui Wu M. Schuster Zhiwen Chen Quoc V. Le Mohammad Norouzi ... Alex Rudnick Oriol Vinyals G. Corrado Macduff Hughes J. Dean AIMat 817 6,768 0 26 Sep 2016
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning Suyoun Kim Takaaki Hori Shinji Watanabe 55 921 0 21 Sep 2016
Advances in All-Neural Speech Recognition Geoffrey Zweig Chengzhu Yu J. Droppo A. Stolcke 49 95 0 19 Sep 2016
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 1.4K 192,638 0 10 Dec 2015
Neural Machine Translation of Rare Words with Subword Units Rico Sennrich Barry Haddow Alexandra Birch 157 7,683 0 31 Aug 2015
End-to-End Attention-based Large Vocabulary Speech Recognition Dzmitry Bahdanau J. Chorowski Dmitriy Serdyuk Philemon Brakel Yoshua Bengio 55 1,149 0 18 Aug 2015
Listen, Attend and Spell William Chan Navdeep Jaitly Quoc V. Le Oriol Vinyals RALM 136 2,261 0 05 Aug 2015
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding Yajie Miao M. Gowayyed Florian Metze 81 753 0 29 Jul 2015
Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition Hasim Sak A. Senior Kanishka Rao F. Beaufays 56 435 0 24 Jul 2015
Attention-Based Models for Speech Recognition J. Chorowski Dzmitry Bahdanau Dmitriy Serdyuk Kyunghyun Cho Yoshua Bengio 103 2,605 0 24 Jun 2015
Deep Speech: Scaling up end-to-end speech recognition Awni Y. Hannun Carl Case Jared Casper Bryan Catanzaro G. Diamos ... R. Prenger S. Satheesh Shubho Sengupta Adam Coates A. Ng 161 2,119 0 17 Dec 2014
Neural Machine Translation by Jointly Learning to Align and Translate Dzmitry Bahdanau Kyunghyun Cho Yoshua Bengio AIMat 395 27,205 0 01 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation Kyunghyun Cho B. V. Merrienboer Çağlar Gülçehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk Yoshua Bengio AIMat 647 23,235 0 03 Jun 2014
Sequence Transduction with Recurrent Neural Networks Alex Graves 145 1,858 0 14 Nov 2012