ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.11928
  4. Cited By
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units

Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units

31 December 2018
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
ArXivPDFHTML

Papers citing "Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units"

32 / 32 papers shown
Title
On using 2D sequence-to-sequence models for speech recognition
On using 2D sequence-to-sequence models for speech recognition
Parnia Bahar
Albert Zeyer
Ralf Schluter
Hermann Ney
VLM
3DV
25
10
0
20 Nov 2019
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
49
160
0
25 Apr 2018
Developing Far-Field Speaker System Via Teacher-Student Learning
Developing Far-Field Speaker System Via Teacher-Student Learning
Jinyu Li
Rui Zhao
Zhuo Chen
Changliang Liu
Xiong Xiao
Guoli Ye
Jiawei Liu
30
56
0
14 Apr 2018
Advancing Acoustic-to-Word CTC Model
Advancing Acoustic-to-Word CTC Model
Jinyu Li
Guoli Ye
Amit Das
Rui Zhao
Jiawei Liu
45
97
0
15 Mar 2018
Advancing Connectionist Temporal Classification With Attention Modeling
Advancing Connectionist Temporal Classification With Attention Modeling
Amit Das
Jinyu Li
Rui Zhao
Jiawei Liu
41
51
0
15 Mar 2018
On Modular Training of Neural Acoustics-to-Word Model for LVCSR
On Modular Training of Neural Acoustics-to-Word Model for LVCSR
Zhehuai Chen
Qi Liu
Hao Li
Kai Yu
39
29
0
03 Mar 2018
Exploring Architectures, Data and Units For Streaming End-to-End Speech
  Recognition with RNN-Transducer
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer
Kanishka Rao
Hasim Sak
Rohit Prabhavalkar
AI4TS
59
346
0
02 Jan 2018
Building competitive direct acoustics-to-word models for English
  conversational speech recognition
Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi
Brian Kingsbury
Bhuvana Ramabhadran
G. Saon
M. Picheny
51
151
0
08 Dec 2017
Improving the Performance of Online Neural Transducer Models
Improving the Performance of Online Neural Transducer Models
Tara N. Sainath
Chung-Cheng Chiu
Rohit Prabhavalkar
Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Zhiwen Chen
AI4TS
60
49
0
05 Dec 2017
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
81
1,150
0
05 Dec 2017
Acoustic-To-Word Model Without OOV
Acoustic-To-Word Model Without OOV
Jinyu Li
Guoli Ye
Rui Zhao
J. Droppo
Jiawei Liu
49
38
0
28 Nov 2017
Exploring Neural Transducers for End-to-End Speech Recognition
Exploring Neural Transducers for End-to-End Speech Recognition
Eric Battenberg
Jitong Chen
R. Child
Adam Coates
Yashesh Gaur Yi Li
...
Hairong Liu
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
AI4TS
61
230
0
24 Jul 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
484
129,831
0
12 Jun 2017
Advances in Joint CTC-Attention based End-to-End Speech Recognition with
  a Deep CNN Encoder and RNN-LM
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
Takaaki Hori
Shinji Watanabe
Yu Zhang
William Chan
58
292
0
08 Jun 2017
Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder
  Based Speech Recognition
Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition
Shubham Toshniwal
Hao Tang
Liang Lu
Karen Livescu
45
116
0
05 Apr 2017
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence
  Labelling
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Hairong Liu
Zhenyao Zhu
Xiangang Li
S. Satheesh
VLM
57
56
0
01 Mar 2017
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large
  Vocabulary Speech Recognition
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition
H. Soltau
H. Liao
Hasim Sak
62
310
0
31 Oct 2016
Latent Sequence Decompositions
Latent Sequence Decompositions
William Chan
Yu Zhang
Quoc V. Le
Navdeep Jaitly
33
62
0
10 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
817
6,768
0
26 Sep 2016
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task
  Learning
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Suyoun Kim
Takaaki Hori
Shinji Watanabe
55
921
0
21 Sep 2016
Advances in All-Neural Speech Recognition
Advances in All-Neural Speech Recognition
Geoffrey Zweig
Chengzhu Yu
J. Droppo
A. Stolcke
49
95
0
19 Sep 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.4K
192,638
0
10 Dec 2015
Neural Machine Translation of Rare Words with Subword Units
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
157
7,683
0
31 Aug 2015
End-to-End Attention-based Large Vocabulary Speech Recognition
End-to-End Attention-based Large Vocabulary Speech Recognition
Dzmitry Bahdanau
J. Chorowski
Dmitriy Serdyuk
Philemon Brakel
Yoshua Bengio
55
1,149
0
18 Aug 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
136
2,261
0
05 Aug 2015
EESEN: End-to-End Speech Recognition using Deep RNN Models and
  WFST-based Decoding
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding
Yajie Miao
M. Gowayyed
Florian Metze
81
753
0
29 Jul 2015
Fast and Accurate Recurrent Neural Network Acoustic Models for Speech
  Recognition
Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition
Hasim Sak
A. Senior
Kanishka Rao
F. Beaufays
56
435
0
24 Jul 2015
Attention-Based Models for Speech Recognition
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
103
2,605
0
24 Jun 2015
Deep Speech: Scaling up end-to-end speech recognition
Deep Speech: Scaling up end-to-end speech recognition
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
...
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
161
2,119
0
17 Dec 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
395
27,205
0
01 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
647
23,235
0
03 Jun 2014
Sequence Transduction with Recurrent Neural Networks
Sequence Transduction with Recurrent Neural Networks
Alex Graves
145
1,858
0
14 Nov 2012
1