Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.06773
Cited By
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
21 September 2016
Suyoun Kim
Takaaki Hori
Shinji Watanabe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning"
44 / 144 papers shown
Title
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Suyoun Kim
Shangguan Yuan
Jay Mahadeokar
A. Bruguier
Christian Fuegen
M. Seltzer
Duc Le
8
28
0
26 Oct 2020
How Phonotactics Affect Multilingual and Zero-shot ASR Performance
Siyuan Feng
Piotr Żelasko
Laureano Moro Velázquez
A. Abavisani
M. Hasegawa-Johnson
O. Scharenborg
Najim Dehak
26
17
0
22 Oct 2020
Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning
Noriyuki Tonami
Keisuke Imoto
Ryosuke Yamanishi
Y. Yamashita
23
13
0
16 Oct 2020
Can Federated Learning Save The Planet?
Xinchi Qiu
Titouan Parcollet
Daniel J. Beutel
Taner Topal
Akhil Mathur
Nicholas D. Lane
23
78
0
13 Oct 2020
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS
Yash Sharma
Basil Abraham
Karan Taneja
P. Jyothi
6
20
0
12 Oct 2020
A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Yerbolat Khassanov
Saida Mussakhojayeva
A. Mirzakhmetov
A. Adiyev
Mukhamet Nurpeiissov
H. A. Varol
11
30
0
22 Sep 2020
Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model
Qi Liu
Zhehuai Chen
Hao Li
Mingkun Huang
Yizhou Lu
Kai Yu
16
6
0
31 Jul 2020
Robust Front-End for Multi-Channel ASR using Flow-Based Density Estimation
Xiaoyuan Yi
Hyeonseung Lee
Wenhao Li
Hyung Yong Kim
Nam Soo Kim
14
22
0
25 Jul 2020
Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset
A. Andrusenko
A. Laptev
Ivan Medennikov
VLM
16
12
0
15 Jun 2020
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
19
53
0
04 Jun 2020
Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
Haoneng Luo
Shiliang Zhang
Ming Lei
Lei Xie
27
33
0
21 May 2020
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke Higuchi
Shinji Watanabe
Nanxin Chen
Tetsuji Ogawa
Tetsunori Kobayashi
17
137
0
18 May 2020
Attention-based Transducer for Online Speech Recognition
Bin Wang
Yan Yin
Hui-Ching Lin
18
4
0
18 May 2020
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
Zhengkun Tian
Jiangyan Yi
J. Tao
Ye Bai
Shuai Zhang
Zhengqi Wen
8
54
0
16 May 2020
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Tara N. Sainath
Yanzhang He
Bo-wen Li
A. Narayanan
Ruoming Pang
...
Trevor Strohman
Mirkó Visontai
Yonghui Wu
Yu Zhang
Ding Zhao
20
215
0
28 Mar 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDL
AI4TS
21
114
0
20 Feb 2020
CGCNN: Complex Gabor Convolutional Neural Network on raw speech
Paul-Gauthier Noé
Titouan Parcollet
Mohamed Morchid
14
29
0
11 Feb 2020
End-to-End Multi-speaker Speech Recognition with Transformer
Xuankai Chang
Wangyou Zhang
Y. Qian
Jonathan Le Roux
Shinji Watanabe
ViT
25
103
0
10 Feb 2020
GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition
Wenyang Hu
Xiaocong Cai
Jun Hou
Shuai Yi
Zhiping Lin
3DV
15
128
0
04 Feb 2020
TextScanner: Reading Characters in Order for Robust Scene Text Recognition
Zhaoyi Wan
Minghang He
Haoran Chen
X. Bai
Cong Yao
17
139
0
28 Dec 2019
Decoupled Attention Network for Text Recognition
Tianwei Wang
Yuanzhi Zhu
Lianwen Jin
Canjie Luo
Xiaoxue Chen
Y. Wu
Qianying Wang
Mingxiang Cai
38
252
0
21 Dec 2019
End-to-end training of time domain audio separation and recognition
Thilo von Neumann
K. Kinoshita
Lukas Drude
Christoph Boeddeker
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
22
34
0
18 Dec 2019
Deep Representations for Cross-spectral Ocular Biometrics
L. A. Zanlorensi
D. Lucio
A. Britto
Hugo Manuel Proença
David Menotti
CVBM
24
25
0
21 Nov 2019
Towards Online End-to-end Transformer Automatic Speech Recognition
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
22
32
0
25 Oct 2019
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
24
99
0
22 Oct 2019
Transformer ASR with Contextual Block Processing
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
56
64
0
16 Oct 2019
Self-Attention Transducers for End-to-End Speech Recognition
Zhengkun Tian
Jiangyan Yi
J. Tao
Ye Bai
Zhengqi Wen
AI4TS
21
70
0
28 Sep 2019
DARTS: Dialectal Arabic Transcription System
Sameer Khurana
Ahmed M. Ali
James R. Glass
6
11
0
26 Sep 2019
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
21
147
0
29 Aug 2019
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain
Johanes Effendi
Andros Tjandra
S. Sakti
Satoshi Nakamura
19
3
0
03 Jun 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
27
125
0
27 May 2019
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
14
7
0
21 May 2019
End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
E. Tsunoo
Yosuke Kashiwagi
S. Asakawa
Toshiyuki Kumakura
11
4
0
17 May 2019
Joint Analysis of Acoustic Events and Scenes Based on Multitask Learning
Noriyuki Tonami
Keisuke Imoto
M. Niitsuma
Ryosuke Yamanishi
Y. Yamashita
14
13
0
27 Apr 2019
Aggregation Cross-Entropy for Sequence Recognition
Zecheng Xie
Yaoxiong Huang
Yuanzhi Zhu
Lianwen Jin
Yuliang Liu
Lele Xie
19
92
0
17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
20
25
0
16 Apr 2019
Stream attention-based multi-array end-to-end speech recognition
Xiaofei Wang
Ruizhi Li
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
25
21
0
12 Nov 2018
Hierarchical Multitask Learning for CTC-based Speech Recognition
Kalpesh Krishna
Shubham Toshniwal
Karen Livescu
11
44
0
17 Jul 2018
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units
Zhangyu Xiao
Zhijian Ou
Wei Chu
Hui-Ching Lin
24
38
0
13 Jul 2018
End-to-End Multimodal Speech Recognition
Shruti Palaskar
Ramon Sanabria
Florian Metze
25
41
0
25 Apr 2018
Tied Multitask Learning for Neural Speech Translation
Antonios Anastasopoulos
David Chiang
100
171
0
19 Feb 2018
Towards Language-Universal End-to-End Speech Recognition
Suyoun Kim
M. Seltzer
27
68
0
06 Nov 2017
Focusing Attention: Towards Accurate Text Recognition in Natural Images
Zhanzhan Cheng
Fan Bai
Yunlu Xu
Gang Zheng
Shiliang Pu
Shuigeng Zhou
15
448
0
07 Sep 2017
Multichannel End-to-end Speech Recognition
Tsubasa Ochiai
Shinji Watanabe
Takaaki Hori
J. Hershey
19
92
0
14 Mar 2017
Previous
1
2
3