Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.01571
Cited By
Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system
3 November 2022
Li Li
Dongxing Xu
Haoran Wei
Yanhua Long
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system"
24 / 24 papers shown
Title
A Study of Transducer based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding Strategies
Florian Boyer
Yusuke Shinohara
Takaaki Ishii
Hirofumi Inaguma
Shinji Watanabe
53
35
0
14 Jan 2022
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Yosuke Higuchi
Keita Karube
Tetsuji Ogawa
Tetsunori Kobayashi
37
23
0
08 Oct 2021
Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate Predictions
Jumon Nozaki
Tatsuya Komatsu
60
74
0
06 Apr 2021
Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition
J. Jeon
Eesung Kim
24
13
0
02 Nov 2020
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong Huang
Wenchao Hu
Y. Yeung
Xiao Chen
39
50
0
13 Aug 2020
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition
Jinyu Li
Yu-Huan Wu
Yashesh Gaur
Chengyi Wang
Rui Zhao
Shujie Liu
39
137
0
28 May 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
212
3,119
0
16 May 2020
Common Voice: A Massively-Multilingual Speech Corpus
Rosana Ardila
Megan Branson
Kelly Davis
Michael Henretty
M. Kohler
Josh Meyer
Reuben Morais
Lindsay Saunders
Francis M. Tyers
Gregor Weber
VLM
87
1,592
0
13 Dec 2019
On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition
Jan Kremer
Lasse Borgholt
Lars Maaløe
53
6
0
28 Nov 2018
Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Hainan Xu
Shuoyang Ding
Shinji Watanabe
62
37
0
10 Nov 2018
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
180
3,514
0
19 Aug 2018
Hierarchical Multi Task Learning With CTC
Ramon Sanabria
Florian Metze
60
50
0
18 Jul 2018
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese
Shiyu Zhou
Linhao Dong
Shuang Xu
Bo Xu
55
63
0
16 May 2018
A comparable study of modeling units for end-to-end Mandarin speech recognition
Wei Zou
Dongwei Jiang
Shuaijiang Zhao
Xiangang Li
52
32
0
10 May 2018
Improved training of end-to-end attention models for speech recognition
Albert Zeyer
Kazuki Irie
Ralf Schluter
Hermann Ney
VLM
65
270
0
08 May 2018
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
212
1,165
0
29 Apr 2018
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Sanjeev Khudanpur
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
93
1,503
0
30 Mar 2018
Advancing Acoustic-to-Word CTC Model
Jinyu Li
Guoli Ye
Amit Das
Rui Zhao
Jiawei Liu
55
98
0
15 Mar 2018
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer
Kanishka Rao
Hasim Sak
Rohit Prabhavalkar
AI4TS
72
347
0
02 Jan 2018
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
86
1,153
0
05 Dec 2017
Acoustic-To-Word Model Without OOV
Jinyu Li
Guoli Ye
Rui Zhao
J. Droppo
Jiawei Liu
58
38
0
28 Nov 2017
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
838
27,303
0
02 Dec 2015
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
195
7,729
0
31 Aug 2015
Sequence Transduction with Recurrent Neural Networks
Alex Graves
183
1,866
0
14 Nov 2012
1