Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.00203
Cited By
v1
v2 (latest)
Improving Generalization of Transformer for Speech Recognition with Parallel Schedule Sampling and Relative Positional Embedding
1 November 2019
Pan Zhou
Ruchao Fan
Wei Chen
Jia Jia
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving Generalization of Transformer for Speech Recognition with Parallel Schedule Sampling and Relative Positional Embedding"
20 / 20 papers shown
Title
The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge
Shutong Niu
Ruoyu Wang
Jun Du
Gaobin Yang
Yanhui Tu
...
Tian Gao
Genshun Wan
Feng Ma
Jia Pan
Jianqing Gao
102
6
0
03 Sep 2024
Stock and market index prediction using Informer network
Yu-Hsin Lu
Hailong Zhang
Qiwen Guo
AIFin
AI4TS
125
5
0
22 May 2023
A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition
Ruchao Fan
Wei Chu
Peng Chang
Abeer Alwan
36
11
0
15 Apr 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
179
48
0
21 Mar 2023
Mitigation of Spatial Nonstationarity with Vision Transformers
Lei Liu
Javier E. Santos
Mavsa Prodanović
Michael J. Pyrcz
55
4
0
09 Dec 2022
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition
Yist Y. Lin
Tao Han
Haihua Xu
Van Tung Pham
Yerbolat Khassanov
Tze Yuang Chong
Yi He
Lu Lu
Zejun Ma
74
2
0
28 Oct 2022
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Jinhan Wang
Xiaosu Tong
Jinxi Guo
Di He
Roland Maas
73
5
0
22 Feb 2022
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training
J. Yang
Lei He
93
11
0
20 Jan 2022
Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Xiaodong Cui
Brian Kingsbury
G. Saon
David Haws
Zoltán Tüske
51
5
0
24 Aug 2021
From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers
K. Choromanski
Han Lin
Haoxian Chen
Tianyi Zhang
Arijit Sehanobish
Valerii Likhosherstov
Jack Parker-Holder
Tamás Sarlós
Adrian Weller
Thomas Weingarten
160
34
0
16 Jul 2021
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
Abeer Alwan
77
15
0
18 Jun 2021
CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings
Tatiana Likhomanenko
Qiantong Xu
Gabriel Synnaeve
R. Collobert
A. Rogozhnikov
OOD
ViT
90
60
0
06 Jun 2021
Relative Positional Encoding for Transformers with Linear Complexity
Antoine Liutkus
Ondřej Cífka
Shih-Lun Wu
Umut Simsekli
Yi-Hsuan Yang
Gaël Richard
84
48
0
18 May 2021
Estimating articulatory movements in speech production with transformer networks
Sathvik Udupa
Anwesha Roy
Abhayjeet Singh
Aravind Illa
P. Ghosh
58
16
0
11 Apr 2021
Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention
Chen Liang
Menglong Xu
Xiao-Lei Zhang
85
9
0
29 Mar 2021
End-to-end acoustic modelling for phone recognition of young readers
Lucile Gelin
Morgane Daniel
J. Pinquier
Thomas Pellegrini
60
13
0
04 Mar 2021
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications
Y. Oh
Kiyoung Park
Jeongue Park
OffRL
85
5
0
14 Jan 2021
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
67
37
0
28 Oct 2020
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-based LVCSR
Xinyuan Zhou
Grandee Lee
Emre Yilmaz
Yanhua Long
Jiaen Liang
Haizhou Li
72
7
0
18 Jun 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Linhao Dong
Cheng Yi
Jianzong Wang
Shiyu Zhou
Shuang Xu
X. Jia
Bo Xu
68
17
0
20 May 2020
1