v1v2 (latest)

Improving Generalization of Transformer for Speech Recognition with Parallel Schedule Sampling and Relative Positional Embedding

1 November 2019

Jia Jia

Papers citing "Improving Generalization of Transformer for Speech Recognition with Parallel Schedule Sampling and Relative Positional Embedding"

20 / 20 papers shown

Title
The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge Shutong Niu Ruoyu Wang Jun Du Gaobin Yang Yanhui Tu ... Tian Gao Genshun Wan Feng Ma Jia Pan Jianqing Gao 102 6 0 03 Sep 2024
Stock and market index prediction using Informer network Yu-Hsin Lu Hailong Zhang Qiwen Guo AIFin AI4TS 125 5 0 22 May 2023
A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition Ruchao Fan Wei Chu Peng Chang Abeer Alwan 36 11 0 15 Apr 2023
Transformers in Speech Processing: A Survey S. Latif Aun Zaidi Heriberto Cuayáhuitl Fahad Shamshad Moazzam Shoukat Muhammad Usama Junaid Qadir 179 48 0 21 Mar 2023
Mitigation of Spatial Nonstationarity with Vision Transformers Lei Liu Javier E. Santos Mavsa Prodanović Michael J. Pyrcz 55 4 0 09 Dec 2022
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition Yist Y. Lin Tao Han Haihua Xu Van Tung Pham Yerbolat Khassanov Tze Yuang Chong Yi He Lu Lu Zejun Ma 74 2 0 28 Oct 2022
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition Jinhan Wang Xiaosu Tong Jinxi Guo Di He Roland Maas 73 5 0 22 Feb 2022
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training J. Yang Lei He 93 11 0 20 Jan 2022
Reducing Exposure Bias in Training Recurrent Neural Network Transducers Xiaodong Cui Brian Kingsbury G. Saon David Haws Zoltán Tüske 51 5 0 24 Aug 2021
From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers K. Choromanski Han Lin Haoxian Chen Tianyi Zhang Arijit Sehanobish Valerii Likhosherstov Jack Parker-Holder Tamás Sarlós Adrian Weller Thomas Weingarten 160 34 0 16 Jul 2021
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition Ruchao Fan Wei Chu Peng Chang Jing Xiao Abeer Alwan 77 15 0 18 Jun 2021
CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings Tatiana Likhomanenko Qiantong Xu Gabriel Synnaeve R. Collobert A. Rogozhnikov OOD ViT 90 60 0 06 Jun 2021
Relative Positional Encoding for Transformers with Linear Complexity Antoine Liutkus Ondřej Cífka Shih-Lun Wu Umut Simsekli Yi-Hsuan Yang Gaël Richard 84 48 0 18 May 2021
Estimating articulatory movements in speech production with transformer networks Sathvik Udupa Anwesha Roy Abhayjeet Singh Aravind Illa P. Ghosh 58 16 0 11 Apr 2021
Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention Chen Liang Menglong Xu Xiao-Lei Zhang 85 9 0 29 Mar 2021
End-to-end acoustic modelling for phone recognition of young readers Lucile Gelin Morgane Daniel J. Pinquier Thomas Pellegrini 60 13 0 04 Mar 2021
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications Y. Oh Kiyoung Park Jeongue Park OffRL 85 5 0 14 Jan 2021
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition Ruchao Fan Wei Chu Peng Chang Jing Xiao 67 37 0 28 Oct 2020
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-based LVCSR Xinyuan Zhou Grandee Lee Emre Yilmaz Yanhua Long Jiaen Liang Haizhou Li 72 7 0 18 Jun 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition Linhao Dong Cheng Yi Jianzong Wang Shiyu Zhou Shuang Xu X. Jia Bo Xu 68 17 0 20 May 2020