SynthASR: Unlocking Synthetic Data for Speech Recognition

14 June 2021

A. Fazel

Wei Yang

Yulan Liu

Roberto Barra-Chicote

Papers citing "SynthASR: Unlocking Synthetic Data for Speech Recognition"

28 / 28 papers shown

Title
High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR Sourav Banerjee Ayushi Agarwal Promila Ghosh 81 3 0 24 Nov 2024
Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis Mohammad Zbeeb Mohammad Ghorayeb Mariam Salman 42 0 0 04 Nov 2024
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition Samuele Cornell Jordan Darefsky Zhiyao Duan Shinji Watanabe SyDa 68 4 0 17 Aug 2024
Handling Numeric Expressions in Automatic Speech Recognition Christian Huber Alexander Waibel 19 0 0 18 Jul 2024
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis Cong-Thanh Do Shuhei Imai R. Doddipatla Thomas Hain 22 2 0 04 Jul 2024
Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling? Tiantian Feng Dimitrios Dimitriadis Shrikanth Narayanan 40 4 0 13 Jun 2024
Contrastive Learning from Synthetic Audio Doppelgängers Manuel Cherep Nikhil Singh 40 1 0 09 Jun 2024
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality Tiantian Feng Xuan Shi Rahul Gupta Shrikanth S. Narayanan 49 0 0 27 Apr 2024
Real-Time Multimodal Cognitive Assistant for Emergency Medical Services Keshara Weerasinghe Saahith Janapati Xueren Ge Sion Kim S. Iyer John A. Stankovic H. Alemzadeh 28 2 0 11 Mar 2024
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning Rishabh Jain Peter Corcoran 20 0 0 07 Nov 2023
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation Aman Khullar Daniel K. Nkemelu Cuong V. Nguyen Michael L. Best 37 2 0 04 Oct 2023
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS Xin Wang Taein Kwon Wei-Ning Hsu Yossi Adi Tu Nguyen D. Bohus Emmanuel Dupoux Neel Joshi Abdelrahman Mohamed 10 4 0 29 Sep 2023
Using Text Injection to Improve Recognition of Personal Identifiers in Speech Yochai Blau Rohan Agrawal Lior Madmony Gary Wang Andrew Rosenberg Zhehuai Chen Zorik Gekhman Genady Beryozkin Parisa Haghani Bhuvana Ramabhadran 46 3 0 14 Aug 2023
Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion Siyuan Shan Yang Li A. Banerjee Junier B. Oliva 26 4 0 11 Aug 2023
External Language Model Integration for Factorized Neural Transducers Michael Levit S. Parthasarathy Cem Aksoylar Mohammad Sadegh Rasooli Shuangyu Chang 29 2 0 26 May 2023
Text Generation with Speech Synthesis for ASR Data Augmentation Zhuangqun Huang Gil Keren Ziran Jiang Shashank Jain David Goss-Grubbs ... Antony DÁvirro Ethan Campbell-Taylor Jessie Salas Irina-Elena Veliche Xi Chen 13 6 0 22 May 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision Xubo Liu Egor Lakomkin Konstantinos Vougioukas Pingchuan Ma Honglie Chen ... Niko Moritz J. Kolár Stavros Petridis M. Pantic Christian Fuegen 52 19 0 30 Mar 2023
On-the-fly Text Retrieval for End-to-End ASR Adaptation Bolaji Yusuf Aditya Gourav Ankur Gandhe I. Bulyko KELM RALM 40 4 0 20 Mar 2023
Machine Learning for Synthetic Data Generation: A Review Ying-Cheng Lu Minjie Shen Huazheng Wang Xiao Wang Capucine Van Rechem Tianfan Fu Wenqi Wei SyDa 42 140 0 08 Feb 2023
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models Rui Zhao Jian Xue P. Parthasarathy Veljko Miljanic Jinyu Li 21 13 0 05 Dec 2022
When Is TTS Augmentation Through a Pivot Language Useful? Nathaniel R. Robinson Perez Ogayo Swetha Gangu David R. Mortensen Shinji Watanabe 17 9 0 20 Jul 2022
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need Daniel Korzekwa Jaime Lorenzo-Trueba Thomas Drugman B. Kostek 23 25 0 02 Jul 2022
Building African Voices Perez Ogayo Graham Neubig A. Black 6 14 0 01 Jul 2022
On the Importance and Applicability of Pre-Training for Federated Learning Hong-You Chen Cheng-Hao Tu Zi-hua Li Hang Shen Wei-Lun Chao FedML 22 77 0 23 Jun 2022
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data Raviraj Joshi Ashutosh Kumar Singh 12 7 0 22 Jun 2022
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder Bolaji Yusuf Ankur Gandhe Alex Sokolov 40 8 0 12 Feb 2022
Continual Learning for Monolingual End-to-End Automatic Speech Recognition Steven Vander Eeckt Hugo Van hamme CLL 17 17 0 17 Dec 2021
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures Nick Rossenbach Mohammad Zeineldeen Benedikt Hilmes Ralf Schluter Hermann Ney 28 12 0 12 Apr 2021