Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.07803
Cited By
SynthASR: Unlocking Synthetic Data for Speech Recognition
14 June 2021
A. Fazel
Wei Yang
Yulan Liu
Roberto Barra-Chicote
Yi Meng
Roland Maas
J. Droppo
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SynthASR: Unlocking Synthetic Data for Speech Recognition"
28 / 28 papers shown
Title
High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR
Sourav Banerjee
Ayushi Agarwal
Promila Ghosh
81
3
0
24 Nov 2024
Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis
Mohammad Zbeeb
Mohammad Ghorayeb
Mariam Salman
42
0
0
04 Nov 2024
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition
Samuele Cornell
Jordan Darefsky
Zhiyao Duan
Shinji Watanabe
SyDa
68
4
0
17 Aug 2024
Handling Numeric Expressions in Automatic Speech Recognition
Christian Huber
Alexander Waibel
19
0
0
18 Jul 2024
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis
Cong-Thanh Do
Shuhei Imai
R. Doddipatla
Thomas Hain
22
2
0
04 Jul 2024
Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?
Tiantian Feng
Dimitrios Dimitriadis
Shrikanth Narayanan
40
4
0
13 Jun 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
40
1
0
09 Jun 2024
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality
Tiantian Feng
Xuan Shi
Rahul Gupta
Shrikanth S. Narayanan
49
0
0
27 Apr 2024
Real-Time Multimodal Cognitive Assistant for Emergency Medical Services
Keshara Weerasinghe
Saahith Janapati
Xueren Ge
Sion Kim
S. Iyer
John A. Stankovic
H. Alemzadeh
28
2
0
11 Mar 2024
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning
Rishabh Jain
Peter Corcoran
20
0
0
07 Nov 2023
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation
Aman Khullar
Daniel K. Nkemelu
Cuong V. Nguyen
Michael L. Best
37
2
0
04 Oct 2023
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Xin Wang
Taein Kwon
Wei-Ning Hsu
Yossi Adi
Tu Nguyen
D. Bohus
Emmanuel Dupoux
Neel Joshi
Abdelrahman Mohamed
10
4
0
29 Sep 2023
Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Yochai Blau
Rohan Agrawal
Lior Madmony
Gary Wang
Andrew Rosenberg
Zhehuai Chen
Zorik Gekhman
Genady Beryozkin
Parisa Haghani
Bhuvana Ramabhadran
46
3
0
14 Aug 2023
Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion
Siyuan Shan
Yang Li
A. Banerjee
Junier B. Oliva
26
4
0
11 Aug 2023
External Language Model Integration for Factorized Neural Transducers
Michael Levit
S. Parthasarathy
Cem Aksoylar
Mohammad Sadegh Rasooli
Shuangyu Chang
29
2
0
26 May 2023
Text Generation with Speech Synthesis for ASR Data Augmentation
Zhuangqun Huang
Gil Keren
Ziran Jiang
Shashank Jain
David Goss-Grubbs
...
Antony DÁvirro
Ethan Campbell-Taylor
Jessie Salas
Irina-Elena Veliche
Xi Chen
13
6
0
22 May 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
...
Niko Moritz
J. Kolár
Stavros Petridis
M. Pantic
Christian Fuegen
52
19
0
30 Mar 2023
On-the-fly Text Retrieval for End-to-End ASR Adaptation
Bolaji Yusuf
Aditya Gourav
Ankur Gandhe
I. Bulyko
KELM
RALM
40
4
0
20 Mar 2023
Machine Learning for Synthetic Data Generation: A Review
Ying-Cheng Lu
Minjie Shen
Huazheng Wang
Xiao Wang
Capucine Van Rechem
Tianfan Fu
Wenqi Wei
SyDa
42
140
0
08 Feb 2023
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Rui Zhao
Jian Xue
P. Parthasarathy
Veljko Miljanic
Jinyu Li
21
13
0
05 Dec 2022
When Is TTS Augmentation Through a Pivot Language Useful?
Nathaniel R. Robinson
Perez Ogayo
Swetha Gangu
David R. Mortensen
Shinji Watanabe
17
9
0
20 Jul 2022
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
B. Kostek
23
25
0
02 Jul 2022
Building African Voices
Perez Ogayo
Graham Neubig
A. Black
6
14
0
01 Jul 2022
On the Importance and Applicability of Pre-Training for Federated Learning
Hong-You Chen
Cheng-Hao Tu
Zi-hua Li
Hang Shen
Wei-Lun Chao
FedML
22
77
0
23 Jun 2022
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Raviraj Joshi
Ashutosh Kumar Singh
12
7
0
22 Jun 2022
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder
Bolaji Yusuf
Ankur Gandhe
Alex Sokolov
40
8
0
12 Feb 2022
Continual Learning for Monolingual End-to-End Automatic Speech Recognition
Steven Vander Eeckt
Hugo Van hamme
CLL
17
17
0
17 Dec 2021
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Nick Rossenbach
Mohammad Zeineldeen
Benedikt Hilmes
Ralf Schluter
Hermann Ney
28
12
0
12 Apr 2021
1