ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.07803
  4. Cited By
SynthASR: Unlocking Synthetic Data for Speech Recognition

SynthASR: Unlocking Synthetic Data for Speech Recognition

14 June 2021
A. Fazel
Wei Yang
Yulan Liu
Roberto Barra-Chicote
Yi Meng
Roland Maas
J. Droppo
    SyDa
ArXivPDFHTML

Papers citing "SynthASR: Unlocking Synthetic Data for Speech Recognition"

28 / 28 papers shown
Title
High-precision medical speech recognition through synthetic data and
  semantic correction: UNITED-MEDASR
High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR
Sourav Banerjee
Ayushi Agarwal
Promila Ghosh
81
3
0
24 Nov 2024
Exploring the Landscape for Generative Sequence Models for Specialized
  Data Synthesis
Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis
Mohammad Zbeeb
Mohammad Ghorayeb
Mariam Salman
42
0
0
04 Nov 2024
Generating Data with Text-to-Speech and Large-Language Models for
  Conversational Speech Recognition
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition
Samuele Cornell
Jordan Darefsky
Zhiyao Duan
Shinji Watanabe
SyDa
68
4
0
17 Aug 2024
Handling Numeric Expressions in Automatic Speech Recognition
Handling Numeric Expressions in Automatic Speech Recognition
Christian Huber
Alexander Waibel
19
0
0
18 Jul 2024
Improving Accented Speech Recognition using Data Augmentation based on
  Unsupervised Text-to-Speech Synthesis
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis
Cong-Thanh Do
Shuhei Imai
R. Doddipatla
Thomas Hain
22
2
0
04 Jul 2024
Can Synthetic Audio From Generative Foundation Models Assist Audio
  Recognition and Speech Modeling?
Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?
Tiantian Feng
Dimitrios Dimitriadis
Shrikanth Narayanan
40
4
0
13 Jun 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
40
1
0
09 Jun 2024
TI-ASU: Toward Robust Automatic Speech Understanding through
  Text-to-speech Imputation Against Missing Speech Modality
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality
Tiantian Feng
Xuan Shi
Rahul Gupta
Shrikanth S. Narayanan
49
0
0
27 Apr 2024
Real-Time Multimodal Cognitive Assistant for Emergency Medical Services
Real-Time Multimodal Cognitive Assistant for Emergency Medical Services
Keshara Weerasinghe
Saahith Janapati
Xueren Ge
Sion Kim
S. Iyer
John A. Stankovic
H. Alemzadeh
28
2
0
11 Mar 2024
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer
  Learning
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning
Rishabh Jain
Peter Corcoran
20
0
0
07 Nov 2023
Hate Speech Detection in Limited Data Contexts using Synthetic Data
  Generation
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation
Aman Khullar
Daniel K. Nkemelu
Cuong V. Nguyen
Michael L. Best
37
2
0
04 Oct 2023
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Xin Wang
Taein Kwon
Wei-Ning Hsu
Yossi Adi
Tu Nguyen
D. Bohus
Emmanuel Dupoux
Neel Joshi
Abdelrahman Mohamed
10
4
0
29 Sep 2023
Using Text Injection to Improve Recognition of Personal Identifiers in
  Speech
Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Yochai Blau
Rohan Agrawal
Lior Madmony
Gary Wang
Andrew Rosenberg
Zhehuai Chen
Zorik Gekhman
Genady Beryozkin
Parisa Haghani
Bhuvana Ramabhadran
46
3
0
14 Aug 2023
Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion
Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion
Siyuan Shan
Yang Li
A. Banerjee
Junier B. Oliva
26
4
0
11 Aug 2023
External Language Model Integration for Factorized Neural Transducers
External Language Model Integration for Factorized Neural Transducers
Michael Levit
S. Parthasarathy
Cem Aksoylar
Mohammad Sadegh Rasooli
Shuangyu Chang
29
2
0
26 May 2023
Text Generation with Speech Synthesis for ASR Data Augmentation
Text Generation with Speech Synthesis for ASR Data Augmentation
Zhuangqun Huang
Gil Keren
Ziran Jiang
Shashank Jain
David Goss-Grubbs
...
Antony DÁvirro
Ethan Campbell-Taylor
Jessie Salas
Irina-Elena Veliche
Xi Chen
13
6
0
22 May 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic
  Supervision
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
...
Niko Moritz
J. Kolár
Stavros Petridis
M. Pantic
Christian Fuegen
52
19
0
30 Mar 2023
On-the-fly Text Retrieval for End-to-End ASR Adaptation
On-the-fly Text Retrieval for End-to-End ASR Adaptation
Bolaji Yusuf
Aditya Gourav
Ankur Gandhe
I. Bulyko
KELM
RALM
40
4
0
20 Mar 2023
Machine Learning for Synthetic Data Generation: A Review
Machine Learning for Synthetic Data Generation: A Review
Ying-Cheng Lu
Minjie Shen
Huazheng Wang
Xiao Wang
Capucine Van Rechem
Tianfan Fu
Wenqi Wei
SyDa
42
140
0
08 Feb 2023
Fast and accurate factorized neural transducer for text adaption of
  end-to-end speech recognition models
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Rui Zhao
Jian Xue
P. Parthasarathy
Veljko Miljanic
Jinyu Li
21
13
0
05 Dec 2022
When Is TTS Augmentation Through a Pivot Language Useful?
When Is TTS Augmentation Through a Pivot Language Useful?
Nathaniel R. Robinson
Perez Ogayo
Swetha Gangu
David R. Mortensen
Shinji Watanabe
17
9
0
20 Jul 2022
Computer-assisted Pronunciation Training -- Speech synthesis is almost
  all you need
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
B. Kostek
23
25
0
02 Jul 2022
Building African Voices
Building African Voices
Perez Ogayo
Graham Neubig
A. Black
6
14
0
01 Jul 2022
On the Importance and Applicability of Pre-Training for Federated
  Learning
On the Importance and Applicability of Pre-Training for Federated Learning
Hong-You Chen
Cheng-Hao Tu
Zi-hua Li
Hang Shen
Wei-Lun Chao
FedML
22
77
0
23 Jun 2022
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using
  Synthetic Data
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Raviraj Joshi
Ashutosh Kumar Singh
12
7
0
22 Jun 2022
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder
Bolaji Yusuf
Ankur Gandhe
Alex Sokolov
40
8
0
12 Feb 2022
Continual Learning for Monolingual End-to-End Automatic Speech
  Recognition
Continual Learning for Monolingual End-to-End Automatic Speech Recognition
Steven Vander Eeckt
Hugo Van hamme
CLL
17
17
0
17 Dec 2021
Comparing the Benefit of Synthetic Training Data for Various Automatic
  Speech Recognition Architectures
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Nick Rossenbach
Mohammad Zeineldeen
Benedikt Hilmes
Ralf Schluter
Hermann Ney
28
12
0
12 Apr 2021
1