ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00707
  4. Cited By
Training Neural Speech Recognition Systems with Synthetic Speech
  Augmentation

Training Neural Speech Recognition Systems with Synthetic Speech Augmentation

2 November 2018
Jason Chun Lok Li
R. Gadde
Boris Ginsburg
Vitaly Lavrukhin
ArXiv (abs)PDFHTML

Papers citing "Training Neural Speech Recognition Systems with Synthetic Speech Augmentation"

31 / 31 papers shown
Title
Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech
  Synthesis in ASR
Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR
Christoph Minixhofer
Ondˇrej Klejch
Peter Bell
80
0
0
16 Oct 2024
Initial Decoding with Minimally Augmented Language Model for Improved
  Lattice Rescoring in Low Resource ASR
Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR
Savitha Murthy
D. Sitaram
49
0
0
16 Mar 2024
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech
  Systems for the MADASR 2023 Challenge
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge
Tanel Alumäe
Jiaming Kong
Daniil Robnikov
29
2
0
26 Oct 2023
Accurate synthesis of Dysarthric Speech for ASR data augmentation
Accurate synthesis of Dysarthric Speech for ASR data augmentation
M. Soleymanpour
Michael T. Johnson
Rahim Soleymanpour
J. Berry
81
3
0
16 Aug 2023
Two-Stage Voice Anonymization for Enhanced Privacy
Two-Stage Voice Anonymization for Enhanced Privacy
F. Nespoli
Daniel Barreda
Joerg Bitzer
Patrick A. Naylor
58
3
0
28 Jun 2023
Unlocking Foundation Models for Privacy-Enhancing Speech Understanding:
  An Early Study on Low Resource Speech Training Leveraging Label-guided
  Synthetic Speech Content
Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Tiantian Feng
Digbalay Bose
Xuan Shi
Shrikanth Narayanan
60
1
0
13 Jun 2023
GPT-FL: Generative Pre-trained Model-Assisted Federated Learning
GPT-FL: Generative Pre-trained Model-Assisted Federated Learning
Tuo Zhang
Tiantian Feng
Samiul Alam
Dimitrios Dimitriadis
Sunwoo Lee
Mi Zhang
Shrikanth S. Narayanan
Salman Avestimehr
FedML
77
29
0
03 Jun 2023
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual
  Representation Learners
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Yonglong Tian
Lijie Fan
Phillip Isola
Huiwen Chang
Dilip Krishnan
VLMDiffM
145
153
0
01 Jun 2023
OLISIA: a Cascade System for Spoken Dialogue State Tracking
OLISIA: a Cascade System for Spoken Dialogue State Tracking
Léo Jacqmin
Lucas Druart
Yannick Esteve
Benoit Favre
L. Rojas-Barahona
Valentin Vielzeuf
87
3
0
20 Apr 2023
AraSpot: Arabic Spoken Command Spotting
AraSpot: Arabic Spoken Command Spotting
Mahmoud Salhab
H. Harmanani
56
0
0
29 Mar 2023
Text-only domain adaptation for end-to-end ASR using integrated
  text-to-mel-spectrogram generator
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
Vladimir Bataev
Roman Korostik
Evgeny Shabalin
Vitaly Lavrukhin
Boris Ginsburg
VLM
78
15
0
27 Feb 2023
MAC: A unified framework boosting low resource automatic speech
  recognition
MAC: A unified framework boosting low resource automatic speech recognition
Zeping Min
Qian Ge
Zhong Li
E. Weinan
116
1
0
05 Feb 2023
SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain
SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain
Heli Qi
Sashi Novitasari
Andros Tjandra
S. Sakti
Satoshi Nakamura
75
3
0
08 Jan 2023
Evaluating and reducing the distance between synthetic and real speech
  distributions
Evaluating and reducing the distance between synthetic and real speech distributions
Christoph Minixhofer
Ondˇrej Klejch
P. Bell
82
8
0
29 Nov 2022
Exploring the Impact of Noise and Degradations on Heart Sound
  Classification Models
Exploring the Impact of Noise and Degradations on Heart Sound Classification Models
Davoud Shariat Panah
Andrew Hines
Susan Mckeever
10
7
0
14 Nov 2022
A Policy-based Approach to the SpecAugment Method for Low Resource E2E
  ASR
A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Rui Li
Guodong Ma
Dexin Zhao
Ranran Zeng
Xiaoyu Li
Haolin Huang
69
2
0
16 Oct 2022
When Is TTS Augmentation Through a Pivot Language Useful?
When Is TTS Augmentation Through a Pivot Language Useful?
Nathaniel R. Robinson
Perez Ogayo
Swetha Gangu
David R. Mortensen
Shinji Watanabe
77
10
0
20 Jul 2022
End-to-End Spoken Language Understanding: Performance analyses of a
  voice command task in a low resource setting
End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting
Thierry Desot
François Portet
Michel Vacher
54
12
0
17 Jul 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
285
368
0
21 May 2022
ASR data augmentation in low-resource settings using cross-lingual
  multi-speaker TTS and cross-lingual voice conversion
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
Edresson Casanova
C. Shulby
Alexander Korolev
Arnaldo Cândido Júnior
A. S. Soares
S. Aluísio
M. Ponti
117
14
0
29 Mar 2022
Differentially Private Speaker Anonymization
Differentially Private Speaker Anonymization
Ali Shahin Shamsabadi
B. M. L. Srivastava
A. Bellet
Nathalie Vauquier
Emmanuel Vincent
Mohamed Maouche
Marc Tommasi
Nicolas Papernot
MIACV
145
35
0
23 Feb 2022
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric
  Speech Recognition
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition
M. Soleymanpour
Michael T. Johnson
Rahim Soleymanpour
J. Berry
82
30
0
27 Jan 2022
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition
Jen-Hao Rick Chang
Martin Bresler
Youssouf Chherawala
Adrien Delaye
Thomas Deselaers
Ryan S. Dixon
Oncel Tuzel
SyDa
60
2
0
13 Oct 2021
Injecting Text in Self-Supervised Speech Pretraining
Injecting Text in Self-Supervised Speech Pretraining
Zhehuai Chen
Yu Zhang
Andrew Rosenberg
Bhuvana Ramabhadran
Gary Wang
Pedro J. Moreno
SSL
88
36
0
27 Aug 2021
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary
  Words in End-To-End ASR Systems
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems
Xianrui Zheng
Yulan Liu
Deniz Gunceler
D. Willett
129
79
0
23 Nov 2020
You Do Not Need More Data: Improving End-To-End Speech Recognition by
  Text-To-Speech Data Augmentation
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation
A. Laptev
Roman Korostik
A. Svischev
A. Andrusenko
Ivan Medennikov
S. Rybin
81
61
0
14 May 2020
Generating diverse and natural text-to-speech samples using a quantized
  fine-grained VAE and auto-regressive prosody prior
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
DiffM
98
93
0
06 Feb 2020
Training Keyword Spotters with Limited and Synthesized Speech Data
Training Keyword Spotters with Limited and Synthesized Speech Data
James Lin
Kevin Kilgour
Dominik Roblek
Matthew Sharifi
63
58
0
31 Jan 2020
Generating Synthetic Audio Data for Attention-Based Speech Recognition
  Systems
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
Nick Rossenbach
Albert Zeyer
Ralf Schluter
Hermann Ney
95
84
0
19 Dec 2019
Speech Recognition with Augmented Synthesized Speech
Speech Recognition with Augmented Synthesized Speech
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Ye Jia
Pedro J. Moreno
Yonghui Wu
Zelin Wu
67
128
0
25 Sep 2019
Synthetic Data for Deep Learning
Synthetic Data for Deep Learning
Sergey I. Nikolenko
149
358
0
25 Sep 2019
1