ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.14036
  4. Cited By
Text-only domain adaptation for end-to-end ASR using integrated
  text-to-mel-spectrogram generator

Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator

27 February 2023
Vladimir Bataev
Roman Korostik
Evgeny Shabalin
Vitaly Lavrukhin
Boris Ginsburg
    VLM
ArXivPDFHTML

Papers citing "Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator"

12 / 12 papers shown
Title
Effective Text Adaptation for LLM-based ASR through Soft Prompt
  Fine-Tuning
Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning
Yingyi Ma
Zhe Liu
Ozlem Kalinli
70
0
0
09 Dec 2024
AMPS: ASR with Multimodal Paraphrase Supervision
AMPS: ASR with Multimodal Paraphrase Supervision
Amruta Parulekar
Abhishek Gupta
Sameep Chattopadhyay
P. Jyothi
75
0
0
27 Nov 2024
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot
  TTS and LLM
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
Jiawei Yu
Yongqian Li
Xiaosong Qiao
Huan Zhao
Xiaofeng Zhao
Wei Tang
M. Zhang
Hao Yang
Jinsong Su
80
0
0
20 Nov 2024
Parameter-efficient Adaptation of Multilingual Multimodal Models for
  Low-resource ASR
Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR
Abhishek Gupta
Amruta Parulekar
Sameep Chattopadhyay
P. Jyothi
VLM
33
0
0
17 Oct 2024
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech
  Recognition
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition
Hsuan Su
Hua Farn
Fan-Yun Sun
Shang-Tse Chen
Hung-yi Lee
MoMe
31
2
0
05 Jun 2024
Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR
  Customization
Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization
Alexandra Antonova
33
0
0
29 Sep 2023
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large
  Language Models
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models
Hsuan Su
Ting-Yao Hu
H. Koppula
Raviteja Vemulapalli
Jen-Hao Rick Chang
Karren D. Yang
G. Mantena
Oncel Tuzel
SyDa
41
1
0
18 Sep 2023
Text Injection for Capitalization and Turn-Taking Prediction in Speech
  Models
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Shaan Bijwadia
Shuo-yiin Chang
Weiran Wang
Zhong Meng
Hao Zhang
Tara N. Sainath
24
1
0
14 Aug 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in
  Transducer
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer
Lu Huang
Yangqiu Song
Jun Zhang
Lu Lu
Zejun Ma
29
2
0
07 Jun 2023
Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Ting-Yao Hu
Mohammadreza Armandpour
A. Shrivastava
Jen-Hao Rick Chang
H. Koppula
Oncel Tuzel
SyDa
52
42
0
21 Oct 2021
CTC Variations Through New WFST Topologies
CTC Variations Through New WFST Topologies
A. Laptev
Somshubra Majumdar
Boris Ginsburg
34
20
0
06 Oct 2021
NeMo: a toolkit for building AI applications using Neural Modules
NeMo: a toolkit for building AI applications using Neural Modules
Oleksii Kuchaiev
Jason Chun Lok Li
Huyen Nguyen
Oleksii Hrinchuk
Ryan Leary
...
Jack Cook
P. Castonguay
Mariya Popova
Jocelyn Huang
Jonathan M. Cohen
208
292
0
14 Sep 2019
1