ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.02417
  4. Cited By
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic
  Speech Recognition

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

4 January 2024
David M. Chan
Shalini Ghosh
Hitesh Tulsiani
Ariya Rastrow
Björn Hoffmeister
ArXivPDFHTML

Papers citing "Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition"

12 / 12 papers shown
Title
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented
  Dialogue Agents
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents
Shuzheng Si
Wen-Cheng Ma
Haoyu Gao
Yuchuan Wu
Ting-En Lin
Yinpei Dai
Hangyu Li
Rui Yan
Fei Huang
Yongbin Li
AuLLM
73
31
0
22 May 2023
Contextual Adapters for Personalized Speech Recognition in Neural
  Transducers
Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Kanthashree Mysore Sathyendra
Thejaswi Muniyappa
Feng-Ju Chang
Jing Liu
Jinru Su
Grant P. Strimel
Athanasios Mouchtaris
Siegfried Kunzmann
29
75
0
26 May 2022
Content-Context Factorized Representations for Automated Speech
  Recognition
Content-Context Factorized Representations for Automated Speech Recognition
David M. Chan
Shalini Ghosh
38
11
0
19 May 2022
Unified Modeling of Multi-Domain Multi-Device ASR Systems
Soumyajit Mitra
Swayambhu Nath Ray
Bharat Padi
Arunasish Sen
Raghavendra Bilgi
Harish Arsikere
Shalini Ghosh
A. Srinivasamurthy
Sri Garimella
45
3
0
13 May 2022
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice
  Conversion for everyone
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
200
391
0
04 Dec 2021
Combined Scaling for Zero-shot Transfer Learning
Combined Scaling for Zero-shot Transfer Learning
Hieu H. Pham
Zihang Dai
Golnaz Ghiasi
Kenji Kawaguchi
Hanxiao Liu
...
Yi-Ting Chen
Minh-Thang Luong
Yonghui Wu
Mingxing Tan
Quoc V. Le
VLM
28
197
0
19 Nov 2021
Multi-Modal Pre-Training for Automated Speech Recognition
Multi-Modal Pre-Training for Automated Speech Recognition
David M. Chan
Shalini Ghosh
D. Chakrabarty
Björn Hoffmeister
SSL
30
16
0
12 Oct 2021
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal
  Conversations
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Satwik Kottur
Seungwhan Moon
A. Geramifard
Babak Damavandi
47
88
0
18 Apr 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
101
5,677
0
20 Jun 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
176
3,082
0
16 May 2020
NeMo: a toolkit for building AI applications using Neural Modules
NeMo: a toolkit for building AI applications using Neural Modules
Oleksii Kuchaiev
Jason Chun Lok Li
Huyen Nguyen
Oleksii Hrinchuk
Ryan Leary
...
Jack Cook
P. Castonguay
Mariya Popova
Jocelyn Huang
Jonathan M. Cohen
221
300
0
14 Sep 2019
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for
  Task-Oriented Dialogue Modelling
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
Paweł Budzianowski
Tsung-Hsien Wen
Bo-Hsiang Tseng
I. Casanueva
Stefan Ultes
Osman Ramadan
Milica Gasic
114
1,306
0
29 Sep 2018
1