ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.11123
  4. Cited By
Improved Long-Form Speech Recognition by Jointly Modeling the Primary
  and Non-primary Speakers

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

18 December 2023
Guru Prakash Arumugam
Shuo-yiin Chang
Tara N. Sainath
Rohit Prabhavalkar
Quan Wang
Shaan Bijwadia
ArXivPDFHTML

Papers citing "Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers"

3 / 3 papers shown
Title
Input Length Matters: Improving RNN-T and MWER Training for Long-form
  Telephony Speech Recognition
Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition
Zhiyun Lu
Yanwei Pan
Thibault Doutre
Parisa Haghani
Liangliang Cao
Rohit Prabhavalkar
C. Zhang
Trevor Strohman
AuLLM
83
14
0
08 Oct 2021
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer
  Transducer Speaker Turn Detection
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Wei Xia
Han Lu
Quan Wang
Anshuman Tripathi
Yiling Huang
Ignacio López Moreno
Hasim Sak
41
51
0
23 Sep 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
274
327
0
24 Jan 2021
1