Title |
---|
![]() Large Language Model Based Generative Error Correction: A Challenge and
Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition Chao-Han Huck Yang Taejin Park Yuan Gong Yuanchao Li Zhehuai Chen ...E. Chng Peter Bell Catherine Lai Shinji Watanabe A. Stolcke |
![]() Sortformer: Seamless Integration of Speaker Diarization and ASR by
Bridging Timestamps and Tokens Taejin Park Ivan Medennikov Kunal Dhawan Weiqing Wang He Huang Nithin Rao Koluguri Krishna C. Puvvada Jagadeesh Balam Boris Ginsburg |
![]() Resource-Efficient Adaptation of Speech Foundation Models for
Multi-Speaker ASR Weiqing Wang Kunal Dhawan Taejin Park Krishna C. Puvvada Ivan Medennikov Somshubra Majumdar He Huang Jagadeesh Balam Boris Ginsburg |