ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.06482
  4. Cited By
Augmenting Transformer-Transducer Based Speaker Change Detection With
  Token-Level Training Loss

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

11 November 2022
Guanlong Zhao
Quan Wang
Han Lu
Yiling Huang
Ignacio López Moreno
ArXivPDFHTML

Papers citing "Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss"

11 / 11 papers shown
Title
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Siddhant Arora
Zhiyun Lu
Chung-Cheng Chiu
Ruoming Pang
Shinji Watanabe
43
2
0
03 Mar 2025
Leveraging Speaker Embeddings in End-to-End Neural Diarization for
  Two-Speaker Scenarios
Leveraging Speaker Embeddings in End-to-End Neural Diarization for Two-Speaker Scenarios
Juan Ignacio Alvarez-Trejos
Beltrán Labrador
Alicia Lozano-Diez
35
1
0
01 Jul 2024
SCDNet: Self-supervised Learning Feature-based Speaker Change Detection
SCDNet: Self-supervised Learning Feature-based Speaker Change Detection
Yue Li
Xinsheng Wang
Li Zhang
Lei Xie
42
1
0
12 Jun 2024
LLM-based speaker diarization correction: A generalizable approach
LLM-based speaker diarization correction: A generalizable approach
Georgios Efstathiadis
Vijay Yadav
Anzar Abbas
45
3
0
07 Jun 2024
DiarizationLM: Speaker Diarization Post-Processing with Large Language
  Models
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Quan Wang
Yiling Huang
Guanlong Zhao
Evan Clark
Wei Xia
Hank Liao
AuLLM
30
8
0
07 Jan 2024
Improved Long-Form Speech Recognition by Jointly Modeling the Primary
  and Non-primary Speakers
Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers
Guru Prakash Arumugam
Shuo-yiin Chang
Tara N. Sainath
Rohit Prabhavalkar
Quan Wang
Shaan Bijwadia
21
3
0
18 Dec 2023
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary
  Network
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Yiling Huang
Weiran Wang
Guanlong Zhao
Hank Liao
Wei Xia
Quan Wang
24
4
0
15 Sep 2023
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained
  Foundation Models
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Guanlong Zhao
Yongqiang Wang
Jason W. Pelecanos
Yu Zhang
Hank Liao
Yiling Huang
Han Lu
Quan Wang
19
4
0
14 Sep 2023
Encoder-decoder multimodal speaker change detection
Encoder-decoder multimodal speaker change detection
Jee-weon Jung
Soonshin Seo
Hee-Soo Heo
Geon-min Kim
You Jin Kim
Youngki Kwon
Min-Ji Lee
Bong-Jin Lee
37
2
0
01 Jun 2023
Highly Efficient Real-Time Streaming and Fully On-Device Speaker
  Diarization with Multi-Stage Clustering
Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Quan Wang
Yiling Huang
Han Lu
Guanlong Zhao
Ignacio López Moreno
27
11
0
25 Oct 2022
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer
  Transducer Speaker Turn Detection
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Wei Xia
Han Lu
Quan Wang
Anshuman Tripathi
Yiling Huang
Ignacio López Moreno
Hasim Sak
41
51
0
23 Sep 2021
1