ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.07086
  4. Cited By
Collar-aware Training for Streaming Speaker Change Detection in
  Broadcast Speech

Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech

14 May 2022
Joonas Kalda
Tanel Alumäe
ArXiv (abs)PDFHTML

Papers citing "Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech"

9 / 9 papers shown
Title
Overlap-aware low-latency online speaker diarization based on end-to-end
  local segmentation
Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation
Juan Manuel Coria
H. Bredin
Sahar Ghannay
Sophie Rosset
71
30
0
14 Sep 2021
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding
Neil Zeghidour
O. Teboul
David Grangier
52
13
0
28 May 2021
End-to-end speaker segmentation for overlap-aware resegmentation
End-to-end speaker segmentation for overlap-aware resegmentation
H. Bredin
Antoine Laurent
VLM
332
168
0
08 Apr 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker
  diarization: theory, implementation and analysis on standard tasks
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
Federico Landini
Jan Profant
Mireia Díez
L. Burget
257
207
0
29 Dec 2020
CN-CELEB: a challenging Chinese speaker recognition dataset
CN-CELEB: a challenging Chinese speaker recognition dataset
Yue Fan
Jiawen Kang
Lantian Li
Keliang Li
Haolin Chen
Sitong Cheng
Pengyuan Zhang
Ziya Zhou
Yunqi Cai
Dong Wang
72
205
0
31 Oct 2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Kenji Nagamatsu
Shinji Watanabe
192
252
0
12 Sep 2019
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
356
2,279
0
14 Jun 2018
VoxCeleb: a large-scale speaker identification dataset
VoxCeleb: a large-scale speaker identification dataset
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
127
2,274
0
26 Jun 2017
MUSAN: A Music, Speech, and Noise Corpus
MUSAN: A Music, Speech, and Noise Corpus
David Snyder
Guoguo Chen
Daniel Povey
81
1,350
0
28 Oct 2015
1