Multi-Speaker and Wide-Band Simulated Conversations as Training Data for
End-to-End Neural Diarization

Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization

12 November 2022

Federico Landini

Alicia Lozano-Diez

Papers citing "Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization"

16 / 16 papers shown

Title
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization Natsuo Yamashita Shota Horiguchi Takeshi Homma 51 17 0 24 Apr 2022
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization Federico Landini Alicia Lozano-Diez Mireia Díez Lukávs Burget 41 36 0 02 Apr 2022
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge Naijun Zheng Na Li Xixin Wu Lingwei Meng Jiawen Kang Haibin Wu Chao Weng Dan Su Helen Meng 45 10 0 04 Feb 2022
End-to-end Neural Diarization: From Transformer to Conformer Yi Y. Liu Eunjung Han Chul Lee A. Stolcke 99 40 0 14 Jun 2021
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio Guoguo Chen Shuzhou Chai Guan-Bo Wang Jiayu Du Weiqiang Zhang ... Xuchen Yao Yongqing Wang Yujun Wang Zhao You Zhiyong Yan 86 360 0 13 Jun 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings Soumi Maiti Hakan Erdogan K. Wilson Scott Wisdom Shinji Watanabe J. Hershey 41 22 0 05 May 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation Changhan Wang M. Rivière Ann Lee Anne Wu Chaitanya Talnikar Daniel Haziza Mary Williamson J. Pino Emmanuel Dupoux SSL 64 477 0 02 Jan 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks Federico Landini Jan Profant Mireia Díez L. Burget 239 205 0 29 Dec 2020
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds K. Kinoshita Marc Delcroix Naohiro Tawara 27 81 0 26 Oct 2020
Spot the conversation: speaker diarisation in the wild Joon Son Chung Jaesung Huh Arsha Nagrani Triantafyllos Afouras Andrew Zisserman VGen 47 144 0 02 Jul 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors Shota Horiguchi Yusuke Fujita Shinji Watanabe Yawen Xue Kenji Nagamatsu 109 189 0 20 May 2020
End-to-End Neural Speaker Diarization with Permutation-Free Objectives Yusuke Fujita Naoyuki Kanda Shota Horiguchi Kenji Nagamatsu Shinji Watanabe 186 248 0 12 Sep 2019
VoxCeleb2: Deep Speaker Recognition Joon Son Chung Arsha Nagrani Andrew Zisserman 344 2,261 0 14 Jun 2018
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 506 129,831 0 12 Jun 2017
MUSAN: A Music, Speech, and Noise Corpus David Snyder Guoguo Chen Daniel Povey 61 1,345 0 28 Oct 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.1K 149,474 0 22 Dec 2014