ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.13802
  4. Cited By
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding

DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding

28 May 2021
Neil Zeghidour
O. Teboul
David Grangier
ArXivPDFHTML

Papers citing "DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding"

22 / 22 papers shown
Title
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
317
331
0
24 Jan 2021
End-to-End Speaker Diarization as Post-Processing
End-to-End Speaker Diarization as Post-Processing
Shota Horiguchi
Leibny Paola García-Perera
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
70
42
0
18 Dec 2020
Integrating end-to-end neural and clustering-based diarization: Getting
  the best of both worlds
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds
K. Kinoshita
Marc Delcroix
Naohiro Tawara
47
81
0
26 Oct 2020
Online End-to-End Neural Diarization with Speaker-Tracing Buffer
Online End-to-End Neural Diarization with Speaker-Tracing Buffer
Yawen Xue
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
31
45
0
04 Jun 2020
Neural Speaker Diarization with Speaker-Wise Chain Rule
Neural Speaker Diarization with Speaker-Wise Chain Rule
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Yawen Xue
Jing Shi
Kenji Nagamatsu
58
45
0
02 Jun 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with
  Encoder-Decoder Based Attractors
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Kenji Nagamatsu
118
189
0
20 May 2020
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for
  Unsegmented Recordings
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Shinji Watanabe
Michael I. Mandel
Jon Barker
Emmanuel Vincent
Ashish Arora
...
Emmanuel Vincent
Shota Horiguchi
Naoyuki Kanda
Takuya Yoshioka
Neville Ryant
50
306
0
20 Apr 2020
End-to-End Neural Diarization: Reformulating Speaker Diarization as
  Simple Multi-label Classification
End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
42
49
0
24 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
80
263
0
20 Feb 2020
Discriminative Neural Clustering for Speaker Diarisation
Discriminative Neural Clustering for Speaker Diarisation
Qiujia Li
Florian Kreyssig
Chao Zhang
P. Woodland
30
45
0
22 Oct 2019
End-to-End Neural Speaker Diarization with Self-attention
End-to-End Neural Speaker Diarization with Self-attention
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
217
239
0
13 Sep 2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Kenji Nagamatsu
Shinji Watanabe
190
251
0
12 Sep 2019
LSTM based Similarity Measurement with Spectral Clustering for Speaker
  Diarization
LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization
Qingjian Lin
Ruiqing Yin
Ming Li
H. Bredin
C. Barras
47
91
0
23 Jul 2019
Fully Supervised Speaker Diarization
Fully Supervised Speaker Diarization
Aonan Zhang
Quan Wang
Zhenyao Zhu
John Paisley
Chong-Jun Wang
BDL
61
218
0
10 Oct 2018
Speaker Diarization with LSTM
Speaker Diarization with LSTM
Quan Wang
Carlton Downey
Li Wan
Philip Mansfield
Ignacio López Moreno
52
316
0
28 Oct 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
682
131,414
0
12 Jun 2017
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
393
7,389
0
12 Sep 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
399
10,481
0
21 Jul 2016
Permutation Invariant Training of Deep Models for Speaker-Independent
  Multi-talker Speech Separation
Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation
Dong Yu
Morten Kolbæk
Zheng-Hua Tan
Jesper Jensen
89
854
0
01 Jul 2016
MUSAN: A Music, Speech, and Noise Corpus
MUSAN: A Music, Speech, and Noise Corpus
David Snyder
Guoguo Chen
Daniel Povey
78
1,347
0
28 Oct 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on
  ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
320
18,609
0
06 Feb 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.7K
150,006
0
22 Dec 2014
1