ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.05952
  4. Cited By
End-to-End Neural Speaker Diarization with Permutation-Free Objectives

End-to-End Neural Speaker Diarization with Permutation-Free Objectives

12 September 2019
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Kenji Nagamatsu
Shinji Watanabe
ArXivPDFHTML

Papers citing "End-to-End Neural Speaker Diarization with Permutation-Free Objectives"

50 / 55 papers shown
Title
USED: Universal Speaker Extraction and Diarization
USED: Universal Speaker Extraction and Diarization
Junyi Ao
Mehmet Sinan Yildirim
Ruijie Tao
Mengyao Ge
Shuai Wang
Yan-min Qian
Haizhou Li
41
5
0
17 Jan 2025
Language-based Audio Moment Retrieval
Language-based Audio Moment Retrieval
Hokuto Munakata
Taichi Nishimura
Shota Nakada
Tatsuya Komatsu
38
1
0
24 Sep 2024
An Investigation Into Explainable Audio Hate Speech Detection
An Investigation Into Explainable Audio Hate Speech Detection
Jinmyeong An
Wonjun Lee
Yejin Jeon
Jungseul Ok
Yunsu Kim
Gary Geunbae Lee
28
2
0
12 Aug 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
39
4
0
21 Jul 2024
AG-LSEC: Audio Grounded Lexical Speaker Error Correction
AG-LSEC: Audio Grounded Lexical Speaker Error Correction
Rohit Paturi
Xiang Li
S. Srinivasan
33
1
0
25 Jun 2024
SCDNet: Self-supervised Learning Feature-based Speaker Change Detection
SCDNet: Self-supervised Learning Feature-based Speaker Change Detection
Yue Li
Xinsheng Wang
Li Zhang
Lei Xie
45
1
0
12 Jun 2024
LLM-based speaker diarization correction: A generalizable approach
LLM-based speaker diarization correction: A generalizable approach
Georgios Efstathiadis
Vijay Yadav
Anzar Abbas
45
3
0
07 Jun 2024
Speaker Embeddings With Weakly Supervised Voice Activity Detection For
  Efficient Speaker Diarization
Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization
Jenthe Thienpondt
Kris Demuynck
41
2
0
15 May 2024
A Large-Scale Evaluation of Speech Foundation Models
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
38
19
0
15 Apr 2024
Continuous Target Speech Extraction: Enhancing Personalized Diarization
  and Extraction on Complex Recordings
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
He Zhao
Hangting Chen
Jianwei Yu
Yuehai Wang
51
0
0
29 Jan 2024
Fine-Tuned Self-Supervised Speech Representations for Language
  Diarization in Multilingual Code-Switched Speech
Fine-Tuned Self-Supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech
Geoffrey T. Frost
Emily Morris
Joshua Jansen van Vüren
T. Niesler
28
2
0
15 Dec 2023
End-to-end Online Speaker Diarization with Target Speaker Tracking
End-to-end Online Speaker Diarization with Target Speaker Tracking
Weiqing Wang
Ming Li
33
5
0
12 Oct 2023
Frame-wise streaming end-to-end speaker diarization with
  non-autoregressive self-attention-based attractors
Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors
Di Liang
Nian Shao
Xiaofei Li
33
4
0
25 Sep 2023
Attention-based Encoder-Decoder End-to-End Neural Diarization with
  Embedding Enhancer
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer
Zhengyang Chen
Bing Han
Shuai Wang
Yan-min Qian
28
18
0
13 Sep 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
A. Brutti
S. Squartini
44
9
0
29 May 2023
Unified Modeling of Multi-Talker Overlapped Speech Recognition and
  Diarization with a Sidecar Separator
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Lingwei Meng
Jiawen Kang
Mingyu Cui
Haibin Wu
Xixin Wu
Helen M. Meng
39
10
0
25 May 2023
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker
  Diarization with Target Speaker Attractor
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
Zhengyang Chen
Bing Han
Shuai Wang
Yan-min Qian
34
15
0
18 May 2023
Neural Diarization with Non-autoregressive Intermediate Attractors
Neural Diarization with Non-autoregressive Intermediate Attractors
Yusuke Fujita
Tatsuya Komatsu
Robin Scheibler
Yusuke Kida
Tetsuji Ogawa
35
11
0
13 Mar 2023
Speaker Overlap-aware Neural Diarization for Multi-party Meeting
  Analysis
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis
Zhihao Du
Shiliang Zhang
Siqi Zheng
Zhijie Yan
24
14
0
18 Nov 2022
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for
  End-to-End Neural Diarization
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization
Federico Landini
Mireia Díez
Alicia Lozano-Diez
L. Burget
37
15
0
12 Nov 2022
BER: Balanced Error Rate For Speaker Diarization
BER: Balanced Error Rate For Speaker Diarization
Tao Liu
K. Yu
20
4
0
08 Nov 2022
High-resolution embedding extractor for speaker diarisation
High-resolution embedding extractor for speaker diarisation
Hee-Soo Heo
Youngki Kwon
Bong-Jin Lee
You Jin Kim
Jee-weon Jung
29
5
0
08 Nov 2022
DiaCorrect: End-to-end error correction for speaker diarization
Jiangyu Han
Yuhang Cao
Heng Lu
Yanhua Long
42
0
0
31 Oct 2022
In search of strong embedding extractors for speaker diarisation
In search of strong embedding extractors for speaker diarisation
Jee-weon Jung
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
A. Brown
Youngki Kwon
Shinji Watanabe
Joon Son Chung
44
16
0
26 Oct 2022
Highly Efficient Real-Time Streaming and Fully On-Device Speaker
  Diarization with Multi-Stage Clustering
Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Quan Wang
Yiling Huang
Han Lu
Guanlong Zhao
Ignacio López Moreno
27
11
0
25 Oct 2022
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
K. Kinoshita
Thilo von Neumann
Marc Delcroix
Christoph Boeddeker
Reinhold Haeb-Umbach
38
4
0
28 Jul 2022
Sequence-level Speaker Change Detection with Difference-based Continuous
  Integrate-and-fire
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Zhiyun Fan
Linhao Dong
Meng Cai
Zejun Ma
Bo Xu
28
4
0
27 Jun 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global
  and Local Attractors
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Y. Kawaguchi
39
23
0
06 Jun 2022
PRISM: Pre-trained Indeterminate Speaker Representation Model for
  Speaker Diarization and Speaker Verification
PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
Siqi Zheng
Hongbin Suo
Qian Chen
35
4
0
16 May 2022
Collar-aware Training for Streaming Speaker Change Detection in
  Broadcast Speech
Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
Joonas Kalda
Tanel Alumäe
13
3
0
14 May 2022
Self-supervised Speaker Diarization
Self-supervised Speaker Diarization
Yehoshua Dissen
Felix Kreuk
Joseph Keshet
13
4
0
08 Apr 2022
From Simulated Mixtures to Simulated Conversations as Training Data for
  End-to-End Neural Diarization
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization
Federico Landini
Alicia Lozano-Diez
Mireia Díez
Lukávs Burget
25
34
0
02 Apr 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech
  Separation for Flexible Number of Speakers
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Soumi Maiti
Yushi Ueda
Shinji Watanabe
Chunlei Zhang
Meng Yu
Shi-Xiong Zhang
Yong-mei Xu
34
32
0
31 Mar 2022
Generation of Speaker Representations Using Heterogeneous Training Batch
  Assembly
Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Yu-Huai Peng
Hung-Shin Lee
Pin-Tuan Huang
Hsin-Min Wang
16
0
0
30 Mar 2022
Tight integration of neural- and clustering-based diarization through
  deep unfolding of infinite Gaussian mixture model
Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model
K. Kinoshita
Marc Delcroix
Tomoharu Iwata
BDL
22
19
0
14 Feb 2022
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party
  meeting transcription (M2MeT) challenge
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge
Maokui He
Xiang Lv
Weilin Zhou
Jingjing Yin
Xiaoqi Zhang
...
Shutong Niu
Yuhang Cao
Heng Lu
Jun Du
Chin-Hui Lee
46
7
0
10 Feb 2022
End-to-end speaker diarization with transformer
End-to-end speaker diarization with transformer
Yongquan Lai
Xin Tang
Yuanyuan Fu
Rui Fang
31
1
0
14 Dec 2021
Speaker Embedding-aware Neural Diarization for Flexible Number of
  Speakers with Textual Information
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information
Zhihao Du
Shiliang Zhang
Siqi Zheng
Weilong Huang
Ming Lei
BDL
16
1
0
28 Nov 2021
Multiset-Equivariant Set Prediction with Approximate Implicit
  Differentiation
Multiset-Equivariant Set Prediction with Approximate Implicit Differentiation
Yan Zhang
David W. Zhang
Simon Lacoste-Julien
Gertjan J. Burghouts
Cees G. M. Snoek
BDL
37
21
0
23 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
110
1,704
0
26 Oct 2021
Multi-Channel End-to-End Neural Diarization with Distributed Microphones
Multi-Channel End-to-End Neural Diarization with Distributed Microphones
Shota Horiguchi
Yuki Takashima
Leibny Paola García-Perera
Shinji Watanabe
Y. Kawaguchi
26
17
0
10 Oct 2021
Location-based training for multi-channel talker-independent speaker
  separation
Location-based training for multi-channel talker-independent speaker separation
H. Taherian
Ke Tan
DeLiang Wang
27
10
0
08 Oct 2021
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer
  Transducer Speaker Turn Detection
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Wei Xia
Han Lu
Quan Wang
Anshuman Tripathi
Yiling Huang
Ignacio López Moreno
Hasim Sak
41
51
0
23 Sep 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection
BeamTransformer: Microphone Array-based Overlapping Speech Detection
Siqi Zheng
Shiliang Zhang
Weilong Huang
Qian Chen
Hongbin Suo
Ming Lei
Jinwei Feng
Zhijie Yan
35
7
0
09 Sep 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
37
64
0
20 Jun 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global
  Networks and Discriminative Speaker Embeddings
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
Soumi Maiti
Hakan Erdogan
K. Wilson
Scott Wisdom
Shinji Watanabe
J. Hershey
27
21
0
05 May 2021
Combination of Deep Speaker Embeddings for Diarisation
Combination of Deep Speaker Embeddings for Diarisation
Guangzhi Sun
Chao Zhang
P. Woodland
19
20
0
22 Oct 2020
Microsoft Speaker Diarization System for the VoxCeleb Speaker
  Recognition Challenge 2020
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020
Xiong Xiao
Naoyuki Kanda
Zhuo Chen
Tianyan Zhou
Takuya Yoshioka
...
Yu-Huan Wu
Jian Wu
Shujie Liu
Jinyu Li
Jiawei Liu
27
62
0
22 Oct 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
43
20
0
25 Jun 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with
  Encoder-Decoder Based Attractors
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Kenji Nagamatsu
37
185
0
20 May 2020
12
Next