ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.12831
  4. Cited By
Target Active Speaker Detection with Audio-visual Cues

Target Active Speaker Detection with Audio-visual Cues

22 May 2023
Yiding Jiang
Ruijie Tao
Zexu Pan
Haizhou Li
ArXivPDFHTML

Papers citing "Target Active Speaker Detection with Audio-visual Cues"

10 / 10 papers shown
Title
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker
  Verification
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification
Tianchi Liu
Kong Aik Lee
Qiongqiong Wang
Haizhou Li
VLM
68
13
0
06 Dec 2023
TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive
  Learning
TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning
Chaeyoung Jung
Suyeon Lee
Kihyun Nam
Kyeongha Rho
You Jin Kim
Youngjoon Jang
Joon Son Chung
15
9
0
21 Sep 2023
Audio-Visual Active Speaker Extraction for Sparsely Overlapped
  Multi-talker Speech
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech
Jun Yu Li
Ruijie Tao
Zexu Pan
Meng Ge
Shuai Wang
Haizhou Li
24
5
0
15 Sep 2023
Attention-based Encoder-Decoder End-to-End Neural Diarization with
  Embedding Enhancer
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer
Zhengyang Chen
Bing Han
Shuai Wang
Yan-min Qian
24
18
0
13 Sep 2023
EEG-Derived Voice Signature for Attended Speaker Detection
EEG-Derived Voice Signature for Attended Speaker Detection
Hongxu Zhu
Siqi Cai
Yidi Jiang
Qiquan Zhang
Haizhou Li
16
0
0
28 Aug 2023
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker
  Diarization with Target Speaker Attractor
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
Zhengyang Chen
Bing Han
Shuai Wang
Yan-min Qian
32
15
0
18 May 2023
A Light Weight Model for Active Speaker Detection
A Light Weight Model for Active Speaker Detection
Junhua Liao
Haihan Duan
Kanghui Feng
Wanbing Zhao
Yanbing Yang
Liangyin Chen
30
36
0
08 Mar 2023
LoCoNet: Long-Short Context Network for Active Speaker Detection
LoCoNet: Long-Short Context Network for Active Speaker Detection
Xizi Wang
Feng Cheng
Gedas Bertasius
David J. Crandall
24
15
0
19 Jan 2023
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse
  Positive Pairs
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Ruijie Tao
Kong Aik Lee
Rohan Kumar Das
Ville Hautamaki
Haizhou Li
SSL
27
8
0
27 Oct 2022
MAAS: Multi-modal Assignation for Active Speaker Detection
MAAS: Multi-modal Assignation for Active Speaker Detection
Juan Carlos León Alcázar
Fabian Caba Heilbron
Ali K. Thabet
Bernard Ghanem
59
51
0
11 Jan 2021
1