ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.15961
  4. Cited By
Using Active Speaker Faces for Diarization in TV shows

Using Active Speaker Faces for Diarization in TV shows

30 March 2022
Rahul Sharma
Shrikanth Narayanan
    CVBM
ArXivPDFHTML

Papers citing "Using Active Speaker Faces for Diarization in TV shows"

9 / 9 papers shown
Title
Character-aware audio-visual subtitling in context
Character-aware audio-visual subtitling in context
Jaesung Huh
Andrew Zisserman
41
0
0
14 Oct 2024
Audio-Visual Speaker Diarization: Current Databases, Approaches and
  Challenges
Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges
Victoria Mingote
Alfonso Ortega
A. Miguel
Eduardo Lleida
30
0
0
09 Sep 2024
Joint Training or Not: An Exploration of Pre-trained Speech Models in
  Audio-Visual Speaker Diarization
Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization
Huan Zhao
Li Lyna Zhang
Yuehong Li
Yannan Wang
Hongji Wang
Wei Rao
Qing Wang
Lei Xie
8
0
0
07 Dec 2023
Speaker Diarization of Scripted Audiovisual Content
Speaker Diarization of Scripted Audiovisual Content
Yogesh Virkar
Brian Thompson
Rohit Paturi
S. Srinivasan
Marcello Federico
24
1
0
04 Aug 2023
STHG: Spatial-Temporal Heterogeneous Graph Learning for Advanced
  Audio-Visual Diarization
STHG: Spatial-Temporal Heterogeneous Graph Learning for Advanced Audio-Visual Diarization
Kyle Min
31
5
0
18 Jun 2023
Audio-Visual Activity Guided Cross-Modal Identity Association for Active
  Speaker Detection
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection
Rahul Sharma
Shrikanth Narayanan
37
8
0
01 Dec 2022
Unsupervised active speaker detection in media content using cross-modal
  information
Unsupervised active speaker detection in media content using cross-modal information
Rahul Sharma
Shrikanth Narayanan
19
3
0
24 Sep 2022
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
274
327
0
24 Jan 2021
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized
  Maximum Eigengap
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap
Tae Jin Park
Kyu Jeong Han
Manoj Kumar
Shrikanth Narayanan
128
116
0
05 Mar 2020
1