ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11368
  4. Cited By
Audio visual character profiles for detecting background characters in
  entertainment media

Audio visual character profiles for detecting background characters in entertainment media

21 March 2022
Rahul Sharma
Shrikanth Narayanan
ArXiv (abs)PDFHTML

Papers citing "Audio visual character profiles for detecting background characters in entertainment media"

10 / 10 papers shown
Title
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Guohao Li
VGen
68
28
0
12 Sep 2021
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
Andrew Brown
Vicky Kalogeiton
Andrew Zisserman
CVBM
61
30
0
20 May 2021
Cross modal video representations for weakly supervised active speaker
  localization
Cross modal video representations for weakly supervised active speaker localization
Rahul Sharma
Krishna Somandepalli
Shrikanth Narayanan
46
8
0
09 Mar 2020
pyannote.audio: neural building blocks for speaker diarization
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
229
325
0
04 Nov 2019
AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Joseph Roth
Sourish Chaudhuri
Ondˇrej Klejch
Radhika Marvin
Andrew C. Gallagher
...
S. Ramaswamy
Arkadiusz Stopczynski
Cordelia Schmid
Zhonghua Xi
C. Pantofaru
55
144
0
05 Jan 2019
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
353
2,279
0
14 Jun 2018
Generalized End-to-End Loss for Speaker Verification
Generalized End-to-End Loss for Speaker Verification
Li Wan
Quan Wang
Alan Papir
Ignacio López Moreno
VLM
68
927
0
28 Oct 2017
VGGFace2: A dataset for recognising faces across pose and age
VGGFace2: A dataset for recognising faces across pose and age
Qiong Cao
Li Shen
Weidi Xie
Omkar M. Parkhi
Andrew Zisserman
CVBM
95
2,630
0
23 Oct 2017
Learning Deep Features for Discriminative Localization
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSLSSegFAtt
250
9,326
0
14 Dec 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
373
13,161
0
12 Mar 2015
1