ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.00079
  4. Cited By
Putting a Face to the Voice: Fusing Audio and Visual Signals Across a
  Video to Determine Speakers

Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers

31 May 2017
K. Hoover
Sourish Chaudhuri
C. Pantofaru
M. Slaney
Ian Sturdy
    CVBM
ArXivPDFHTML

Papers citing "Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers"

3 / 3 papers shown
Title
A Comprehensive Survey on Video Saliency Detection with Auditory
  Information: the Audio-visual Consistency Perceptual is the Key!
A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!
Chenglizhao Chen
Mengke Song
Wenfeng Song
Li Guo
Muwei Jian
35
25
0
20 Jun 2022
Look Who's Talking: Active Speaker Detection in the Wild
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
41
20
0
17 Aug 2021
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
31
156
0
14 Jan 2020
1