Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.00079
Cited By
Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
31 May 2017
K. Hoover
Sourish Chaudhuri
C. Pantofaru
M. Slaney
Ian Sturdy
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers"
3 / 3 papers shown
Title
A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!
Chenglizhao Chen
Mengke Song
Wenfeng Song
Li Guo
Muwei Jian
35
25
0
20 Jun 2022
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
41
20
0
17 Aug 2021
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
31
156
0
14 Jan 2020
1