Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers

31 May 2017

Papers citing "Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers"

3 / 3 papers shown

Title
A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key! Chenglizhao Chen Mengke Song Wenfeng Song Li Guo Muwei Jian 35 25 0 20 Jun 2022
Look Who's Talking: Active Speaker Detection in the Wild You Jin Kim Hee-Soo Heo Soyeon Choe Soo-Whan Chung Yoohwan Kwon Bong-Jin Lee Youngki Kwon Joon Son Chung 41 20 0 17 Aug 2021
Deep Audio-Visual Learning: A Survey Hao Zhu Mandi Luo Rui Wang A. Zheng Ran He 31 156 0 14 Jan 2020