Self-supervised object detection from audio-visual correspondence

Self-supervised object detection from audio-visual correspondence

13 April 2021

Triantafyllos Afouras

Francois Fagan

Andrea Vedaldi

Papers citing "Self-supervised object detection from audio-visual correspondence"

15 / 15 papers shown

Title
Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey Hong-Hanh Nguyen-Le Van-Tuan Tran Dinh-Thuc Nguyen Nhien-An Le-Khac AAML 110 1 0 26 Nov 2024
Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models David Kurzendörfer Otniel-Bogdan Mercea A. Sophia Koepke Zeynep Akata VLM CLIP 33 2 0 09 Apr 2024
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images Antonín Vobecký Oriane Siméoni David Hurych Spyros Gidaris Andrei Bursuc Patrick Pérez Josef Sivic 40 33 0 17 Jan 2024
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation Zhaofeng Shi Qingbo Wu Fanman Meng Linfeng Xu Hongliang Li VOS 30 3 0 10 Oct 2023
Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research Davide Berghi M. Volino Philip J. B. Jackson VGen 19 6 0 04 Dec 2022
Egocentric Audio-Visual Noise Suppression Roshan S. Sharma Weipeng He Ju Lin Egor Lakomkin Yang Liu Kaustubh Kalgaonkar EgoV 24 1 0 07 Nov 2022
Temporal and cross-modal attention for audio-visual zero-shot learning Otniel-Bogdan Mercea Thomas Hummel A. Sophia Koepke Zeynep Akata 38 25 0 20 Jul 2022
Learning Music-Dance Representations through Explicit-Implicit Rhythm Synchronization Jiashuo Yu Junfu Pu Ying Cheng Rui Feng Ying Shan 21 5 0 07 Jul 2022
ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound Yan-Bo Lin Jie Lei Joey Tianyi Zhou Gedas Bertasius 41 39 0 06 Apr 2022
Self-Supervised Moving Vehicle Detection from Audio-Visual Cues Jannik Zürn Wolfram Burgard SSL 31 8 0 30 Jan 2022
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking Yidi Li Hong Liu Hao Tang 17 20 0 14 Dec 2021
Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals Wouter Van Gansbeke Simon Vandenhende Stamatios Georgoulis Luc Van Gool SSL 188 250 0 11 Feb 2021
UFO $^2$ : A Unified Framework towards Omni-supervised Object Detection Zhongzheng Ren Zhiding Yu Xiaodong Yang Xuan Li A. Schwing Jan Kautz ObjD 193 35 0 21 Oct 2020
Self-supervised Co-training for Video Representation Learning Tengda Han Weidi Xie Andrew Zisserman SSL 215 309 0 19 Oct 2020
Confidence Regularized Self-Training Yang Zou Zhiding Yu Xiaofeng Liu B. Kumar Jinsong Wang 233 789 0 26 Aug 2019