A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!

20 June 2022

Chenglizhao Chen

Papers citing "A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!"

22 / 22 papers shown

Title
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization Ling Xing Hongyu Qu Rui Yan Xiangbo Shu Jinhui Tang 66 2 0 12 Sep 2024
Depth-Cooperated Trimodal Network for Video Salient Object Detection Yukang Lu Dingyao Min Keren Fu Qijun Zhao MDE 37 13 0 12 Feb 2022
Full-Duplex Strategy for Video Object Segmentation Ge-Peng Ji Deng-Ping Fan Keren Fu Zhe Wu Jianbing Shen Ling Shao VOS 97 131 0 06 Aug 2021
Localizing Visual Sounds the Hard Way Honglie Chen Weidi Xie Triantafyllos Afouras Arsha Nagrani Andrea Vedaldi Andrew Zisserman ObjD 29 185 0 06 Apr 2021
Unsupervised Sound Localization via Iterative Contrastive Learning Yan-Bo Lin Hung-Yu Tseng Hsin-Ying Lee Yen-Yu Lin Ming-Hsuan Yang SSL 55 35 0 01 Apr 2021
Themes Informed Audio-visual Correspondence Learning Runze Su Fei Tao Xudong Liu Haoran Wei Xiaorong Mei Z. Duan Lei Yuan Ji Liu Yuying Xie 24 5 0 14 Sep 2020
Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning Ying Cheng Ruize Wang Zhihao Pan Rui Feng Yuejie Zhang SSL 100 107 0 13 Aug 2020
Audio-Visual Instance Discrimination with Cross-Modal Agreement Pedro Morgado Nuno Vasconcelos Ishan Misra SSL 52 271 0 27 Apr 2020
Unified Image and Video Saliency Modeling Richard Droste Jianbo Jiao J. A. Noble 72 157 0 11 Mar 2020
Focus on Semantic Consistency for Cross-domain Crowd Understanding Tao Han Junyu Gao Yuan. Yuan Qi. Wang 26 45 0 20 Feb 2020
Deep Audio-Visual Learning: A Survey Hao Zhu Mandi Luo Rui Wang A. Zheng Ran He 49 157 0 14 Jan 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network A. Tsiami Petros Koutras Petros Maragos 35 73 0 09 Jan 2020
Listen to Look: Action Recognition by Previewing Audio Ruohan Gao Tae-Hyun Oh Kristen Grauman Lorenzo Torresani VLM 45 251 0 10 Dec 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering Humam Alwassel D. Mahajan Bruno Korbar Lorenzo Torresani Guohao Li Du Tran SSL 51 429 0 28 Nov 2019
Self-supervised Moving Vehicle Tracking with Stereo Sound Chuang Gan Hang Zhao Peihao Chen David D. Cox Antonio Torralba 21 147 0 25 Oct 2019
The Sound of Motions Hang Zhao Chuang Gan Wei-Chiu Ma Antonio Torralba 46 252 0 11 Apr 2019
Multi-source weak supervision for saliency detection Yu Zeng Yunzhi Zhuge Huchuan Lu Lulu Zhang Mingyang Qian Yizhou Yu 39 169 0 01 Apr 2019
Revisiting Video Saliency: A Large-scale Benchmark and a New Model Wenguan Wang Jianbing Shen Fang Guo Ming-Ming Cheng Ali Borji VLM 25 264 0 23 Jan 2018
Objects that Sound Relja Arandjelović Andrew Zisserman ObjD VOS 59 529 0 18 Dec 2017
Multimodal Machine Learning: A Survey and Taxonomy T. Baltrušaitis Chaitanya Ahuja Louis-Philippe Morency 60 2,890 0 26 May 2017
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model Marcella Cornia Lorenzo Baraldi G. Serra Rita Cucchiara 55 549 0 29 Nov 2016
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting Xingjian Shi Zhourong Chen Hao Wang Dit-Yan Yeung W. Wong W. Woo 440 7,952 0 13 Jun 2015