Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.13390
Cited By
A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!
20 June 2022
Chenglizhao Chen
Mengke Song
Wenfeng Song
Li Guo
Muwei Jian
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!"
22 / 22 papers shown
Title
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
66
2
0
12 Sep 2024
Depth-Cooperated Trimodal Network for Video Salient Object Detection
Yukang Lu
Dingyao Min
Keren Fu
Qijun Zhao
MDE
37
13
0
12 Feb 2022
Full-Duplex Strategy for Video Object Segmentation
Ge-Peng Ji
Deng-Ping Fan
Keren Fu
Zhe Wu
Jianbing Shen
Ling Shao
VOS
97
131
0
06 Aug 2021
Localizing Visual Sounds the Hard Way
Honglie Chen
Weidi Xie
Triantafyllos Afouras
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
ObjD
29
185
0
06 Apr 2021
Unsupervised Sound Localization via Iterative Contrastive Learning
Yan-Bo Lin
Hung-Yu Tseng
Hsin-Ying Lee
Yen-Yu Lin
Ming-Hsuan Yang
SSL
55
35
0
01 Apr 2021
Themes Informed Audio-visual Correspondence Learning
Runze Su
Fei Tao
Xudong Liu
Haoran Wei
Xiaorong Mei
Z. Duan
Lei Yuan
Ji Liu
Yuying Xie
24
5
0
14 Sep 2020
Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning
Ying Cheng
Ruize Wang
Zhihao Pan
Rui Feng
Yuejie Zhang
SSL
100
107
0
13 Aug 2020
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Pedro Morgado
Nuno Vasconcelos
Ishan Misra
SSL
52
271
0
27 Apr 2020
Unified Image and Video Saliency Modeling
Richard Droste
Jianbo Jiao
J. A. Noble
72
157
0
11 Mar 2020
Focus on Semantic Consistency for Cross-domain Crowd Understanding
Tao Han
Junyu Gao
Yuan. Yuan
Qi. Wang
26
45
0
20 Feb 2020
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
49
157
0
14 Jan 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network
A. Tsiami
Petros Koutras
Petros Maragos
35
73
0
09 Jan 2020
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
45
251
0
10 Dec 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
51
429
0
28 Nov 2019
Self-supervised Moving Vehicle Tracking with Stereo Sound
Chuang Gan
Hang Zhao
Peihao Chen
David D. Cox
Antonio Torralba
21
147
0
25 Oct 2019
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
46
252
0
11 Apr 2019
Multi-source weak supervision for saliency detection
Yu Zeng
Yunzhi Zhuge
Huchuan Lu
Lulu Zhang
Mingyang Qian
Yizhou Yu
39
169
0
01 Apr 2019
Revisiting Video Saliency: A Large-scale Benchmark and a New Model
Wenguan Wang
Jianbing Shen
Fang Guo
Ming-Ming Cheng
Ali Borji
VLM
25
264
0
23 Jan 2018
Objects that Sound
Relja Arandjelović
Andrew Zisserman
ObjD
VOS
59
529
0
18 Dec 2017
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
60
2,890
0
26 May 2017
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model
Marcella Cornia
Lorenzo Baraldi
G. Serra
Rita Cucchiara
55
549
0
29 Nov 2016
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
440
7,952
0
13 Jun 2015
1