ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.13390
  4. Cited By
A Comprehensive Survey on Video Saliency Detection with Auditory
  Information: the Audio-visual Consistency Perceptual is the Key!

A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!

20 June 2022
Chenglizhao Chen
Mengke Song
Wenfeng Song
Li Guo
Muwei Jian
ArXivPDFHTML

Papers citing "A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!"

22 / 22 papers shown
Title
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
66
2
0
12 Sep 2024
Depth-Cooperated Trimodal Network for Video Salient Object Detection
Depth-Cooperated Trimodal Network for Video Salient Object Detection
Yukang Lu
Dingyao Min
Keren Fu
Qijun Zhao
MDE
37
13
0
12 Feb 2022
Full-Duplex Strategy for Video Object Segmentation
Full-Duplex Strategy for Video Object Segmentation
Ge-Peng Ji
Deng-Ping Fan
Keren Fu
Zhe Wu
Jianbing Shen
Ling Shao
VOS
97
131
0
06 Aug 2021
Localizing Visual Sounds the Hard Way
Localizing Visual Sounds the Hard Way
Honglie Chen
Weidi Xie
Triantafyllos Afouras
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
ObjD
29
185
0
06 Apr 2021
Unsupervised Sound Localization via Iterative Contrastive Learning
Unsupervised Sound Localization via Iterative Contrastive Learning
Yan-Bo Lin
Hung-Yu Tseng
Hsin-Ying Lee
Yen-Yu Lin
Ming-Hsuan Yang
SSL
55
35
0
01 Apr 2021
Themes Informed Audio-visual Correspondence Learning
Themes Informed Audio-visual Correspondence Learning
Runze Su
Fei Tao
Xudong Liu
Haoran Wei
Xiaorong Mei
Z. Duan
Lei Yuan
Ji Liu
Yuying Xie
24
5
0
14 Sep 2020
Look, Listen, and Attend: Co-Attention Network for Self-Supervised
  Audio-Visual Representation Learning
Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning
Ying Cheng
Ruize Wang
Zhihao Pan
Rui Feng
Yuejie Zhang
SSL
100
107
0
13 Aug 2020
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Pedro Morgado
Nuno Vasconcelos
Ishan Misra
SSL
52
271
0
27 Apr 2020
Unified Image and Video Saliency Modeling
Unified Image and Video Saliency Modeling
Richard Droste
Jianbo Jiao
J. A. Noble
72
157
0
11 Mar 2020
Focus on Semantic Consistency for Cross-domain Crowd Understanding
Focus on Semantic Consistency for Cross-domain Crowd Understanding
Tao Han
Junyu Gao
Yuan. Yuan
Qi. Wang
26
45
0
20 Feb 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
49
157
0
14 Jan 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network
STAViS: Spatio-Temporal AudioVisual Saliency Network
A. Tsiami
Petros Koutras
Petros Maragos
35
73
0
09 Jan 2020
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
45
251
0
10 Dec 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
51
429
0
28 Nov 2019
Self-supervised Moving Vehicle Tracking with Stereo Sound
Self-supervised Moving Vehicle Tracking with Stereo Sound
Chuang Gan
Hang Zhao
Peihao Chen
David D. Cox
Antonio Torralba
21
147
0
25 Oct 2019
The Sound of Motions
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
46
252
0
11 Apr 2019
Multi-source weak supervision for saliency detection
Multi-source weak supervision for saliency detection
Yu Zeng
Yunzhi Zhuge
Huchuan Lu
Lulu Zhang
Mingyang Qian
Yizhou Yu
39
169
0
01 Apr 2019
Revisiting Video Saliency: A Large-scale Benchmark and a New Model
Revisiting Video Saliency: A Large-scale Benchmark and a New Model
Wenguan Wang
Jianbing Shen
Fang Guo
Ming-Ming Cheng
Ali Borji
VLM
25
264
0
23 Jan 2018
Objects that Sound
Objects that Sound
Relja Arandjelović
Andrew Zisserman
ObjD
VOS
59
529
0
18 Dec 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
60
2,890
0
26 May 2017
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive
  Model
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model
Marcella Cornia
Lorenzo Baraldi
G. Serra
Rita Cucchiara
55
549
0
29 Nov 2016
Convolutional LSTM Network: A Machine Learning Approach for
  Precipitation Nowcasting
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
440
7,952
0
13 Jun 2015
1