ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.14267
  4. Cited By
Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

19 April 2025
Li Yu
Xuanzhe Sun
Wei Zhou
Moncef Gabbouj
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction"

9 / 9 papers shown
Title
Few-shot Learner Parameterization by Diffusion Time-steps
Few-shot Learner Parameterization by Diffusion Time-steps
Zhongqi Yue
Pan Zhou
Richang Hong
Hanwang Zhang
Qianru Sun
103
12
0
05 Mar 2024
CASP-Net: Rethinking Video Saliency Prediction from an
  Audio-VisualConsistency Perceptual Perspective
CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Jun Xiong
Gang Wang
Peng Zhang
Wei Huang
Yufei Zha
Guangtao Zhai
50
14
0
11 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLMMLLM
432
4,656
0
30 Jan 2023
Temporal-Spatial Feature Pyramid for Video Saliency Detection
Temporal-Spatial Feature Pyramid for Video Saliency Detection
Qinyao Chang
Shiping Zhu
78
27
0
10 May 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.0K
29,926
0
26 Feb 2021
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
770
18,408
0
19 Jun 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network
STAViS: Spatio-Temporal AudioVisual Saliency Network
A. Tsiami
Petros Koutras
Petros Maragos
93
73
0
09 Jan 2020
Revisiting Video Saliency: A Large-scale Benchmark and a New Model
Revisiting Video Saliency: A Large-scale Benchmark and a New Model
Wenguan Wang
Jianbing Shen
Fang Guo
Ming-Ming Cheng
Ali Borji
VLM
55
266
0
23 Jan 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in
  Video Classification
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
155
1,333
0
13 Dec 2017
1