ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.20693
  4. Cited By
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues

Boosting Audio Visual Question Answering via Key Semantic-Aware Cues

30 July 2024
Guangyao Li
Henghui Du
Di Hu
ArXivPDFHTML

Papers citing "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues"

3 / 3 papers shown
Title
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
Jinxing Zhou
Dan Guo
Yuxin Mao
Yiran Zhong
Xiaojun Chang
Meng Wang
38
12
0
11 Jul 2024
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual
  video parsing
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing
Yaru Chen
Ruohao Guo
Xubo Liu
Peipei Wu
Guangyao Li
Zhenbo Li
Wenwu Wang
34
7
0
11 Oct 2023
CATR: Combinatorial-Dependence Audio-Queried Transformer for
  Audio-Visual Video Segmentation
CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation
Kexin Li
Zongxin Yang
Lei Chen
Yezhou Yang
Jun Xiao
VOS
39
51
0
18 Sep 2023
1