ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.10131
  4. Cited By
Leveraging Topics and Audio Features with Multimodal Attention for Audio
  Visual Scene-Aware Dialog

Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog

20 December 2019
Shachi H. Kumar
Eda Okur
Saurav Sahay
Jonathan Huang
L. Nachman
ArXivPDFHTML

Papers citing "Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog"

3 / 3 papers shown
Title
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results
  for Video Question Answering
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering
Yueqian Wang
Yuxuan Wang
Kai Chen
Dongyan Zhao
33
2
0
08 Jan 2024
VGNMN: Video-grounded Neural Module Network to Video-Grounded Language
  Tasks
VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks
Hung Le
Nancy F. Chen
Guosheng Lin
MLLM
26
19
0
16 Apr 2021
Learning Reasoning Paths over Semantic Graphs for Video-grounded
  Dialogues
Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues
Hung Le
Nancy F. Chen
Guosheng Lin
36
14
0
01 Mar 2021
1