ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.03131
  4. Cited By
Video Question Answering Using CLIP-Guided Visual-Text Attention
v1v2 (latest)

Video Question Answering Using CLIP-Guided Visual-Text Attention

6 March 2023
Shuhong Ye
Weikai Kong
Chenglin Yao
Jianfeng Ren
Xudong Jiang
ArXiv (abs)PDFHTML

Papers citing "Video Question Answering Using CLIP-Guided Visual-Text Attention"

2 / 2 papers shown
Title
MMRL: Multi-Modal Representation Learning for Vision-Language Models
MMRL: Multi-Modal Representation Learning for Vision-Language Models
Yuncheng Guo
Xiaodong Gu
VLMOffRL
448
3
0
11 Mar 2025
Variational Information Pursuit with Large Language and Multimodal
  Models for Interpretable Predictions
Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Kwan Ho Ryan Chan
Aditya Chattopadhyay
B. Haeffele
René Vidal
57
0
0
24 Aug 2023
1