ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.13432
  4. Cited By
Hierarchical Object-oriented Spatio-Temporal Reasoning for Video
  Question Answering

Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering

25 June 2021
Long Hoang Dang
T. Le
Vuong Le
T. Tran
ArXivPDFHTML

Papers citing "Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering"

24 / 24 papers shown
Title
Leveraging Static Relationships for Intra-Type and Inter-Type Message Passing in Video Question Answering
Leveraging Static Relationships for Intra-Type and Inter-Type Message Passing in Video Question Answering
Lili Liang
Guanglu Sun
50
0
0
03 Apr 2025
Align and Aggregate: Compositional Reasoning with Video Alignment and
  Answer Aggregation for Video Question-Answering
Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering
Zhaohe Liao
Jiangtong Li
Li Niu
Liqing Zhang
CoGe
37
3
0
03 Jul 2024
Ranking Distillation for Open-Ended Video Question Answering with
  Insufficient Labels
Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
Tianming Liang
Chaolei Tan
Beihao Xia
Wei-Shi Zheng
Jianfang Hu
36
1
0
21 Mar 2024
BDIQA: A New Dataset for Video Question Answering to Explore Cognitive
  Reasoning through Theory of Mind
BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of Mind
Yuanyuan Mao
Xin Lin
Qin Ni
Liang He
29
3
0
12 Feb 2024
Glance and Focus: Memory Prompting for Multi-Event Video Question
  Answering
Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
Ziyi Bai
Ruiping Wang
Xilin Chen
97
8
0
03 Jan 2024
Answering from Sure to Uncertain: Uncertainty-Aware Curriculum Learning
  for Video Question Answering
Answering from Sure to Uncertain: Uncertainty-Aware Curriculum Learning for Video Question Answering
Haopeng Li
Qiuhong Ke
Mingming Gong
Tom Drummond
39
1
0
03 Jan 2024
Object-aware Adaptive-Positivity Learning for Audio-Visual Question
  Answering
Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering
Zhangbin Li
Dan Guo
Jinxing Zhou
Jing Zhang
Meng Wang
29
11
0
20 Dec 2023
Visual Commonsense based Heterogeneous Graph Contrastive Learning
Visual Commonsense based Heterogeneous Graph Contrastive Learning
Zongzhao Li
Xiangyu Zhu
Xi Zhang
Zhaoxiang Zhang
Zhen Lei
21
1
0
11 Nov 2023
ATM: Action Temporality Modeling for Video Question Answering
ATM: Action Temporality Modeling for Video Question Answering
Junwen Chen
Jie Zhu
Yu Kong
24
1
0
05 Sep 2023
Redundancy-aware Transformer for Video Question Answering
Redundancy-aware Transformer for Video Question Answering
Yicong Li
Xun Yang
An Zhang
Chun Feng
Xiang Wang
Tat-Seng Chua
17
15
0
07 Aug 2023
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question
  Answering
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering
Yi Cheng
Hehe Fan
Dongyun Lin
Ying Sun
Mohan S. Kankanhalli
J. Lim
40
4
0
25 Jul 2023
Discovering Spatio-Temporal Rationales for Video Question Answering
Discovering Spatio-Temporal Rationales for Video Question Answering
Yicong Li
Junbin Xiao
Chun Feng
Xiang Wang
Tat-Seng Chua
25
13
0
22 Jul 2023
Learning Situation Hyper-Graphs for Video Question Answering
Learning Situation Hyper-Graphs for Video Question Answering
Aisha Urooj Khan
Hilde Kuehne
Bo Wu
Kim Chheu
Walid Bousselham
Chuang Gan
N. Lobo
M. Shah
34
15
0
18 Apr 2023
Contrastive Video Question Answering via Video Graph Transformer
Contrastive Video Question Answering via Video Graph Transformer
Junbin Xiao
Pan Zhou
Angela Yao
Yicong Li
Richang Hong
Shuicheng Yan
Tat-Seng Chua
ViT
27
35
0
27 Feb 2023
Locate before Answering: Answer Guided Question Localization for Video
  Question Answering
Locate before Answering: Answer Guided Question Localization for Video Question Answering
Tianwen Qian
Ran Cui
Jingjing Chen
Pai Peng
Xiao-Wei Guo
Yu-Gang Jiang
29
17
0
05 Oct 2022
Equivariant and Invariant Grounding for Video Question Answering
Equivariant and Invariant Grounding for Video Question Answering
Yicong Li
Xiang Wang
Junbin Xiao
Tat-Seng Chua
20
25
0
26 Jul 2022
Video Graph Transformer for Video Question Answering
Video Graph Transformer for Video Question Answering
Junbin Xiao
Pan Zhou
Tat-Seng Chua
Shuicheng Yan
ViT
156
75
0
12 Jul 2022
Video Dialog as Conversation about Objects Living in Space-Time
Video Dialog as Conversation about Objects Living in Space-Time
H. Pham
T. Le
Vuong Le
Tu Minh Phuong
T. Tran
41
11
0
08 Jul 2022
Invariant Grounding for Video Question Answering
Invariant Grounding for Video Question Answering
Yicong Li
Xiang Wang
Junbin Xiao
Wei Ji
Tat-Seng Chua
OOD
15
95
0
06 Jun 2022
Multilevel Hierarchical Network with Multiscale Sampling for Video
  Question Answering
Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiang-Dong Zhou
29
24
0
09 May 2022
Video Question Answering: Datasets, Algorithms and Challenges
Video Question Answering: Datasets, Algorithms and Challenges
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
27
85
0
02 Mar 2022
(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
A. Cherian
Chiori Hori
Tim K. Marks
Jonathan Le Roux
24
35
0
18 Feb 2022
Video as Conditional Graph Hierarchy for Multi-Granular Question
  Answering
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering
Junbin Xiao
Angela Yao
Zhiyuan Liu
Yicong Li
Wei Ji
Tat-Seng Chua
30
111
0
12 Dec 2021
Simple Online and Realtime Tracking with a Deep Association Metric
Simple Online and Realtime Tracking with a Deep Association Metric
N. Wojke
Alex Bewley
Dietrich Paulus
VOT
240
3,465
0
21 Mar 2017
1