ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.00392
  4. Cited By
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning

Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning

1 March 2020
Shizhe Chen
Yida Zhao
Qin Jin
Qi Wu
ArXivPDFHTML

Papers citing "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning"

13 / 163 papers shown
Title
A Comprehensive Review of the Video-to-Text Problem
A Comprehensive Review of the Video-to-Text Problem
Jesus Perez-Martin
B. Bustos
S. Guimarães
I. Sipiran
Jorge A. Pérez
Grethel Coello Said
13
17
0
27 Mar 2021
On Semantic Similarity in Video Retrieval
On Semantic Similarity in Video Retrieval
Michael Wray
Hazel Doughty
Dima Damen
33
66
0
18 Mar 2021
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual
  Transfer of Vision-Language Models
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Po-Yao (Bernie) Huang
Mandela Patrick
Junjie Hu
Graham Neubig
Florian Metze
Alexander G. Hauptmann
MLLM
VLM
24
56
0
16 Mar 2021
A Universal Model for Cross Modality Mapping by Relational Reasoning
A Universal Model for Cross Modality Mapping by Relational Reasoning
Zun Li
Congyan Lang
Liqian Liang
Tao Wang
Songhe Feng
Jun Wu
Yidong Li
22
2
0
26 Feb 2021
Graph Neural Networks: Taxonomy, Advances and Trends
Graph Neural Networks: Taxonomy, Advances and Trends
Yu Zhou
Haixia Zheng
Xin Huang
Shufeng Hao
Dengao Li
Jumin Zhao
AI4TS
27
117
0
16 Dec 2020
SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries
SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries
Xirong Li
Fangming Zhou
Chaoxi Xu
Jiaqi Ji
Gang Yang
14
52
0
24 Nov 2020
Watch and Learn: Mapping Language and Noisy Real-world Videos with
  Self-supervision
Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision
Yujie Zhong
Linhai Xie
Sen Wang
Lucia Specia
Yishu Miao
SSL
11
0
0
19 Nov 2020
COOT: Cooperative Hierarchical Transformer for Video-Text Representation
  Learning
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Simon Ging
Mohammadreza Zolfaghari
Hamed Pirsiavash
Thomas Brox
ViT
CLIP
31
168
0
01 Nov 2020
Support-set bottlenecks for video-text representation learning
Support-set bottlenecks for video-text representation learning
Mandela Patrick
Po-Yao (Bernie) Huang
Yuki M. Asano
Florian Metze
Alexander G. Hauptmann
João Henriques
Andrea Vedaldi
22
244
0
06 Oct 2020
A Simple Yet Effective Method for Video Temporal Grounding with
  Cross-Modality Attention
A Simple Yet Effective Method for Video Temporal Grounding with Cross-Modality Attention
Binjie Zhang
Yu Li
Chun Yuan
D. Xu
Pin Jiang
Ying Shan
13
5
0
23 Sep 2020
Dual Encoding for Video Retrieval by Text
Dual Encoding for Video Retrieval by Text
Jianfeng Dong
Xirong Li
Chaoxi Xu
Xun Yang
Gang Yang
Xun Wang
Meng Wang
24
2
0
10 Sep 2020
Text-based Localization of Moments in a Video Corpus
Text-based Localization of Moments in a Video Corpus
Sudipta Paul
Niluthpol Chowdhury Mithun
Amit K. Roy-Chowdhury
10
14
0
20 Aug 2020
The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)
The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)
Samuel Albanie
Yang Liu
Arsha Nagrani
Antoine Miech
Ernesto Coto
...
Kaixu Cui
Hui Liu
Chen Wang
Yudong Jiang
Xiaoshuai Hao
34
9
0
03 Aug 2020
Previous
1234