Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.03038
Cited By
Dual-Stream Transformer for Generic Event Boundary Captioning
7 July 2022
Xin Gu
Hanhua Ye
Guang Chen
Yufei Wang
Libo Zhang
Longyin Wen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dual-Stream Transformer for Generic Event Boundary Captioning"
4 / 4 papers shown
Title
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
Xin Gu
Yaojie Shen
Chenxi Luo
Tiejian Luo
Yan Huang
Yuewei Lin
Heng Fan
L. Zhang
73
1
0
16 Feb 2025
Text with Knowledge Graph Augmented Transformer for Video Captioning
Xin Gu
G. Chen
Yufei Wang
Libo Zhang
Tiejian Luo
Longyin Wen
32
47
0
22 Mar 2023
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
259
561
0
28 Sep 2021
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
301
39,238
0
01 Sep 2014
1