Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.11851
Cited By
Transcript to Video: Efficient Clip Sequencing from Texts
25 July 2021
Yu Xiong
Fabian Caba Heilbron
Dahua Lin
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Transcript to Video: Efficient Clip Sequencing from Texts"
8 / 8 papers shown
Title
Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows
Anyi Rao
Xuekun Jiang
Sichen Wang
Yuwei Guo
Zihao Liu
Bo Dai
Long Pang
Xiaoyu Wu
Dahua Lin
Libiao Jin
21
6
0
17 Oct 2022
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Mattia Soldan
Alejandro Pardo
Juan Carlos León Alcázar
Fabian Caba Heilbron
Chen Zhao
Silvio Giancola
Guohao Li
VGen
44
95
0
01 Dec 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
259
558
0
28 Sep 2021
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIP
VLM
317
780
0
18 Apr 2021
Video Representation Learning by Recognizing Temporal Transformations
Simon Jenni
Givi Meishvili
Paolo Favaro
131
133
0
21 Jul 2020
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
424
596
0
21 Jul 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
119
275
0
24 Jan 2020
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
272
31,267
0
16 Jan 2013
1