ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.11851
  4. Cited By
Transcript to Video: Efficient Clip Sequencing from Texts

Transcript to Video: Efficient Clip Sequencing from Texts

25 July 2021
Yu Xiong
Fabian Caba Heilbron
Dahua Lin
    CLIP
ArXivPDFHTML

Papers citing "Transcript to Video: Efficient Clip Sequencing from Texts"

8 / 8 papers shown
Title
Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows
Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows
Anyi Rao
Xuekun Jiang
Sichen Wang
Yuwei Guo
Zihao Liu
Bo Dai
Long Pang
Xiaoyu Wu
Dahua Lin
Libiao Jin
21
6
0
17 Oct 2022
MAD: A Scalable Dataset for Language Grounding in Videos from Movie
  Audio Descriptions
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Mattia Soldan
Alejandro Pardo
Juan Carlos León Alcázar
Fabian Caba Heilbron
Chen Zhao
Silvio Giancola
Guohao Li
VGen
44
95
0
01 Dec 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text
  Understanding
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
259
558
0
28 Sep 2021
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip
  Retrieval
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIP
VLM
317
780
0
18 Apr 2021
Video Representation Learning by Recognizing Temporal Transformations
Video Representation Learning by Recognizing Temporal Transformations
Simon Jenni
Givi Meishvili
Paolo Favaro
131
133
0
21 Jul 2020
Multi-modal Transformer for Video Retrieval
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
424
596
0
21 Jul 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
119
275
0
24 Jan 2020
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
272
31,267
0
16 Jan 2013
1