ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.00650
  4. Cited By
CLIP-It! Language-Guided Video Summarization

CLIP-It! Language-Guided Video Summarization

1 July 2021
Medhini Narasimhan
Anna Rohrbach
Trevor Darrell
    CLIP
ArXivPDFHTML

Papers citing "CLIP-It! Language-Guided Video Summarization"

18 / 68 papers shown
Title
Contrastive Losses Are Natural Criteria for Unsupervised Video
  Summarization
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
Zongshang Pang
Yuta Nakashima
Mayu Otani
Hajime Nagahara
25
6
0
18 Nov 2022
Unsupervised Audio-Visual Lecture Segmentation
Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh
Anchit Gupta
C. V. Jawahar
Makarand Tapaswi
VOS
24
4
0
29 Oct 2022
Reconstructing Action-Conditioned Human-Object Interactions Using
  Commonsense Knowledge Priors
Reconstructing Action-Conditioned Human-Object Interactions Using Commonsense Knowledge Priors
Xi Wang
Gengyan Li
Yen-Ling Kuo
Muhammed Kocabas
Emre Aksan
Otmar Hilliges
56
29
0
06 Sep 2022
Visual Subtitle Feature Enhanced Video Outline Generation
Visual Subtitle Feature Enhanced Video Outline Generation
Qi Lv
Ziqiang Cao
Wenrui Xie
Derui Wang
Jingwen Wang
...
Yuan-Fang Li
Min Cao
Wenjie Li
Sujian Li
Guohong Fu
VGen
21
0
0
24 Aug 2022
TL;DW? Summarizing Instructional Videos with Task Relevance &
  Cross-Modal Saliency
TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Medhini Narasimhan
Arsha Nagrani
Chen Sun
Michael Rubinstein
Trevor Darrell
Anna Rohrbach
Cordelia Schmid
20
34
0
14 Aug 2022
Multimodal Frame-Scoring Transformer for Video Summarization
Multimodal Frame-Scoring Transformer for Video Summarization
Jeiyoon Park
Kiho Kwoun
Chanhee Lee
Heuiseok Lim
ViT
30
6
0
05 Jul 2022
An Empirical Survey on Long Document Summarization: Datasets, Models and
  Metrics
An Empirical Survey on Long Document Summarization: Datasets, Models and Metrics
Huan Yee Koh
Jiaxin Ju
Ming Liu
Shirui Pan
83
122
0
03 Jul 2022
Backbones-Review: Feature Extraction Networks for Deep Learning and Deep
  Reinforcement Learning Approaches
Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches
O. Elharrouss
Y. Akbari
Noor Almaadeed
S. Al-Maadeed
27
69
0
16 Jun 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
72
530
0
13 Jun 2022
Language-Bridged Spatial-Temporal Interaction for Referring Video Object
  Segmentation
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Zihan Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Jizhong Han
Si Liu
VOS
33
52
0
08 Jun 2022
Prompt-based Learning for Unpaired Image Captioning
Prompt-based Learning for Unpaired Image Captioning
Peipei Zhu
Tianlin Li
Lin Zhu
Zhenglong Sun
Weishi Zheng
Yaowei Wang
Chia-Ju Chen
VLM
27
31
0
26 May 2022
MHSCNet: A Multimodal Hierarchical Shot-aware Convolutional Network for
  Video Summarization
MHSCNet: A Multimodal Hierarchical Shot-aware Convolutional Network for Video Summarization
Wujiang Xu
Bing Han
jeff little Guo
Shaoshuai Li
Qiongxu Ma
Yunan Zhao
Shengting Guo
Yifei Xu
Junchi Yan
17
8
0
18 Apr 2022
Progressive Video Summarization via Multimodal Self-supervised Learning
Progressive Video Summarization via Multimodal Self-supervised Learning
Haopeng Li
Qiuhong Ke
Mingming Gong
Tom Drummond
AI4TS
39
18
0
07 Jan 2022
A Simple Long-Tailed Recognition Baseline via Vision-Language Model
A Simple Long-Tailed Recognition Baseline via Vision-Language Model
Teli Ma
Shijie Geng
Mengmeng Wang
Jing Shao
Jiasen Lu
Hongsheng Li
Peng Gao
Yu Qiao
VLM
40
46
0
29 Nov 2021
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
Andreas Fürst
Elisabeth Rumetshofer
Johannes Lehner
Viet-Hung Tran
Fei Tang
...
David P. Kreil
Michael K Kopp
Günter Klambauer
Angela Bitto-Nemling
Sepp Hochreiter
VLM
CLIP
207
102
0
21 Oct 2021
Planning to Chronicle
Planning to Chronicle
Hazhar Rahmani
Dylan A. Shell
J. O’Kane
9
6
0
04 Nov 2020
Multi-modal Transformer for Video Retrieval
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
430
596
0
21 Jul 2020
Query-Focused Extractive Video Summarization
Query-Focused Extractive Video Summarization
Aidean Sharghi
Boqing Gong
M. Shah
74
121
0
18 Jul 2016
Previous
12